0% found this document useful (0 votes)

101 views41 pages

XML Dom and Sax Parsers

The document discusses XML parsers which read XML files and convert them into a format applications can use, comparing tree-based parsers which build a document object model (DOM) tree and event-based parsers which use callbacks to report parsing events, as well as describing DOM which defines a standard way to access and manipulate XML documents.

Uploaded by

Aaditya Pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views41 pages

XML Dom and Sax Parsers

Uploaded by

Aaditya Pandey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 41

XML

DOM and SAX

Parsers
Introduction to parsers

 The word parser comes from

compilers

 In a compiler, a parser is the module

that reads and interprets the
programming language.
Introduction to Parsers

 In XML, a
parser is a
software
component
that sits
between the
application
and the XML
files.
Introduction to parsers

 It reads a text-formatted XML file or

stream and converts it to a
document to be manipulated by the
application.
Well-formedness and validity

 Well-formed documents respect the

syntactic rules.

 Valid documents not only respect the

syntactic rules but also conform to a
structure as described in a DTD.
Validating vs. Non-validating
parsers

 Both parsers enforce syntactic rules

 only validating parsers know how to

validate documents against their
DTDs
Tree-based parsers

 These map an XML document into an

internal tree structure, and then
allow an application to navigate that
tree.

 Ideal for browsers, editors, XSL

processors.
Event-based

 An event-based API reports parsing

events (such as the start and end of
elements) directly to the application
through callbacks.

 The application implements handlers

to deal with the different events
Event-based vs. Tree-based
parsers

 Tree-based parsers deal generally

small documents.

 Event-based parsers deal generally

used for large documents.
Event-based vs. Tree-based
parsers

 Tree-based parsers are generally

easier to implement.

 Event-based parsers are more

complex and give hard time for the
programmer
What is DOM?

 The Document Object Model (DOM)

is an application programming
interface (API) for HTML and XML
documents.

 It defines the logical structure of

documents and the way a document
is accessed and manipulated
Properties of DOM
 Programmers can build documents,
navigate their structure, and add, modify,
or delete elements and content.

 Provides a standard programming

interface that can be used in a wide
variety of environments and applications.

 structural isomorphism.
DOM Identifies

 The interfaces and objects used to

represent and manipulate a document.

 The semantics of these interfaces and

objects - including both behavior and
attributes.

 The relationships and collaborations

among these interfaces and objects.
What DOM is not!!

 The Document Object Model is not a

binary specification.

 The Document Object Model is not a way

of persisting objects to XML or HTML.

 The Document Object Model does not

define "the true inner semantics" of XML
or HTML.
What DOM is not!!

 The Document Object Model is not a

set of data structures, it is an object
model that specifies interfaces.

 The Document Object Model is not a

competitor to the Component Object
Model (COM).
DOM into work
<?xml version="1.0"?>
<products>
<product>
<name>XML Editor</name>
<price>499.00</price>
</product>
<product>
<name>DTD Editor</name>
<price>199.00</price>
</product>
<product>
<name>XML Book</name>
<price>19.99</price>
</product>
<product>
<name>XML Training</name>
<price>699.00</price>
</product>
</products>
DOM into work
DOM levels: level 0

 DOM Level 0 is a mix of Netscape

Navigator 3.0 and MS Internet
Explorer 3.0 document
functionalities.
DOM levels: DOM 1

 It contains functionality for document

navigation and manipulation.

i.e.: functions for creating, deleting

and changing elements and their
attributes.
DOM level 1 limitations
 A structure model for the internal
subset and the external subset.
 Validation against a schema.
 Control for rendering documents via
style sheets.
 Access control.
 Thread-safety.
 Events
DOM levels: DOM 2
 A style sheet object model and
defines functionality for manipulating
the style information attached to a
document.
 Enables of the traversal on the

document.
 Defines an event model.

 Provides support for XML

namespaces
DOM levels: DOM 3
 Document loading and saving as well
as content models (such as DTD’s
and schemas) with document
validation support.

 Document views and formatting, key

events and event groups
An Application of DOM
<HTML>
<HEAD>
<TITLE>Currency Conversion</TITLE>
<SCRIPT LANGUAGE="JavaScript" SRC="conversion.js"></SCRIPT>
</HEAD>
<BODY>
<CENTER>
<FORM ID="controls">
File: <INPUT TYPE="TEXT" NAME="fname" VALUE="prices.xml">
Rate: <INPUT TYPE="TEXT" NAME="rate" VALUE="0.95274" SIZE="4"><BR>
<INPUT TYPE="BUTTON" VALUE="Convert" ONCLICK="convert(controls,xml)">
<INPUT TYPE="BUTTON" VALUE="Clear" ONCLICK="output.value=''"><BR>
<TEXTAREA NAME="output" ROWS="10" COLS="50" READONLY> </TEXTAREA>
</FORM>
<xml id="xml"></xml>
</CENTER>
</BODY>
</HTML>
An Application of DOM
 <xml id="xml"></xml>: defines an XML
island.

 XML islands are mechanisms used to

insert XML in HTML documents.

 In this case, XML islands are used to

access Internet Explorer’s XML parser. The
price list is loaded into the island.
An Application of DOM
 The “Convert” button in the HTML file
calls the JavaScript function
convert(), which is the conversion
routine.

 convert() accepts two parameters,

the form and the XML island.
An Application for DOM
<SCRIPT LANGUAGE="JavaScript"
SRC="conversion.js"></SCRIPT>

function convert(form,xmldocument)
{var fname = form.fname.value,
output = form.output,
rate = form.rate.value;
output.value = "";
var document = parse(fname,xmldocument),
topLevel = document.documentElement;
searchPrice(topLevel,output,rate);}

function getText(node)
{return node.firstChild.data;}
An Application of DOM
 nodeType is a code representing the type of the object.

 parentNode is the parent (if any) of current Node object.

 childNode is the list of children for the current Node object.

 firstChild is the Node’s first child.

 lastChild is the Node’s last child.

 previousSibling is the Node immediately preceding the

current one.
 nextSibling is the Node immediately following the current
one.

 attributes is the list of attributes, if the current Node has

any.
An Application of DOM

 The parse() function loads the price

list in the XML island and returns its
Document object.

 The function searchPrice() tests

whether the current node is an
element.
An Application of DOM

 The function
searchPrice() visits
each node by
recursively calling
itself for all
children of the
current node.
An Application for DOM
What is SAX?
 SAX (the Simple API for XML) is an event-
based parser for xml documents.

 The parser tells the application what is in

the document by notifying the application
of a stream of parsing events.

 Application then processes those events to

act on data.
SAX History

 SAX 1.0 was released on May 11, 1998.

 SAX is a common, event-based API for

parsing XML documents, developed as a
collaborative project of the members of
the XML-DEV discussion under the
leadership of David Megginson.
Why SAX?

 For applications that are not so XML-

centric, an object-based interface is
less appealing.

 Efficiency: lower level than object-

based interfaces
Why SAX?

 Event-based interface consumes

fewer resources than an object-
based one

 With an event-based interface, the

application can start processing the
document as the parser is reading it
Limitations of SAX

 With SAX, it is not possible to

navigate through the document as
you can with a DOM.

 The application must explicitly buffer

those events it is interested in.
SAX API

 Parser events are similar to user-

interface events such as ONCLICK (in
a browser) or AWT events (in Java).

 Events alert the application that

something happened and the
application might want to react.
SAX API
 Element opening tags

 Element closing tags

 Content of elements

 Entities

 Parsing errors
SAX API
SAX Example

<?xml version="1.0"?>
<doc>
<para>Hello, world!</para>
</doc>
SAX example

 start document
 start element: doc
 start element: para
 characters: Hello, world!
 end element: para
 end element: doc
 end document

DOM
100% (1)
DOM
8 pages
Understanding DOM Structure and Levels
No ratings yet
Understanding DOM Structure and Levels
8 pages
Introduction to Dynamic HTML (DHTML)
No ratings yet
Introduction to Dynamic HTML (DHTML)
24 pages
Document Object Model (Dom) Api For Javascript: Al and Range
100% (1)
Document Object Model (Dom) Api For Javascript: Al and Range
14 pages
Module-03 FSD (BIS601) - Final
No ratings yet
Module-03 FSD (BIS601) - Final
33 pages
JavaScript DOM Methods & Examples
No ratings yet
JavaScript DOM Methods & Examples
144 pages
Javascript Note
No ratings yet
Javascript Note
81 pages
Core Web Programming - Chapter 23: Document Object Model DOM
No ratings yet
Core Web Programming - Chapter 23: Document Object Model DOM
34 pages
JavaScript DOM Manipulation Guide
No ratings yet
JavaScript DOM Manipulation Guide
13 pages
Web Module 2
No ratings yet
Web Module 2
133 pages
Dom Manipulation
No ratings yet
Dom Manipulation
13 pages
JavaScript - Operators
No ratings yet
JavaScript - Operators
13 pages
Unit 4
100% (1)
Unit 4
68 pages
ES6 Notes
100% (1)
ES6 Notes
2 pages
MongoDB 3
No ratings yet
MongoDB 3
42 pages
Object Oriented Programming With The Typescript
No ratings yet
Object Oriented Programming With The Typescript
33 pages
Module-04 FSD (BIS601)
No ratings yet
Module-04 FSD (BIS601)
53 pages
JavaScript Basics and Usage Guide
No ratings yet
JavaScript Basics and Usage Guide
72 pages
The Unofficial Guide To NDI - 6 X 9 in PDF
No ratings yet
The Unofficial Guide To NDI - 6 X 9 in PDF
171 pages
HTML1 PDF
100% (1)
HTML1 PDF
35 pages
Variables and Data Types in C#
No ratings yet
Variables and Data Types in C#
14 pages
WK 6 JavaScript Part I
No ratings yet
WK 6 JavaScript Part I
23 pages
jQuery Selector Cheat Sheet
No ratings yet
jQuery Selector Cheat Sheet
3 pages
JavaScript Overview and Features
No ratings yet
JavaScript Overview and Features
8 pages
ITT301 M5 Ktunotes - in
No ratings yet
ITT301 M5 Ktunotes - in
16 pages
Web Services
No ratings yet
Web Services
2 pages
Understanding Dynamic HTML (DHTML)
No ratings yet
Understanding Dynamic HTML (DHTML)
20 pages
Programming The Web Notes
100% (1)
Programming The Web Notes
217 pages
HTML5 WebSockets: Real-Time Web Communication
No ratings yet
HTML5 WebSockets: Real-Time Web Communication
55 pages
FSD Module 5 Notes
No ratings yet
FSD Module 5 Notes
13 pages
Module-03 FSD (BIS601)
No ratings yet
Module-03 FSD (BIS601)
25 pages
2021 Dec. ITT301-A
100% (1)
2021 Dec. ITT301-A
3 pages
C++ Question Bank
No ratings yet
C++ Question Bank
3 pages
Machine Learning Unit 1
No ratings yet
Machine Learning Unit 1
72 pages
Module-02 FSD (BIS601)
No ratings yet
Module-02 FSD (BIS601)
38 pages
Lecturer Notes On IT 2353 UNIT III
100% (3)
Lecturer Notes On IT 2353 UNIT III
30 pages
102 WDB Flexbox Responsive
No ratings yet
102 WDB Flexbox Responsive
25 pages
Adobe AIR Building-Apps
No ratings yet
Adobe AIR Building-Apps
256 pages
JavaScript Interview Q&A Quiz
No ratings yet
JavaScript Interview Q&A Quiz
15 pages
Understanding the Document Object Model
No ratings yet
Understanding the Document Object Model
54 pages
Understanding Database Management Systems
No ratings yet
Understanding Database Management Systems
9 pages
.net ppt
No ratings yet
.net ppt
20 pages
Js Interview
No ratings yet
Js Interview
245 pages
CSS Notes Basic
No ratings yet
CSS Notes Basic
8 pages
JSON.NET Integration for Developers
No ratings yet
JSON.NET Integration for Developers
9 pages
Assignments of NamasteReact Course ?
No ratings yet
Assignments of NamasteReact Course ?
68 pages
Mrcet R20 Iv 1 QB
No ratings yet
Mrcet R20 Iv 1 QB
79 pages
Visualization With Matplotlib
No ratings yet
Visualization With Matplotlib
18 pages
Residue Number System Overview
No ratings yet
Residue Number System Overview
45 pages
Blockchain and Cryptocurrency Course Overview
No ratings yet
Blockchain and Cryptocurrency Course Overview
59 pages
API in JS
No ratings yet
API in JS
10 pages
WK 8 JavaScript Part III
No ratings yet
WK 8 JavaScript Part III
74 pages
HTML5 Interview Questions PDF
No ratings yet
HTML5 Interview Questions PDF
6 pages
Introduction to LaTeX Basics
No ratings yet
Introduction to LaTeX Basics
35 pages
De Syllabus
No ratings yet
De Syllabus
2 pages
Introduction to ReactJS and ES6 Basics
No ratings yet
Introduction to ReactJS and ES6 Basics
85 pages
Understanding XML DOM in JavaScript
No ratings yet
Understanding XML DOM in JavaScript
33 pages
DOM and SAX Parsers in XML
No ratings yet
DOM and SAX Parsers in XML
19 pages
WD Assignment 2
No ratings yet
WD Assignment 2
6 pages
Mern Previous Papers
No ratings yet
Mern Previous Papers
59 pages
FPTD FDM Config Guide 660
No ratings yet
FPTD FDM Config Guide 660
798 pages
DevNet Deployment & Security Guide
No ratings yet
DevNet Deployment & Security Guide
93 pages
Git Basic Good
100% (1)
Git Basic Good
20 pages
Structured Analysis and Structured Design 3
No ratings yet
Structured Analysis and Structured Design 3
7 pages
ER - Relational Solutions PDF
No ratings yet
ER - Relational Solutions PDF
7 pages
GoldenGate - Setup Bi-Directional Replication in Multitenant Environment
No ratings yet
GoldenGate - Setup Bi-Directional Replication in Multitenant Environment
15 pages
(Ebook PDF) Database Systems Design, Implementation, & Management 13th Edition Instant Download
100% (5)
(Ebook PDF) Database Systems Design, Implementation, & Management 13th Edition Instant Download
57 pages
Object Oriented Programming-Java: OOP Vs POP, Java, Java Technologies
No ratings yet
Object Oriented Programming-Java: OOP Vs POP, Java, Java Technologies
393 pages
Paradarshia Corruption Free Economy
No ratings yet
Paradarshia Corruption Free Economy
2 pages
Fix Windows User Profile Error
No ratings yet
Fix Windows User Profile Error
5 pages
EOI for ERP Implementation at KSIDC
No ratings yet
EOI for ERP Implementation at KSIDC
4 pages
Steps To Install Oracle Database 19c On CentOS 8
No ratings yet
Steps To Install Oracle Database 19c On CentOS 8
19 pages
w95 SMB Erp Technology Value Matrix Fy22 en Us
No ratings yet
w95 SMB Erp Technology Value Matrix Fy22 en Us
19 pages
Global Directory Services
No ratings yet
Global Directory Services
10 pages
Introduction To AWS
No ratings yet
Introduction To AWS
8 pages
SAP CPQ Implementation Guide C_C4H420_94
No ratings yet
SAP CPQ Implementation Guide C_C4H420_94
2 pages
Mysql PHP Tutorial: Ferry Boender
No ratings yet
Mysql PHP Tutorial: Ferry Boender
17 pages
Oracle Team: Summary of Qualifications
No ratings yet
Oracle Team: Summary of Qualifications
2 pages
Diogenes Y. Exam Ref SC-900 Microsoft Security... 2022
100% (7)
Diogenes Y. Exam Ref SC-900 Microsoft Security... 2022
442 pages
Web Services in Cloud Computing Overview
No ratings yet
Web Services in Cloud Computing Overview
23 pages
Synopis For College Management System
100% (1)
Synopis For College Management System
16 pages
Windows Reset or Remove Windows Activation or Remove License Key
No ratings yet
Windows Reset or Remove Windows Activation or Remove License Key
30 pages
Test Driven Development
No ratings yet
Test Driven Development
4 pages
iRODS Beginner Training Overview
No ratings yet
iRODS Beginner Training Overview
50 pages
Ibm Tivoli Sample Resume 3
No ratings yet
Ibm Tivoli Sample Resume 3
3 pages
C Programming Course Syllabus PDF
71% (7)
C Programming Course Syllabus PDF
1 page
The Glamour's Leaf Management System: Mr. Narendra Singh Attri
No ratings yet
The Glamour's Leaf Management System: Mr. Narendra Singh Attri
10 pages
SAP Authorization Framework Guide
No ratings yet
SAP Authorization Framework Guide
10 pages
Database Backup
No ratings yet
Database Backup
5 pages
Big Data Analytics Overview at York University
No ratings yet
Big Data Analytics Overview at York University
6 pages

XML Dom and Sax Parsers

Uploaded by

XML Dom and Sax Parsers

Uploaded by

XML

DOM and SAX

 The word parser comes from

 In a compiler, a parser is the module

 It reads a text-formatted XML file or

 Well-formed documents respect the

 Valid documents not only respect the

 Both parsers enforce syntactic rules

 only validating parsers know how to

 These map an XML document into an

 Ideal for browsers, editors, XSL

 An event-based API reports parsing

 The application implements handlers

 Tree-based parsers deal generally

 Event-based parsers deal generally

 Tree-based parsers are generally

 Event-based parsers are more

 The Document Object Model (DOM)

 It defines the logical structure of

 Provides a standard programming

 The interfaces and objects used to

 The semantics of these interfaces and

 The relationships and collaborations

 The Document Object Model is not a

 The Document Object Model is not a way

 The Document Object Model does not

 The Document Object Model is not a

 The Document Object Model is not a

 DOM Level 0 is a mix of Netscape

 It contains functionality for document

i.e.: functions for creating, deleting

 Provides support for XML

 Document views and formatting, key

 XML islands are mechanisms used to

 In this case, XML islands are used to

 convert() accepts two parameters,

 parentNode is the parent (if any) of current Node object.

 firstChild is the Node’s first child.

 previousSibling is the Node immediately preceding the

 attributes is the list of attributes, if the current Node has

 The parse() function loads the price

 The function searchPrice() tests

 The parser tells the application what is in

 Application then processes those events to

 SAX 1.0 was released on May 11, 1998.

 SAX is a common, event-based API for

 For applications that are not so XML-

 Efficiency: lower level than object-

 Event-based interface consumes

 With an event-based interface, the

 With SAX, it is not possible to

 The application must explicitly buffer

 Parser events are similar to user-

 Events alert the application that

 Element closing tags

You might also like