XML DOM Objects and Methods (Contd.)

Extensible Markup Language
Objectives
In this session, you will learn to:
Identify the XML Document Object Model
Validate an XML document against an XML schema using the
Document Object Model
Apply a Style Sheet to an XML document
Ver. 1.0
Slide 1 of 31
Extensible Markup Language
Identifying the XML Document Object Model
DOM defines the logical structure of documents.
DOM provides an Application Programming Interface (API)
for dynamically accessing and manipulating a document.
The DOM objects have associated methods and properties
to access and manipulate a document.
A DOM-enabled parser is required to use the features
provided by DOM.
A DOM-enabled parser:
Parses an XML document to ascertain its validity.
Creates an in-memory representation of the XML document as
a tree structure.
Ver. 1.0
Slide 2 of 31
Extensible Markup Language
Implementation of DOM in MSXML Parser
MSXML parser:
Is the Microsoft implementation of DOM.
Provides fundamental as well as added interfaces to access
documents.
The following figure represents how a DOM tree is used by
applications to access data.
MSXML Library
XML
Document
Parser
Parsed
Document
DOM Tree
Root
Child
Application
Text
Child
Text
Ver. 1.0
Slide 3 of 31
Extensible Markup Language
XML DOM Objects and Methods
Following are the key DOM objects:
Document
Element
Node
NodeList
Attr
Text
CDataSection
ParseError
Ver. 1.0
Slide 4 of 31
Extensible Markup Language
XML DOM Objects and Methods (Contd.)
Following are the key DOM objects:
Document
Element
Node
NodeList
Attr
Text
CDataSection
ParseError
Ver. 1.0
It is the top-level object that implements all the
basic DOM methods.
It also has methods that support XSLT.
It has methods that can be used to navigate,
query, and modify the content and structure of
an XML document.
Some of the methods provided by this object
are createElement(), createAttribute(),
createComment() , and createTextNode().
Some of the properties provided by this object that
help in manipulating the information contained in
the object are async, childNodes, firstChild,
documentElement, xml, and readyState.
Slide 5 of 31
Extensible Markup Language
XML DOM Objects and Methods (Contd.)
Following are the key DOM objects:
Document
Element
Node
NodeList
Attr
Text
CDataSection
ParseError
Ver. 1.0
It represents all the element nodes in an XML
document.
The attributes associated with the elements are
considered to be the properties of the elements
rather than their child elements.
Some of the methods of this object are
also inherited from the Node object.
Some of the methods provided by this object are
getAttribute(),
getElementsByTagName(),
normalize(), and removeAttributeNS().
Slide 6 of 31
Extensible Markup Language
XML DOM Objects and Methods (Contd.)
Following are the key DOM objects:
Document
Element
Node
NodeList
Attr
Text
CDataSection
ParseError
Ver. 1.0
It represents a single node in the XML document
tree structure.
It provides methods to work with child elements.
Some of the methods of this object are
appendChild(newChild),
insertBefore(newNode,refNode),
and removeChild(nodeName).
Slide 7 of 31
Extensible Markup Language
XML DOM Objects and Methods (Contd.)
Following are the key DOM objects:
Document
Element
Node
NodeList
Attr
Text
CDataSection
ParseError
Ver. 1.0
It provides a list of nodes present in an XML
document for manipulation.
This object enables you to iterate through a
collection of nodes.
Some of the method of this object are item()
and nextNode().
Slide 8 of 31
Extensible Markup Language
XML DOM Objects and Methods (Contd.)
Following are the key DOM objects:
Document
Element
Node
NodeList
Attr
Text
CDataSection
ParseError
Ver. 1.0
It represents an attribute of the Element object.
It is also a Node and inherits various attributes
and methods of Node object.
An attribute is not considered by the DOM to be
a child node of an element, but rather a property.
Slide 9 of 31
Extensible Markup Language
XML DOM Objects and Methods (Contd.)
Following are the key DOM objects:
Document
Element
Node
NodeList
Attr
Text
CDataSection
ParseError
Ver. 1.0
It represents the text inside an XML element in
the node tree.
The splitText() method is associated with
this object.
Slide 10 of 31
Extensible Markup Language
XML DOM Objects and Methods (Contd.)
Following are the key DOM objects:
Document
Element
Node
NodeList
Attr
Text
CDataSection
ParseError
Ver. 1.0
It represents the CDATA sections in the node
tree.
This node is used to escape the parts of text
that normally would be recognized as markup.
Slide 11 of 31
Extensible Markup Language
XML DOM Objects and Methods (Contd.)
Following are the key DOM objects:
Document
Element
Node
NodeList
Attr
Text
CDataSection
ParseError
Ver. 1.0
It returns information about the most recent
parse error.
It provides properties to retrieve information,
such as the error code, the error text, and the
line that caused the error.
Some of the properties of this object are
errorCode, reason, line, and srcText.
Slide 12 of 31
Extensible Markup Language
XML DOM Objects in Scripts
The DOM objects can be used within scripting languages
such as JavaScript and VBScript.
Using DOM objects in scripts allow dynamically applying a
style sheet to an XML document.
The code for using DOM objects for accessing an XML
document needs to be used as an HTML page.
Ver. 1.0
Slide 13 of 31
Extensible Markup Language
Validating an XML Document Against an XML Schema Using DOM
DOM:
Used to validate XML documents against XML schemas.
Ensures proper grammar and proper formation of the XML
document.
DOMDocument object:
Main object in a DOM.
Represents the top node in every document tree.
Used to load an XML document, parse it, and validate it.
XMLSchemaCache object:
Loads the schema document associated with the XML
document.
Ver. 1.0
Slide 14 of 31
Extensible Markup Language
Accessing an XML Document by Using the XMLSchemaCache Object
Used to hold a collection of schema documents that
specifies the rules to which XML documents must conform.
Following table describes some methods provided by the
XMLSchemaCache object.
Ver. 1.0
Method
Description
add(namespaceURI,
variable)
This method adds a new schema to the collection and also
associates the specified namespaceURI with the schema.
addCollection(XMLSch
emaCollection
object)
This method adds schemas from other schema collections.
It also ensures that the namespaceURIs of the different
schemas do not clash.
get(namespaceURI)
This method returns a node that contains the <schema>
element.
namespaceURI(index
number)
This method returns the namespace that is associated with
the schema at the specified index number.
remove(namespaceURI)
This method removes a schema from a collection.
Slide 15 of 31
Extensible Markup Language
Validating an XML Document Against the Schema
To write a script that validates an XML document against the
schema, you need to follow the given tasks:
1. Create the user interface to accept the name of the XML
document and the XML schema.
2. Write the code to load the XML document.
3. Write the code to add the XML schema in the
XMLSchemaCache object.
4. Write the code to validate the XML document against the
schema.
Ver. 1.0
Slide 16 of 31
Extensible Markup Language
Demo: Validating an XML Document
Problem Statement:
The head office of CyberShoppe receives data in the form of
XML documents from its branches. In order to ensure the
consistency of data sent by the branches, the head office
maintains the definitions for the structures of documents in
schemas. After receiving data from the branches, the head
office needs to verify that the data conforms to the schema of
the respective document. For this, the head office needs to
write a script that validates the data stored in an XML
document against a schema. You will initially do the testing for
the product.xml file.
Ver. 1.0
Slide 17 of 31
Extensible Markup Language
Applying a Style Sheet to an XML Document
Style sheet object that passes into the transformNode
method needs to be recompiled every time the method is
called.
Compiling a style sheet means setting all its template rules
in an executable state.
Using the XSLTemplate object and the XSLProcessor
object to perform transformation facilitates the following:
Reduction of overheads
Increase in the performance of an XSLT application
Ver. 1.0
Slide 18 of 31
Extensible Markup Language
The XSLTemplate Object
Is a DOM object that is used to access an XSLT style sheet.
Used to hold a cached style sheet that can then be
dynamically associated with an XML document.
Before a style sheet document can be applied to an XML
document, it is converted into a tree structure by the parser.
XSLT tree structure is loaded into the memory of the
computer and used to process the XML document.
Ver. 1.0
Slide 19 of 31
Extensible Markup Language
The XSLProcessor Object (Contd.)
Used to apply a style sheet to an XML document and then
process that document.
Applies the given XSLT document to a specific XML
document.
Transforms an XML document by using the XSLT style
sheet.
JavaScript code to create an XSLProcessor object is as
follows:
var xslprocobj= xsltobj.createProcessor();
Ver. 1.0
Slide 20 of 31
Extensible Markup Language
Demo: Applying a Style Sheet to an XML Document During Run Time
Problem Statement:
CyberShoppe sells its products through an e-commerce Web
site. Product details, such as product name, description, price,
and quantity, need to be displayed.
A customer can choose to view the product details either as a
table or as a bulleted list.
Ver. 1.0
Slide 21 of 31
Extensible Markup Language
Exercises
Problem Statement:
The following XML document, customer.xml contains
customer details:
<?xml version="1.0"?>
<CUSTOMER>
<CUSTOMERNAME>Harold Johnson</CUSTOMERNAME>
<ADDRESS>56, Regent Road</ADDRESS>
<CITY>London</CITY>
<COUNTRY>UK</COUNTRY>
<PHONE>444-425-2355</PHONE>
</CUSTOMER>
Create a Web page that accepts the name of the XML file in a
text box. When you click the Submit hyperlink, a JavaScript
code should load the XML document and extract the
CUSTOMERNAME element. The address and phone number
should be changed to 94, McFarlane Avenue and 412-2332344, respectively, by using XML DOM objects.
Ver. 1.0
Slide 22 of 31
Extensible Markup Language
Practice Questions
Which of the following methods of the DOMDocument object
will you use to create an element node that has a
namespace prefix associated with it?
a.
b.
c.
d.
createElement()
createTextNode()
createNode()
createAttribute()
Answer:
c. createNode()
Ver. 1.0
Slide 23 of 31
Extensible Markup Language
Practice Questions
You have written a script to load an XML document and
validate it against a schema. Which of the following
properties of the DOMDocument object will you use to
ensure that the script waits till the XML document is loaded
before executing the rest of the code?
a.
b.
c.
d.
async
readyState
documentElement
xml
Answer:
a. async
Ver. 1.0
Slide 24 of 31
Extensible Markup Language
Practice Questions
Which of the following properties of the IXMLDOMNode
object will you use to return the text associated with a node?
a.
b.
c.
d.
nodeName
xml
nodeType
nodeValue
Answer:
d. nodeValue
Ver. 1.0
Slide 25 of 31
Extensible Markup Language
Practice Questions
Which of the following properties of the
IXMLDOMParseError object will you use to retrieve the
string containing the line that caused an error while parsing
an XML document?
a.
b.
c.
d.
srcText
linePos
reason
line
Answer:
a. srcText
Ver. 1.0
Slide 26 of 31
Extensible Markup Language
Practice Questions
Which one of the following is an object that represents the
complete XML document map?
a.
b.
c.
d.
IXMLDOMNode
DOMDocument
IXMLDOMNodeList
IXMLDOMParseError
Answer:
b. DOMDocument
Ver. 1.0
Slide 27 of 31
Extensible Markup Language
Summary
In this session, you learned that:
The XMLSchemaCache object is used to associate an XML
document with an XSD document.
The XMLSchemaCache object is used to hold a collection of
schema documents that specifies the rules to which XML
documents must conform.
The XSLTemplate object is used to access an XSLT style
sheet.
The XSLProcessor object is used to apply style sheets on a
given XML document.
The XSLProcessor object is created using the
createProcessor() method.
The XSLProcessor object is associated with an XML
document by using the input property of the XSLProcessor
object.
Ver. 1.0
Slide 28 of 31
Extensible Markup Language
Summary (Contd.)
The XSLProcessor object provides the transform()
method to transform an XML document according to the
information provided in an XSLT style sheet.
DOM is an application-programming interface that allows an
application to access the contents of an XML document.
DOM objects allow you to access and manipulate XML
documents.
The MSXML parser loads an XML document and creates a
tree structure that represents the various components of the
XML document.
The basic building block of the tree structure is a node. A node
is a container that holds information about the elements,
attributes, and content stored in an XML document.
Ver. 1.0
Slide 29 of 31
Extensible Markup Language
Summary (Contd.)
Some DOM objects that are used to manipulate data in a
document are:
Document
Element
Node
NodeList
Attr
Text
CDataSection
ParseError
The Document object is the top-level object in the XML DOM.
This object provides various properties and methods that help
you to navigate, query, and modify the content and structure of
XML documents.
The Element object represents all the element nodes in a
document.
Ver. 1.0
Slide 30 of 31
Extensible Markup Language
Summary (Contd.)
The Node object represents a node in the XML document
structure. This object provides methods to work with the child
elements.
The NodeList object allows you to iterate through a collection
of nodes.
The Attr object represents an attribute of an Element object.
The Text object represents the text inside an XML element in
the node tree.
The ParseError object returns information about the most
recent parse error.
Ver. 1.0
Slide 31 of 31