SlideShare a Scribd company logo
1
Introduction to XML
Chapter 1
Core XML / Chapter 1 / Slide 2 of 352
Chapter Objectives -1
 Discuss markup language
 List and explain drawbacks of HTML
 Discuss the architecture of XML documents
 List the benefits of XML
 Discuss Parser
Core XML / Chapter 1 / Slide 3 of 353
Chapter Objectives -2
 Build a complete XML Document:
 Character Data
 Comments
 Processing Instructions
 Entities
 General Entities
 Parameter Entities
 The DOCTYPE Declarations
Core XML / Chapter 1 / Slide 4 of 354
History of Markup
Documents recorded
using paper and pen
Typesetters formatting
documents
Tools used by typesetters
to format a document
Core XML / Chapter 1 / Slide 5 of 355
Markup Language
 A Markup language defines the rules that help to add
meaning to the content and structure of documents.
 They are classified as:
 Stylistic Markup – It determines the presentation of the
document
 Structure Markup – It defines the structure of the
document
 Semantic Markup – It determines the content of the
document
Core XML / Chapter 1 / Slide 6 of 356
SGML
 Generalized Markup Language (GML) is the
system of formatting documents.
 GML was fine-tuned and came to be known
as Standard Generalized Markup Language
(SGML).
 SGML is the source of origin of all markup
languages
Core XML / Chapter 1 / Slide 7 of 357
Features of SGML
 It describes markup language, which allows
authors to create their own tags that relate to
their content.
 It needs a separate file that will contain all
the rules for the language, for its
interpretation
 A SGML application is markup language
derived from SGML.
Core XML / Chapter 1 / Slide 8 of 358
HTML
 HTML is the most famous markup language derived
from SGML.
 It was created to mark up technical papers so that
they could be transferred across different platforms
for the scientific community.
 It is now also used by those non-scientific users who
are concerned about their document’s presentation.
Core XML / Chapter 1 / Slide 9 of 359
Drawbacks of HTML
 Fixed tag set
 Presentation technology does not relate to the
contents
 It is flat
 Clogging
 HTML is not international
 Data interchange is impossible
 Does not have a robust linking mechanism
 HTML is not reusable
Core XML / Chapter 1 / Slide 10 of 3510
HTML and XML code Examples
<UL>
<LI> TOM CRUISE
<UL>
<LI> CLIENT ID : 100
<LI> COMPANY : XYZ Corp.
<LI> Email : tom@usa.net
<LI> Phone : 3336767
<LI> Street Adress: 25th
St.
<LI> City : Toronto
<LI> State : Toronto
<LI> Zip : 20056
</UL>
</UL>
<Details>
<CONTACT>
<PERSON_NAME>TOM CRUISE
</PERSON_NAME>
<ID> 100 </ID>
<Company>XYZ Corp. </Company>
<Email> tom@usa.net</Email>
<Phone> 3336767 </Phone>
<Street> 25th St. </Street>
<City> Toronto </City>
<State> Toronto </State>
<ZIP> 20056 </ZIP>
</CONTACT>
</Details>
HTML Code XML Code
Core XML / Chapter 1 / Slide 11 of 3511
XML -1
 XML stands for Extensible Markup Language.
 It overcomes all the drawbacks of HTML.
 It allows the user to define their own set of tags, and also
makes it possible for others (people or programs) to
understand it.
 It is more flexible than HTML.
 It inherits the features of SGML and combines it with the
features of HTML.
 It is a smaller version of SGML.
Core XML / Chapter 1 / Slide 12 of 3512
XML -2
 XML is a metalanguage and it describes other
languages.
 The data contained in an XML file can be displayed
in different ways.
 It can also be offered to other applications for further
processing.
 Style sheets help transform structured data into
different HTML views. This enables data to be
displayed on different browsers.
Core XML / Chapter 1 / Slide 13 of 3513
XML Architecture - 1
 XML supports three-tier architecture for handling
and manipulating data.
 It can be generated from existing databases using a
scalable three-tier model.
 XML tags represent the logical structure of data that
can be interpreted and used in various ways by
different applications.
 The middle-tier is used to access multiple databases
and translate data into XML.
Core XML / Chapter 1 / Slide 14 of 3514
XML Architecture -2
Core XML / Chapter 1 / Slide 15 of 3515
XML – A Universal data format
 HTML is a single markup language, but XML is a
family of markup languages.
 Any type of data can be easily defined in XML.
 XML is popular because it supports a wide range of
applications and is easy to use.
 XML has a structured data format, which allows it to
store complex data
Core XML / Chapter 1 / Slide 16 of 3516
Benefits of XML
 The three-tier architecture has easier
scalability and better security.
 The benefits of XML are classified into the
following:
 Business benefits
 Technological benefits
Core XML / Chapter 1 / Slide 17 of 3517
Business Benefits
 Information sharing:
 Allows businesses to define data formats in XML
 Provides tools to read, write and transform data between
XML and other formats
 XML inside a single application:
 Powerful, flexible and extensible language
 Content Delivery:
 Supports different users and channels, like digital TV,
phone, web and multimedia kiosks
Core XML / Chapter 1 / Slide 18 of 3518
Technological Benefits
Technological
Benefits
Re-use of data
Separation of data and
presentation
ExtensibilitySemantic
information
Core XML / Chapter 1 / Slide 19 of 3519
XML Document Structure
 An XML document is composed of sets of “entities”
identified by unique names.
 All documents begin with a root or document entity.
 Entities are aliases for more complex functions.
 Documents are logically composed of declarations,
elements, comments, character references, and
processing instructions.
Core XML / Chapter 1 / Slide 20 of 3520
Well formed and Valid Documents
 An XML document is considered as well formed, if a
minimum set of requirements defined in the XML
1.0 specification are satisfied.
 The requirements ensure that correct language terms
are used in the right manner .
 A valid XML document is a well-formed XML
document, which conforms to the rules of a
Document Type Definition (DTD).
 DTD defines the rules that an XML markup in the
XML document must follow.
Core XML / Chapter 1 / Slide 21 of 3521
Parsers - 1
 Parsers help the computer interpret an XML
file.
<?xml
version=“1.0”?
>
<nxn>
</nxn>
Editor with the
XML document
Parsed document
viewed in the browser
XML document parsed by the
parser
Their are two types of parsers:
Non Validating parser
Validating parser
Core XML / Chapter 1 / Slide 22 of 3522
Parsers - 2
XML
file
Other related
files (like DTD
file)
Parsers load the XML
and other related files
to check whether the
XML document is well
formed and valid
Data tree
Core XML / Chapter 1 / Slide 23 of 3523
Data versus Markup
<NAME> Tom Cruise </NAME>
Markup
Data
Core XML / Chapter 1 / Slide 24 of 3524
Creating an XML Document
 To create an XML document:
 State an XML declaration
 Create a root element
 Create the XML code
 Verify the document
Core XML / Chapter 1 / Slide 25 of 3525
Stating an XML Declaration
 Syntax
<?xml version=“1.0” standalone=“no” encoding=“UTP-8”?>
 ‘Standalone’ and ‘encoding’ attributes are
optional, only the version number is mandatory
 ‘Standalone’ – is the external declaration
 ‘Encoding’ - specifies the character encoding
used by the author
 XML 1.0 version is default
Core XML / Chapter 1 / Slide 26 of 3526
Creating a Root Element
 There can only be one root element
 It describes the function of the document
 Every XML document must have a root
element
Example
<?xml version=“1.0” standalone=“no” encoding=“UTP-8”?>
<BOOK>
</BOOK>
Core XML / Chapter 1 / Slide 27 of 3527
Creating the XML Code -1
 It is the process of creating our own elements
and attributes as required by our application.
 Elements are the basic units of XML content.
 Tags tell the user agent to do something to the
content encased between the start and end tag.Opening Tag Content Closing Tag
<TITLE> Aptech Ltd </TITLE>
Element
Parts of an
element
Core XML / Chapter 1 / Slide 28 of 3528
Creating the XML Code -2
 Rules govern the elements:
 At least one element required
 XML tags are case sensitive
 End the tags correctly
 Nest tags Properly
 Use legal tags
 Length of markup names
 Define Valid Attributes
Core XML / Chapter 1 / Slide 29 of 3529
Verify the document
 The document should follow the
XML rules; otherwise it will not be
read by the browser or by any other
XML reader
Core XML / Chapter 1 / Slide 30 of 3530
Comments
 This is information for the understanding of
the user, and is to be ignored by the
processor.
 Syntax
<!- - Write the comment here -- >
Example
<!-- don't show these
<NAME>KATE WINSLET</NAME>
<NAME>NICOLE KIDMAN</NAME>
<NAME>ARNOLD</NAME>
-->
<NAME>TOM CRUISE</NAME>
The example given will
display only the name TOM
CRUSIE, and others are
treated as comments.
Core XML / Chapter 1 / Slide 31 of 3531
Processing Instruction
 A processing information is a bit of information meant
for the application using the XML document.
 These instructions are directly passed to the application
using the parser.
 The XML declaration is also a processing agent.
<?xml:stylesheet type=“text/xsl”?>
Name of application Instruction information
Core XML / Chapter 1 / Slide 32 of 3532
Character Data
 The text between the start and end tags is
defined as ‘character data’.
 Character data may be any legal (Unicode).
 Character data is classified into:
 PCDATA
 CDATA
Core XML / Chapter 1 / Slide 33 of 3533
PCDATA
 It stands for parsed character data.
 PCDATA is text that will be parsed by a Parser.
 Tags inside the text will be treated as markup and
entities will be expanded.
Entity Name Character
&lt; <
&gt; >
&amp; &
&quot; "
&apos; '
Predefined entities
Core XML / Chapter 1 / Slide 34 of 3534
CDATA
 It means character data.
 It will not be parsed by the Parser.
 CDATA are used to make it convenient to include
large blocks of special characters.
 The character string ]]> is not allowed within a
CDATA block as it will signal the end of the CDATA
block.
<SAMPLE>
<![CDATA[<DOCUMENT>
<NAME>TOM CRUISE</NAME>
<EMAIL>tom@usa.com</EMAIL>
</DOCUMENT>]]>
</SAMPLE>
Example
Core XML / Chapter 1 / Slide 35 of 3535
Entities
 Entities are used to avoid typing long pieces of text
repeatedly within a document.
 There are two categories of entities:
 General entities
Syntax
<!ENTITY ADDRESS "text that is to be represented
by an entity">
 Parameter entities
Syntax
<!ENTITY % ADDRESS "text that is to be represented by an entity">
Core XML / Chapter 1 / Slide 36 of 3536
Examples of Entities
An example of Parameter entities
< CLIENT = "&APTECH;" PRODUCT
= "&PRODUCT_ID;" QUANTITY
= "15">
 Entity declaration
 Syntax
%PARAMETER_ENTITY_NAME
;
 Example
%address;
An example of a General entity
<!ENTITY full_address " My
Address 12 Tenth Ave. Suite 12
Paris, France">
 Entity declaration
 Syntax
&ENTITY_NAME;
 Example
&address;
Core XML / Chapter 1 / Slide 37 of 3537
The DOCTYPE declarations
 The <!DOCTYPE [..]> declaration follows the XML
declaration in an XML document.
 Syntax
<?xml version="1.0"?>
<!DOCTYPE myDoc [
...declare the entities here....
<myDoc>
...body of the document....
</myDoc>
Example
<!DOCTYPE CUSTOMERS [
<!ENTITY firstFloor "15 Downing St Floor 1">
<!ENTITY secondFloor "15 Downing St Floor 2">
<!ENTITY thirdFloor "15 Downing St Floor 3">
]>
Core XML / Chapter 1 / Slide 38 of 3538
Attributes
 An attribute gives information about an
element.
 Attributes are embedded in the element start
tag.
 An attribute consists of an attribute name
and attribute value.
Example
<TV count="8">SONY</TV>
<LAPTOP count="10">IBM</LAPTOP>
Core XML / Chapter 1 / Slide 39 of 3539
Summary-1
 A markup language defines a set of rules that adds
meaning to the content and structure of documents
 XML is extensible, which means that we can define our
own set of tags, and make it possible for other parties
(people or programs) to know and understand these tags.
This makes XML much more flexible than HTML
 XML inherits features from SGML and includes the
features of HTML. XML can be generated from existing
databases using a scalable three-tier model. XML-based
data does not contain information about how data should
be displayed
 An XML document is composed of a set of “entities”
identified by unique names
Core XML / Chapter 1 / Slide 40 of 3540
Summary-2
 A well-formed document is one that conforms to the basic
rules of XML; a valid document is a well-formed
document that conforms to the rules of a DTD (Document
Type Definition)
 The parser helps the computer to interpret an XML file
 Steps involved in the building of an XML document are:
 Stating an XML declaration
 Creating a root element
 Creating the XML code
 Verifying the document
 Character data is classified into PCDATA and CDATA
Core XML / Chapter 1 / Slide 41 of 3541
Summary-3
 Entities are used to avoid typing long pieces of text repeatedly
in a document. The two types of entities are:
 General entities
 Parameter entities
 The <!DOCTYPE […]> declaration follows the XML
declaration in an XML document.
 An attribute gives information about an element

More Related Content

PPTX
HTML frames and HTML forms
PPTX
View of data DBMS
DOCX
Evolving role of Software,Legacy software,CASE tools,Process Models,CMMI
PPTX
Magnetic Disk
PPTX
directory structure and file system mounting
PPTX
Structured Query Language (SQL)
PPT
Page Replacement
PPTX
Data Flow Diagrams
HTML frames and HTML forms
View of data DBMS
Evolving role of Software,Legacy software,CASE tools,Process Models,CMMI
Magnetic Disk
directory structure and file system mounting
Structured Query Language (SQL)
Page Replacement
Data Flow Diagrams

What's hot (20)

PPT
EVOLUTION OF SYSTEM
PPT
Data Base System Application - Unit 7
PPTX
Adbms 3 main characteristics of the database approach
PPTX
Interfacing With High Level Programming Language
PPTX
Operating system; Multitasking
PPTX
Programming languages
PPTX
Simple xml in .net
PPTX
Database management system
PPTX
Structure of dbms
DOC
Hld and lld
PPTX
Memory organization in computer architecture
PPTX
Memory Hierarchy
PPT
Use Case Diagram
PDF
Database users
PDF
Input-Output Modules
PPT
Programming languages
PPT
Database Chapter 3
PPT
Lecture 01 introduction to database
PPT
Object Oriented Modeling and Design with UML
EVOLUTION OF SYSTEM
Data Base System Application - Unit 7
Adbms 3 main characteristics of the database approach
Interfacing With High Level Programming Language
Operating system; Multitasking
Programming languages
Simple xml in .net
Database management system
Structure of dbms
Hld and lld
Memory organization in computer architecture
Memory Hierarchy
Use Case Diagram
Database users
Input-Output Modules
Programming languages
Database Chapter 3
Lecture 01 introduction to database
Object Oriented Modeling and Design with UML
Ad

Viewers also liked (12)

PDF
IMT Lecture: Transfer of Technology from.R+D MM-Biztel-29Jul09
PPT
IP Licensing
PPT
Biotechnology regulatory process and agencies, legal aspects, IPR
PPT
Trade secrets vs. confidential information
PPT
Intellectual Capital
PPTX
Licensing & franchising
PPT
Intellectual Capital Management
PPTX
Ppt On Sm
PPTX
Media and Information Literacy (MIL) - Intellectual Property, Fair Use, and C...
PDF
Intellectual Capital: Define, Understand and Capture It
PPT
Licensing,Ppt
PPTX
LICENSING & FRANCHISING
IMT Lecture: Transfer of Technology from.R+D MM-Biztel-29Jul09
IP Licensing
Biotechnology regulatory process and agencies, legal aspects, IPR
Trade secrets vs. confidential information
Intellectual Capital
Licensing & franchising
Intellectual Capital Management
Ppt On Sm
Media and Information Literacy (MIL) - Intellectual Property, Fair Use, and C...
Intellectual Capital: Define, Understand and Capture It
Licensing,Ppt
LICENSING & FRANCHISING
Ad

Similar to Introduction to XML (20)

PPT
Xml theory 2005_[ngohaianh.info]_1_introduction-to-xml
PPT
Introduction to XML
PPT
uptu web technology unit 2 Xml2
PPTX
Unit 5 xml (1)
PPT
XML-Unit 1.ppt
PDF
Xml overview
PPTX
XML Introduction
PPTX
PDF
M.FLORENCE DAYANA WEB DESIGN -Unit 5 XML
PPT
eXtensible Markup Language (By Dr.Hatem Mohamed)
PPTX
Sgml and xml
PPS
Xml basics for beginning
PPTX
XML
PPTX
Extensible Markup Language(XML)_lecture.pptx
PPTX
Unit3wt
PPTX
Unit3wt
Xml theory 2005_[ngohaianh.info]_1_introduction-to-xml
Introduction to XML
uptu web technology unit 2 Xml2
Unit 5 xml (1)
XML-Unit 1.ppt
Xml overview
XML Introduction
M.FLORENCE DAYANA WEB DESIGN -Unit 5 XML
eXtensible Markup Language (By Dr.Hatem Mohamed)
Sgml and xml
Xml basics for beginning
XML
Extensible Markup Language(XML)_lecture.pptx
Unit3wt
Unit3wt

More from Bình Trọng Án (20)

PDF
A Developer's Guide to CQRS Using .NET Core and MediatR
PDF
Nếu con em vị nói lắp
PDF
Bài giảng chuyên đề - Lê Minh Hoàng
PDF
Tìm hiểu về NodeJs
PDF
Clean code-v2.2
PDF
Các câu chuyện toán học - Tập 3: Khẳng định trong phủ định
PDF
Luyện dịch Việt Anh
DOCX
2816 mcsa--part-11--domain-c111ntroller--join-domain-1
DOC
LinQ to XML
PDF
Chuyên đề group policy
PPT
Chapter 4 xml schema
DOCX
Tỷ lệ vàng - một phát hiện vĩ đại của hình học
PPT
Attributes & .NET components
DOCX
Ajax Control ToolKit
PPT
Linq intro
DOCX
Sách chữa tật nói lắp Version 1.0 beta
PPT
Mô hình 3 lớp
PPT
Xsd examples
PPT
Displaying XML Documents Using CSS and XSL
A Developer's Guide to CQRS Using .NET Core and MediatR
Nếu con em vị nói lắp
Bài giảng chuyên đề - Lê Minh Hoàng
Tìm hiểu về NodeJs
Clean code-v2.2
Các câu chuyện toán học - Tập 3: Khẳng định trong phủ định
Luyện dịch Việt Anh
2816 mcsa--part-11--domain-c111ntroller--join-domain-1
LinQ to XML
Chuyên đề group policy
Chapter 4 xml schema
Tỷ lệ vàng - một phát hiện vĩ đại của hình học
Attributes & .NET components
Ajax Control ToolKit
Linq intro
Sách chữa tật nói lắp Version 1.0 beta
Mô hình 3 lớp
Xsd examples
Displaying XML Documents Using CSS and XSL

Recently uploaded (20)

PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
cuic standard and advanced reporting.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Mobile App Security Testing_ A Comprehensive Guide.pdf
Empathic Computing: Creating Shared Understanding
cuic standard and advanced reporting.pdf
Big Data Technologies - Introduction.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Spectral efficient network and resource selection model in 5G networks
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Approach and Philosophy of On baking technology
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Understanding_Digital_Forensics_Presentation.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Agricultural_Statistics_at_a_Glance_2022_0.pdf

Introduction to XML

  • 2. Core XML / Chapter 1 / Slide 2 of 352 Chapter Objectives -1  Discuss markup language  List and explain drawbacks of HTML  Discuss the architecture of XML documents  List the benefits of XML  Discuss Parser
  • 3. Core XML / Chapter 1 / Slide 3 of 353 Chapter Objectives -2  Build a complete XML Document:  Character Data  Comments  Processing Instructions  Entities  General Entities  Parameter Entities  The DOCTYPE Declarations
  • 4. Core XML / Chapter 1 / Slide 4 of 354 History of Markup Documents recorded using paper and pen Typesetters formatting documents Tools used by typesetters to format a document
  • 5. Core XML / Chapter 1 / Slide 5 of 355 Markup Language  A Markup language defines the rules that help to add meaning to the content and structure of documents.  They are classified as:  Stylistic Markup – It determines the presentation of the document  Structure Markup – It defines the structure of the document  Semantic Markup – It determines the content of the document
  • 6. Core XML / Chapter 1 / Slide 6 of 356 SGML  Generalized Markup Language (GML) is the system of formatting documents.  GML was fine-tuned and came to be known as Standard Generalized Markup Language (SGML).  SGML is the source of origin of all markup languages
  • 7. Core XML / Chapter 1 / Slide 7 of 357 Features of SGML  It describes markup language, which allows authors to create their own tags that relate to their content.  It needs a separate file that will contain all the rules for the language, for its interpretation  A SGML application is markup language derived from SGML.
  • 8. Core XML / Chapter 1 / Slide 8 of 358 HTML  HTML is the most famous markup language derived from SGML.  It was created to mark up technical papers so that they could be transferred across different platforms for the scientific community.  It is now also used by those non-scientific users who are concerned about their document’s presentation.
  • 9. Core XML / Chapter 1 / Slide 9 of 359 Drawbacks of HTML  Fixed tag set  Presentation technology does not relate to the contents  It is flat  Clogging  HTML is not international  Data interchange is impossible  Does not have a robust linking mechanism  HTML is not reusable
  • 10. Core XML / Chapter 1 / Slide 10 of 3510 HTML and XML code Examples <UL> <LI> TOM CRUISE <UL> <LI> CLIENT ID : 100 <LI> COMPANY : XYZ Corp. <LI> Email : tom@usa.net <LI> Phone : 3336767 <LI> Street Adress: 25th St. <LI> City : Toronto <LI> State : Toronto <LI> Zip : 20056 </UL> </UL> <Details> <CONTACT> <PERSON_NAME>TOM CRUISE </PERSON_NAME> <ID> 100 </ID> <Company>XYZ Corp. </Company> <Email> tom@usa.net</Email> <Phone> 3336767 </Phone> <Street> 25th St. </Street> <City> Toronto </City> <State> Toronto </State> <ZIP> 20056 </ZIP> </CONTACT> </Details> HTML Code XML Code
  • 11. Core XML / Chapter 1 / Slide 11 of 3511 XML -1  XML stands for Extensible Markup Language.  It overcomes all the drawbacks of HTML.  It allows the user to define their own set of tags, and also makes it possible for others (people or programs) to understand it.  It is more flexible than HTML.  It inherits the features of SGML and combines it with the features of HTML.  It is a smaller version of SGML.
  • 12. Core XML / Chapter 1 / Slide 12 of 3512 XML -2  XML is a metalanguage and it describes other languages.  The data contained in an XML file can be displayed in different ways.  It can also be offered to other applications for further processing.  Style sheets help transform structured data into different HTML views. This enables data to be displayed on different browsers.
  • 13. Core XML / Chapter 1 / Slide 13 of 3513 XML Architecture - 1  XML supports three-tier architecture for handling and manipulating data.  It can be generated from existing databases using a scalable three-tier model.  XML tags represent the logical structure of data that can be interpreted and used in various ways by different applications.  The middle-tier is used to access multiple databases and translate data into XML.
  • 14. Core XML / Chapter 1 / Slide 14 of 3514 XML Architecture -2
  • 15. Core XML / Chapter 1 / Slide 15 of 3515 XML – A Universal data format  HTML is a single markup language, but XML is a family of markup languages.  Any type of data can be easily defined in XML.  XML is popular because it supports a wide range of applications and is easy to use.  XML has a structured data format, which allows it to store complex data
  • 16. Core XML / Chapter 1 / Slide 16 of 3516 Benefits of XML  The three-tier architecture has easier scalability and better security.  The benefits of XML are classified into the following:  Business benefits  Technological benefits
  • 17. Core XML / Chapter 1 / Slide 17 of 3517 Business Benefits  Information sharing:  Allows businesses to define data formats in XML  Provides tools to read, write and transform data between XML and other formats  XML inside a single application:  Powerful, flexible and extensible language  Content Delivery:  Supports different users and channels, like digital TV, phone, web and multimedia kiosks
  • 18. Core XML / Chapter 1 / Slide 18 of 3518 Technological Benefits Technological Benefits Re-use of data Separation of data and presentation ExtensibilitySemantic information
  • 19. Core XML / Chapter 1 / Slide 19 of 3519 XML Document Structure  An XML document is composed of sets of “entities” identified by unique names.  All documents begin with a root or document entity.  Entities are aliases for more complex functions.  Documents are logically composed of declarations, elements, comments, character references, and processing instructions.
  • 20. Core XML / Chapter 1 / Slide 20 of 3520 Well formed and Valid Documents  An XML document is considered as well formed, if a minimum set of requirements defined in the XML 1.0 specification are satisfied.  The requirements ensure that correct language terms are used in the right manner .  A valid XML document is a well-formed XML document, which conforms to the rules of a Document Type Definition (DTD).  DTD defines the rules that an XML markup in the XML document must follow.
  • 21. Core XML / Chapter 1 / Slide 21 of 3521 Parsers - 1  Parsers help the computer interpret an XML file. <?xml version=“1.0”? > <nxn> </nxn> Editor with the XML document Parsed document viewed in the browser XML document parsed by the parser Their are two types of parsers: Non Validating parser Validating parser
  • 22. Core XML / Chapter 1 / Slide 22 of 3522 Parsers - 2 XML file Other related files (like DTD file) Parsers load the XML and other related files to check whether the XML document is well formed and valid Data tree
  • 23. Core XML / Chapter 1 / Slide 23 of 3523 Data versus Markup <NAME> Tom Cruise </NAME> Markup Data
  • 24. Core XML / Chapter 1 / Slide 24 of 3524 Creating an XML Document  To create an XML document:  State an XML declaration  Create a root element  Create the XML code  Verify the document
  • 25. Core XML / Chapter 1 / Slide 25 of 3525 Stating an XML Declaration  Syntax <?xml version=“1.0” standalone=“no” encoding=“UTP-8”?>  ‘Standalone’ and ‘encoding’ attributes are optional, only the version number is mandatory  ‘Standalone’ – is the external declaration  ‘Encoding’ - specifies the character encoding used by the author  XML 1.0 version is default
  • 26. Core XML / Chapter 1 / Slide 26 of 3526 Creating a Root Element  There can only be one root element  It describes the function of the document  Every XML document must have a root element Example <?xml version=“1.0” standalone=“no” encoding=“UTP-8”?> <BOOK> </BOOK>
  • 27. Core XML / Chapter 1 / Slide 27 of 3527 Creating the XML Code -1  It is the process of creating our own elements and attributes as required by our application.  Elements are the basic units of XML content.  Tags tell the user agent to do something to the content encased between the start and end tag.Opening Tag Content Closing Tag <TITLE> Aptech Ltd </TITLE> Element Parts of an element
  • 28. Core XML / Chapter 1 / Slide 28 of 3528 Creating the XML Code -2  Rules govern the elements:  At least one element required  XML tags are case sensitive  End the tags correctly  Nest tags Properly  Use legal tags  Length of markup names  Define Valid Attributes
  • 29. Core XML / Chapter 1 / Slide 29 of 3529 Verify the document  The document should follow the XML rules; otherwise it will not be read by the browser or by any other XML reader
  • 30. Core XML / Chapter 1 / Slide 30 of 3530 Comments  This is information for the understanding of the user, and is to be ignored by the processor.  Syntax <!- - Write the comment here -- > Example <!-- don't show these <NAME>KATE WINSLET</NAME> <NAME>NICOLE KIDMAN</NAME> <NAME>ARNOLD</NAME> --> <NAME>TOM CRUISE</NAME> The example given will display only the name TOM CRUSIE, and others are treated as comments.
  • 31. Core XML / Chapter 1 / Slide 31 of 3531 Processing Instruction  A processing information is a bit of information meant for the application using the XML document.  These instructions are directly passed to the application using the parser.  The XML declaration is also a processing agent. <?xml:stylesheet type=“text/xsl”?> Name of application Instruction information
  • 32. Core XML / Chapter 1 / Slide 32 of 3532 Character Data  The text between the start and end tags is defined as ‘character data’.  Character data may be any legal (Unicode).  Character data is classified into:  PCDATA  CDATA
  • 33. Core XML / Chapter 1 / Slide 33 of 3533 PCDATA  It stands for parsed character data.  PCDATA is text that will be parsed by a Parser.  Tags inside the text will be treated as markup and entities will be expanded. Entity Name Character &lt; < &gt; > &amp; & &quot; " &apos; ' Predefined entities
  • 34. Core XML / Chapter 1 / Slide 34 of 3534 CDATA  It means character data.  It will not be parsed by the Parser.  CDATA are used to make it convenient to include large blocks of special characters.  The character string ]]> is not allowed within a CDATA block as it will signal the end of the CDATA block. <SAMPLE> <![CDATA[<DOCUMENT> <NAME>TOM CRUISE</NAME> <EMAIL>tom@usa.com</EMAIL> </DOCUMENT>]]> </SAMPLE> Example
  • 35. Core XML / Chapter 1 / Slide 35 of 3535 Entities  Entities are used to avoid typing long pieces of text repeatedly within a document.  There are two categories of entities:  General entities Syntax <!ENTITY ADDRESS "text that is to be represented by an entity">  Parameter entities Syntax <!ENTITY % ADDRESS "text that is to be represented by an entity">
  • 36. Core XML / Chapter 1 / Slide 36 of 3536 Examples of Entities An example of Parameter entities < CLIENT = "&APTECH;" PRODUCT = "&PRODUCT_ID;" QUANTITY = "15">  Entity declaration  Syntax %PARAMETER_ENTITY_NAME ;  Example %address; An example of a General entity <!ENTITY full_address " My Address 12 Tenth Ave. Suite 12 Paris, France">  Entity declaration  Syntax &ENTITY_NAME;  Example &address;
  • 37. Core XML / Chapter 1 / Slide 37 of 3537 The DOCTYPE declarations  The <!DOCTYPE [..]> declaration follows the XML declaration in an XML document.  Syntax <?xml version="1.0"?> <!DOCTYPE myDoc [ ...declare the entities here.... <myDoc> ...body of the document.... </myDoc> Example <!DOCTYPE CUSTOMERS [ <!ENTITY firstFloor "15 Downing St Floor 1"> <!ENTITY secondFloor "15 Downing St Floor 2"> <!ENTITY thirdFloor "15 Downing St Floor 3"> ]>
  • 38. Core XML / Chapter 1 / Slide 38 of 3538 Attributes  An attribute gives information about an element.  Attributes are embedded in the element start tag.  An attribute consists of an attribute name and attribute value. Example <TV count="8">SONY</TV> <LAPTOP count="10">IBM</LAPTOP>
  • 39. Core XML / Chapter 1 / Slide 39 of 3539 Summary-1  A markup language defines a set of rules that adds meaning to the content and structure of documents  XML is extensible, which means that we can define our own set of tags, and make it possible for other parties (people or programs) to know and understand these tags. This makes XML much more flexible than HTML  XML inherits features from SGML and includes the features of HTML. XML can be generated from existing databases using a scalable three-tier model. XML-based data does not contain information about how data should be displayed  An XML document is composed of a set of “entities” identified by unique names
  • 40. Core XML / Chapter 1 / Slide 40 of 3540 Summary-2  A well-formed document is one that conforms to the basic rules of XML; a valid document is a well-formed document that conforms to the rules of a DTD (Document Type Definition)  The parser helps the computer to interpret an XML file  Steps involved in the building of an XML document are:  Stating an XML declaration  Creating a root element  Creating the XML code  Verifying the document  Character data is classified into PCDATA and CDATA
  • 41. Core XML / Chapter 1 / Slide 41 of 3541 Summary-3  Entities are used to avoid typing long pieces of text repeatedly in a document. The two types of entities are:  General entities  Parameter entities  The <!DOCTYPE […]> declaration follows the XML declaration in an XML document.  An attribute gives information about an element