SlideShare a Scribd company logo
METADATA: AN INTRODUCTIONPresented by:Hamidahbt. HJ. A. RahmanSenior LecturerFaculty of Information ManagementUiTMPuncakPerdana Campus40150 Shah AlamSELANGOR DARUL EHSAN4/5/20101
Metadata is structured data which describes the characteristics of a resource. It shares many similar characteristics to the cataloguing that takes place in libraries, museums and archives. The term "meta" derives from the Greek word denoting a nature of a higher order or more fundamental kind. A metadata record consists of a number of pre-defined elements representing specific attributes of a resource, and each element can have one or more values.4/5/20102What is metadata?
What is metadata?Structured data about resourcesLibrary cataloguesAbstracting and indexing servicesArchival finding aidsMuseum documentationCommunity informationCarriers:  MARC, HTML, SGML, XML4/5/20103
Types of metadataDescriptive MetadataAdministrative MetadataStructural Metadata4/5/20104
Types of metadataDescriptive MetadataAdministrative MetadataStructural Metadata4/5/20105
Types of metadataDescriptive Metadata: to serve the purposes of discovery (how one finds a resource), identification (how a resource can be distinguished from other, similar resources), and selection (how to determine that a resource fills a particular need, for example, for the DVD version of a video recording)4/5/20106
Types of metadataAdministrative Metadata:is information intended to facilitate the management of resources. It can include such information as when and how an object was created,who is responsible for controlling access to or archiving the content, what control or processing activities have been performed in relation to it, and what restrictions on access or use apply.4/5/20107
Types of metadataStructural Metadata (SM):can be thought of as the glue that holds compound digital objects together. A book, for example, may have many chapters, each consisting of a set of pages, each page represented by a separate digital file. Structural metadata (SM) is required to record the relationships between physical files and pages, between pages and chapters, and between chapters and the book as a whole. Presentation software uses SM to display Tables of contents and to deliver such functions as going directly to a requested chapter, or to turn pages forward or backward in order.4/5/20108
Why use metadata?Metadata provides the essential link between the information creator and the information user.4/5/20109
Aim of metadataWhile the primary aim of metadata is to improve resource discovery, metadata sets are also being developed for other reasons, including:administrative control
security
personal information
management information
content rating
rights management
preservation4/5/201010
Metadata may be deployed in a number of ways:Embedding the metadata in the Web page by the creator or their agent using META tags in the HTML coding of the page As a separate HTML document linked to the resource it describes In a database linked to the resource. The records may either have been directly created within the database or extracted from another source, such as Web pages.4/5/201011
Some of the most popular metadata schemas include:Dublin Core
AACR2 (Anglo-American Cataloging Rules)
GILS (Government Information Locator Service)
EAD (Encoded Archives Description) IMS (IMS Global Learning Consortium) AGLS (Australian Government Locator Service) 4/5/201012
The data will be unusable, unless the encoding scheme understands the semantics of the metadata schema. The encoding allows the metadata to be processed by a computer program. Important schemes include:HTML(Hyper-Text Markup Language) SGML (Standard Generalised Markup Language) XML (eXtensible Markup Language) RDF (Resource Description Framework) MARC (MAchine Readable Cataloging) MIME (Multipurpose Internet Mail Extensions) 4/5/201013
Markup languagesSGML- Standard Generalised Markup Language - controls document formatting for publicationXML- Extensible Markup Language - “next generation” SGMLHTML- Hyper Text Markup Language - SGML subset, controls display of web pagesTags (usually paired) structure text into elementse.g. headings, paragraphs, lists, etc.<title> </title>   <p> </p>   <li> </li>4/5/201014
MARC - structure Structured format
 Numeric and alpha tags
 Fixed fields
Leader, 001-008, 010-099
 Variable fields4/5/201015
MARC – elements1XX Main entry2XX Title, SR, edition, publication3XX Physical description4XX Series5XX Notes6XX Subject access7XX Added entries8XX Added entries for series9XX References and local fields4/5/201016
ONIX - structureCarrier - XML
 Primary use
 publishers to Internet booksellers
 rich product information In use first version 1999
 current version Release 2.0 (2001)
 Elements – XML reference name and tag4/5/201017
ONIX - elements Message header

More Related Content

PPT
Ontology based metadata schema for digital library projects in China
PPT
Metadata Workshop-Maastricht - November 6, 2008
PPT
Does metadata matter?
PDF
Week10
PPT
B08 A3pc 90 Diapo Damy En
PPT
Making the Web searchable
PDF
CTDA MODS Implementation Guidelines
PDF
CTDA Metadata Application Profile
Ontology based metadata schema for digital library projects in China
Metadata Workshop-Maastricht - November 6, 2008
Does metadata matter?
Week10
B08 A3pc 90 Diapo Damy En
Making the Web searchable
CTDA MODS Implementation Guidelines
CTDA Metadata Application Profile

What's hot (20)

PPTX
Metadata
PDF
The importance of metadata for datasets: The DCAT-AP European standard
PPT
Genre discovery in corpus management systems (2004)
PPT
Metadata : Concentrating on the data, not on the scheme
PPT
Metadata: A concept
PPT
Metadata harvesting Tools
PDF
Metadata Standards
PPTX
The Web Data Commons Microdata, RDFa, and Microformat Dataset Series @ ISWC2014
PDF
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
ODP
Building a Digital Library
PPT
Beyond Seamless Access: Meta-data In The Age of Content Integration
ODP
DC-2008 Tutorial 3 - Dublin Core and other metadata schemas
PPTX
Metadata primer for technical communicators
PDF
Linked (Open) Data: A quick introduction
PPT
Introduction to linked data and the semantic web
PPTX
HDL - Towards A Harmonized Dataset Model for Open Data Portals
PPT
SemanticWeb Nuts 'n Bolts
PPTX
Cataloging and Metadata Management
PDF
Introduction to RDF
PPTX
Introduction To Enterprise Search - OKCSUG 2010
Metadata
The importance of metadata for datasets: The DCAT-AP European standard
Genre discovery in corpus management systems (2004)
Metadata : Concentrating on the data, not on the scheme
Metadata: A concept
Metadata harvesting Tools
Metadata Standards
The Web Data Commons Microdata, RDFa, and Microformat Dataset Series @ ISWC2014
LOD2: State of Play WP5 - Linked Data Visualization, Browsing and Authoring
Building a Digital Library
Beyond Seamless Access: Meta-data In The Age of Content Integration
DC-2008 Tutorial 3 - Dublin Core and other metadata schemas
Metadata primer for technical communicators
Linked (Open) Data: A quick introduction
Introduction to linked data and the semantic web
HDL - Towards A Harmonized Dataset Model for Open Data Portals
SemanticWeb Nuts 'n Bolts
Cataloging and Metadata Management
Introduction to RDF
Introduction To Enterprise Search - OKCSUG 2010
Ad

Viewers also liked (20)

PPT
Understanding MARC Session 1 - Winter 2011
PPT
Understanding MARC Session 4 - Winter 2011
PPTX
Introduction to RDA Part 1
PPTX
RDA: Resource Description and Access
PPTX
Managing electronic collections
PPTX
Practical guide to architecting data lakes - Avinash Ramineni - Phoenix Data...
PDF
Introduction to RDA
PPTX
JOSA TechTalk: Metadata Management
in Big Data
PPT
Manper03 failure
PDF
BU PRSSA: What is Advertising
PDF
Data Lake for the Cloud: Extending your Hadoop Implementation
PPT
Abstracts & abstracting
PPTX
A Non-Standard use Case of Hadoop: High Scale Image Processing and Analytics
PPT
Indexing or dividing_head
PPT
Indexing
PPT
Indexing
Understanding MARC Session 1 - Winter 2011
Understanding MARC Session 4 - Winter 2011
Introduction to RDA Part 1
RDA: Resource Description and Access
Managing electronic collections
Practical guide to architecting data lakes - Avinash Ramineni - Phoenix Data...
Introduction to RDA
JOSA TechTalk: Metadata Management
in Big Data
Manper03 failure
BU PRSSA: What is Advertising
Data Lake for the Cloud: Extending your Hadoop Implementation
Abstracts & abstracting
A Non-Standard use Case of Hadoop: High Scale Image Processing and Analytics
Indexing or dividing_head
Indexing
Indexing
Ad

Similar to Introduction to Metadata (20)

PPT
Metadata Cloud
PPT
Metadata Workshop - Utrecht - November 5, 2008
PPT
Wisneski TeI workshop 2009-2010
PPT
Xml Case Learns 2008
PPT
Intro XML for archivists (2011)
PPT
Archives hub ead 2011_lifeshare
PPT
New Directions in Metadata
PPT
Semantic Web, Cataloging, & Metadata
ODP
An RDF Metadata Model for OpenDocument Format 1.2
ODP
The OpenOffice.org ODF Toolkit Project
PPT
The Rhizomer Semantic Content Management System
PPT
Dublin Core In Practice
PPT
Html for Beginners
ODP
ISO MLR semantics
PPT
Metadata first, ontologies second
PPTX
EAD, MARC and DACS
PPTX
The ISO-DCR
Metadata Cloud
Metadata Workshop - Utrecht - November 5, 2008
Wisneski TeI workshop 2009-2010
Xml Case Learns 2008
Intro XML for archivists (2011)
Archives hub ead 2011_lifeshare
New Directions in Metadata
Semantic Web, Cataloging, & Metadata
An RDF Metadata Model for OpenDocument Format 1.2
The OpenOffice.org ODF Toolkit Project
The Rhizomer Semantic Content Management System
Dublin Core In Practice
Html for Beginners
ISO MLR semantics
Metadata first, ontologies second
EAD, MARC and DACS
The ISO-DCR

More from Bhgn. Pembangunan Bibliografik & Pemprosesan Teknikal (6)

Introduction to Metadata

  • 1. METADATA: AN INTRODUCTIONPresented by:Hamidahbt. HJ. A. RahmanSenior LecturerFaculty of Information ManagementUiTMPuncakPerdana Campus40150 Shah AlamSELANGOR DARUL EHSAN4/5/20101
  • 2. Metadata is structured data which describes the characteristics of a resource. It shares many similar characteristics to the cataloguing that takes place in libraries, museums and archives. The term "meta" derives from the Greek word denoting a nature of a higher order or more fundamental kind. A metadata record consists of a number of pre-defined elements representing specific attributes of a resource, and each element can have one or more values.4/5/20102What is metadata?
  • 3. What is metadata?Structured data about resourcesLibrary cataloguesAbstracting and indexing servicesArchival finding aidsMuseum documentationCommunity informationCarriers: MARC, HTML, SGML, XML4/5/20103
  • 4. Types of metadataDescriptive MetadataAdministrative MetadataStructural Metadata4/5/20104
  • 5. Types of metadataDescriptive MetadataAdministrative MetadataStructural Metadata4/5/20105
  • 6. Types of metadataDescriptive Metadata: to serve the purposes of discovery (how one finds a resource), identification (how a resource can be distinguished from other, similar resources), and selection (how to determine that a resource fills a particular need, for example, for the DVD version of a video recording)4/5/20106
  • 7. Types of metadataAdministrative Metadata:is information intended to facilitate the management of resources. It can include such information as when and how an object was created,who is responsible for controlling access to or archiving the content, what control or processing activities have been performed in relation to it, and what restrictions on access or use apply.4/5/20107
  • 8. Types of metadataStructural Metadata (SM):can be thought of as the glue that holds compound digital objects together. A book, for example, may have many chapters, each consisting of a set of pages, each page represented by a separate digital file. Structural metadata (SM) is required to record the relationships between physical files and pages, between pages and chapters, and between chapters and the book as a whole. Presentation software uses SM to display Tables of contents and to deliver such functions as going directly to a requested chapter, or to turn pages forward or backward in order.4/5/20108
  • 9. Why use metadata?Metadata provides the essential link between the information creator and the information user.4/5/20109
  • 10. Aim of metadataWhile the primary aim of metadata is to improve resource discovery, metadata sets are also being developed for other reasons, including:administrative control
  • 17. Metadata may be deployed in a number of ways:Embedding the metadata in the Web page by the creator or their agent using META tags in the HTML coding of the page As a separate HTML document linked to the resource it describes In a database linked to the resource. The records may either have been directly created within the database or extracted from another source, such as Web pages.4/5/201011
  • 18. Some of the most popular metadata schemas include:Dublin Core
  • 20. GILS (Government Information Locator Service)
  • 21. EAD (Encoded Archives Description) IMS (IMS Global Learning Consortium) AGLS (Australian Government Locator Service) 4/5/201012
  • 22. The data will be unusable, unless the encoding scheme understands the semantics of the metadata schema. The encoding allows the metadata to be processed by a computer program. Important schemes include:HTML(Hyper-Text Markup Language) SGML (Standard Generalised Markup Language) XML (eXtensible Markup Language) RDF (Resource Description Framework) MARC (MAchine Readable Cataloging) MIME (Multipurpose Internet Mail Extensions) 4/5/201013
  • 23. Markup languagesSGML- Standard Generalised Markup Language - controls document formatting for publicationXML- Extensible Markup Language - “next generation” SGMLHTML- Hyper Text Markup Language - SGML subset, controls display of web pagesTags (usually paired) structure text into elementse.g. headings, paragraphs, lists, etc.<title> </title> <p> </p> <li> </li>4/5/201014
  • 24. MARC - structure Structured format
  • 25. Numeric and alpha tags
  • 29. MARC – elements1XX Main entry2XX Title, SR, edition, publication3XX Physical description4XX Series5XX Notes6XX Subject access7XX Added entries8XX Added entries for series9XX References and local fields4/5/201016
  • 32. publishers to Internet booksellers
  • 33. rich product information In use first version 1999
  • 34. current version Release 2.0 (2001)
  • 35. Elements – XML reference name and tag4/5/201017
  • 36. ONIX - elements Message header
  • 37. Product recordidentifiers, author, title, edition, language, subject, audience, descriptions, publisher, datesterritorial rights, dimensions, suppliers, availability, promotions Main series and sub series records4/5/201018
  • 38. ONIX record<ISBN> 0123456789 </ISBN><DistinctiveTitle> Alice in Wonderland </Distinctive Title><Contributor><ContributorRole> Author </ContributorRole> <PersonNameInverted> Carroll, Lewis </PersonNameInverted> </Contributor><PublisherName> Collins </PublisherName><PublicationDate> 2000 </Publication Date>4/5/201019
  • 39. Dublin Core - structure Simple resource discovery DCMES – Dublin Core Metadata Element Set HTML the most common ‘carrier’ Comprises 15 elements with element qualifiers element encoding schemes optional/mandatory elements Application profiles4/5/201020
  • 40. Dublin Core - elementsTitleCreatorSubjectDescriptionPublisherContributorDateResource TypeFormatResource IdentifierSourceLanguageRelationCoverageRights4/5/201021
  • 41. Dublin Core - record<Title> Alice in Wonderland </Title><Creator> Lewis Carroll </Creator><Subject> <LCSH> Fiction </LCSH> </Subject><Publisher> Project Gutenberg </Publisher><Date> 2000 </Date><Format> ASCII file via FTP </Format><Identifier> http://promo.net/pg/….. </Identifier> 4/5/201022
  • 42. Encoded Archival Description EAD1993 project to develop standard for machine-readable finding aids,Version 1 1998 SGML (and XML compliant)
  • 43. Hierarchical structure of archivesrepository, management group, fonds, series, file, item Possible to embed MARC elements4/5/201023
  • 44. EAD - structure<ead> <eadheader> </eadheader> <frontmatter> [optional] </frontmatter> <archdesc> <did> </did> </archdesc></ead>4/5/201024
  • 45. EAD - elements<eadheader> [id + bibliographic inf. for finding aid]<archdesc> [data on a body of archival materials]<did> [container, physical description, physical location, repository, date and title of unit]<admininfo> [biography, scope, access, arrangement]<controlaccess> [name, place, genre, subject, title]</archdesc>4/5/201025
  • 46. EAD record - <header><ead><eadheader><eadid> LKX-3042 </eadid<filedesc><titlestmt> <titleproper> Pitman Shorthand Collection Catalogue </titleproper> <author> Ann Chapman </author> </titlestmt><publicationstmt> <date> 1990 </date> <publisher> Bath University Library </publisher> </publicationstmt> </filedesc> </eadheader> 4/5/201026
  • 47. EAD record - <archdesc><archdesc> collection<did> <abstract> A collection of materials in and aboutshorthand collected by Sir Isaac Pitman and JamesPitman</abstract> </did><controlaccess><subject encodinganalog=“MARC650”> Shorthand </subject></controlaccess> </archdesc></ead>4/5/201027
  • 48. Collection DescriptionSchema developed May 2000Access version for RSLP – summer 2001Web version for Reveal – spring 2002General attributesSubjectDatesAssociated agentsExternal relationships4/5/201028
  • 49. Coll.Desc. - elementsGeneral: title, identifier, description, strength, physical characteristics, language, type, access control, accrual status, legal status, custodial history, note, locationSubject: concept, object, name, place, timeDates: accumulation, contentsAgents: creator, ownerRelationships: sub/super collections, catalogues and descriptions, associated collections and publications4/5/201029
  • 50. Coll. Desc. - recordTitle: Pitman CollectionStrength: Shorthand – national collectionPhys. Desc: Printed texts and manuscriptsLang: English, Spanish, Esperanto, ……Access: Written request to the Librarian, Bath Univ.Accrual: passive, deposit Location: The Library, Bath University, BathSubject: Shorthand, Sir Isaac PitmanOwner: Pitman Publishing Co.Catalogue: Bath University OPAC4/5/201030
  • 51. M21 Community InformationSame principles as MARC BibliographicLeader individual/organization/program/event/otherFixed fields001-008, 010-099 fixed fields007 disability facilities008 special aspectsVariable fields4/5/201031
  • 52. M21 Comm. Inf. - elements1XX Name 2XX Title and Address3XX Physical description4XX Series (for events)5XX Notes6XX Subject access7XX Added entries8XX Other variable fields4/5/201032
  • 53. M21 Comm. Inf. - record110 $a CILIP245 $a CILIP HQ247 $a LA HQ $f 19?? - 2002270 $a 7 Ridgmount St, London, WC1E 7AE $k 020 7255 0505 $m info@cilip.org.uk$r 9am to 6pm311 $a Ewart Room $d seats 50 $g £100 per day312 $a Overhead projector $f £10 per day581 $a Library + Information Update856 $a http://www.cilip.org.uk4/5/201033
  • 54. Metadata – fit for purpose MARC Bibliographic
  • 59. M21 Community Information4/5/201034
  • 60. How does one create metadata?DC-dot - http://www.ukoln.ac.uk/metadata/dcdot/. This service will retrieve a Web page and automatically generate Dublin Core metadata, either as HTML tags or as RDF/XML, suitable for embedding in the section of the page. DCmeta - http://www.dstc.edu.au/RDU/MetaWeb/generic_tool.html. Developed by Tasmania Online. It is based on SuperNoteTab text-editor and can be customised. HotMeta - http://www.dstc.edu.au/Research/Projects/hotmeta/. A package of software, including metadata editor, repository and search engine. 4/5/201035