SlideShare a Scribd company logo
Deconstructing Freebase
Manuel de la Villa
Department of Information Technologies
University of Huelva                     Medical-Miner Project
                                         Huelva 22/7/2010
Deconstructing Freebase
• What is it
  – A net? A tool? A kind of Data? wikipedia-turned-
    database?
• The Schema,
  – domains, types, properties. Sources.
• How it works
• How can we use it
  – MQL, API, Acre, RDF

                                               Medical-Miner Project
                                               Huelva 22/7/2010
What is it
• Freebase is a large public database that collects three
  kinds of information:
   – data;
   – texts; and
   – media, that references…
• …entities or topics (≈ 12 million). An entity is a unique
  single person, place, or thing.
       • A single concept or real-world thing.
       • A topic could also be called an entity, resource or element or thing, it
         is a fundamental unit in Freebase.
       • /common/topic
       • Each topic has a Guid or globally unique ID
           – http://www.freebase.com/view/en/barack_obama
           – http://www.freebase.com/guid/9202a8c04000641f800000000029c277


                                                                        Medical-Miner Project
                                                                        Huelva 22/7/2010
What is it (and II)
• Freebase connects entities together as a graph,
   – defines its data structure as a set of nodes and a set of
     links that establish relationships between the nodes.
• Most of our topics are associated with one or
  more types (such as people, places, books,
  films, etc) and may have additional properties like "date of
  birth" for a person or latitude and longitude for a location.
  These types and properties and related concepts are called
  Schema.


                                                          Medical-Miner Project
                                                          Huelva 22/7/2010
The Schema
• We use the term Schema to refer to the way Freebase's data is laid out or
  structured. In the Semantic web world, this is referred to as Ontology.
• Schema in Freebase is expressed through Types and Properties. Types are
  grouped together in Domains.

              What is Schema?
                   Schema describes how data in Freebase is structured.

              Schema is made by creating:
                   Types, which define a particular kind of person, place or thing and contain

                   Properties, which define the unique qualities of a type.



                                       William Shakespeare
                                              is a  →      type     → Person
                                              has a →      property → Date of Birth


                                                                                          Medical-Miner Project
                                                                                          Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema: Medicine




                  Medical-Miner Project
                  Huelva 22/7/2010
Then… is an Ontology?
• Yes and No,

          is a kind of Folksonomy…




                                     Medical-Miner Project
                                     Huelva 22/7/2010
The sources
• Wikipedia, ChefMoz, NNDB, MusicBrainz…
• The Freebase community, with a 2.0 spirit, are:
       •   data contributors,
       •   schema builders,
       •   data curators,
       •   application developers,
       •   and many others who use Freebase
   – Expert status
   – Augmentative edition
• Creative Commons Attribution Licensing, which means
  that it's free for you to browse, query, copy, and even use the data
  in your own systems or software, even for commercial use; all we
  ask in return is that you mention that

                                                               Medical-Miner Project
                                                               Huelva 22/7/2010
How can we use it…
• As a reference or information source
• Create interesting Views and Visualizations and
  share them with others
• Embed Freebase data in your website
• Use our API or Acre, our hosted app development
  platform, to build apps that use Freebase data
• Download our Data dumps
• Use Freebase's RDF for Semantic Web applications


                                              Medical-Miner Project
                                              Huelva 22/7/2010
How can we use it…
• Ways to use Freebase:
  – Use Freebase's Ids to uniquely identify entities
    anywhere on the web
  – Query Freebase's data using MQL
  – Build applications using our API or Acre, the
    hosted development platform




                                                 Medical-Miner Project
                                                 Huelva 22/7/2010
MQL (Metaweb Query Language)
• http://api.freebase.com/api/service/mqlread?query={"query"
  :{"type":"/music/artist","name":"U2","album":[]}}
• Query Editor




                                                     Medical-Miner Project
                                                     Huelva 22/7/2010
MQL (and II)




               Medical-Miner Project
               Huelva 22/7/2010
Freebase API
Freebase's API allows you:
    –  to perform searches and queries against
      Freebase's data, or
    – to write data to Freebase.
    – There are Libraries available for many
      languages.

Searching with Search API
• The search API lets you do a fuzzily-matched
   search for a string that occurs in a topic's
   name, aliases, or description.
Reading
• The mqlread API is for doing structured
   queries against Freebase's schema.
Writing
• The mqlwrite API lets you write data to
   Freebase.


                                                     Medical-Miner Project
                                                     Huelva 22/7/2010
Acre
A hosted application
development platform,
Acre:
   • makes it easy to build and
   share Freebase applications.
   •It encapsulates many of the
   trickier steps (like OAuth
   access for writing to
   Freebase) and
   •lets you view other people's
   apps, clone them, modify
   them, and share your
   improvements.

                                          Medical-Miner Project
                                          Huelva 22/7/2010
Applications… with Acre
• Powerset (NLP search engine)

• Fmdb

• Freebase
     Sets



                                 Medical-Miner Project
                                 Huelva 22/7/2010
RDF




      Medical-Miner Project
      Huelva 22/7/2010
Summarizing
"Freebase is the bridge between the bottom up vision
  of Web 2.0 collective intelligence and the more
  structured world of the semantic web." Tim O'Reilly




                                                Medical-Miner Project
                                                Huelva 22/7/2010

More Related Content

PDF
Islandora Webinar: Research Data Repositories
PPTX
鏈結資料在圖書館的應用20131107
PPTX
Viaf and isni ifla 2013 08-16
PDF
Connecting the Dots: Constellations in the Linked Data Universe
PPT
IFLA 2012 - OCLC Linked Data round table
PPT
Ontology Web services for Semantic Applications
PDF
Working with Islandora
PDF
圖書館趨勢觀察
Islandora Webinar: Research Data Repositories
鏈結資料在圖書館的應用20131107
Viaf and isni ifla 2013 08-16
Connecting the Dots: Constellations in the Linked Data Universe
IFLA 2012 - OCLC Linked Data round table
Ontology Web services for Semantic Applications
Working with Islandora
圖書館趨勢觀察

Similar to Deconstructing freebase (20)

ZIP
Linked Open Data in Libraries, Archives & Museums
PDF
What is New in W3C land?
ZIP
Intro to Linked Open Data in Libraries, Archives & Museums
PDF
Oss and libraries enabling arabic libraries and creating opportunities
PPTX
Breaking Down Walls in Enterprise with Social Semantics
PDF
Curation and Characterization of Web Services
PDF
Here Comes Everything
PDF
Global lodlam_communities and open cultural data
PDF
Linked Data
PDF
International Image Interoperability Framework (IIIF)
PPTX
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
ZIP
Intro to Linked Open Data in Libraries Archives & Museums.
PPTX
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
PPT
Overview AG AKSW
PDF
Open Data - Principles and Techniques
PDF
CAEPIA 2011
PDF
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
PDF
International Image Interoperability Framework (IIIF)
ZIP
Linked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries, Archives & Museums
What is New in W3C land?
Intro to Linked Open Data in Libraries, Archives & Museums
Oss and libraries enabling arabic libraries and creating opportunities
Breaking Down Walls in Enterprise with Social Semantics
Curation and Characterization of Web Services
Here Comes Everything
Global lodlam_communities and open cultural data
Linked Data
International Image Interoperability Framework (IIIF)
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
Intro to Linked Open Data in Libraries Archives & Museums.
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Overview AG AKSW
Open Data - Principles and Techniques
CAEPIA 2011
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
International Image Interoperability Framework (IIIF)
Linked Open Data in Libraries Archives & Museums
Ad

More from Manuel de la Villa (16)

PPTX
Presentación TFG Informes de Alta Automáticos
PDF
Presentación programa Social Media UHU
PDF
Marca personal para community managers
PDF
Taller Facebook #SMUHU parte 2
PDF
Taller Facebook #SMUHU parte 1
PDF
Personal branding
PDF
Taller de Presentaciones efectivas
PDF
Presentacion Grado en Ingeniería Informática UHU
PDF
Curso personal branding profesores
PDF
Herramientas web 2.0 parte 2
PDF
Herramientas web 2.0 Parte 1
PDF
MVilla IUI 2012 Lisbon
PDF
Experiences on integrating explicit knowledge on information access tools in ...
PPSX
A Biomedical Information Retrieval System based on Clustering for Mobile Dev...
PDF
A critical and comparative study about ISO 9001, CMMI and ISO 15504
PDF
Presentación TFG Informes de Alta Automáticos
Presentación programa Social Media UHU
Marca personal para community managers
Taller Facebook #SMUHU parte 2
Taller Facebook #SMUHU parte 1
Personal branding
Taller de Presentaciones efectivas
Presentacion Grado en Ingeniería Informática UHU
Curso personal branding profesores
Herramientas web 2.0 parte 2
Herramientas web 2.0 Parte 1
MVilla IUI 2012 Lisbon
Experiences on integrating explicit knowledge on information access tools in ...
A Biomedical Information Retrieval System based on Clustering for Mobile Dev...
A critical and comparative study about ISO 9001, CMMI and ISO 15504
Ad

Recently uploaded (20)

PPTX
Cloud computing and distributed systems.
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Big Data Technologies - Introduction.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Electronic commerce courselecture one. Pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Approach and Philosophy of On baking technology
Cloud computing and distributed systems.
Encapsulation_ Review paper, used for researhc scholars
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Network Security Unit 5.pdf for BCA BBA.
Advanced methodologies resolving dimensionality complications for autism neur...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Big Data Technologies - Introduction.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Electronic commerce courselecture one. Pdf
Spectral efficient network and resource selection model in 5G networks
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Per capita expenditure prediction using model stacking based on satellite ima...
Review of recent advances in non-invasive hemoglobin estimation
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Approach and Philosophy of On baking technology

Deconstructing freebase

  • 1. Deconstructing Freebase Manuel de la Villa Department of Information Technologies University of Huelva Medical-Miner Project Huelva 22/7/2010
  • 2. Deconstructing Freebase • What is it – A net? A tool? A kind of Data? wikipedia-turned- database? • The Schema, – domains, types, properties. Sources. • How it works • How can we use it – MQL, API, Acre, RDF Medical-Miner Project Huelva 22/7/2010
  • 3. What is it • Freebase is a large public database that collects three kinds of information: – data; – texts; and – media, that references… • …entities or topics (≈ 12 million). An entity is a unique single person, place, or thing. • A single concept or real-world thing. • A topic could also be called an entity, resource or element or thing, it is a fundamental unit in Freebase. • /common/topic • Each topic has a Guid or globally unique ID – http://www.freebase.com/view/en/barack_obama – http://www.freebase.com/guid/9202a8c04000641f800000000029c277 Medical-Miner Project Huelva 22/7/2010
  • 4. What is it (and II) • Freebase connects entities together as a graph, – defines its data structure as a set of nodes and a set of links that establish relationships between the nodes. • Most of our topics are associated with one or more types (such as people, places, books, films, etc) and may have additional properties like "date of birth" for a person or latitude and longitude for a location. These types and properties and related concepts are called Schema. Medical-Miner Project Huelva 22/7/2010
  • 5. The Schema • We use the term Schema to refer to the way Freebase's data is laid out or structured. In the Semantic web world, this is referred to as Ontology. • Schema in Freebase is expressed through Types and Properties. Types are grouped together in Domains. What is Schema? Schema describes how data in Freebase is structured. Schema is made by creating: Types, which define a particular kind of person, place or thing and contain Properties, which define the unique qualities of a type. William Shakespeare is a → type → Person has a → property → Date of Birth Medical-Miner Project Huelva 22/7/2010
  • 6. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 7. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 8. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 9. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 10. The Schema: Medicine Medical-Miner Project Huelva 22/7/2010
  • 11. Then… is an Ontology? • Yes and No, is a kind of Folksonomy… Medical-Miner Project Huelva 22/7/2010
  • 12. The sources • Wikipedia, ChefMoz, NNDB, MusicBrainz… • The Freebase community, with a 2.0 spirit, are: • data contributors, • schema builders, • data curators, • application developers, • and many others who use Freebase – Expert status – Augmentative edition • Creative Commons Attribution Licensing, which means that it's free for you to browse, query, copy, and even use the data in your own systems or software, even for commercial use; all we ask in return is that you mention that Medical-Miner Project Huelva 22/7/2010
  • 13. How can we use it… • As a reference or information source • Create interesting Views and Visualizations and share them with others • Embed Freebase data in your website • Use our API or Acre, our hosted app development platform, to build apps that use Freebase data • Download our Data dumps • Use Freebase's RDF for Semantic Web applications Medical-Miner Project Huelva 22/7/2010
  • 14. How can we use it… • Ways to use Freebase: – Use Freebase's Ids to uniquely identify entities anywhere on the web – Query Freebase's data using MQL – Build applications using our API or Acre, the hosted development platform Medical-Miner Project Huelva 22/7/2010
  • 15. MQL (Metaweb Query Language) • http://api.freebase.com/api/service/mqlread?query={"query" :{"type":"/music/artist","name":"U2","album":[]}} • Query Editor Medical-Miner Project Huelva 22/7/2010
  • 16. MQL (and II) Medical-Miner Project Huelva 22/7/2010
  • 17. Freebase API Freebase's API allows you: – to perform searches and queries against Freebase's data, or – to write data to Freebase. – There are Libraries available for many languages. Searching with Search API • The search API lets you do a fuzzily-matched search for a string that occurs in a topic's name, aliases, or description. Reading • The mqlread API is for doing structured queries against Freebase's schema. Writing • The mqlwrite API lets you write data to Freebase. Medical-Miner Project Huelva 22/7/2010
  • 18. Acre A hosted application development platform, Acre: • makes it easy to build and share Freebase applications. •It encapsulates many of the trickier steps (like OAuth access for writing to Freebase) and •lets you view other people's apps, clone them, modify them, and share your improvements. Medical-Miner Project Huelva 22/7/2010
  • 19. Applications… with Acre • Powerset (NLP search engine) • Fmdb • Freebase Sets Medical-Miner Project Huelva 22/7/2010
  • 20. RDF Medical-Miner Project Huelva 22/7/2010
  • 21. Summarizing "Freebase is the bridge between the bottom up vision of Web 2.0 collective intelligence and the more structured world of the semantic web." Tim O'Reilly Medical-Miner Project Huelva 22/7/2010