SlideShare a Scribd company logo
Tetherless World Constellation



Semantic Web: “10 year update”


                   Jim Hendler
Tetherless World Professor of Computer and Cognitive Science
 Assistant Dean of Information Technology and Web Science

  Rensselaer Polytechnic Institute
  http://www.cs.rpi.edu/~hendler
        @jahendler (twitter)
Original Outline (July 2000)

   Tetherless World Constellation




                 (May 21, 2001)
Tetherless World Constellation
Sem Web 2010

Tetherless World Constellation




                  April 2010
Facebook’s Open Graph Protocol

                                                        Tetherless World Constellation

•  Your Documents (XML, HTML, XHTML) contain RDFA with some FB
   specific vocabulary (+ links!!)
   –  og:title - The title of your object as it should appear within the graph, e.g., "The
      Rock".
   –  og:type - The type of your object, e.g., "movie". Depending on the type you
      specify, other properties may also be required.
   –  og:image - An image URL which should represent your object within the graph.
   –  og:url - The canonical URL of your object that will be used as its permanent ID in
      the graph
   –  og:description - A one to two sentence description of your object.
   –  og:site_name - If your object is part of a larger web site, the name which should
      be displayed for the overall site. e.g., "IMDb".
OGP use growing quickly
             Facebook incentivizing use of RDFa like buttons
                                    Tetherless World Constellation


     15,178 sites of top 1,000,000 as of 3/3/11




FB reports ~ 10-15% of > 3,000,000 likes per day!

Why are they pushing developers to use the RDFa version?
Because we need the links!

                                          Tetherless World Constellation




The network of likes is where their money is made!
       (predicted >$5B of advertising in next two years)
Creates a platform for SW-powered apps

               Tetherless World Constellation
Semantic Web 2010

Tetherless World Constellation




                   July 2010
Sem Web 2010

Tetherless World Constellation




                   July 2010
Semantic Web 2010

Tetherless World Constellation




              Nov 4, 2010
Sem Web 2010

 Tetherless World Constellation




(Enterprise Sem Web)
Enterprise Semantic Web

Tetherless World Constellation
The coming of “Linked Data”

                               Tetherless World Constellation


•  What is different now?
  – Semantic Search
  – Advertising drives Web markets
  – “Buzz” around data on the Web
    • Esp open govt data
•  Maturation of RDF technologies
  – SPARQL endpoints
  – RDFa !!!
  – Lightweight Knowledge
    • A little semantics goes a long way
The Evolving Web (Technology View)

                                          Tetherless World Constellation



•  Web is powered by the
   links between documents
  –  Google worked because of the
     link space
•  Web 2.0 is powered by
   "social context"
  –  The network effect is in the social network
     •  At scale tagging runs into usual vocabulary issues
•  Web 3.0 adds data relations and vocabulary links
  –  Controlled vocabularies express data relationships
     •  Semantic Web standards
Maturation of the “bottom” of the Semantic Web

                                Tetherless World Constellation


•  What is
   seeing
   the most
   use??

              RDFa
On the Web -- links are critical!

                                      Tetherless World Constellation

       Web page                         Any Web Resource



       <a href=                             URI>

HTML              <a href=“http://…”>



                        URI


          URI                               URI

RDF               RDF is like the web!
Links in the data

                                    Tetherless World Constellation



DOC1
   <mind:Person rdf:id=“Hendler”>
      <mind:title jobs:Professor>
       <jobs:placeOfWork http://www.cs.rpi.edu>
   </mind:Person>


                                 Jobs: Professor
 Mind:
           DOC1        Mind:title

               Hendler
 Jobs:
                       Jobs:placeOfWork     Web Page
                                           http://www…
Directly linking datasets

Tetherless World Constellation




                   Sindice.com
Linked Data is entering many sectors

            Tetherless World Constellation




         Linkeddata.org 25 billion links
What about ontologies?
                                    Tetherless World Constellation


•  Consider, eg, US National Center for
   Biotechnology Information, "Oncology
   Metathesaurus"
  –  50,000+ classes, ~8 people supporting full
     time, monthly updates, mandated for use by
     NIH-funded cancer researchers
    •  OWL DL rigorously followed
    •  Provably consistent
•  Compare to OGP
Widely varying use

                                     Tetherless World Constellation


•  NCBI Oncology Ontology
  –  “High use” in medical community (~1200
     users)
  –  Very "trusted" information (provenance from
     NCBI)
  –  Primarily terminological (relationships between
     cancer-related concepts), not data-oriented
•  Compare to OGP
  –  Hundreds of millions of users
     •  Generating >1M triples/day
The argument for NCBI
               seems compelling
                             Tetherless World Constellation



                      •  When "folksonomy"
                         isn't enough…




Which one do you want your doctor to use?
But the cost is VERY high
                                        Tetherless World Constellation


•  Formal modeling finds its use cases in verticals
   and enterprises
   –  Where the vocabulary can be controlled
   –  Where finding things in the data is important
•  But the modeling is very expensive and the
   return on investment must be very high!
   –  Which is part of why the "expert systems revolution"
      wasn't one
   –  Became part of the technology tool kit, a useful niche in
      the programming pantheon, but didn't change the world




           Analogy: the pre-web hypertext world
The alternative
                                                          Tetherless World Constellation

•  Linked Data approach is based on RDF, a language designed
   for the (Semantic) Web
   –  Built with Web architecture in mind
      •  Exploits Web infrastructure, respects W3C TAG recommendations
          –  Internationalization, accessibility, extensibility
   –  Fits the Web culture
      •  Open and extensible, supports communities of interest
          –  If you don't like my ontology, extend it, change it, or build your own
      •  Fits the Web application development paradigm
          –  Scales like "databases"
   –  With some new ways of linking to formal models
      •  Heavy use of a small amount of RDFS and a tiny bit of OWL
      •  Generally used "like it sounds" not like the formal model
          –  Example "owl:sameAs" debate

     “linked data” often used to describe this low
     semantics Semantic Web
                   Analogy: the World Wide Web
Linked Data + Semantics


                                        Tetherless World Constellation



•  "Linked Data"
   approach finds its
   use cases in Web
   Applications (at
   Web scales)
  –  A lot of data, a
     little semantics
  –  Finding
     anything in the    http://www.cs.rpi.edu/~hendler/LittleSemanticsWeb.html

     mess can be a
     win!
Example: Government Data on the Web

             Tetherless World Constellation
Government Data Sharing

                                                                                                                                    Tetherless World Constellation




                                                                    data.gov online                                   “Open Government                                 data.gov relaunch
 January 1, 2009




                                                                                             December 8, 2009
                   “Openness will strengthen




                                                                                                                                                        May 21, 2010
                                                  May 21, 2009
                   our democracy and promote                                                                          Directive” released                              with semantic web
                                                                    57 Data Sets                                      ~2000 Data Sets                                  featured
                   efficiency and effectiveness
                   in Government.”                                                                                                                                      >305,000 Data Sets
                            --- President Obama



2009                                                                                                                                                                              2010 …
                                                                                                                January 19, 2010
                                                           June30,2009




                                                                         Putting Govt Data                                                                              ~6000 Data Set
                                                                         online-
                                                                         Data.gov.uk beta                                          data.gov.uk online
Data.gov community: International

          Tetherless World Constellation



         Examples:
         US          305,000
         Japan        30,184
         Denmark      17,086
         UK            6,000
         Korea          833
         Australia      700
         World          400
         Health
         Org
         Ireland        263
         Catalonia      246
Creating/Using Data “app” technologies

              Tetherless World Constellation




   See more than 50 of these at http://logd.tw.rpi.edu
Linking GDP of the US and China

                                                                    Tetherless World Constellation



   GDP of the US (Billion Dollar)




GDP of China (Billion Chinese Yuan )




                                       [Temporal Mashup] bea.gov + federalreserve.gov +stats.gov.cn
Linking GDP of the US and China

                                                                    Tetherless World Constellation



   GDP of the US (Billion Dollar)




               This mashup was built in less than 8 hours –
               including conversion of data, web interface, and
               visualization!

GDP of China (Billion Chinese Yuan )




                                       [Temporal Mashup] bea.gov + federalreserve.gov +stats.gov.cn
Govt data linked to Social Media Metadata

                 Tetherless World Constellation
There is a lot of workflow information in the mix

                                         Tetherless World Constellation




                                                Access	
  
  Convert	
  



 derive                                      derive



Enhance	
  


                           Version	
  
                revision




                                                SemDiff	
  




                                             derive
Data Search

                    Tetherless World Constellation




How can we search for data?
Metadata is crucial

                                        Tetherless World Constellation




What kinds of metadata are: simple to create, powerful enough
for search and internationalizable (esp. beyond English)
Example, integrating data and info search

                 Tetherless World Constellation
Visualization can help identify data errors

                            Tetherless World Constellation




Correlates fires, acres burned, and
         agency budgets
“Web 3.0”

                                     Tetherless World Constellation



                      Web 3.0

                        Semantic Web (RDFS, owl)
    Web 2.0
                       Linked Data (RDF, SPARQL)


                 Web (REST API)


Web 3.0 extends current Web applications using Semantic
Web, esp semantic and real-time search, technologies and
               graph-based, open data.
Semantic Search
                                     Tetherless World Constellation




IEEE Computer, Jan 2010; IEEE Computing Now, Feb 2010 (free)
Semantic Search

                      Tetherless World Constellation




Semantic Search Powered by RDFa
Trialx.com

             Tetherless World Constellation




Save lives
Lots More
                                Tetherless World Constellation




Web	
  3.0	
  Applica<ons	
  
Web 3.0 excitement (hype?)

                           Tetherless World Constellation


•  Significant and growing
   commercial interest…
  – Web: Google, Amazon, Travelocity…
  – Web 2.0: Facebook, Wikipedia,
    YouTube, Twitter…
  – Web 3.0: ??
Summary
                                     Tetherless World Constellation


•  The Semantic Web is real
   –  People asking “how,” not why
•  So far the commercial driver has been “weak
   semantics”
   –  Very Simple “ontologies”
   –  Lots of linking
   –  Metadata agreements, not ontology alignments
•  Web 3.0 adds semantics as a value add to regular
   Web functionality
   –  Data mashup
   –  Semantic search
   –  Semantic match
•  Investor excitement: The big one is still out there

More Related Content

KEY
Isle of Man open data overview
PDF
20111120 warsaw learning curve by b hyland notes
PDF
Government Linked Data Projects in the Wild
ZIP
Intro to Linked Open Data in Libraries Archives & Museums.
PDF
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
PDF
Demo: Profiling & Exploration of Linked Open Data
PDF
Linked Data Approach for Integration of Human Health & Environmental Data
PDF
Rapid Semantic Web Application Development
Isle of Man open data overview
20111120 warsaw learning curve by b hyland notes
Government Linked Data Projects in the Wild
Intro to Linked Open Data in Libraries Archives & Museums.
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
Demo: Profiling & Exploration of Linked Open Data
Linked Data Approach for Integration of Human Health & Environmental Data
Rapid Semantic Web Application Development

What's hot (20)

PPTX
They have left the building: The Web Route to Library Users
PPTX
Online Learning and Linked Data: An Introduction
PPTX
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
PDF
Warsaw Poland 20-Oct-2011 on Open Government Linked Data
PDF
Linked Data Management
PDF
20111114 b hyland government data and publishers
PDF
Sharing Data on the Web
PDF
Intertwingularity, Semantic Web and linked Geo data
PDF
A structured catalog of open educational datasets
PDF
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...
PPT
Youth Engagement and the Online Databases
PDF
Library Orientation School of Medicine 2009
PDF
Metadata / Linked Data
PPT
Generous Interfaces - rich websites for digital collections
PDF
Mining and Understanding Activities and Resources on the Web
PPT
Linked Open Govt Data - Sem Tech East
PDF
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
PPTX
Linked Open Data for Archives
PDF
Brief for W3C Government Linked Data Working Group 29-June 2011
They have left the building: The Web Route to Library Users
Online Learning and Linked Data: An Introduction
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
Warsaw Poland 20-Oct-2011 on Open Government Linked Data
Linked Data Management
20111114 b hyland government data and publishers
Sharing Data on the Web
Intertwingularity, Semantic Web and linked Geo data
A structured catalog of open educational datasets
Linked Data Cookbook for Government Agencies, SemTech East, Washington DC 1-D...
Youth Engagement and the Online Databases
Library Orientation School of Medicine 2009
Metadata / Linked Data
Generous Interfaces - rich websites for digital collections
Mining and Understanding Activities and Resources on the Web
Linked Open Govt Data - Sem Tech East
Web Science Synergies: Exploring Web Knowledge through the Semantic Web
Linked Open Data for Archives
Brief for W3C Government Linked Data Working Group 29-June 2011
Ad

Viewers also liked (20)

PDF
CORNER: A Completeness Reasoner for SPARQL Queries over RDF Data Sources
PDF
On the Semantic Web, Completeness does Matter!
PPTX
Managing Completeness of Data
PPTX
Expressing No-Value Information in RDF
PPTX
20100614 ISWSA Keynote
PDF
Expressing No-Value Information in RDF
PDF
"What is left to do?", Dublin Core 2012 Keynote
PPTX
Closing Session ISWC 2015
PDF
Antara Indonesia, Jerman, dan Italia
PDF
Managing and Consuming Completeness Information for Wikidata Using COOL-WD
PDF
ESWC 2013 Poster: Representing and Querying Negative Knowledge in RDF
PPTX
10 Jahre Web Science
PPTX
ESWC 2015 Closing and "General Chair's minute of Madness"
PDF
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
PDF
Poster - Completeness Statements about RDF Data Sources and Their Use for Qu...
PDF
Query-Driven Management of Linked Data Quality
PPTX
European Data Science Academy: Training the Next Generation of Data Scientists
PPT
Semantic Web: Intro
PDF
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
PDF
Entrepreneur Way #16 - Maret 2016
CORNER: A Completeness Reasoner for SPARQL Queries over RDF Data Sources
On the Semantic Web, Completeness does Matter!
Managing Completeness of Data
Expressing No-Value Information in RDF
20100614 ISWSA Keynote
Expressing No-Value Information in RDF
"What is left to do?", Dublin Core 2012 Keynote
Closing Session ISWC 2015
Antara Indonesia, Jerman, dan Italia
Managing and Consuming Completeness Information for Wikidata Using COOL-WD
ESWC 2013 Poster: Representing and Querying Negative Knowledge in RDF
10 Jahre Web Science
ESWC 2015 Closing and "General Chair's minute of Madness"
2017 UniBZ Winter Seminar Poster: Managing and Consuming Completeness Informa...
Poster - Completeness Statements about RDF Data Sources and Their Use for Qu...
Query-Driven Management of Linked Data Quality
European Data Science Academy: Training the Next Generation of Data Scientists
Semantic Web: Intro
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
Entrepreneur Way #16 - Maret 2016
Ad

Similar to Semantic Web: "ten year" update (20)

PPT
The Semantic Web: 2010 Update
PPT
The Semantic Web: 2010 Update
PPT
Data Big and Broad (Oxford, 2012)
PPT
Semantic Web: The Inside Story
PPT
On Beyond OWL: challenges for ontologies on the Web
PPT
Broad Data (India 2015)
PDF
Why the Semantic Web will nerver work
PPTX
Digital Archiving, The Semantic Web, and Modern AI
PPT
Corrib.org - OpenSource and Research
PDF
One Big Happy Family
PDF
[Webinar] Semantic Technologies
PPT
Future of Web 2.0 & The Semantic Web
PDF
Cooking up the Semantic Web
PPTX
The Unreasonable Effectiveness of Metadata
ODT
Riding The Semantic Wave
PPT
Wither OWL
PPT
Semantic Web Science
PPTX
Connecting for Change: 5 Reasons Why Nonprofits Should Care About the Semanti...
PDF
Introduction to the Social Semantic Web
PPTX
The Future(s) of the World Wide Web
The Semantic Web: 2010 Update
The Semantic Web: 2010 Update
Data Big and Broad (Oxford, 2012)
Semantic Web: The Inside Story
On Beyond OWL: challenges for ontologies on the Web
Broad Data (India 2015)
Why the Semantic Web will nerver work
Digital Archiving, The Semantic Web, and Modern AI
Corrib.org - OpenSource and Research
One Big Happy Family
[Webinar] Semantic Technologies
Future of Web 2.0 & The Semantic Web
Cooking up the Semantic Web
The Unreasonable Effectiveness of Metadata
Riding The Semantic Wave
Wither OWL
Semantic Web Science
Connecting for Change: 5 Reasons Why Nonprofits Should Care About the Semanti...
Introduction to the Social Semantic Web
The Future(s) of the World Wide Web

More from James Hendler (20)

PPTX
Knowing what AI Systems Don't know and Why it matters
PPTX
Exploring the Boundaries of Artificial Intelligence (or "Modern AI")
PPTX
Tragedy of the Data Commons (ODSC-East, 2021)
PPTX
Tragedy of the (Data) Commons
PPTX
Knowledge Graph Semantics/Interoperability
PPTX
Enhancing Precision Wellness with Personal Health Knowledge Graphs
PPTX
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
PPTX
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
PPTX
Enhancing Precision Wellness with Knowledge Graphs and Semantic Analytics: O...
PPT
KR in the age of Deep Learning
PPT
Social Machines - 2017 Update (University of Iowa)
PPT
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
PPT
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
PPTX
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
PPTX
The Science of Data Science
PPTX
Watson: An Academic's Perspective
PDF
Facilitating Web Science Collaboration through Semantic Markup
PPT
Big Data and Computer Science Education
PPTX
Why Watson Won: A cognitive perspective
PPTX
The Rensselaer IDEA: Data Exploration
Knowing what AI Systems Don't know and Why it matters
Exploring the Boundaries of Artificial Intelligence (or "Modern AI")
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the (Data) Commons
Knowledge Graph Semantics/Interoperability
Enhancing Precision Wellness with Personal Health Knowledge Graphs
The Future of AI: Going Beyond Deep Learning, Watson, and the Semantic Web
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...
Enhancing Precision Wellness with Knowledge Graphs and Semantic Analytics: O...
KR in the age of Deep Learning
Social Machines - 2017 Update (University of Iowa)
Social Machines: The coming collision of Artificial Intelligence, Social Netw...
Knowledge Representation in the Age of Deep Learning, Watson, and the Semanti...
Artificial Intelligence: Existential Threat or Our Best Hope for the Future?
The Science of Data Science
Watson: An Academic's Perspective
Facilitating Web Science Collaboration through Semantic Markup
Big Data and Computer Science Education
Why Watson Won: A cognitive perspective
The Rensselaer IDEA: Data Exploration

Recently uploaded (20)

PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Spectroscopy.pptx food analysis technology
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPT
Teaching material agriculture food technology
PDF
Electronic commerce courselecture one. Pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
cuic standard and advanced reporting.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Cloud computing and distributed systems.
Reach Out and Touch Someone: Haptics and Empathic Computing
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Spectroscopy.pptx food analysis technology
“AI and Expert System Decision Support & Business Intelligence Systems”
Diabetes mellitus diagnosis method based random forest with bat algorithm
Network Security Unit 5.pdf for BCA BBA.
20250228 LYD VKU AI Blended-Learning.pptx
Understanding_Digital_Forensics_Presentation.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Teaching material agriculture food technology
Electronic commerce courselecture one. Pdf
Building Integrated photovoltaic BIPV_UPV.pdf
cuic standard and advanced reporting.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Cloud computing and distributed systems.

Semantic Web: "ten year" update

  • 1. Tetherless World Constellation Semantic Web: “10 year update” Jim Hendler Tetherless World Professor of Computer and Cognitive Science Assistant Dean of Information Technology and Web Science Rensselaer Polytechnic Institute http://www.cs.rpi.edu/~hendler @jahendler (twitter)
  • 2. Original Outline (July 2000) Tetherless World Constellation (May 21, 2001)
  • 4. Sem Web 2010 Tetherless World Constellation April 2010
  • 5. Facebook’s Open Graph Protocol Tetherless World Constellation •  Your Documents (XML, HTML, XHTML) contain RDFA with some FB specific vocabulary (+ links!!) –  og:title - The title of your object as it should appear within the graph, e.g., "The Rock". –  og:type - The type of your object, e.g., "movie". Depending on the type you specify, other properties may also be required. –  og:image - An image URL which should represent your object within the graph. –  og:url - The canonical URL of your object that will be used as its permanent ID in the graph –  og:description - A one to two sentence description of your object. –  og:site_name - If your object is part of a larger web site, the name which should be displayed for the overall site. e.g., "IMDb".
  • 6. OGP use growing quickly Facebook incentivizing use of RDFa like buttons Tetherless World Constellation 15,178 sites of top 1,000,000 as of 3/3/11 FB reports ~ 10-15% of > 3,000,000 likes per day! Why are they pushing developers to use the RDFa version?
  • 7. Because we need the links! Tetherless World Constellation The network of likes is where their money is made! (predicted >$5B of advertising in next two years)
  • 8. Creates a platform for SW-powered apps Tetherless World Constellation
  • 9. Semantic Web 2010 Tetherless World Constellation July 2010
  • 10. Sem Web 2010 Tetherless World Constellation July 2010
  • 11. Semantic Web 2010 Tetherless World Constellation Nov 4, 2010
  • 12. Sem Web 2010 Tetherless World Constellation (Enterprise Sem Web)
  • 13. Enterprise Semantic Web Tetherless World Constellation
  • 14. The coming of “Linked Data” Tetherless World Constellation •  What is different now? – Semantic Search – Advertising drives Web markets – “Buzz” around data on the Web • Esp open govt data •  Maturation of RDF technologies – SPARQL endpoints – RDFa !!! – Lightweight Knowledge • A little semantics goes a long way
  • 15. The Evolving Web (Technology View) Tetherless World Constellation •  Web is powered by the links between documents –  Google worked because of the link space •  Web 2.0 is powered by "social context" –  The network effect is in the social network •  At scale tagging runs into usual vocabulary issues •  Web 3.0 adds data relations and vocabulary links –  Controlled vocabularies express data relationships •  Semantic Web standards
  • 16. Maturation of the “bottom” of the Semantic Web Tetherless World Constellation •  What is seeing the most use?? RDFa
  • 17. On the Web -- links are critical! Tetherless World Constellation Web page Any Web Resource <a href= URI> HTML <a href=“http://…”> URI URI URI RDF RDF is like the web!
  • 18. Links in the data Tetherless World Constellation DOC1 <mind:Person rdf:id=“Hendler”> <mind:title jobs:Professor> <jobs:placeOfWork http://www.cs.rpi.edu> </mind:Person> Jobs: Professor Mind: DOC1 Mind:title Hendler Jobs: Jobs:placeOfWork Web Page http://www…
  • 19. Directly linking datasets Tetherless World Constellation Sindice.com
  • 20. Linked Data is entering many sectors Tetherless World Constellation Linkeddata.org 25 billion links
  • 21. What about ontologies? Tetherless World Constellation •  Consider, eg, US National Center for Biotechnology Information, "Oncology Metathesaurus" –  50,000+ classes, ~8 people supporting full time, monthly updates, mandated for use by NIH-funded cancer researchers •  OWL DL rigorously followed •  Provably consistent •  Compare to OGP
  • 22. Widely varying use Tetherless World Constellation •  NCBI Oncology Ontology –  “High use” in medical community (~1200 users) –  Very "trusted" information (provenance from NCBI) –  Primarily terminological (relationships between cancer-related concepts), not data-oriented •  Compare to OGP –  Hundreds of millions of users •  Generating >1M triples/day
  • 23. The argument for NCBI seems compelling Tetherless World Constellation •  When "folksonomy" isn't enough… Which one do you want your doctor to use?
  • 24. But the cost is VERY high Tetherless World Constellation •  Formal modeling finds its use cases in verticals and enterprises –  Where the vocabulary can be controlled –  Where finding things in the data is important •  But the modeling is very expensive and the return on investment must be very high! –  Which is part of why the "expert systems revolution" wasn't one –  Became part of the technology tool kit, a useful niche in the programming pantheon, but didn't change the world Analogy: the pre-web hypertext world
  • 25. The alternative Tetherless World Constellation •  Linked Data approach is based on RDF, a language designed for the (Semantic) Web –  Built with Web architecture in mind •  Exploits Web infrastructure, respects W3C TAG recommendations –  Internationalization, accessibility, extensibility –  Fits the Web culture •  Open and extensible, supports communities of interest –  If you don't like my ontology, extend it, change it, or build your own •  Fits the Web application development paradigm –  Scales like "databases" –  With some new ways of linking to formal models •  Heavy use of a small amount of RDFS and a tiny bit of OWL •  Generally used "like it sounds" not like the formal model –  Example "owl:sameAs" debate “linked data” often used to describe this low semantics Semantic Web Analogy: the World Wide Web
  • 26. Linked Data + Semantics Tetherless World Constellation •  "Linked Data" approach finds its use cases in Web Applications (at Web scales) –  A lot of data, a little semantics –  Finding anything in the http://www.cs.rpi.edu/~hendler/LittleSemanticsWeb.html mess can be a win!
  • 27. Example: Government Data on the Web Tetherless World Constellation
  • 28. Government Data Sharing Tetherless World Constellation data.gov online “Open Government data.gov relaunch January 1, 2009 December 8, 2009 “Openness will strengthen May 21, 2010 May 21, 2009 our democracy and promote Directive” released with semantic web 57 Data Sets ~2000 Data Sets featured efficiency and effectiveness in Government.” >305,000 Data Sets --- President Obama 2009 2010 … January 19, 2010 June30,2009 Putting Govt Data ~6000 Data Set online- Data.gov.uk beta data.gov.uk online
  • 29. Data.gov community: International Tetherless World Constellation Examples: US 305,000 Japan 30,184 Denmark 17,086 UK 6,000 Korea 833 Australia 700 World 400 Health Org Ireland 263 Catalonia 246
  • 30. Creating/Using Data “app” technologies Tetherless World Constellation See more than 50 of these at http://logd.tw.rpi.edu
  • 31. Linking GDP of the US and China Tetherless World Constellation GDP of the US (Billion Dollar) GDP of China (Billion Chinese Yuan ) [Temporal Mashup] bea.gov + federalreserve.gov +stats.gov.cn
  • 32. Linking GDP of the US and China Tetherless World Constellation GDP of the US (Billion Dollar) This mashup was built in less than 8 hours – including conversion of data, web interface, and visualization! GDP of China (Billion Chinese Yuan ) [Temporal Mashup] bea.gov + federalreserve.gov +stats.gov.cn
  • 33. Govt data linked to Social Media Metadata Tetherless World Constellation
  • 34. There is a lot of workflow information in the mix Tetherless World Constellation Access   Convert   derive derive Enhance   Version   revision SemDiff   derive
  • 35. Data Search Tetherless World Constellation How can we search for data?
  • 36. Metadata is crucial Tetherless World Constellation What kinds of metadata are: simple to create, powerful enough for search and internationalizable (esp. beyond English)
  • 37. Example, integrating data and info search Tetherless World Constellation
  • 38. Visualization can help identify data errors Tetherless World Constellation Correlates fires, acres burned, and agency budgets
  • 39. “Web 3.0” Tetherless World Constellation Web 3.0 Semantic Web (RDFS, owl) Web 2.0 Linked Data (RDF, SPARQL) Web (REST API) Web 3.0 extends current Web applications using Semantic Web, esp semantic and real-time search, technologies and graph-based, open data.
  • 40. Semantic Search Tetherless World Constellation IEEE Computer, Jan 2010; IEEE Computing Now, Feb 2010 (free)
  • 41. Semantic Search Tetherless World Constellation Semantic Search Powered by RDFa
  • 42. Trialx.com Tetherless World Constellation Save lives
  • 43. Lots More Tetherless World Constellation Web  3.0  Applica<ons  
  • 44. Web 3.0 excitement (hype?) Tetherless World Constellation •  Significant and growing commercial interest… – Web: Google, Amazon, Travelocity… – Web 2.0: Facebook, Wikipedia, YouTube, Twitter… – Web 3.0: ??
  • 45. Summary Tetherless World Constellation •  The Semantic Web is real –  People asking “how,” not why •  So far the commercial driver has been “weak semantics” –  Very Simple “ontologies” –  Lots of linking –  Metadata agreements, not ontology alignments •  Web 3.0 adds semantics as a value add to regular Web functionality –  Data mashup –  Semantic search –  Semantic match •  Investor excitement: The big one is still out there