SlideShare a Scribd company logo
Simon Price, University of Bristol
Webs of People, Webs of Data
Web 2.0 Live, Taunton, Nov 2006
Web 2.0
Web Applications (Web 1.5?)
Hybrid Web-Desktop (Web 1.6?)
Canonical Web 2.0
• Amazon
– Customer Reviews
– Amazon Recommends
• Google
– PageRank™
– Making money out of links
– Google Mail, Maps, APIs, Desktop Search, ...
Web 2.0 Technology (nothing new)
• Minimum
– CGI (e.g. Perl, PHP, Python, C/C++)
– Database (e.g. MySQL, Postgres, Oracle)
• More recent additions
– Java
– XML
– Web Services
– AJAX
– Ruby on Rails
Social Networks
A key ingredient in the Web 2.0 melting pot
Google PageRank™
• Sergey Brin and Lawrence Page (Stanford, 1995)
• Intuition behind PageRank:
– Web is a network (graph) connected by links
– A link is a "vote" for the destination page
– Strength of vote is a fraction of the PageRank
of the page casting the vote
PageRank of a page is the
probability of a random
surfer arriving at that page
after many clicks.
(By Markov Theory)
Newsgroup Mining
Work by Jonathan Roberts
Web Mining
www.theyrule.net
Link Discovery
www.theyrule.net
Webs of People, Webs of Data
The Web of Data
Semantic Web
The Semantic Web is a graph-based
knowledge representation of data, spanning
the Web, traditional databases, the desktop
and mobile devices.
Friend of a Friend (FOAF)
"The FOAF project is about creating a Web of
machine-readable homepages describing people, the
links between them and the things they create and do."
http://www.foaf-project.org/
FOAF and Co-depiction
PARIP
• PARIP = Practice As Research In Performance
– 5 year national project
– Led by University of Bristol's Department of Drama:
Theatre, Film, Television
– Professor Baz Kershaw and Dr Angela Piccini
• PARIP Explorer
– Innovative contacts and research database
– Developed by ILRT
– Semantic Web technology
PARIP - Data Fusion
• contact details
• research interests
• images
• interviews
• concepts
• questionnaire responses
• institutions
• projects
• …
PARIP - User Perspective
• Dual interface:
– Text View cross-database search-engine
– Map View visual link discovery and browsing
PARIP - Technical Perspective
• Semantic Web: RDF/XML and FOAF
• Prolog running as a Web Service (WSDL+SOAP)
• SPARQL query interface for programmatic access
• XHTML AJAX client
• Visualisation via Flash
Research Directions
Automated Data Fusion
Exabyte Scale Informatics
• 1 Exabyte = 1018
bytes
i.e. 1,000,000,000,000,000,000 bytes
• 1 Exabyte is approximately everything ever:
• written,
• composed,
• filmed,
• painted
• or in any other way 'recorded' by humans.
• Manual classification and retrieval is inadequate;
machine learning and data mining are essential.
Google on "Simon Price Bristol"
Contact details

More Related Content

PDF
2014_WWW_BTOR
PPTX
Annotation and Community
PPTX
Intro to IIIF and IIIF @NLW
ODP
Resource Oriented Architecture
PDF
Kohacon2016
KEY
Library Mashups & APIs
PPTX
Linked Open Data at SAAM: Past, Present, and Future
2014_WWW_BTOR
Annotation and Community
Intro to IIIF and IIIF @NLW
Resource Oriented Architecture
Kohacon2016
Library Mashups & APIs
Linked Open Data at SAAM: Past, Present, and Future

What's hot (13)

PDF
Resource Oriented Architectures: The Future of Data API?
PPTX
IIIF as an Enabler to Interoperability within a Single Institution
PPTX
Whither the web
PDF
Andrew Hoppin, CIO, NY State Senate
PPTX
American Paintings to 1945: The Collections of The Nelson-Atkins Museum of A...
PPTX
NCompass Live: Beyond MARC: BIBFRAME and the Future of Bibliographic Data
PDF
Web History 101, or How the Future is Unwritten
PPTX
MDST 3703 F10 Seminar 11
PDF
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
KEY
WebART in 10 minutes
PDF
Assessing the performance of RDF Engines: Discussing RDF Benchmarks
PPTX
Exploring Community Engagement with OpenTreeMap
KEY
Drupal Open Source Everything
Resource Oriented Architectures: The Future of Data API?
IIIF as an Enabler to Interoperability within a Single Institution
Whither the web
Andrew Hoppin, CIO, NY State Senate
American Paintings to 1945: The Collections of The Nelson-Atkins Museum of A...
NCompass Live: Beyond MARC: BIBFRAME and the Future of Bibliographic Data
Web History 101, or How the Future is Unwritten
MDST 3703 F10 Seminar 11
Spiders, Chatbots, and the Future of Metadata: A look inside the BNC BiblioSh...
WebART in 10 minutes
Assessing the performance of RDF Engines: Discussing RDF Benchmarks
Exploring Community Engagement with OpenTreeMap
Drupal Open Source Everything
Ad

Viewers also liked (20)

PPTX
Co-designing Research IT and Research Data Services
PPTX
Managing Large-scale Multimedia Development Projects
PPTX
Querying and Merging Heterogeneous Data by Approximate Joins on Higher-Order ...
PPTX
Adapting CARDIO for BOS
PPT
Nature Locator
PPTX
NewsPatterns - visualisation layer of news feed mining
PPT
Cost of Migrating Large-Scale Computer Assisted Learning (CAL) Software to We...
PPT
Managing research data at Bristol
PPTX
Research IT at the University of Bristol
PPTX
Mobile Apps for Research Data Collection
PPT
A review of the state of the art in Machine Learning on the Semantic Web
PPTX
Best of Bristol Media City - MyMobileBristol, NatureLocator, Visualising China
PPTX
Técnicas y procesos del coachin
PPTX
data.bris - Use case, role and functionality for CKAN adoption
PPTX
Visualising China - historical photos of China
PPTX
Data Sharing and Standards
PPTX
Supporting Big Data, Open Data, Data Analytics and Data Science
PPTX
Historical Photographs of China - the journey towards sustainability and utility
PPTX
Academic IT support for Data Science
PPT
SubSift web services and workflows for profiling and comparing scientists and...
Co-designing Research IT and Research Data Services
Managing Large-scale Multimedia Development Projects
Querying and Merging Heterogeneous Data by Approximate Joins on Higher-Order ...
Adapting CARDIO for BOS
Nature Locator
NewsPatterns - visualisation layer of news feed mining
Cost of Migrating Large-Scale Computer Assisted Learning (CAL) Software to We...
Managing research data at Bristol
Research IT at the University of Bristol
Mobile Apps for Research Data Collection
A review of the state of the art in Machine Learning on the Semantic Web
Best of Bristol Media City - MyMobileBristol, NatureLocator, Visualising China
Técnicas y procesos del coachin
data.bris - Use case, role and functionality for CKAN adoption
Visualising China - historical photos of China
Data Sharing and Standards
Supporting Big Data, Open Data, Data Analytics and Data Science
Historical Photographs of China - the journey towards sustainability and utility
Academic IT support for Data Science
SubSift web services and workflows for profiling and comparing scientists and...
Ad

Similar to Webs of People, Webs of Data (20)

PPT
Sticking between: mashup in libraries
PPTX
What happened to the Semantic Web?
PDF
Contextual Computing: Laying a Global Data Foundation
PPT
Semantic Search overview at SSSW 2012
PDF
Schema.org: Where did that come from!
PDF
What do we want computers to do for us?
PDF
Contextual Computing - Knowledge Graphs & Web of Entities
PDF
On the many graphs of the Web and the interest of adding their missing links.
PDF
Web search engines and search technology
PPTX
PPTX
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
PDF
Three Linked Data choices for Libraries
PPT
Marc and beyond: 3 Linked Data Choices
PDF
Structured Data: It's All about the Graph | Richard Wallis, Data Liberate
PDF
Structured Data: It's All About the Graph!
PPTX
Web Archives and the dream of the Personal Search Engine
PDF
From Ambition to Go Live SWIB.pdf
PDF
From Ambition to Go Live
PPTX
Ourlib SyncTheCity presentation - Jan 2015
Sticking between: mashup in libraries
What happened to the Semantic Web?
Contextual Computing: Laying a Global Data Foundation
Semantic Search overview at SSSW 2012
Schema.org: Where did that come from!
What do we want computers to do for us?
Contextual Computing - Knowledge Graphs & Web of Entities
On the many graphs of the Web and the interest of adding their missing links.
Web search engines and search technology
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
Three Linked Data choices for Libraries
Marc and beyond: 3 Linked Data Choices
Structured Data: It's All about the Graph | Richard Wallis, Data Liberate
Structured Data: It's All About the Graph!
Web Archives and the dream of the Personal Search Engine
From Ambition to Go Live SWIB.pdf
From Ambition to Go Live
Ourlib SyncTheCity presentation - Jan 2015

More from Simon Price (7)

PPTX
Adding Open Data Value to 'Closed Data' Problems
PPT
Citizen Science and Crowd-sourcing Biological Surveys
PPT
Mining and Mapping the Research Landscape
PPTX
A Higher-Order Data Flow Model for Heterogeneous Big Data
PPT
SubSift: a novel application of the vector space model to support the academi...
PPTX
Code Club - a Fight Club inspired approach to software inspection and review
PPTX
Clinical Experience Recorder
Adding Open Data Value to 'Closed Data' Problems
Citizen Science and Crowd-sourcing Biological Surveys
Mining and Mapping the Research Landscape
A Higher-Order Data Flow Model for Heterogeneous Big Data
SubSift: a novel application of the vector space model to support the academi...
Code Club - a Fight Club inspired approach to software inspection and review
Clinical Experience Recorder

Recently uploaded (20)

PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
Mega Projects Data Mega Projects Data
PPT
Quality review (1)_presentation of this 21
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PDF
Lecture1 pattern recognition............
PPTX
Computer network topology notes for revision
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Mega Projects Data Mega Projects Data
Quality review (1)_presentation of this 21
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
Lecture1 pattern recognition............
Computer network topology notes for revision
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
climate analysis of Dhaka ,Banglades.pptx
Clinical guidelines as a resource for EBP(1).pdf
Introduction-to-Cloud-ComputingFinal.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Database Infoormation System (DBIS).pptx
Moving the Public Sector (Government) to a Digital Adoption
Major-Components-ofNKJNNKNKNKNKronment.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Introduction to Knowledge Engineering Part 1
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf

Webs of People, Webs of Data

  • 1. Simon Price, University of Bristol Webs of People, Webs of Data Web 2.0 Live, Taunton, Nov 2006
  • 5. Canonical Web 2.0 • Amazon – Customer Reviews – Amazon Recommends • Google – PageRank™ – Making money out of links – Google Mail, Maps, APIs, Desktop Search, ...
  • 6. Web 2.0 Technology (nothing new) • Minimum – CGI (e.g. Perl, PHP, Python, C/C++) – Database (e.g. MySQL, Postgres, Oracle) • More recent additions – Java – XML – Web Services – AJAX – Ruby on Rails
  • 7. Social Networks A key ingredient in the Web 2.0 melting pot
  • 8. Google PageRank™ • Sergey Brin and Lawrence Page (Stanford, 1995) • Intuition behind PageRank: – Web is a network (graph) connected by links – A link is a "vote" for the destination page – Strength of vote is a fraction of the PageRank of the page casting the vote
  • 9. PageRank of a page is the probability of a random surfer arriving at that page after many clicks. (By Markov Theory)
  • 10. Newsgroup Mining Work by Jonathan Roberts
  • 14. The Web of Data
  • 15. Semantic Web The Semantic Web is a graph-based knowledge representation of data, spanning the Web, traditional databases, the desktop and mobile devices.
  • 16. Friend of a Friend (FOAF) "The FOAF project is about creating a Web of machine-readable homepages describing people, the links between them and the things they create and do." http://www.foaf-project.org/
  • 18. PARIP • PARIP = Practice As Research In Performance – 5 year national project – Led by University of Bristol's Department of Drama: Theatre, Film, Television – Professor Baz Kershaw and Dr Angela Piccini • PARIP Explorer – Innovative contacts and research database – Developed by ILRT – Semantic Web technology
  • 19. PARIP - Data Fusion • contact details • research interests • images • interviews • concepts • questionnaire responses • institutions • projects • …
  • 20. PARIP - User Perspective • Dual interface: – Text View cross-database search-engine – Map View visual link discovery and browsing
  • 21. PARIP - Technical Perspective • Semantic Web: RDF/XML and FOAF • Prolog running as a Web Service (WSDL+SOAP) • SPARQL query interface for programmatic access • XHTML AJAX client • Visualisation via Flash
  • 24. Exabyte Scale Informatics • 1 Exabyte = 1018 bytes i.e. 1,000,000,000,000,000,000 bytes • 1 Exabyte is approximately everything ever: • written, • composed, • filmed, • painted • or in any other way 'recorded' by humans. • Manual classification and retrieval is inadequate; machine learning and data mining are essential.
  • 25. Google on "Simon Price Bristol" Contact details