SlideShare a Scribd company logo
How to get your data into Sindice and Google with sitemap4rdfBoris Villazón-Terrazas (OEG), Richard Cyganiak (DERI)
Publishing Linked Data from a triple store
Linked Data frontends for triple storesSource: Pubby website, http://www4.wiwiss.fu-berlin.de/pubby/
Search engines
Sindice: the best RDF search engine
Sindice: the best RDF search engine120M+ documentsContinuously updating since 2006Low-latency search APIRDF/XML, Turtle, RDFa, microformats
The Sitemap protocol
Sitemap ProtocolUsed by web crawlersEfficiently find all your content & discover what has been updatedhttp://sitemaps.org/
Sitemap Protocol: Simple example<?xml version="1.0" encoding="UTF-8"?><urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9">   <url>      <loc>http://yoursite/</loc>   </url>   <url>      <loc>http://yoursite/products/53546</loc>   </url>   <url>      <loc>http://yoursite/products/98421</loc>   </url>   <url>      <loc>http://yoursite/products/41003</loc>   </url></urlset>
Sitemap Protocol: Optional parts<?xml version="1.0" encoding="UTF-8"?><urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9">   <url>      <loc>http://yoursite/</loc>      <lastmod>2010-06-24</lastmod>      <changefreq>daily</changefreq>   </url></urlset>
Sitemap Protocol: Huge sitemapsGzip-compress your sitemapLimit: 50k URLs or 10MBsplit into multiple sitemap filesadd a sitemap index file
Sitemap Protocol: DiscoveryPublish the sitemap fileAdd a line to http://yoursite/robots.txt  Sitemap: http://yoursite/sitemap.xml
sitemap4rdfGenerate Sitemap files from a SPARQL endpoint
sitemap4rdfSimple command line toolSends a SPARQL query to list all URIsGenerates sitemapsitemap4rdf http://yoursite/sparql http://yoursite/resource/
Submit the sitemap location - Sindicehttp://sindice.com/main/submit
Submit the sitemap location - Googlehttps://www.google.com/webmasters/tools/
SummarySitemap protocol informs search engines about available pagesSupported by Sindice!sitemap4rdf generates Sitemap files by listing URIs in a SPARQL endpointOpen source, Javahttp://lab.linkeddata.deri.ie/2010/sitemap4rdf/

More Related Content

PPT
Talis Platform: A Linked Data Engine
PDF
Data Curation @ SpazioDati - NEXA Lunch Seminar
PDF
Using entity extraction extension with OpenRefine and Dandelion API
PDF
ISWC 2014 - Dandelion: from raw data to dataGEMs for developers
PDF
The RDF Report Card: Beyond the Triple Count
PDF
Beyond 2022 project presentation 2021
PDF
Text analytics for Google Spreadsheets using Text Mining add-on
PDF
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Talis Platform: A Linked Data Engine
Data Curation @ SpazioDati - NEXA Lunch Seminar
Using entity extraction extension with OpenRefine and Dandelion API
ISWC 2014 - Dandelion: from raw data to dataGEMs for developers
The RDF Report Card: Beyond the Triple Count
Beyond 2022 project presentation 2021
Text analytics for Google Spreadsheets using Text Mining add-on
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...

What's hot (20)

PPT
The Power of Semantic Technologies to Explore Linked Open Data
PDF
ORCID cross-sector application and use cases, Funder workflow: National Resea...
PDF
Querying the Wikidata Knowledge Graph
PDF
S4: The Self-Service Semantic Suite
PDF
DataXDay - Real-Time Access log analysis
PDF
New Product Introductions - Minesoft
PDF
Smarter content with a Dynamic Semantic Publishing Platform
PDF
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
PDF
5 Ruby Gems in 10 minutes - Faraday, Hashie, Twitter, Diametric, and Adamantium
PDF
GraphDB Connectors – Powering Complex SPARQL Queries
PDF
Cloud architectures for data science
PPTX
Connected data meetup group - introduction & scope
PDF
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
PDF
New Product Introductions - FIZ Karlsruhe
PDF
Smart Data Applications powered by the Wikidata Knowledge Graph
PDF
Fast Data processing with RFX
PDF
Discovering Related Data Sources in Data Portals
PPTX
Using historical open data for family history - and the value of GB1900 data
PDF
PID Services for FAIR data
PDF
PID services - understandability and findability of data
The Power of Semantic Technologies to Explore Linked Open Data
ORCID cross-sector application and use cases, Funder workflow: National Resea...
Querying the Wikidata Knowledge Graph
S4: The Self-Service Semantic Suite
DataXDay - Real-Time Access log analysis
New Product Introductions - Minesoft
Smarter content with a Dynamic Semantic Publishing Platform
Transforming Your Data with GraphDB: GraphDB Fundamentals, Jan 2018
5 Ruby Gems in 10 minutes - Faraday, Hashie, Twitter, Diametric, and Adamantium
GraphDB Connectors – Powering Complex SPARQL Queries
Cloud architectures for data science
Connected data meetup group - introduction & scope
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
New Product Introductions - FIZ Karlsruhe
Smart Data Applications powered by the Wikidata Knowledge Graph
Fast Data processing with RFX
Discovering Related Data Sources in Data Portals
Using historical open data for family history - and the value of GB1900 data
PID Services for FAIR data
PID services - understandability and findability of data
Ad

Similar to How to get your data into Sindice and Google with sitemap4rdf (20)

PDF
Sitemap4rdf(v2 boris)
PPT
Semantic Web
PPTX
The new CIARD RING , a machine-readable directory of datasets for agriculture
PPTX
Datasets, APIs, and Web Scraping
PPT
Drupal and the Semantic Web
PPT
Dsp bbc-jem rayfield-semtech2011
PPT
PPT
Semantic web and Drupal: an introduction
PPTX
The CIARD RING , a global directory of datasets for agriculture, by Valeria P...
PPT
Getting Started With The Talis Platform
PPT
Microformats
PPT
JahiaOne - Semantic Web with Jahia
PPTX
Open belgium 2015 - open tourism
PPTX
Publishing Linked Data 3/5 Semtech2011
PDF
E017624043
PDF
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
PPTX
NCompass Live: RSS: Feed Me
PPTX
Reto2.011 APEX API
PDF
The Semantic Web Client Library - Consuming Linked Data in Your Applications
PDF
LOD技術解説
Sitemap4rdf(v2 boris)
Semantic Web
The new CIARD RING , a machine-readable directory of datasets for agriculture
Datasets, APIs, and Web Scraping
Drupal and the Semantic Web
Dsp bbc-jem rayfield-semtech2011
Semantic web and Drupal: an introduction
The CIARD RING , a global directory of datasets for agriculture, by Valeria P...
Getting Started With The Talis Platform
Microformats
JahiaOne - Semantic Web with Jahia
Open belgium 2015 - open tourism
Publishing Linked Data 3/5 Semtech2011
E017624043
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
NCompass Live: RSS: Feed Me
Reto2.011 APEX API
The Semantic Web Client Library - Consuming Linked Data in Your Applications
LOD技術解説
Ad

More from Richard Cyganiak (12)

PPTX
SHACL: Shaping the Big Ball of Data Mud
PPTX
What's New in RDF 1.1?
PDF
EDF2012: The Web of Data and its Five Stars
PPTX
VoID: Metadata for RDF Datasets
PPTX
Practical Cross-Dataset Queries with SPARQL (Introduction)
PPTX
How to Publish Open Data
PPTX
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
PPT
Investigating Community Implementation of the GoodRelations Ontology
PPTX
Self-Service Linked Government Data with dcat and Gridworks
PPTX
The State of Linked Government Data
PDF
What is SDMX-RDF?
PDF
dcat: An RDF vocabulary for interoperability of data catalogues
SHACL: Shaping the Big Ball of Data Mud
What's New in RDF 1.1?
EDF2012: The Web of Data and its Five Stars
VoID: Metadata for RDF Datasets
Practical Cross-Dataset Queries with SPARQL (Introduction)
How to Publish Open Data
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
Investigating Community Implementation of the GoodRelations Ontology
Self-Service Linked Government Data with dcat and Gridworks
The State of Linked Government Data
What is SDMX-RDF?
dcat: An RDF vocabulary for interoperability of data catalogues

Recently uploaded (20)

PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Big Data Technologies - Introduction.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Approach and Philosophy of On baking technology
PDF
KodekX | Application Modernization Development
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Spectroscopy.pptx food analysis technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Cloud computing and distributed systems.
Encapsulation_ Review paper, used for researhc scholars
Big Data Technologies - Introduction.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Approach and Philosophy of On baking technology
KodekX | Application Modernization Development
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
NewMind AI Weekly Chronicles - August'25 Week I
Reach Out and Touch Someone: Haptics and Empathic Computing
“AI and Expert System Decision Support & Business Intelligence Systems”
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
MYSQL Presentation for SQL database connectivity
Programs and apps: productivity, graphics, security and other tools
Spectroscopy.pptx food analysis technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Cloud computing and distributed systems.

How to get your data into Sindice and Google with sitemap4rdf

  • 1. How to get your data into Sindice and Google with sitemap4rdfBoris Villazón-Terrazas (OEG), Richard Cyganiak (DERI)
  • 2. Publishing Linked Data from a triple store
  • 3. Linked Data frontends for triple storesSource: Pubby website, http://www4.wiwiss.fu-berlin.de/pubby/
  • 5. Sindice: the best RDF search engine
  • 6. Sindice: the best RDF search engine120M+ documentsContinuously updating since 2006Low-latency search APIRDF/XML, Turtle, RDFa, microformats
  • 8. Sitemap ProtocolUsed by web crawlersEfficiently find all your content & discover what has been updatedhttp://sitemaps.org/
  • 9. Sitemap Protocol: Simple example<?xml version="1.0" encoding="UTF-8"?><urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9"> <url> <loc>http://yoursite/</loc> </url> <url> <loc>http://yoursite/products/53546</loc> </url> <url> <loc>http://yoursite/products/98421</loc> </url> <url> <loc>http://yoursite/products/41003</loc> </url></urlset>
  • 10. Sitemap Protocol: Optional parts<?xml version="1.0" encoding="UTF-8"?><urlsetxmlns="http://www.sitemaps.org/schemas/sitemap/0.9"> <url> <loc>http://yoursite/</loc> <lastmod>2010-06-24</lastmod> <changefreq>daily</changefreq> </url></urlset>
  • 11. Sitemap Protocol: Huge sitemapsGzip-compress your sitemapLimit: 50k URLs or 10MBsplit into multiple sitemap filesadd a sitemap index file
  • 12. Sitemap Protocol: DiscoveryPublish the sitemap fileAdd a line to http://yoursite/robots.txt Sitemap: http://yoursite/sitemap.xml
  • 13. sitemap4rdfGenerate Sitemap files from a SPARQL endpoint
  • 14. sitemap4rdfSimple command line toolSends a SPARQL query to list all URIsGenerates sitemapsitemap4rdf http://yoursite/sparql http://yoursite/resource/
  • 15. Submit the sitemap location - Sindicehttp://sindice.com/main/submit
  • 16. Submit the sitemap location - Googlehttps://www.google.com/webmasters/tools/
  • 17. SummarySitemap protocol informs search engines about available pagesSupported by Sindice!sitemap4rdf generates Sitemap files by listing URIs in a SPARQL endpointOpen source, Javahttp://lab.linkeddata.deri.ie/2010/sitemap4rdf/