Get the                                 Feeling!

 Supporting Users in Finding Relevant Sources
      of Linked Open Data at Web-Scale


Thomas Gottron, Ansgar Scherp, Bastian Krayer, Arne Peters
System Support for Searchers
System involvement
 automatic                        Hold for later
 execution


monitor and           (skip)
recommend
                                                                      Hold for
                                                                       later
execute on                           Area of recommended
command                                  development
                   Operational
                    Systems
  display            (then)
  options


   none                               Pure user activity


                                                                                 User activity



Bates, M.J.: Where should the person stop and the information search interface start?
Information Processing and Management 26(5), 575–591 (1990)
 Get the Google Feeling                Thomas Gottron                               BTC 2012 2
System Support Helps: Query Specific Snippets




                          Recall                         Precision




                         Speed                          Satisfaction



Tombros, A., Sanderson, M.: Advantages of query biased summaries in information retrieval.
SIGIR’98. pp. 2–10 (1998)
Get the Google Feeling                 Thomas Gottron                             BTC 2012 3
System Support Helps: Query Suggestions




                                          of all queries were chosen from suggestions




    Find entry point                  Think out of the box            Identify new query terms


Kelly, D., et al. Effects of popularity and quality on the usage of query suggestions during
information search. CHI '10, p 45-54, (2010)
Get the Google Feeling                   Thomas Gottron                                 BTC 2012 4
Get the Google Feeling   Thomas Gottron   BTC 2012 5
Did you mean?



 Result Set Size




Ranked Retrieval



Result Snippets




Related Queries
Schema-based Index Design




Get the Google Feeling   Thomas Gottron   BTC 2012 7
„Under the hood“

SPARQL                   Generalize      Select




                                         Count

   Query              Retrieve
                                          Rank
translation          Datasources

                                       Snippets


            • 1 query for result set and result set size
            • N queries for ranking data and snippets
                 Specify      Select

            • 2 queries per related query

Get the Google Feeling                Thomas Gottron   BTC 2012 8
Stats

                          Use of the complete BTC 2012 dataset
                          Index size
                             133M schema triples
                             224M payload triples

                          Commodity hardware
                             Data processing
                             LODation service provision



                          Index construction (15h) and optimization (5h)
                          Response time: < 1s on a single CPU machine



Get the Google Feeling                 Thomas Gottron              BTC 2012 9
Get the                Feeling!



          Thank you!

More Related Content

PDF
Manual completo
PDF
Evaluation of Biocontrol agents against Lasiodiplodia theobromae causing Infl...
PPT
Простые и составные числительные
PPTX
Body in mind
PPT
Needle stick
PPTX
Jclic pre
PDF
De-virtualizing virtual Function Calls using various Type Analysis Technique...
PPTX
Diapos staditik
Manual completo
Evaluation of Biocontrol agents against Lasiodiplodia theobromae causing Infl...
Простые и составные числительные
Body in mind
Needle stick
Jclic pre
De-virtualizing virtual Function Calls using various Type Analysis Technique...
Diapos staditik

Viewers also liked (13)

PDF
Jamming Attacks Prevention in Wireless Networks Using Packet Hiding Methods
PDF
Classification By Clustering Based On Adjusted Cluster
PDF
A Novel PSNR-B Approach for Evaluating the Quality of De-blocked Images
PDF
A New Theoretical Approach to Location Based Power Aware Routing
PPT
Формирование универсальных учебных действий. Регулятивные УУД.
PDF
Combining both Plug-in Vehicles and Renewable Energy Resources for Unit Commi...
PDF
Requirements and Challenges for Securing Cloud Applications and Services
PPT
Virus Informaticos
PDF
Effect of Age of Spawned Catfish (Clarias Gariepinus) Broodstock on Quantity ...
PPT
Sofitel Resort progress 14 11-2012
PDF
B0520710
PDF
Efficiency of Prediction Algorithms for Mining Biological Databases
PDF
Prototyping the Future Potentials of Location Based Services in the Realm of ...
Jamming Attacks Prevention in Wireless Networks Using Packet Hiding Methods
Classification By Clustering Based On Adjusted Cluster
A Novel PSNR-B Approach for Evaluating the Quality of De-blocked Images
A New Theoretical Approach to Location Based Power Aware Routing
Формирование универсальных учебных действий. Регулятивные УУД.
Combining both Plug-in Vehicles and Renewable Energy Resources for Unit Commi...
Requirements and Challenges for Securing Cloud Applications and Services
Virus Informaticos
Effect of Age of Spawned Catfish (Clarias Gariepinus) Broodstock on Quantity ...
Sofitel Resort progress 14 11-2012
B0520710
Efficiency of Prediction Algorithms for Mining Biological Databases
Prototyping the Future Potentials of Location Based Services in the Realm of ...
Ad

Similar to Get the Google Feeling! Supporting Users in Finding Relevant Sources (20)

PDF
How google works_final
PDF
Indextank east bay ruby meetup slides
PPTX
DATAWEEK KEYNOTE: LARGE SCALE SEARCH, DISCOVERY AND ANALYSIS IN ACTION
PDF
Semantic Search Tutorial at SemTech 2012
KEY
A Service-Based Architecture for Multi-domain Search on the Web
PPTX
Introduction to Information Retrieval
PPTX
Semantic Search tutorial at SemTech 2012
PDF
Quest Trail: An Effective Approach for Construction of Personalized Search En...
PDF
DynamoDB and Amazon Cloudsearch
PDF
GSA Webinar - June 2, 2011
PDF
Intro to new Google cloud technologies: Google Storage, Prediction API, BigQuery
PDF
Exploring session search
PDF
Better Search Engine Testing - Eric Pugh
PPT
Advanced Google Ref Tools Lilrc Bitting
PPTX
Summit EU Machine Learning
PDF
Information Retrieval (for beginners)
PPT
675d614e68cce (5).ppt image retrival for UI
KEY
02 Web Search
PPTX
Introduction to Information Retrieval
How google works_final
Indextank east bay ruby meetup slides
DATAWEEK KEYNOTE: LARGE SCALE SEARCH, DISCOVERY AND ANALYSIS IN ACTION
Semantic Search Tutorial at SemTech 2012
A Service-Based Architecture for Multi-domain Search on the Web
Introduction to Information Retrieval
Semantic Search tutorial at SemTech 2012
Quest Trail: An Effective Approach for Construction of Personalized Search En...
DynamoDB and Amazon Cloudsearch
GSA Webinar - June 2, 2011
Intro to new Google cloud technologies: Google Storage, Prediction API, BigQuery
Exploring session search
Better Search Engine Testing - Eric Pugh
Advanced Google Ref Tools Lilrc Bitting
Summit EU Machine Learning
Information Retrieval (for beginners)
675d614e68cce (5).ppt image retrival for UI
02 Web Search
Introduction to Information Retrieval
Ad

More from Thomas Gottron (10)

PDF
Focused Exploration of Geospatial Context on Linked Open Data
PDF
Leveraging the Web of Data: Managing, Analysing and Making Use of Linked Open...
PPTX
Perplexity of Index Models over Evolving Linked Data
PPTX
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
PPTX
Of Sampling and Smoothing: Approximating Distributions over Linked Open Data
PDF
Making Use of the Linked Data Cloud: The Role of Index Structures
PPTX
 Challenges in Managing Online Business Communities
PPTX
ESWC 2013: A Systematic Investigation of Explicit and Implicit Schema Informa...
PPTX
Challenging Retrieval Scenarios: Social Media and Linked Open Data
PPTX
Finding Good URLs: Aligning Entities in Knowledge Bases with Public Web Docum...
Focused Exploration of Geospatial Context on Linked Open Data
Leveraging the Web of Data: Managing, Analysing and Making Use of Linked Open...
Perplexity of Index Models over Evolving Linked Data
From Changes to Dynamics: Dynamics Analysis of Linked Open Data Sources
Of Sampling and Smoothing: Approximating Distributions over Linked Open Data
Making Use of the Linked Data Cloud: The Role of Index Structures
 Challenges in Managing Online Business Communities
ESWC 2013: A Systematic Investigation of Explicit and Implicit Schema Informa...
Challenging Retrieval Scenarios: Social Media and Linked Open Data
Finding Good URLs: Aligning Entities in Knowledge Bases with Public Web Docum...

Recently uploaded (20)

PPTX
endocrine - management of adrenal incidentaloma.pptx
PPTX
Hypertension_Training_materials_English_2024[1] (1).pptx
PPTX
perinatal infections 2-171220190027.pptx
PDF
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PPTX
Understanding the Circulatory System……..
PPTX
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
PDF
Packaging materials of fruits and vegetables
PPTX
PMR- PPT.pptx for students and doctors tt
PDF
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
PPTX
limit test definition and all limit tests
PPT
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
PPT
Presentation of a Romanian Institutee 2.
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PDF
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
PPT
veterinary parasitology ````````````.ppt
PDF
CHAPTER 2 The Chemical Basis of Life Lecture Outline.pdf
PPT
LEC Synthetic Biology and its application.ppt
PPT
Computional quantum chemistry study .ppt
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PPT
Biochemestry- PPT ON Protein,Nitrogenous constituents of Urine, Blood, their ...
endocrine - management of adrenal incidentaloma.pptx
Hypertension_Training_materials_English_2024[1] (1).pptx
perinatal infections 2-171220190027.pptx
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
Understanding the Circulatory System……..
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
Packaging materials of fruits and vegetables
PMR- PPT.pptx for students and doctors tt
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
limit test definition and all limit tests
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
Presentation of a Romanian Institutee 2.
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
veterinary parasitology ````````````.ppt
CHAPTER 2 The Chemical Basis of Life Lecture Outline.pdf
LEC Synthetic Biology and its application.ppt
Computional quantum chemistry study .ppt
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
Biochemestry- PPT ON Protein,Nitrogenous constituents of Urine, Blood, their ...

Get the Google Feeling! Supporting Users in Finding Relevant Sources

  • 1. Get the Feeling! Supporting Users in Finding Relevant Sources of Linked Open Data at Web-Scale Thomas Gottron, Ansgar Scherp, Bastian Krayer, Arne Peters
  • 2. System Support for Searchers System involvement automatic Hold for later execution monitor and (skip) recommend Hold for later execute on Area of recommended command development Operational Systems display (then) options none Pure user activity User activity Bates, M.J.: Where should the person stop and the information search interface start? Information Processing and Management 26(5), 575–591 (1990) Get the Google Feeling Thomas Gottron BTC 2012 2
  • 3. System Support Helps: Query Specific Snippets Recall Precision Speed Satisfaction Tombros, A., Sanderson, M.: Advantages of query biased summaries in information retrieval. SIGIR’98. pp. 2–10 (1998) Get the Google Feeling Thomas Gottron BTC 2012 3
  • 4. System Support Helps: Query Suggestions of all queries were chosen from suggestions Find entry point Think out of the box Identify new query terms Kelly, D., et al. Effects of popularity and quality on the usage of query suggestions during information search. CHI '10, p 45-54, (2010) Get the Google Feeling Thomas Gottron BTC 2012 4
  • 5. Get the Google Feeling Thomas Gottron BTC 2012 5
  • 6. Did you mean? Result Set Size Ranked Retrieval Result Snippets Related Queries
  • 7. Schema-based Index Design Get the Google Feeling Thomas Gottron BTC 2012 7
  • 8. „Under the hood“ SPARQL Generalize Select Count Query Retrieve Rank translation Datasources Snippets • 1 query for result set and result set size • N queries for ranking data and snippets Specify Select • 2 queries per related query Get the Google Feeling Thomas Gottron BTC 2012 8
  • 9. Stats  Use of the complete BTC 2012 dataset  Index size  133M schema triples  224M payload triples  Commodity hardware  Data processing  LODation service provision  Index construction (15h) and optimization (5h)  Response time: < 1s on a single CPU machine Get the Google Feeling Thomas Gottron BTC 2012 9
  • 10. Get the Feeling! Thank you!