SlideShare a Scribd company logo
Measuring the usefulness of
Knowledge Organization Systems
in
Information Retrieval applications
Philipp Mayr
Observatory for Knowledge Organisation
Systems KNOWeSCAPE workshop, Valletta,
Malta
February 01, 2017
GESIS
• We are developing interactive information retrieval systems for
searching indexed literature and data sets
• We follow the principle „research-based service“; develop research
prototypes, test and evaluate them and implement the features which
are working for the end users
2
Intro
• Typical difficulties in searching digital libraries (DL)
– Vagueness between search and indexing terms
– How to support searchers with controlled vocabulary?
• Assumption: a user’s search (experience) should
improve by using Knowledge Organization Systems
(KOS):
– Vague search tasks
– Unfamiliar fields
– Cross domain searches
• Case studies to demonstrate the effectiveness of
KOS in different search scenarios 3
Case Study 1: Information retrieval
experiment
• Intra- and interdisciplinary cross-
concordances in the project
KoMoHe
– Social Sciences-SocSci; SocSci-
Economics; SocSci-Psychology; Politics-
Economics; Medicine-Psychology, …
• Information retrieval evaluation of the
mappings (effectiveness of
intellectual mapping)
4Controlled terms
Case Study 1
• How effective are the mappings in an actual search?
Does the application of term mappings (TT) improve
search over a non-transformed subject (i.e. controlled
vocabulary) search (CT)?
• Real queries, only equivalence relations, 13 thesaurus
mappings
5
Mayr/Petras 2008
• Overlap and more
identical terms in
intradisciplinary
mappings
• Interdisciplinary
mappings made the
strongest effect
Case Study 2: Information retrieval
experiment
• Discipline-specific Search-Term-
Recommendation (STR) Services in IRM
project
• Are recommendations from discipline
specific STRs better suited for query
expansion than general ones?
• Co-occurence of terms
in title/description and
assigned controlled
terms
• 17 STR services
– 16 discipline-spec.
– 1 global
6Lüke et al. 2012
Case Study 2
• Are recommendations from discipline specific STRs better suited for
query expansion (QE) than general ones?
• 100 topics from the GIRT corpus, top 4 recommendations to expand the
original query
• gSTR = global STR; tSTR = topical STR; bSTR = best-performing STR
7
Lüke et al. 2012
• QE with specific STRs leads to significantly better results than QE with a
general STR
• Selecting the best matching specific STR in an automatic way is a major
Case Study 3: Interactive IR
experiment
• Measuring the utility and
performance of Search Term
Recommendation (STR) Services in
AMUR project
• Logfile-based evaluation of STR
usage and later search session
success
• We defined positive signals (export,
save, email, full text …) in the
system
enter_search_term→select_term_from_reco
mmender→search→view_record_1→view_r
ecord_2→view_record_3→export_record
• Analysis of one year of log data
8
Hienert/Mutschke 2016
Case Study 3
• Usage of the STR significantly often implicates the
occurrence of positive signals during the following session
steps
9
Hienert/Mutschke 2016
Conclusions
• Information retrieval and
interactice IR settings are able to
demonstrate the utility of KOS
usage (usefulness)
– In experimental settings
– In user evaluations
• Each methodology has pros and
cons
– Effort and significance in small user
studies
– Too controlled, system-based,
without real users
• Terminology mapping projects 10
IR
Interactive
IR
Availability of corpora high low
Reproducibility high low
Control high low
Measures medium medium
Effort low high
Significance medium medium
Generalisability medium medium
Realistic Scenario no high
Outlook
• Integrate different recommender systems in
real retrieval tasks (search sessions)
• Use and evaluate recommenders for query
expansion and as dynamic features in IR, in
the retrieval process (AMUR project)
• Develop new measures of utility of
recommender systems
– E.g. measure task completion rates or goal
satisfaction
11
References
• Hienert, D. & Mutschke, P. (2016). A Usefulness-based Approach for
Measuring the Local and Global Effect of IIR Services. In Proceedings of
the 2016 ACM on Conference on Human Information Interaction and
Retrieval (CHIIR '16). ACM, New York, NY, USA, 153-162.
http://dx.doi.org/10.1145/2854946.2854962
• Lüke, T., Schaer, P., & Mayr, P. (2012). Improving Retrieval Results with
discipline-specific Query Expansion. In International Conference on Theory
and Practice of Digital Libraries (TPDL 2012) (pp. 408–413). Paphos,
Cyprus: Springer Berlin Heidelberg. http://doi.org/10.1007/978-3-642-
33290-6_44
• Mayr, P., & Petras, V. (2008). Cross-concordances: terminology mapping
and its effectiveness for information retrieval. In 74th IFLA World Library and
Information Congress. Québec, Canada: IFLA. Retrieved from
http://www.ifla.org/IV/ifla74/papers/129-Mayr_Petras-en.pdf
12
Thank you
Contact:
Dr Philipp Mayr
GESIS - Leibniz Institute for the Social Sciences,
Germany
Email: philipp.mayr@gesis.org
Twitter: @philipp_mayr
13

More Related Content

PDF
A practical guide to do primary research on meta analysis methodology - Pubrica
PDF
Recommending Scientific Papers: Investigating the User Curriculum
PPTX
The comparative study of information retrieval models used in search engines
PPTX
The Simulacrum, a Synthetic Cancer Dataset
PDF
An efficient information retrieval ontology system based indexing for context
PDF
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
PDF
Ijmet 10 02_050
PPTX
Grds conferences icst and icbelsh (10)
A practical guide to do primary research on meta analysis methodology - Pubrica
Recommending Scientific Papers: Investigating the User Curriculum
The comparative study of information retrieval models used in search engines
The Simulacrum, a Synthetic Cancer Dataset
An efficient information retrieval ontology system based indexing for context
Bibliometric-enhanced Retrieval Models for Big Scholarly Information Systems
Ijmet 10 02_050
Grds conferences icst and icbelsh (10)

What's hot (20)

PDF
Profiling Users' Preferences with Text Mining '14
PDF
Peter (Yun-shao) Sung's Resume 2016III
PDF
Data and Data collection
DOCX
Ms 66 marketing research
PDF
C017510717
PPTX
Integrating research indicators for use in the repositories infrastructure
PPTX
Literature review
PPTX
environmental scanning
PPTX
Crowdsourcing Predictors of Behavioral Outcomes
DOCX
Glossary
PPTX
Introduction to Research methodology: Orientation for Doctoral Program Course...
PPT
Replicating FLOSS Research as eResearch
PPTX
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
PPTX
KREAM@ICCS2013
DOCX
an empirical performance evaluation of relational keyword search techniques
PDF
Strasser "Effective data management and its role in open research"
PPTX
Multi-factor Information Security Risk in Information System
PPTX
COAR Next Generation Repositories WG - Text mining and Recommender system sto...
PDF
Research design decisions and be competent in the process of reliable data co...
Profiling Users' Preferences with Text Mining '14
Peter (Yun-shao) Sung's Resume 2016III
Data and Data collection
Ms 66 marketing research
C017510717
Integrating research indicators for use in the repositories infrastructure
Literature review
environmental scanning
Crowdsourcing Predictors of Behavioral Outcomes
Glossary
Introduction to Research methodology: Orientation for Doctoral Program Course...
Replicating FLOSS Research as eResearch
Llebot "Research Data Support for Researchers: Metadata, Challenges, and Oppo...
KREAM@ICCS2013
an empirical performance evaluation of relational keyword search techniques
Strasser "Effective data management and its role in open research"
Multi-factor Information Security Risk in Information System
COAR Next Generation Repositories WG - Text mining and Recommender system sto...
Research design decisions and be competent in the process of reliable data co...
Ad

Viewers also liked (20)

PDF
Recent Advances in Bibliometric-Enhanced Information Retrieval
PDF
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
PPT
Adaptive Design implications for Knowledge Organization and Information Retri...
PDF
Establishing an Online Access Panel for Interactive Information Retrieval Res...
PPTX
PEP-TF: Social Media Monitoring of the Campaigns for the 2013 German Bundesta...
PPTX
Opening Scholarly Communication in Social Sciences (OSCOSS)
PPTX
Are topic-specific search term, journal name and author name recommendations ...
PPTX
Introduction of the 3rd International Workshop on Bibliometric-enhanced Infor...
PPTX
Analyzing the research output presented at European Networked Knowledge Organ...
PPTX
Opening Scholarly Communication in the Social Sciences
PPTX
Past, present and future of scientific information
PPTX
Demonstrating a Framework for KOS-based Recommendations Systems
PPTX
Recent applications of Knowledge Organization Systems
PPTX
Pennants for Descriptors
PPTX
Introduction to the 15th NKOS workshop @TPDL2016
PPTX
Introduction of the Bibliometric-enhanced Information Retrieval (BIR) workshop
PPTX
Assessing a human mediated current awareness service
PDF
Using co-authorship networks for author name disambiguation
PDF
Towards a Semantic Citation Index for the German Social Sciences
PDF
How to build your own citation index
Recent Advances in Bibliometric-Enhanced Information Retrieval
Opening Scholarly Communication in Social Sciences by Connecting Collaborativ...
Adaptive Design implications for Knowledge Organization and Information Retri...
Establishing an Online Access Panel for Interactive Information Retrieval Res...
PEP-TF: Social Media Monitoring of the Campaigns for the 2013 German Bundesta...
Opening Scholarly Communication in Social Sciences (OSCOSS)
Are topic-specific search term, journal name and author name recommendations ...
Introduction of the 3rd International Workshop on Bibliometric-enhanced Infor...
Analyzing the research output presented at European Networked Knowledge Organ...
Opening Scholarly Communication in the Social Sciences
Past, present and future of scientific information
Demonstrating a Framework for KOS-based Recommendations Systems
Recent applications of Knowledge Organization Systems
Pennants for Descriptors
Introduction to the 15th NKOS workshop @TPDL2016
Introduction of the Bibliometric-enhanced Information Retrieval (BIR) workshop
Assessing a human mediated current awareness service
Using co-authorship networks for author name disambiguation
Towards a Semantic Citation Index for the German Social Sciences
How to build your own citation index
Ad

Similar to Measuring the usefulness of Knowledge Organization Systems in Information Retrieval applications (20)

PDF
Intra- and interdisciplinary cross-concordances for information retrieval
PDF
Search term recommendation and non-textual ranking evaluated
PPTX
Query expansion for search improvement by faizulhaque
PDF
TOP 10 Cited Computer Science & Information Technology Research Articles From...
PPTX
Information Retrieval
PPTX
information retrieval
PDF
Improving search result via search keywords and data classification similarity
PPT
A Topic map-based ontology IR system versus Clustering-based IR System: A Com...
PDF
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
DOC
Semantic Search of E-Learning Documents Using Ontology Based System
PDF
Hci encyclopedia irshortefords
PDF
Hci encyclopedia irshortefords
PDF
SENSITIVITY ANALYSIS OF INFORMATION RETRIEVAL METRICS
PDF
A combination of reduction and expansion approaches to handle with long natur...
PDF
Performance Evaluation of Query Processing Techniques in Information Retrieval
PPTX
IRT Unit_I.pptx
PDF
G1803054653
PDF
Evaluation in (Music) Information Retrieval through the Audio Music Similarit...
PDF
Interactive informationretrieval 토인모_201202
Intra- and interdisciplinary cross-concordances for information retrieval
Search term recommendation and non-textual ranking evaluated
Query expansion for search improvement by faizulhaque
TOP 10 Cited Computer Science & Information Technology Research Articles From...
Information Retrieval
information retrieval
Improving search result via search keywords and data classification similarity
A Topic map-based ontology IR system versus Clustering-based IR System: A Com...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
Semantic Search of E-Learning Documents Using Ontology Based System
Hci encyclopedia irshortefords
Hci encyclopedia irshortefords
SENSITIVITY ANALYSIS OF INFORMATION RETRIEVAL METRICS
A combination of reduction and expansion approaches to handle with long natur...
Performance Evaluation of Query Processing Techniques in Information Retrieval
IRT Unit_I.pptx
G1803054653
Evaluation in (Music) Information Retrieval through the Audio Music Similarit...
Interactive informationretrieval 토인모_201202

More from GESIS (17)

PDF
Chatting with Papers: A Hybrid Approach Using LLMs and Knowledge Graphs
PPTX
10th BIR Workshop @ECIR 2020: introduction
PPTX
From closed to open access: A case study of flipped journals
PPTX
Highly cited references in PLOS ONE and their in-text usage over time
PDF
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
PPTX
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
PPTX
Analyzing the network structure and gender differences of the “NKOS community”
PPTX
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
PPTX
Searching beyond datasets in the Social Sciences
PPTX
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
PPTX
Contextualised Browsing in a Digital Library’s Living Lab
PPTX
41st European Conference on Information Retrieval (ECIR 2019)
PPTX
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
PDF
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
PPTX
Challenges in Extracting and Managing References
PPTX
Einführung in das Vektorraummodell
PPTX
Industrie 4.0
Chatting with Papers: A Hybrid Approach Using LLMs and Knowledge Graphs
10th BIR Workshop @ECIR 2020: introduction
From closed to open access: A case study of flipped journals
Highly cited references in PLOS ONE and their in-text usage over time
4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural...
Bibliometric-enhanced Information Retrieval: Connecting IR with Bibliometrics
Analyzing the network structure and gender differences of the “NKOS community”
Recent advances in the project EXCITE – Extraction of Citations from PDF Docu...
Searching beyond datasets in the Social Sciences
Bedeutung von Text Mining am Beispiel der Sozialwissenschaften
Contextualised Browsing in a Digital Library’s Living Lab
41st European Conference on Information Retrieval (ECIR 2019)
Offenes kollaboratives Schreiben: Eine „Open Science“-Infrastruktur am Beispi...
A Complete Year of User Retrieval Sessions in a Social Sciences Academic Sear...
Challenges in Extracting and Managing References
Einführung in das Vektorraummodell
Industrie 4.0

Recently uploaded (20)

PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPT
protein biochemistry.ppt for university classes
PPTX
2. Earth - The Living Planet earth and life
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPTX
Fluid dynamics vivavoce presentation of prakash
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PDF
Placing the Near-Earth Object Impact Probability in Context
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPTX
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
TOTAL hIP ARTHROPLASTY Presentation.pptx
protein biochemistry.ppt for university classes
2. Earth - The Living Planet earth and life
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
ECG_Course_Presentation د.محمد صقران ppt
Phytochemical Investigation of Miliusa longipes.pdf
Taita Taveta Laboratory Technician Workshop Presentation.pptx
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Biophysics 2.pdffffffffffffffffffffffffff
Fluid dynamics vivavoce presentation of prakash
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
Placing the Near-Earth Object Impact Probability in Context
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
Classification Systems_TAXONOMY_SCIENCE8.pptx
POSITIONING IN OPERATION THEATRE ROOM.ppt
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx

Measuring the usefulness of Knowledge Organization Systems in Information Retrieval applications

  • 1. Measuring the usefulness of Knowledge Organization Systems in Information Retrieval applications Philipp Mayr Observatory for Knowledge Organisation Systems KNOWeSCAPE workshop, Valletta, Malta February 01, 2017
  • 2. GESIS • We are developing interactive information retrieval systems for searching indexed literature and data sets • We follow the principle „research-based service“; develop research prototypes, test and evaluate them and implement the features which are working for the end users 2
  • 3. Intro • Typical difficulties in searching digital libraries (DL) – Vagueness between search and indexing terms – How to support searchers with controlled vocabulary? • Assumption: a user’s search (experience) should improve by using Knowledge Organization Systems (KOS): – Vague search tasks – Unfamiliar fields – Cross domain searches • Case studies to demonstrate the effectiveness of KOS in different search scenarios 3
  • 4. Case Study 1: Information retrieval experiment • Intra- and interdisciplinary cross- concordances in the project KoMoHe – Social Sciences-SocSci; SocSci- Economics; SocSci-Psychology; Politics- Economics; Medicine-Psychology, … • Information retrieval evaluation of the mappings (effectiveness of intellectual mapping) 4Controlled terms
  • 5. Case Study 1 • How effective are the mappings in an actual search? Does the application of term mappings (TT) improve search over a non-transformed subject (i.e. controlled vocabulary) search (CT)? • Real queries, only equivalence relations, 13 thesaurus mappings 5 Mayr/Petras 2008 • Overlap and more identical terms in intradisciplinary mappings • Interdisciplinary mappings made the strongest effect
  • 6. Case Study 2: Information retrieval experiment • Discipline-specific Search-Term- Recommendation (STR) Services in IRM project • Are recommendations from discipline specific STRs better suited for query expansion than general ones? • Co-occurence of terms in title/description and assigned controlled terms • 17 STR services – 16 discipline-spec. – 1 global 6Lüke et al. 2012
  • 7. Case Study 2 • Are recommendations from discipline specific STRs better suited for query expansion (QE) than general ones? • 100 topics from the GIRT corpus, top 4 recommendations to expand the original query • gSTR = global STR; tSTR = topical STR; bSTR = best-performing STR 7 Lüke et al. 2012 • QE with specific STRs leads to significantly better results than QE with a general STR • Selecting the best matching specific STR in an automatic way is a major
  • 8. Case Study 3: Interactive IR experiment • Measuring the utility and performance of Search Term Recommendation (STR) Services in AMUR project • Logfile-based evaluation of STR usage and later search session success • We defined positive signals (export, save, email, full text …) in the system enter_search_term→select_term_from_reco mmender→search→view_record_1→view_r ecord_2→view_record_3→export_record • Analysis of one year of log data 8 Hienert/Mutschke 2016
  • 9. Case Study 3 • Usage of the STR significantly often implicates the occurrence of positive signals during the following session steps 9 Hienert/Mutschke 2016
  • 10. Conclusions • Information retrieval and interactice IR settings are able to demonstrate the utility of KOS usage (usefulness) – In experimental settings – In user evaluations • Each methodology has pros and cons – Effort and significance in small user studies – Too controlled, system-based, without real users • Terminology mapping projects 10 IR Interactive IR Availability of corpora high low Reproducibility high low Control high low Measures medium medium Effort low high Significance medium medium Generalisability medium medium Realistic Scenario no high
  • 11. Outlook • Integrate different recommender systems in real retrieval tasks (search sessions) • Use and evaluate recommenders for query expansion and as dynamic features in IR, in the retrieval process (AMUR project) • Develop new measures of utility of recommender systems – E.g. measure task completion rates or goal satisfaction 11
  • 12. References • Hienert, D. & Mutschke, P. (2016). A Usefulness-based Approach for Measuring the Local and Global Effect of IIR Services. In Proceedings of the 2016 ACM on Conference on Human Information Interaction and Retrieval (CHIIR '16). ACM, New York, NY, USA, 153-162. http://dx.doi.org/10.1145/2854946.2854962 • Lüke, T., Schaer, P., & Mayr, P. (2012). Improving Retrieval Results with discipline-specific Query Expansion. In International Conference on Theory and Practice of Digital Libraries (TPDL 2012) (pp. 408–413). Paphos, Cyprus: Springer Berlin Heidelberg. http://doi.org/10.1007/978-3-642- 33290-6_44 • Mayr, P., & Petras, V. (2008). Cross-concordances: terminology mapping and its effectiveness for information retrieval. In 74th IFLA World Library and Information Congress. Québec, Canada: IFLA. Retrieved from http://www.ifla.org/IV/ifla74/papers/129-Mayr_Petras-en.pdf 12
  • 13. Thank you Contact: Dr Philipp Mayr GESIS - Leibniz Institute for the Social Sciences, Germany Email: philipp.mayr@gesis.org Twitter: @philipp_mayr 13