SlideShare a Scribd company logo
Introduction Method Results Summary
Contextualization of Topics
browsing through terms, authors, journals and cluster allocations
Rob Koopman1 Shenghui Wang1 Andrea Scharnhorst2
1OCLC Research 2DANS-KNAW
ISSI 2015
Introduction Method Results Summary
Introduction
What are essence and boundary of a scientific field?
Different ways to find clusters in scientific literature based on
connectivity in terms of authorship, citations, language
similarity, etc.
Ambiguous nature in science
Introduction Method Results Summary
Ariadne: interactive context explorer
Ariadne is an interactive interface which allows users to
explore the context of entities such as authors, journals,
topical terms, etc.
It builds on semantic indexing statistically computed from a
large scale bibliographic corpus
It was originally implemented to explore 1M topical terms, 3M
authors, 35K journals and 700+ Dewey decimal classes
associated with 65M articles.
Introduction Method Results Summary
Research questions
Q1: How does the Ariadne algorithm work on a much smaller,
field specific dataset?
Q2: Can we use Ariadne to label the clusters produce by the
different methods?
Q3: Can we use Ariadne to compare different clustering
solutions?
Introduction Method Results Summary
LittleAriadne
LittleAriadne: context explorer over Astrophysics data
Offline: generates a semantic representation for each entity
Online: finds the most related entities and using
multidimensional scaling to display
Introduction Method Results Summary
LittleAriadne
An example article
Article ID ISI:000276828000006
Title On the Mass Transfer Rate in SS Cyg
Abstract The mass transfer rate in SS Cyg at quiescence, estimated
from the observed luminosity of the hot spot, is log M-tr
= 16.8 +/- 0.3. This is safely below the critical mass
transfer rates of log M-crit = 18.1 (corresponding to log
T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to
the “revised” value of log T-crit(0) = 3.65). The mass
transfer rate during outbursts is strongly enhanced
Author [author:smak j]
ISSN [issn:0001-5237]
Subject [subject:accretion, accretion disks] [subject:cataclysmic
variables] [subject:disc instability model] [subject:dwarf novae]
[subject:novae, cataclysmic variables] [subject:outbursts]
[subject:parameters] [subject:stars] [subject:stars dwarf novae]
[subject:stars individual ss cyg] [subject:state] [subject:
superoutbursts]
Introduction Method Results Summary
LittleAriadne
An example article
Article ID ISI:000276828000006
Title On the Mass Transfer Rate in SS Cyg
Abstract The mass transfer rate in SS Cyg at quiescence, estimated
from the observed luminosity of the hot spot, is log M-tr
= 16.8 +/- 0.3. This is safely below the critical mass
transfer rates of log M-crit = 18.1 (corresponding to log
T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to
the “revised” value of log T-crit(0) = 3.65). The mass
transfer rate during outbursts is strongly enhanced
Author [author:smak j]
ISSN [issn:0001-5237]
Subject [subject:accretion, accretion disks] [subject:cataclysmic
variables] [subject:disc instability model] [subject:dwarf novae]
[subject:novae, cataclysmic variables] [subject:outbursts]
[subject:parameters] [subject:stars] [subject:stars dwarf novae]
[subject:stars individual ss cyg] [subject:state] [subject:
superoutbursts]
Cluster label [cluster:a 19] [cluster:b 16] [cluster:c 15]
[cluster:d 51] [cluster:e 17] [cluster:f 1]
Introduction Method Results Summary
LittleAriadne
Six different clustering solutions
x Source y=#Cluster #Cluster in LittleAriadne
a cwts 1.8 23 23
b UMSI 23 23
c oclc 20 20 20
d hu 139 48
e sts 5664 229
f ECOOM 15 15
Introduction Method Results Summary
LittleAriadne
Entities in the Astrophysis dataset
There are in total 90,343 entities associated with 111,616
astrophysics articles
59 journals
27,027 author names (no disambiguation applied)
39,577 topical terms
23,322 subjects (extracted from ”Author Keywords” and
”Keywords Plus”)
358 cluster labels (source + cluster id)
Introduction Method Results Summary
LittleAriadne
Build semantic representation
Basic assumptions
Entities can be represented by its context
Entities which share more context are more likely to be related
Context is the textual environment where an entity occurs
Introduction Method Results Summary
LittleAriadne
An example article
Article ID ISI:000276828000006
Title On the Mass Transfer Rate in SS Cyg
Abstract The mass transfer rate in SS Cyg at quiescence, estimated
from the observed luminosity of the hot spot, is log M-tr
= 16.8 +/- 0.3. This is safely below the critical mass
transfer rates of log M-crit = 18.1 (corresponding to log
T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to
the “revised” value of log T-crit(0) = 3.65). The mass
transfer rate during outbursts is strongly enhanced
Author [author:smak j]
ISSN [issn:0001-5237]
Subject [subject:accretion, accretion disks] [subject:cataclysmic
variables] [subject:disc instability model] [subject:dwarf novae]
[subject:novae, cataclysmic variables] [subject:outbursts]
[subject:parameters] [subject:stars] [subject:stars dwarf novae]
[subject:stars individual ss cyg] [subject:state] [subject:
superoutbursts]
Cluster label [cluster:a 19] [cluster:b 16] [cluster:c 15]
[cluster:d 51] [cluster:e 17] [cluster:f 1]
Introduction Method Results Summary
LittleAriadne
Dimension reduction using Random Projection
mass
transfer
rate
[subject:outburst]
[subject:sstars]
[subject:parameters]
[author:smak j]
[cluster: a19]
[issn:0001-5237]
Introduction Method Results Summary
LittleAriadne
Dimension reduction using Random Projection
mass
transfer
rate
[subject:outburst]
[subject:sstars]
[subject:parameters]
[author:smak j]
[cluster: a19]
[issn:0001-5237]
Introduction Method Results Summary
LittleAriadne
From semantic representation to visualisation and more
Each entity has its semantic representation
Cosine similarity between entities can be computed very fast,
based on which the 2D visualisation is implemented
Introduction Method Results Summary
LittleAriadne
From semantic representation to visualisation and more
Each entity has its semantic representation
Cosine similarity between entities can be computed very fast,
based on which the 2D visualisation is implemented
For each article, we collected the semantic representation of
all the entities in which it involves, and take an average as its
semantic representation
We applied a standard K-means clustering method to cluster
these articles based on their semantic representations
Introduction Method Results Summary
Experiment 1: Exploring context
Experiment 1: Exploring context
Now we can explore
Let’s start with stars
An overview of all journals
Introduction Method Results Summary
Experiment 1: Exploring context
Contextual view of stars
Introduction Method Results Summary
Experiment 2: Labelling clusters
Experiment 2: Labelling clusters
What is cluster a 2?
Introduction Method Results Summary
Experiment 2: Labelling clusters
Experiment 2: Labelling clusters
Introduction Method Results Summary
Experiment 2: Labelling clusters
Experiment 2: Labelling clusters
Cluster ID Top 9 most related topical terms
a 2 ”cosmology” ”dark energy” ”density perturbations”
”cosmologies” ”planck” ”cosmological” ”spatial
curvature” ”inflationary” ”inflation”
b 2 ”cosmology” ”cosmological constant” ”cosmologies”
”cosmological” ”universes” ”dark energy” ”quadratic”
”tensor” ”planck”
c 17 ”power spectrum” ”cosmological parameters” ”cmb”
”last scattering” ”anisotropies” ”microwave background”
”power spectra” ”planck” ”cosmic microwave”
d 28 ”density perturbations” ”inflationary” ”inflation”
”dark energy” ”scale invariant” ”spatial curvature”
”cosmological perturbations” ”inflationary models”
”cosmologies”
Introduction Method Results Summary
Experiment 3: Comparing clustering solutions
Experiment 3: Comparing clustering solutions
Cluster labels are treated as entities
Let’s compare
Introduction Method Results Summary
Experiment 3: Comparing clustering solutions
Highly similar clustering solutions
Introduction Method Results Summary
Experiment 3: Comparing clustering solutions
Partially agreeing clustering solutions
Introduction Method Results Summary
Experiment 3: Comparing clustering solutions
An overview of all clustering solutions
Introduction Method Results Summary
Summary
Summary
We present a method and an interface that allows visual
exploration through the contexts of entities
We can provide the most related topical terms to clusters
although expert knowledge is needed to transform them into
real labels/topics
LittleAriadne provides a visual way of comparing different
clustering solutions
Our na¨ıve way of clustering is worth exploring further
Introduction Method Results Summary
Future extensions
Future extensions
Add more types of entities, such as citations, publishers,
conferences, etc, to provide richer context
Add direct links to articles to answer information retrieval
needs
Study context sensitivity
compare ”young” and ”young”
Introduction Method Results Summary
Thank you
Thank you
http://thoth.pica.nl/astro/relate
Rob Koopman (rob.koopman@oclc.org)
Shenghui Wang (shenghui.wang@oclc.org)
Andrea Scharnhorst (andrea.scharnhorst@dans.knaw.nl)

More Related Content

PDF
Surface Tension in Liquid Nickel
PDF
Nucleon electromagnetic form factors at high-momentum transfer from Lattice QCD
PDF
6. Proton Spin and Tensorgluons
PPTX
How JSR 385 Could have Saved the Mars Climate Orbiter DWX June 2019
PDF
A Holographic origin for the big bang
PDF
How JSR-385 Could Have Saved the Mars Climate Orbiter
PDF
MM2020:D17.00009
PDF
BNL_Research_Poster
Surface Tension in Liquid Nickel
Nucleon electromagnetic form factors at high-momentum transfer from Lattice QCD
6. Proton Spin and Tensorgluons
How JSR 385 Could have Saved the Mars Climate Orbiter DWX June 2019
A Holographic origin for the big bang
How JSR-385 Could Have Saved the Mars Climate Orbiter
MM2020:D17.00009
BNL_Research_Poster

What's hot (14)

PDF
Go2411811184
PDF
Paolo Creminelli "Dark Energy after GW170817"
PDF
Prediction of just suspended speed for mixed slurries at
PDF
DOI: 10.1007/s10773-009-0027-9
PDF
Aurelian Isar - Decoherence And Transition From Quantum To Classical In Open ...
PDF
MHD Newtonian and non-Newtonian Nano Fluid Flow Passing On A Magnetic Sphere ...
PPTX
Torsion and gravity seminar
PPTX
Origin of Universe (Twin)
PDF
"Warm tachyon matter" - N. Bilic
PPTX
Universe (Twin)
PDF
Kk graviton redo.july5,2012
PDF
Climate Extremes Workshop - Decompositions of Dependence for High-Dimensiona...
PDF
S. Duplij, "Higher braid groups and regular semigroups from polyadic binary c...
PDF
DL_FinalProposal
Go2411811184
Paolo Creminelli "Dark Energy after GW170817"
Prediction of just suspended speed for mixed slurries at
DOI: 10.1007/s10773-009-0027-9
Aurelian Isar - Decoherence And Transition From Quantum To Classical In Open ...
MHD Newtonian and non-Newtonian Nano Fluid Flow Passing On A Magnetic Sphere ...
Torsion and gravity seminar
Origin of Universe (Twin)
"Warm tachyon matter" - N. Bilic
Universe (Twin)
Kk graviton redo.july5,2012
Climate Extremes Workshop - Decompositions of Dependence for High-Dimensiona...
S. Duplij, "Higher braid groups and regular semigroups from polyadic binary c...
DL_FinalProposal
Ad

Similar to Contextualization of topics - browsing through terms, authors, journals and cluster allocations (20)

PDF
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
PDF
EP Thesis Defence 2016
PDF
Nucleating Nematic Droplets
PDF
Kinetic pathways to the isotropic-nematic phase transformation: a mean field ...
PDF
My Prize Winning Physics Poster from 2006
PDF
Multi wavelenth observations and surveys of galaxy clusters
PDF
Stellar hydro
PDF
UNF Undergrad Physics
PDF
Simulations of Strong Lensing
PDF
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
PDF
ON APPROACH TO INCREASE DENSITY OF FIELD- EFFECT TRANSISTORS IN AN INVERTER C...
PDF
Comparative study of results obtained by analysis of structures using ANSYS, ...
PPT
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
PDF
Introduction to Hadron Structure from Lattice QCD
PDF
The distribution and_annihilation_of_dark_matter_around_black_holes
PDF
Group Cohomology of the Poincare Group and Invariant States
PDF
Dragoljub Dimitrijević "Tachyon Inflation in the RSII Framework"
PDF
International Journal of Engineering Inventions (IJEI)
PDF
Topology of charge density from pseudopotential density functional theory cal...
Gwendolyn Eadie: A New Method for Time Series Analysis in Astronomy with an A...
EP Thesis Defence 2016
Nucleating Nematic Droplets
Kinetic pathways to the isotropic-nematic phase transformation: a mean field ...
My Prize Winning Physics Poster from 2006
Multi wavelenth observations and surveys of galaxy clusters
Stellar hydro
UNF Undergrad Physics
Simulations of Strong Lensing
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
ON APPROACH TO INCREASE DENSITY OF FIELD- EFFECT TRANSISTORS IN AN INVERTER C...
Comparative study of results obtained by analysis of structures using ANSYS, ...
(PhD Dissertation Defense) Theoretical and Numerical Investigations on Crysta...
Introduction to Hadron Structure from Lattice QCD
The distribution and_annihilation_of_dark_matter_around_black_holes
Group Cohomology of the Poincare Group and Invariant States
Dragoljub Dimitrijević "Tachyon Inflation in the RSII Framework"
International Journal of Engineering Inventions (IJEI)
Topology of charge density from pseudopotential density functional theory cal...
Ad

More from Shenghui Wang (13)

PDF
Non-parametric Subject Prediction
PDF
Our journey with semantic embedding
PPTX
Linking entities via semantic indexing
PDF
Semantic indexing for KOS
PPTX
Exploring a world of networked information built from free-text metadata
PPTX
Ariadne's Thread -- Exploring a world of networked information built from fre...
PDF
Learning Concept Mappings from Instance Similarity
PDF
Measuring the dynamic bi-directional influence between content and social ne...
PDF
Similarity Features, and their Role in Concept Alignment Learning
PDF
What is concept dirft and how to measure it?
PDF
ICA Slides
PPT
ECCS 2010
PDF
Study concept drift in political ontologies
Non-parametric Subject Prediction
Our journey with semantic embedding
Linking entities via semantic indexing
Semantic indexing for KOS
Exploring a world of networked information built from free-text metadata
Ariadne's Thread -- Exploring a world of networked information built from fre...
Learning Concept Mappings from Instance Similarity
Measuring the dynamic bi-directional influence between content and social ne...
Similarity Features, and their Role in Concept Alignment Learning
What is concept dirft and how to measure it?
ICA Slides
ECCS 2010
Study concept drift in political ontologies

Recently uploaded (20)

PPTX
Cell Membrane: Structure, Composition & Functions
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PDF
An interstellar mission to test astrophysical black holes
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
2. Earth - The Living Planet earth and life
Cell Membrane: Structure, Composition & Functions
bbec55_b34400a7914c42429908233dbd381773.pdf
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
Derivatives of integument scales, beaks, horns,.pptx
Biophysics 2.pdffffffffffffffffffffffffff
TOTAL hIP ARTHROPLASTY Presentation.pptx
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
2. Earth - The Living Planet Module 2ELS
neck nodes and dissection types and lymph nodes levels
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
AlphaEarth Foundations and the Satellite Embedding dataset
Taita Taveta Laboratory Technician Workshop Presentation.pptx
An interstellar mission to test astrophysical black holes
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
2. Earth - The Living Planet earth and life

Contextualization of topics - browsing through terms, authors, journals and cluster allocations

  • 1. Introduction Method Results Summary Contextualization of Topics browsing through terms, authors, journals and cluster allocations Rob Koopman1 Shenghui Wang1 Andrea Scharnhorst2 1OCLC Research 2DANS-KNAW ISSI 2015
  • 2. Introduction Method Results Summary Introduction What are essence and boundary of a scientific field? Different ways to find clusters in scientific literature based on connectivity in terms of authorship, citations, language similarity, etc. Ambiguous nature in science
  • 3. Introduction Method Results Summary Ariadne: interactive context explorer Ariadne is an interactive interface which allows users to explore the context of entities such as authors, journals, topical terms, etc. It builds on semantic indexing statistically computed from a large scale bibliographic corpus It was originally implemented to explore 1M topical terms, 3M authors, 35K journals and 700+ Dewey decimal classes associated with 65M articles.
  • 4. Introduction Method Results Summary Research questions Q1: How does the Ariadne algorithm work on a much smaller, field specific dataset? Q2: Can we use Ariadne to label the clusters produce by the different methods? Q3: Can we use Ariadne to compare different clustering solutions?
  • 5. Introduction Method Results Summary LittleAriadne LittleAriadne: context explorer over Astrophysics data Offline: generates a semantic representation for each entity Online: finds the most related entities and using multidimensional scaling to display
  • 6. Introduction Method Results Summary LittleAriadne An example article Article ID ISI:000276828000006 Title On the Mass Transfer Rate in SS Cyg Abstract The mass transfer rate in SS Cyg at quiescence, estimated from the observed luminosity of the hot spot, is log M-tr = 16.8 +/- 0.3. This is safely below the critical mass transfer rates of log M-crit = 18.1 (corresponding to log T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to the “revised” value of log T-crit(0) = 3.65). The mass transfer rate during outbursts is strongly enhanced Author [author:smak j] ISSN [issn:0001-5237] Subject [subject:accretion, accretion disks] [subject:cataclysmic variables] [subject:disc instability model] [subject:dwarf novae] [subject:novae, cataclysmic variables] [subject:outbursts] [subject:parameters] [subject:stars] [subject:stars dwarf novae] [subject:stars individual ss cyg] [subject:state] [subject: superoutbursts]
  • 7. Introduction Method Results Summary LittleAriadne An example article Article ID ISI:000276828000006 Title On the Mass Transfer Rate in SS Cyg Abstract The mass transfer rate in SS Cyg at quiescence, estimated from the observed luminosity of the hot spot, is log M-tr = 16.8 +/- 0.3. This is safely below the critical mass transfer rates of log M-crit = 18.1 (corresponding to log T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to the “revised” value of log T-crit(0) = 3.65). The mass transfer rate during outbursts is strongly enhanced Author [author:smak j] ISSN [issn:0001-5237] Subject [subject:accretion, accretion disks] [subject:cataclysmic variables] [subject:disc instability model] [subject:dwarf novae] [subject:novae, cataclysmic variables] [subject:outbursts] [subject:parameters] [subject:stars] [subject:stars dwarf novae] [subject:stars individual ss cyg] [subject:state] [subject: superoutbursts] Cluster label [cluster:a 19] [cluster:b 16] [cluster:c 15] [cluster:d 51] [cluster:e 17] [cluster:f 1]
  • 8. Introduction Method Results Summary LittleAriadne Six different clustering solutions x Source y=#Cluster #Cluster in LittleAriadne a cwts 1.8 23 23 b UMSI 23 23 c oclc 20 20 20 d hu 139 48 e sts 5664 229 f ECOOM 15 15
  • 9. Introduction Method Results Summary LittleAriadne Entities in the Astrophysis dataset There are in total 90,343 entities associated with 111,616 astrophysics articles 59 journals 27,027 author names (no disambiguation applied) 39,577 topical terms 23,322 subjects (extracted from ”Author Keywords” and ”Keywords Plus”) 358 cluster labels (source + cluster id)
  • 10. Introduction Method Results Summary LittleAriadne Build semantic representation Basic assumptions Entities can be represented by its context Entities which share more context are more likely to be related Context is the textual environment where an entity occurs
  • 11. Introduction Method Results Summary LittleAriadne An example article Article ID ISI:000276828000006 Title On the Mass Transfer Rate in SS Cyg Abstract The mass transfer rate in SS Cyg at quiescence, estimated from the observed luminosity of the hot spot, is log M-tr = 16.8 +/- 0.3. This is safely below the critical mass transfer rates of log M-crit = 18.1 (corresponding to log T-crit(0) = 3.88) or log M-crit = 17.2 (corresponding to the “revised” value of log T-crit(0) = 3.65). The mass transfer rate during outbursts is strongly enhanced Author [author:smak j] ISSN [issn:0001-5237] Subject [subject:accretion, accretion disks] [subject:cataclysmic variables] [subject:disc instability model] [subject:dwarf novae] [subject:novae, cataclysmic variables] [subject:outbursts] [subject:parameters] [subject:stars] [subject:stars dwarf novae] [subject:stars individual ss cyg] [subject:state] [subject: superoutbursts] Cluster label [cluster:a 19] [cluster:b 16] [cluster:c 15] [cluster:d 51] [cluster:e 17] [cluster:f 1]
  • 12. Introduction Method Results Summary LittleAriadne Dimension reduction using Random Projection mass transfer rate [subject:outburst] [subject:sstars] [subject:parameters] [author:smak j] [cluster: a19] [issn:0001-5237]
  • 13. Introduction Method Results Summary LittleAriadne Dimension reduction using Random Projection mass transfer rate [subject:outburst] [subject:sstars] [subject:parameters] [author:smak j] [cluster: a19] [issn:0001-5237]
  • 14. Introduction Method Results Summary LittleAriadne From semantic representation to visualisation and more Each entity has its semantic representation Cosine similarity between entities can be computed very fast, based on which the 2D visualisation is implemented
  • 15. Introduction Method Results Summary LittleAriadne From semantic representation to visualisation and more Each entity has its semantic representation Cosine similarity between entities can be computed very fast, based on which the 2D visualisation is implemented For each article, we collected the semantic representation of all the entities in which it involves, and take an average as its semantic representation We applied a standard K-means clustering method to cluster these articles based on their semantic representations
  • 16. Introduction Method Results Summary Experiment 1: Exploring context Experiment 1: Exploring context Now we can explore Let’s start with stars An overview of all journals
  • 17. Introduction Method Results Summary Experiment 1: Exploring context Contextual view of stars
  • 18. Introduction Method Results Summary Experiment 2: Labelling clusters Experiment 2: Labelling clusters What is cluster a 2?
  • 19. Introduction Method Results Summary Experiment 2: Labelling clusters Experiment 2: Labelling clusters
  • 20. Introduction Method Results Summary Experiment 2: Labelling clusters Experiment 2: Labelling clusters Cluster ID Top 9 most related topical terms a 2 ”cosmology” ”dark energy” ”density perturbations” ”cosmologies” ”planck” ”cosmological” ”spatial curvature” ”inflationary” ”inflation” b 2 ”cosmology” ”cosmological constant” ”cosmologies” ”cosmological” ”universes” ”dark energy” ”quadratic” ”tensor” ”planck” c 17 ”power spectrum” ”cosmological parameters” ”cmb” ”last scattering” ”anisotropies” ”microwave background” ”power spectra” ”planck” ”cosmic microwave” d 28 ”density perturbations” ”inflationary” ”inflation” ”dark energy” ”scale invariant” ”spatial curvature” ”cosmological perturbations” ”inflationary models” ”cosmologies”
  • 21. Introduction Method Results Summary Experiment 3: Comparing clustering solutions Experiment 3: Comparing clustering solutions Cluster labels are treated as entities Let’s compare
  • 22. Introduction Method Results Summary Experiment 3: Comparing clustering solutions Highly similar clustering solutions
  • 23. Introduction Method Results Summary Experiment 3: Comparing clustering solutions Partially agreeing clustering solutions
  • 24. Introduction Method Results Summary Experiment 3: Comparing clustering solutions An overview of all clustering solutions
  • 25. Introduction Method Results Summary Summary Summary We present a method and an interface that allows visual exploration through the contexts of entities We can provide the most related topical terms to clusters although expert knowledge is needed to transform them into real labels/topics LittleAriadne provides a visual way of comparing different clustering solutions Our na¨ıve way of clustering is worth exploring further
  • 26. Introduction Method Results Summary Future extensions Future extensions Add more types of entities, such as citations, publishers, conferences, etc, to provide richer context Add direct links to articles to answer information retrieval needs Study context sensitivity compare ”young” and ”young”
  • 27. Introduction Method Results Summary Thank you Thank you http://thoth.pica.nl/astro/relate Rob Koopman (rob.koopman@oclc.org) Shenghui Wang (shenghui.wang@oclc.org) Andrea Scharnhorst (andrea.scharnhorst@dans.knaw.nl)