Mining biomedical texts Lars Juhl Jensen >10 km
exponential growth
 
 
some things are constant
 
~45 seconds per paper
information retrieval
find the relevant texts
still too much to read
computer
as smart as a dog
teach it specific tricks
 
 
named entity recognition
identify the concepts
comprehensive lexicon
small molecules
proteins
cellular components
organisms
diseases
orthographic variation
“ black list”
Reflect.ws
augmented browsing
browser add-on
Pafilis, O’Donoghue, Jensen et al.,  Nature Biotechnology , 2009 O’Donoghue et al.,  Journal of Web Semantics , 2010
Firefox
Internet Explorer
Google Chrome
Safari
Utopia Documents
web services
~150 years of publishing
 
dead wood
 
dead e-wood
added value
collaboration
 
 
SciVerse application
 
 
 
 
 
STITCH
Kuhn et al.,  Nucleic Acids Research , 2010
curated knowledge
drug targets
pathways
Letunic & Bork,  Trends in Biochemical Sciences , 2008
experimental data
physical interactions
Jensen & Bork,  Science , 2008
text mining
co-mentioning
 
NLP Natural Language Processing
 
abstracts
full text
restricted access
 
collaboration
electronic patient journals
a hard problem
in Danish
no lexicon
by busy doctors
acronyms
typos
about psychiatric patients
delusions
domain specific system
F20 F200 Negation Family
diagnoses
patient stratification
Roque et al.,  PLoS Computational Biology , 2011
disease comorbidity
Roque et al.,  PLoS Computational Biology , 2011
medication
adverse drug events
pharmacovigilance
phenotype
genotype
Thank you! Reflect.ws Sune Frankild Heiko Horn Evangelos Pafilis Michael Kuhn Reinhardt Schneider Sean O’Donoghue SciVerse app Juan-Carlos Silla-Castro Sean O’Donoghue EPJ-mining Francisco S Roque Peter B Jensen Robert Eriksson Henriette Schmock Marlene Dalgaard Massimo Andreatta Thomas Hansen Karen Søeby Søren Bredkjær Anders Juul Thomas Werge Søren Brunak
larsjuhljensen

More Related Content

PPT
Mining literature and medical records
PPT
Networks of proteins and diseases
PPT
Biomedical text mining
PPT
Data and Text Mining
PPT
Biomedical text mining and network analysis
PPT
Text and data mining
PPT
Network integration of data and text
PPT
Large-scale data and text mining
Mining literature and medical records
Networks of proteins and diseases
Biomedical text mining
Data and Text Mining
Biomedical text mining and network analysis
Text and data mining
Network integration of data and text
Large-scale data and text mining

Similar to Mining biomedical texts (20)

PPT
The researcher perspective, Jean-Fred Fontaine, MDC Berlin
PPT
Mining text and data on chemicals
PPTX
ContentMine: Mining the Scientific Literature
PPT
Medical data and text mining - Linking diseases, drugs, and adverse reactions
PPT
Mining and communicating biomedical knowledge
PPT
Network biology - A basis for large-scale biomedica data mining
PPTX
Biovision2017 Accessing the scientific literature
PPT
Computational Biology - Signaling networks and drug repositioning
PPT
Network biology: Large-scale biomedical data and text mining
PPT
Large-scale integration of data and text
PPT
Network biology
PDF
Deep learning for biomedical discovery and data mining I
PPTX
Big Data and ContentMining for Libraries
PPT
Turning big data and text collections into web resrouces
PPT
The pragmatic text miner: It’s just another type of poorly standardized data
PPTX
2016 davis-biotech
PDF
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
PPTX
ContentMining for France and Europe; Lessons from 2 years in UK
PDF
Biomedical Literature Mining 1st Edition Vinod D Kumar Hannah Jane Tipney Eds
PDF
MOLIERE: Automatic Biomedical Hypothesis Generation System
The researcher perspective, Jean-Fred Fontaine, MDC Berlin
Mining text and data on chemicals
ContentMine: Mining the Scientific Literature
Medical data and text mining - Linking diseases, drugs, and adverse reactions
Mining and communicating biomedical knowledge
Network biology - A basis for large-scale biomedica data mining
Biovision2017 Accessing the scientific literature
Computational Biology - Signaling networks and drug repositioning
Network biology: Large-scale biomedical data and text mining
Large-scale integration of data and text
Network biology
Deep learning for biomedical discovery and data mining I
Big Data and ContentMining for Libraries
Turning big data and text collections into web resrouces
The pragmatic text miner: It’s just another type of poorly standardized data
2016 davis-biotech
Semantic Web for 360-degree Health: State-of-the-Art & Vision for Better Inte...
ContentMining for France and Europe; Lessons from 2 years in UK
Biomedical Literature Mining 1st Edition Vinod D Kumar Hannah Jane Tipney Eds
MOLIERE: Automatic Biomedical Hypothesis Generation System
Ad

More from Lars Juhl Jensen (20)

PPT
One tagger, many uses: Illustrating the power of dictionary-based named entit...
PPT
One tagger, many uses: Simple text-mining strategies for biomedicine
PPT
Extract 2.0: Text-mining-assisted interactive annotation
PPT
Network visualization: A crash course on using Cytoscape
PPT
STRING & STITCH : Network integration of heterogeneous data
PPT
Biomedical text mining: Automatic processing of unstructured text
PPT
Medical network analysis: Linking diseases and genes through data and text mi...
PPT
Network Biology: A crash course on STRING and Cytoscape
PPT
Cellular networks
PPT
Cellular Network Biology: Large-scale integration of data and text
PPT
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
PPT
STRING & related databases: Large-scale integration of heterogeneous data
PPT
Tagger: Rapid dictionary-based named entity recognition
PPT
Network Biology: Large-scale integration of data and text
PPT
Medical text mining: Linking diseases, drugs, and adverse reactions
PPT
Network biology: Large-scale integration of data and text
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
PPT
Cellular Network Biology
PPT
Network biology: Large-scale integration of data and text
PPT
Biomarker bioinformatics: Network-based candidate prioritization
One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Simple text-mining strategies for biomedicine
Extract 2.0: Text-mining-assisted interactive annotation
Network visualization: A crash course on using Cytoscape
STRING & STITCH : Network integration of heterogeneous data
Biomedical text mining: Automatic processing of unstructured text
Medical network analysis: Linking diseases and genes through data and text mi...
Network Biology: A crash course on STRING and Cytoscape
Cellular networks
Cellular Network Biology: Large-scale integration of data and text
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
STRING & related databases: Large-scale integration of heterogeneous data
Tagger: Rapid dictionary-based named entity recognition
Network Biology: Large-scale integration of data and text
Medical text mining: Linking diseases, drugs, and adverse reactions
Network biology: Large-scale integration of data and text
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Cellular Network Biology
Network biology: Large-scale integration of data and text
Biomarker bioinformatics: Network-based candidate prioritization
Ad

Recently uploaded (20)

PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
August Patch Tuesday
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
Web Crawler for Trend Tracking Gen Z Insights.pptx
PDF
Zenith AI: Advanced Artificial Intelligence
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
WOOl fibre morphology and structure.pdf for textiles
PPTX
Tartificialntelligence_presentation.pptx
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Architecture types and enterprise applications.pdf
PDF
Getting Started with Data Integration: FME Form 101
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPT
What is a Computer? Input Devices /output devices
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
August Patch Tuesday
NewMind AI Weekly Chronicles – August ’25 Week III
A review of recent deep learning applications in wood surface defect identifi...
A comparative study of natural language inference in Swahili using monolingua...
Web Crawler for Trend Tracking Gen Z Insights.pptx
Zenith AI: Advanced Artificial Intelligence
Final SEM Unit 1 for mit wpu at pune .pptx
Hindi spoken digit analysis for native and non-native speakers
WOOl fibre morphology and structure.pdf for textiles
Tartificialntelligence_presentation.pptx
Benefits of Physical activity for teenagers.pptx
Assigned Numbers - 2025 - Bluetooth® Document
Architecture types and enterprise applications.pdf
Getting Started with Data Integration: FME Form 101
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
What is a Computer? Input Devices /output devices
Chapter 5: Probability Theory and Statistics
Enhancing emotion recognition model for a student engagement use case through...
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor

Mining biomedical texts