SlideShare a Scribd company logo
Making gene networks through
data integration
Lars Juhl Jensen
association networks
guilt by association
Making gene networks through data integration
molecular networks
proteins
string-db.org
small molecules
stitch-db.org
non-coding RNAs
compartments
compartments.jensenlab.org
tissues
tissues.jensenlab.org
diseases
data integration
computational predictions
gene neighborhood
Korbel et al., Nature Biotechnology, 2004
TargetScan
experimental data
gene expression
Making gene networks through data integration
protein interactions
Jensen & Bork, Science, 2008
miRTarBase
curated knowledge
metabolic pathways
Letunic & Bork, Trends in Biochemical Sciences, 2008
signaling pathways
many databases
different formats
different identifiers
variable quality
not comparable
hard work
(Ph.D. students)
common identifiers
quality scores
von Mering et al., Nucleic Acids Research, 2005
score calibration
von Mering et al., Nucleic Acids Research, 2005
homology-based transfer
Franceschini et al., Nucleic Acids Research, 2013
missing most of the data
text mining
>10 km
too much to read
computer
as smart as a dog
teach it specific tricks
Making gene networks through data integration
Making gene networks through data integration
named entity recognition
comprehensive lexicon
let-7a-3p
let-7a*
flexible matching
let-7a
let7a
name expansions
let-7a
miR-let-7a
“black list”
SDS
co-mentioning
counting
within documents
within paragraphs
within sentences
Making gene networks through data integration
Making gene networks through data integration
high recall
high precision
fuzzy associations
NLP
Natural Language Processing
Gene and protein names
Cue words for entity
recognition
Verbs for relation extraction
[nxexpr The expression of
[nxgene the cytochrome
genes
[nxpg CYC1 and CYC7]]]
is controlled by
[nxpg HAP1]
extract stated facts
high precision
poor recall
Jensen et al., Nature Reviews Genetics, 2006
questions?

More Related Content

PPT
Gene association networks: Large-scale integration of data and text
PPT
Gene association networks - Large-scale integration of data and text
PPT
Network biology - Large-scale integration of data and text
PPT
Data integration with STRING
PPT
Gene association networks: Large-scale integration of data and text
PPT
Gene association networks: Large-scale integration of data and text
PPT
Introduction to STRING
PPT
STRING - Protein networks from data and text mining
Gene association networks: Large-scale integration of data and text
Gene association networks - Large-scale integration of data and text
Network biology - Large-scale integration of data and text
Data integration with STRING
Gene association networks: Large-scale integration of data and text
Gene association networks: Large-scale integration of data and text
Introduction to STRING
STRING - Protein networks from data and text mining

What's hot (20)

PPT
Network biology: Large-scale data and text mining
PPT
Gene association networks - Large-scale integration of data and text
PPT
Protein association networks: Large-scale integration of data and text
PPT
STRING - Large-scale integration of data and text
PPT
STRING: Protein networks from data and text mining
PPT
Gene association networks - Large-scale integration of data and text
PPT
Gene association networks - Large-scale integration of data and text
PPT
Advanced bioinformatics of proteomics datasets
PPT
Biomarker bioinformatics: Network-based candidate prioritization
PPT
Large-scale integration of data and text
KEY
STRING/STITCH tutorial
PPT
Network Biology: Large-scale integration of data and text
PPT
In silico and Text-Based Analysis of Cellular Networks
PPT
STRING & STITCH : Network integration of heterogeneous data
PPT
Gene Association Networks: Large-scale integration of data and text
PPT
Network Biology: A crash course on STRING and Cytoscape
PPT
Network biology: Large-scale data and text mining
PPT
Networks of proteins and diseases
PPT
STRING & related databases: Large-scale integration of heterogeneous data
PPT
STRING: Large-scale data and text mining
Network biology: Large-scale data and text mining
Gene association networks - Large-scale integration of data and text
Protein association networks: Large-scale integration of data and text
STRING - Large-scale integration of data and text
STRING: Protein networks from data and text mining
Gene association networks - Large-scale integration of data and text
Gene association networks - Large-scale integration of data and text
Advanced bioinformatics of proteomics datasets
Biomarker bioinformatics: Network-based candidate prioritization
Large-scale integration of data and text
STRING/STITCH tutorial
Network Biology: Large-scale integration of data and text
In silico and Text-Based Analysis of Cellular Networks
STRING & STITCH : Network integration of heterogeneous data
Gene Association Networks: Large-scale integration of data and text
Network Biology: A crash course on STRING and Cytoscape
Network biology: Large-scale data and text mining
Networks of proteins and diseases
STRING & related databases: Large-scale integration of heterogeneous data
STRING: Large-scale data and text mining
Ad

Viewers also liked (10)

PDF
12th PenCHORD Seminar, Showcase and Workshop Event
PPTX
Digestion protein, absorption amino acid and amino acid pool
PPT
Electron Transport Chain
PPTX
GLYCOGENESIS
PPTX
Protein digestion
PPTX
Electron transport chain and Oxidative phosphorylation
PPTX
Human genome project
PPT
TCA cycle- steps, regulation and significance
PPTX
Glycolysis- An over view
PPTX
Electron Transport Chain ETC
12th PenCHORD Seminar, Showcase and Workshop Event
Digestion protein, absorption amino acid and amino acid pool
Electron Transport Chain
GLYCOGENESIS
Protein digestion
Electron transport chain and Oxidative phosphorylation
Human genome project
TCA cycle- steps, regulation and significance
Glycolysis- An over view
Electron Transport Chain ETC
Ad

Similar to Making gene networks through data integration (15)

PPT
Cellular network biology: Proteome-wide analysis of heterogeneous data
PPT
Unraveling cellular phosphorylation networks using computational biology
PPT
Network biology: Large-scale data integration and text mining
PPT
Network biology
PPT
Network biology: Large-scale data integration and text mining
PPT
Networks of proteins and diseases
PPT
Integration of heterogeneous data
PPT
Network biology
PPT
Networks of proteins and diseases
PPT
The STRING database and related tools
PPT
Large-scale integration of data and text
PPT
Data Integration and Systems Biology
PPT
Turning big data and text collections into web resrouces
PPT
Protein interaction networks
PPT
Unraveling signal transduction networks through data integration
Cellular network biology: Proteome-wide analysis of heterogeneous data
Unraveling cellular phosphorylation networks using computational biology
Network biology: Large-scale data integration and text mining
Network biology
Network biology: Large-scale data integration and text mining
Networks of proteins and diseases
Integration of heterogeneous data
Network biology
Networks of proteins and diseases
The STRING database and related tools
Large-scale integration of data and text
Data Integration and Systems Biology
Turning big data and text collections into web resrouces
Protein interaction networks
Unraveling signal transduction networks through data integration

More from Lars Juhl Jensen (20)

PPT
One tagger, many uses: Illustrating the power of dictionary-based named entit...
PPT
One tagger, many uses: Simple text-mining strategies for biomedicine
PPT
Extract 2.0: Text-mining-assisted interactive annotation
PPT
Network visualization: A crash course on using Cytoscape
PPT
Biomedical text mining: Automatic processing of unstructured text
PPT
Medical network analysis: Linking diseases and genes through data and text mi...
PPT
Cellular networks
PPT
Cellular Network Biology: Large-scale integration of data and text
PPT
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
PPT
Tagger: Rapid dictionary-based named entity recognition
PPT
Medical text mining: Linking diseases, drugs, and adverse reactions
PPT
Network biology: Large-scale integration of data and text
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
PPT
Cellular Network Biology
PPT
Network biology: Large-scale integration of data and text
PPT
The Art of Counting: Scoring and ranking co-occurrences in literature
PPT
Text-mining-based retrieval of protein networks
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Simple text-mining strategies for biomedicine
Extract 2.0: Text-mining-assisted interactive annotation
Network visualization: A crash course on using Cytoscape
Biomedical text mining: Automatic processing of unstructured text
Medical network analysis: Linking diseases and genes through data and text mi...
Cellular networks
Cellular Network Biology: Large-scale integration of data and text
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Tagger: Rapid dictionary-based named entity recognition
Medical text mining: Linking diseases, drugs, and adverse reactions
Network biology: Large-scale integration of data and text
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Cellular Network Biology
Network biology: Large-scale integration of data and text
The Art of Counting: Scoring and ranking co-occurrences in literature
Text-mining-based retrieval of protein networks
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions

Recently uploaded (20)

PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
famous lake in india and its disturibution and importance
PPTX
Comparative Structure of Integument in Vertebrates.pptx
DOCX
Viruses (History, structure and composition, classification, Bacteriophage Re...
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PPTX
Cell Membrane: Structure, Composition & Functions
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
Placing the Near-Earth Object Impact Probability in Context
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
ECG_Course_Presentation د.محمد صقران ppt
famous lake in india and its disturibution and importance
Comparative Structure of Integument in Vertebrates.pptx
Viruses (History, structure and composition, classification, Bacteriophage Re...
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
Cell Membrane: Structure, Composition & Functions
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
Placing the Near-Earth Object Impact Probability in Context
Phytochemical Investigation of Miliusa longipes.pdf
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Derivatives of integument scales, beaks, horns,.pptx
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
AlphaEarth Foundations and the Satellite Embedding dataset
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...

Making gene networks through data integration