SlideShare a Scribd company logo
In Silico and Text-Based Analysis
of Cellular Networks
Lars Juhl Jensen
association networks
guilt by association
In silico and Text-Based Analysis of Cellular Networks
protein networks
STRING
2000+ genomes
computational predictions
gene fusion
Korbel et al., Nature Biotechnology, 2004
phylogenetic profiles
Korbel et al., Nature Biotechnology, 2004
experimental data
gene coexpression
In silico and Text-Based Analysis of Cellular Networks
physical interactions
Jensen & Bork, Science, 2008
curated knowledge
pathways
Letunic & Bork, Trends in Biochemical Sciences, 2008
many databases
different formats
different identifiers
variable quality
not comparable
hard work
parsers
mapping files
quality scores
von Mering et al., Nucleic Acids Research, 2005
score calibration
gold standard
von Mering et al., Nucleic Acids Research, 2005
common scale
missing most of the data
>10 km
too much to read
computer
as smart as a dog
teach it specific tricks
In silico and Text-Based Analysis of Cellular Networks
In silico and Text-Based Analysis of Cellular Networks
named entity recognition
comprehensive lexicon
cyclin dependent kinase 1
CDC2
orthographic variation
spaces and hyphens
cyclin dependent kinase 1
cyclin-dependent kinase 1
prefixes and suffixes
CDC2
hCdc2
“black list”
SDS
co-mentioning
within documents
within paragraphs
within sentences
quality score
protein networks
Szklarczyk et al., Nucleic Acids Research, 2015string-db.org
general approach
chemical networks
Kuhn et al., Nucleic Acids Research, 2014stitch-db.org
space
subcellular localization
Binder et al., Database, 2014compartments.jensenlab.org
tissue expression
tissues.jensenlab.org Santos et al., submitted, 2015
time
cell-cycle expression
Santos et al., Nucleic Acids Research, 2015cyclebase.org
disease associations
diseases.jensenlab.org Frankild et al., Methods, 2015
Acknowledgments
Molecular networks
Michael Kuhn
Damian Szklarczyk
Andrea Franceschini
Milan Simonovic
Alexander Roth
Sune Pletscher-Frankild
Jianyi Lin
Pablo Minguez
Christian von Mering
Peer Bork
Time and space
Alberto Santos
Sune Pletscher-Frankild
Janos Binder
Kalliopi Tsafou
Christian Stolte
Albert Palleja
Heiko Horn
Rasmus Wernersson
Reinhardt Schneider
Sean O’ Donoghue

More Related Content

PPT
Gene association networks - Large-scale integration of data and text
PPT
STRING: Protein networks from data and text mining
PPT
Network Biology: A crash course on STRING and Cytoscape
PPT
STRING - Large-scale integration of data and text
PPT
Gene association networks - Large-scale integration of data and text
PPT
Gene association networks - Large-scale integration of data and text
PPT
STRING - Protein networks from data and text mining
PPT
Gene association networks: Large-scale integration of data and text
Gene association networks - Large-scale integration of data and text
STRING: Protein networks from data and text mining
Network Biology: A crash course on STRING and Cytoscape
STRING - Large-scale integration of data and text
Gene association networks - Large-scale integration of data and text
Gene association networks - Large-scale integration of data and text
STRING - Protein networks from data and text mining
Gene association networks: Large-scale integration of data and text

What's hot (20)

PPT
Gene association networks: Large-scale integration of data and text
PPT
Gene association networks: Large-scale integration of data and text
PPT
Introduction to STRING
PPT
Networks of proteins and diseases
PPT
Gene association networks - Large-scale integration of data and text
PPT
Network biology - Large-scale integration of data and text
PPT
Protein association networks: Large-scale integration of data and text
PPT
Network biology: Large-scale data and text mining
PPT
One tagger, many uses - Illustrating the power of ontologies in named entity ...
PPT
Data integration with STRING
PPT
Network Biology: Large-scale integration of data and text
PPT
Making gene networks through data integration
PPT
Network biology: Large-scale data and text mining
PPT
Large-scale integration of data and text
PPT
Large-scale data and text mining
PPT
Advanced bioinformatics of proteomics datasets
PPT
Gene Association Networks: Large-scale integration of data and text
PPT
Data and Text Mining
PPT
Network biology: Large-scale data integration and text mining
KEY
STRING/STITCH tutorial
Gene association networks: Large-scale integration of data and text
Gene association networks: Large-scale integration of data and text
Introduction to STRING
Networks of proteins and diseases
Gene association networks - Large-scale integration of data and text
Network biology - Large-scale integration of data and text
Protein association networks: Large-scale integration of data and text
Network biology: Large-scale data and text mining
One tagger, many uses - Illustrating the power of ontologies in named entity ...
Data integration with STRING
Network Biology: Large-scale integration of data and text
Making gene networks through data integration
Network biology: Large-scale data and text mining
Large-scale integration of data and text
Large-scale data and text mining
Advanced bioinformatics of proteomics datasets
Gene Association Networks: Large-scale integration of data and text
Data and Text Mining
Network biology: Large-scale data integration and text mining
STRING/STITCH tutorial
Ad

Viewers also liked (18)

PPSX
Robertson immemxi final March 2016
PPTX
In-Silico Modelling of Tumour Growth
PDF
Evaluation of the impact of error correction algorithms on SNP calling.
PPT
Tetra Arm PCR
PDF
Big Data and Genomic Medicine by Corey Nislow
PDF
WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015
PPT
Molecular modelling for in silico drug discovery
PPTX
Pharmacogenomics
PPT
Protein Modeling And In-Silico Drug Designing Approach
PPT
Intro to in silico drug discovery 2014
PPTX
SNp mining in crops
PPT
Identification of disease genes
PPTX
PDF
SNP Genotyping Technologies
PPTX
Single nucleotide polymorphism
PPTX
Next generation sequencing
PPT
Robertson immemxi final March 2016
In-Silico Modelling of Tumour Growth
Evaluation of the impact of error correction algorithms on SNP calling.
Tetra Arm PCR
Big Data and Genomic Medicine by Corey Nislow
WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015
Molecular modelling for in silico drug discovery
Pharmacogenomics
Protein Modeling And In-Silico Drug Designing Approach
Intro to in silico drug discovery 2014
SNp mining in crops
Identification of disease genes
SNP Genotyping Technologies
Single nucleotide polymorphism
Next generation sequencing
Ad

Similar to In silico and Text-Based Analysis of Cellular Networks (20)

PPT
Network biology: Large-scale data integration and text mining
PPT
Large-scale integration of data and text
PPT
Protein networks: A basis for large-scale data mining
PPT
Protein networks: A basis for large-scale data mining
PPT
Information integration
PPT
Cellular Network Biology: Large-scale integration of data and text
PPT
Large-scale integration of data and text
PPT
Cellular network biology: Proteome-wide analysis of heterogeneous data
PPT
Cellular Network Biology
PPT
Protein interaction networks
PPT
STRING & related databases: Large-scale integration of heterogeneous data
PPT
Network biology
PPT
Systems biology - Bioinformatics on complete biological systems
PPT
Systems biology: Bioinformatics on complete biological system
PPT
Protein networks: A basis for large-scale data mining
PPT
Systems biology - Understanding biology at the systems level
PPT
Protein networks: A basis for large-scale data mining
PPT
Systems biology: Bioinformatics on complete biological systems
PPT
Systems biology: Large-scale biomedical data mining
PPT
STRING: Large-scale data and text mining
Network biology: Large-scale data integration and text mining
Large-scale integration of data and text
Protein networks: A basis for large-scale data mining
Protein networks: A basis for large-scale data mining
Information integration
Cellular Network Biology: Large-scale integration of data and text
Large-scale integration of data and text
Cellular network biology: Proteome-wide analysis of heterogeneous data
Cellular Network Biology
Protein interaction networks
STRING & related databases: Large-scale integration of heterogeneous data
Network biology
Systems biology - Bioinformatics on complete biological systems
Systems biology: Bioinformatics on complete biological system
Protein networks: A basis for large-scale data mining
Systems biology - Understanding biology at the systems level
Protein networks: A basis for large-scale data mining
Systems biology: Bioinformatics on complete biological systems
Systems biology: Large-scale biomedical data mining
STRING: Large-scale data and text mining

More from Lars Juhl Jensen (20)

PPT
One tagger, many uses: Illustrating the power of dictionary-based named entit...
PPT
One tagger, many uses: Simple text-mining strategies for biomedicine
PPT
Extract 2.0: Text-mining-assisted interactive annotation
PPT
Network visualization: A crash course on using Cytoscape
PPT
STRING & STITCH : Network integration of heterogeneous data
PPT
Biomedical text mining: Automatic processing of unstructured text
PPT
Medical network analysis: Linking diseases and genes through data and text mi...
PPT
Cellular networks
PPT
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
PPT
Tagger: Rapid dictionary-based named entity recognition
PPT
Medical text mining: Linking diseases, drugs, and adverse reactions
PPT
Network biology: Large-scale integration of data and text
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
PPT
Network biology: Large-scale integration of data and text
PPT
Biomarker bioinformatics: Network-based candidate prioritization
PPT
The Art of Counting: Scoring and ranking co-occurrences in literature
PPT
Text-mining-based retrieval of protein networks
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Simple text-mining strategies for biomedicine
Extract 2.0: Text-mining-assisted interactive annotation
Network visualization: A crash course on using Cytoscape
STRING & STITCH : Network integration of heterogeneous data
Biomedical text mining: Automatic processing of unstructured text
Medical network analysis: Linking diseases and genes through data and text mi...
Cellular networks
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
Tagger: Rapid dictionary-based named entity recognition
Medical text mining: Linking diseases, drugs, and adverse reactions
Network biology: Large-scale integration of data and text
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Network biology: Large-scale integration of data and text
Biomarker bioinformatics: Network-based candidate prioritization
The Art of Counting: Scoring and ranking co-occurrences in literature
Text-mining-based retrieval of protein networks
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Medical data and text mining: Linking diseases, drugs, and adverse reactions

Recently uploaded (20)

PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
BIOMOLECULES PPT........................
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
. Radiology Case Scenariosssssssssssssss
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
An interstellar mission to test astrophysical black holes
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
Comparative Structure of Integument in Vertebrates.pptx
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PPTX
2. Earth - The Living Planet earth and life
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
Introduction to Cardiovascular system_structure and functions-1
PDF
Placing the Near-Earth Object Impact Probability in Context
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
Cell Membrane: Structure, Composition & Functions
BIOMOLECULES PPT........................
Taita Taveta Laboratory Technician Workshop Presentation.pptx
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
. Radiology Case Scenariosssssssssssssss
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
An interstellar mission to test astrophysical black holes
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Comparative Structure of Integument in Vertebrates.pptx
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
2. Earth - The Living Planet earth and life
Classification Systems_TAXONOMY_SCIENCE8.pptx
Introduction to Cardiovascular system_structure and functions-1
Placing the Near-Earth Object Impact Probability in Context
Phytochemical Investigation of Miliusa longipes.pdf
7. General Toxicologyfor clinical phrmacy.pptx
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice

In silico and Text-Based Analysis of Cellular Networks