SlideShare a Scribd company logo
The STRING database Lars Juhl Jensen EMBL Heidelberg
data integration
 
functional interactions
 
179 proteomes
Ensembl
SWISS-PROT
genomic context methods
phylogenetic profiles
 
 
 
 
Cell Cellulosomes Cellulose
gene fusion
 
gene neighborhood
 
questionable reliability
raw quality scores
gene neighborhood
sum of intergenic distances
 
many types of evidence
raw quality scores
not directly comparable
benchmarking
calibrate against KEGG
 
curated knowledge
KEGG Kyoto Encyclopedia of Genes and Genomes
Reactome
MIPS Munich Information center for Protein Sequences
STKE Signal Transduction Knowledge Environment
primary experimental data
many sources
parsers
co-expression
GEO Gene Expression Omnibus
SMD Stanford Microarray Database
physical protein interactions
BIND Biomolecular Interaction Network Database
MINT Molecular Interactions Database
GRID General Repository for Interaction Datasets
DIP Database of Interacting Proteins
HPRD Human Protein Reference Database
literature mining
different gene identifiers
synonyms lists
M EDLINE
SGD Saccharomyces Genome Database
The Interactive Fly
OMIM Online Mendelian Inheritance in Man
co-mentioning
NLP Natural Language Processing
Gene  and protein  names Cue words for entity recognition Verbs for relation extraction [ nxgene  The  GAL4   gene ] [ nxexpr  T he  expression  of   [ nxgene   the cytochrome  genes   [ nxpg   CYC1  and  CYC7 ]]] is  controlled  by [ nxpg   HAP1 ]
 
combine all evidence
spread over many species
transfer by orthology
 
orthologous groups
 
fuzzy orthology
? Source species Target species
Bayesian scoring scheme
 
Acknowledgments The STRING team (EMBL) Christian von Mering Berend Snel Martijn Huynen Sean Hooper Samuel Chaffron Julien Lagarde Mathilde Foglierini Peer Bork Literature mining project (EML Research) Jasmin Saric Rossitza Ouzounova Isabel Rojas

More Related Content

PPTX
Comparative genomics in eukaryotes, organelles
DOCX
UniProt
DOCX
Swiss pdb viewer
PPTX
The Gene Ontology & Gene Ontology Annotation resources
PPTX
String.pptx
PPTX
protein data bank
PPTX
Proteins databases
Comparative genomics in eukaryotes, organelles
UniProt
Swiss pdb viewer
The Gene Ontology & Gene Ontology Annotation resources
String.pptx
protein data bank
Proteins databases

What's hot (20)

PPTX
Uni prot presentation
PPTX
Bioinformatics
DOCX
Protein sequence databases
PPTX
sequence of file formats in bioinformatics
PPTX
Genomics(functional genomics)
PPT
Biological Databases
PPTX
Gen bank databases
PPTX
Protein information resource (PIR)
PPTX
Kegg database resources
PPTX
Protein identification and analysis on ExPASy server
PPTX
Biological databases
PPTX
gene prediction programs
PPT
Proteome databases
PPTX
Sequence alignment
PPTX
Expressed sequence tag (EST), molecular marker
PPTX
Comparative genomics
PPTX
Genes, Genomics and Proteomics
PPTX
PPTX
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
Uni prot presentation
Bioinformatics
Protein sequence databases
sequence of file formats in bioinformatics
Genomics(functional genomics)
Biological Databases
Gen bank databases
Protein information resource (PIR)
Kegg database resources
Protein identification and analysis on ExPASy server
Biological databases
gene prediction programs
Proteome databases
Sequence alignment
Expressed sequence tag (EST), molecular marker
Comparative genomics
Genes, Genomics and Proteomics
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
Ad

Viewers also liked (20)

PPT
Computational approaches to cell cycle analysis: Current research topics (tho...
PPT
Text mining for protein and small molecule relations
PPT
Literature mining: what is it, and should I care?
PPT
Systematic discovery of phosphorylation networks - Combining linear motifs an...
PPT
Room 4 Masks
PDF
On the margins of scholarship
PPT
HW Initiative 1
PDF
Vrsovice Banner Case Study
PDF
Senso Branding
PDF
Kooperativa Top 10
PPS
Barbara Streisand Budapest Audio
PPT
Gil Giardelli Www Versus Wwd A Web 3
PPS
Holocaust Memorial Tato
PDF
Desenvolvimento Gerenciamento Produdos e serviços Aula 2008 2 mktpassos
PPT
FERRAMENTAS TECNOLÓGICAS
 
PPT
Bcit Wayne Stevens
PPT
Total Aventura
PPS
Le ContaráS A Tus Hijos
PPS
Não_Esper
PPT
La ProduccióN
Computational approaches to cell cycle analysis: Current research topics (tho...
Text mining for protein and small molecule relations
Literature mining: what is it, and should I care?
Systematic discovery of phosphorylation networks - Combining linear motifs an...
Room 4 Masks
On the margins of scholarship
HW Initiative 1
Vrsovice Banner Case Study
Senso Branding
Kooperativa Top 10
Barbara Streisand Budapest Audio
Gil Giardelli Www Versus Wwd A Web 3
Holocaust Memorial Tato
Desenvolvimento Gerenciamento Produdos e serviços Aula 2008 2 mktpassos
FERRAMENTAS TECNOLÓGICAS
 
Bcit Wayne Stevens
Total Aventura
Le ContaráS A Tus Hijos
Não_Esper
La ProduccióN
Ad

Similar to The STRING database (20)

PPT
Functional association networks - The STRING and STITCH web resources
PPT
The STRING database
PPT
Network integration of heterogeneous data
PPT
Introduction to STRING
PPT
Prediction of protein networks through data integration
PPT
STRING - Modeling of biological systems through cross-species data integ...
PPT
The STRING database - Quality scores for heterogeneous interaction data
PPT
Cross-species data integration
PPT
Integration of heterogeneous data
PPT
The STRING database and related tools
PPT
Using networks to derive function
PPT
Network biology: Large-scale data and text mining
PPT
Computational approaches to cell cycle analysis: Data and databases
PPT
Data integration and functional association networks
PPT
Advanced bioinformatics of proteomics datasets
PPT
Integration of diverse large-scale datasets
PPT
Large-scale integration of data and text
ZIP
Exploring proteins, chemicals and their interactions with STRING and STITCH
PPT
STRING: Large-scale data and text mining
PPT
STRING: Prediction of protein networks through integration of diverse large-s...
Functional association networks - The STRING and STITCH web resources
The STRING database
Network integration of heterogeneous data
Introduction to STRING
Prediction of protein networks through data integration
STRING - Modeling of biological systems through cross-species data integ...
The STRING database - Quality scores for heterogeneous interaction data
Cross-species data integration
Integration of heterogeneous data
The STRING database and related tools
Using networks to derive function
Network biology: Large-scale data and text mining
Computational approaches to cell cycle analysis: Data and databases
Data integration and functional association networks
Advanced bioinformatics of proteomics datasets
Integration of diverse large-scale datasets
Large-scale integration of data and text
Exploring proteins, chemicals and their interactions with STRING and STITCH
STRING: Large-scale data and text mining
STRING: Prediction of protein networks through integration of diverse large-s...

More from Lars Juhl Jensen (20)

PPT
One tagger, many uses: Illustrating the power of dictionary-based named entit...
PPT
One tagger, many uses: Simple text-mining strategies for biomedicine
PPT
Extract 2.0: Text-mining-assisted interactive annotation
PPT
Network visualization: A crash course on using Cytoscape
PPT
STRING & STITCH : Network integration of heterogeneous data
PPT
Biomedical text mining: Automatic processing of unstructured text
PPT
Medical network analysis: Linking diseases and genes through data and text mi...
PPT
Network Biology: A crash course on STRING and Cytoscape
PPT
Cellular networks
PPT
Cellular Network Biology: Large-scale integration of data and text
PPT
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
PPT
STRING & related databases: Large-scale integration of heterogeneous data
PPT
Tagger: Rapid dictionary-based named entity recognition
PPT
Network Biology: Large-scale integration of data and text
PPT
Medical text mining: Linking diseases, drugs, and adverse reactions
PPT
Network biology: Large-scale integration of data and text
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
PPT
Cellular Network Biology
PPT
Network biology: Large-scale integration of data and text
PPT
Biomarker bioinformatics: Network-based candidate prioritization
One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Simple text-mining strategies for biomedicine
Extract 2.0: Text-mining-assisted interactive annotation
Network visualization: A crash course on using Cytoscape
STRING & STITCH : Network integration of heterogeneous data
Biomedical text mining: Automatic processing of unstructured text
Medical network analysis: Linking diseases and genes through data and text mi...
Network Biology: A crash course on STRING and Cytoscape
Cellular networks
Cellular Network Biology: Large-scale integration of data and text
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
STRING & related databases: Large-scale integration of heterogeneous data
Tagger: Rapid dictionary-based named entity recognition
Network Biology: Large-scale integration of data and text
Medical text mining: Linking diseases, drugs, and adverse reactions
Network biology: Large-scale integration of data and text
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Cellular Network Biology
Network biology: Large-scale integration of data and text
Biomarker bioinformatics: Network-based candidate prioritization

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PPT
Teaching material agriculture food technology
PPTX
Big Data Technologies - Introduction.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Spectroscopy.pptx food analysis technology
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Electronic commerce courselecture one. Pdf
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
cuic standard and advanced reporting.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
20250228 LYD VKU AI Blended-Learning.pptx
The AUB Centre for AI in Media Proposal.docx
Teaching material agriculture food technology
Big Data Technologies - Introduction.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Spectroscopy.pptx food analysis technology
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
NewMind AI Weekly Chronicles - August'25 Week I
Review of recent advances in non-invasive hemoglobin estimation
Dropbox Q2 2025 Financial Results & Investor Presentation
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Electronic commerce courselecture one. Pdf
sap open course for s4hana steps from ECC to s4
Network Security Unit 5.pdf for BCA BBA.
cuic standard and advanced reporting.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Digital-Transformation-Roadmap-for-Companies.pptx

The STRING database