SlideShare a Scribd company logo
Mining large-scale data sets on the eukaryotic cell cycle Lars Juhl Jensen EMBL Heidelberg
the cell cycle
grow and divide
one cell
two cells
four phases
G 1  phase
growth
S phase
DNA replication
G 2  phase
growth
M phase
cell division
 
regulation
gene expression
phosphorylation
targeted degradation
protein interactions
molecular biology
one gene
one postdoc
many types of data
a single gene
high-throughput biology
one lab
one technology
all the relevant genes
a single type of data
systems biology
many types of data
all the relevant genes
data integration
data mining
expression data
cell cultures
 
synchronization
microarrays
 
time courses
 
expression profiles
 
list of genes
periodically expressed
peak times
S. cerevisiae
expression data
Cho et al.
Spellman et al.
computational methods
Zhao et al.
Langmead et al.
Johansson et al.
Wichert et al.
Luan and Li
Lu et al.
Ahdesm äki et al.
Willbrand et al.
Chen et al.
Qiu et al.
Ahnert et al.
Andersson et al.
no benchmarking
reanalysis
benchmarking
 
no progress
no benchmarking
 
S. pombe
Rustici et al.
Peng et al.
Oliva et al.
no benchmarking
no integration
reanalysis
integration
benchmarking
 
no progress
no benchmarking
no integration
 
H. sapiens
Whitfield et al.
reanalysis
benchmarking
 
A. thaliana
Menges et al.
reanalysis
benchmarking
 
four organisms
list of genes
periodically expressed
peak times
protein interactions
S. cerevisiae
yeast two-hybrid
Uetz et al.
Ito et al.
complex pull-down
Gavin et al.
Ho et al.
 
30–50% false positives
topology-based scoring
yeast two-hybrid
-log((N 1 +1) · (N 2 +1))
complex pull-down
log[(N 12 · N)/((N 1 +1) · (N 2 +1))]
calibrate against KEGG
 
quality threshold
subcellular localization
 
expression data
temporal network
 
benchmarking
 
 
 
30–50% false positives
 
3–5% false positives
detailed function prediction
uncharacterized proteins
who
whom
when
global statements
dynamic and static
 
 
CDK–cyclin complexes
 
consistent timing
 
 
pre-replication complex
 
just-in-time assembly
dynamic and static
partial protein complexes
last missing subunits
phosphorylation
Übersax et al.
 
27% of dynamic proteins
8% of static proteins
targeted degradation
PEST regions
 
44% of dynamic proteins
29% of static proteins
data mining
undescribed link
transcriptional regulation
post-translational regulation
 
how can we test this?
cross-species comparison
evolutionary conservation
orthology detection
sequence similarity
 
not conserved
individual genes
just-in-time assembly
protein complexes
peak times
not comparable
time warping
 
same color = same phase
DNA replication
DNA polymerases
 
deoxynucleotide synthesis
 
phosphorylation
Übersax et al.
Loog et al.
Phospho.ELM
NetPhosK
correlation
 
 
cell cycle vs. non-cell cycle
co-evolution
 
 
transcriptional regulation
post-translational regulation
co-evolution
summary
reanalysis
integration
high-throughput data
biological discoveries
challenge
data mining
do this automatically
beware of the noise
benchmark!
Acknowledgments Thomas Skøt Jensen Ulrik de Lichtenberg Søren Brunak Peer Bork

More Related Content

PPTX
Bioc4700 2014 Guest Lecture
PPTX
Transposable elements
PDF
Comparative Genomics and Visualisation - Part 1
PDF
Protein Evolution: Structure, Function, and Human Health
PPTX
Production of transgenic farm animals
PPT
Gene order
PPTX
Horizontal gene transfer
PPTX
Human genetics evolutionary genetics
Bioc4700 2014 Guest Lecture
Transposable elements
Comparative Genomics and Visualisation - Part 1
Protein Evolution: Structure, Function, and Human Health
Production of transgenic farm animals
Gene order
Horizontal gene transfer
Human genetics evolutionary genetics

What's hot (20)

PPTX
Molecular evolution
PPTX
TRANSGENIC TECHNIQUES AND GENE THERAPY
PPTX
Presentation1population neutral theory
PPT
Plang functional genome
PPTX
Bacterial conjugation and its application
PPTX
Genetic engineering in animal cells
PDF
Transgenenics animals
PPTX
Microbial Genetics: Transformation, Transduction, Conjugation, Plasmids, Tran...
PPTX
Conjugation
PDF
Genetic engineering in animal
PPTX
Horizontal gene transfer
PDF
Molecular clock, Neutral hypothesis
PPTX
TRANSGENIC MICE
PPT
KnockOut mouse technology By Bikash karki
PPTX
Trends in evolution by faunafondness
PPTX
Transgenic technology
PPTX
Mutation
PPTX
Comparative genomics presentation
PPT
Comparative genomics
Molecular evolution
TRANSGENIC TECHNIQUES AND GENE THERAPY
Presentation1population neutral theory
Plang functional genome
Bacterial conjugation and its application
Genetic engineering in animal cells
Transgenenics animals
Microbial Genetics: Transformation, Transduction, Conjugation, Plasmids, Tran...
Conjugation
Genetic engineering in animal
Horizontal gene transfer
Molecular clock, Neutral hypothesis
TRANSGENIC MICE
KnockOut mouse technology By Bikash karki
Trends in evolution by faunafondness
Transgenic technology
Mutation
Comparative genomics presentation
Comparative genomics
Ad

Similar to Mining large-scale data sets on the eukaryotic cell cycle (20)

PPT
Just-in-time assembly - the evolution of transcriptional and post-translation...
PPT
Just-in-time assembly - Co-evolution of transcriptional and post-translationa...
PPT
Just-in-time assembly - Co-evolution of transcriptional and post-translationa...
PPT
Protein networks as a scaffold for structuring other data
PPT
Just-in-time assembly - Co-evolution of transcriptional and post-translationa...
PPT
Mining heterogeneous data: Understanding systems at the level of complexes an...
PPT
Integration of diverse large-scale datasets
PPT
Systems biology - Understanding biology at the systems level
PPT
Mining heterogeneous data: Understanding systems at the level of complexes an...
PPT
Literature mining and large-scale data integration
PPT
Cross-species data integration
PPT
Computational approaches to cell cycle analysis: Current research topics (tho...
PPT
Protein interaction networks from yeast to human
PDF
Resolving transcriptional dynamics of the epithelial-mesenchymal transition u...
PPT
Network integration of heterogeneous data
PDF
Grindberg - PNAS
PPTX
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
PPT
Integration of biomedical literature and databases
PDF
Shamilova nn 2013
PPT
Integration of biomedical literature and databases
Just-in-time assembly - the evolution of transcriptional and post-translation...
Just-in-time assembly - Co-evolution of transcriptional and post-translationa...
Just-in-time assembly - Co-evolution of transcriptional and post-translationa...
Protein networks as a scaffold for structuring other data
Just-in-time assembly - Co-evolution of transcriptional and post-translationa...
Mining heterogeneous data: Understanding systems at the level of complexes an...
Integration of diverse large-scale datasets
Systems biology - Understanding biology at the systems level
Mining heterogeneous data: Understanding systems at the level of complexes an...
Literature mining and large-scale data integration
Cross-species data integration
Computational approaches to cell cycle analysis: Current research topics (tho...
Protein interaction networks from yeast to human
Resolving transcriptional dynamics of the epithelial-mesenchymal transition u...
Network integration of heterogeneous data
Grindberg - PNAS
scRNA-Seq Lecture - Stem Cell Network RNA-Seq Workshop 2017
Integration of biomedical literature and databases
Shamilova nn 2013
Integration of biomedical literature and databases
Ad

More from Lars Juhl Jensen (20)

PPT
One tagger, many uses: Illustrating the power of dictionary-based named entit...
PPT
One tagger, many uses: Simple text-mining strategies for biomedicine
PPT
Extract 2.0: Text-mining-assisted interactive annotation
PPT
Network visualization: A crash course on using Cytoscape
PPT
STRING & STITCH : Network integration of heterogeneous data
PPT
Biomedical text mining: Automatic processing of unstructured text
PPT
Medical network analysis: Linking diseases and genes through data and text mi...
PPT
Network Biology: A crash course on STRING and Cytoscape
PPT
Cellular networks
PPT
Cellular Network Biology: Large-scale integration of data and text
PPT
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
PPT
STRING & related databases: Large-scale integration of heterogeneous data
PPT
Tagger: Rapid dictionary-based named entity recognition
PPT
Network Biology: Large-scale integration of data and text
PPT
Medical text mining: Linking diseases, drugs, and adverse reactions
PPT
Network biology: Large-scale integration of data and text
PPT
Medical data and text mining: Linking diseases, drugs, and adverse reactions
PPT
Cellular Network Biology
PPT
Network biology: Large-scale integration of data and text
PPT
Biomarker bioinformatics: Network-based candidate prioritization
One tagger, many uses: Illustrating the power of dictionary-based named entit...
One tagger, many uses: Simple text-mining strategies for biomedicine
Extract 2.0: Text-mining-assisted interactive annotation
Network visualization: A crash course on using Cytoscape
STRING & STITCH : Network integration of heterogeneous data
Biomedical text mining: Automatic processing of unstructured text
Medical network analysis: Linking diseases and genes through data and text mi...
Network Biology: A crash course on STRING and Cytoscape
Cellular networks
Cellular Network Biology: Large-scale integration of data and text
Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...
STRING & related databases: Large-scale integration of heterogeneous data
Tagger: Rapid dictionary-based named entity recognition
Network Biology: Large-scale integration of data and text
Medical text mining: Linking diseases, drugs, and adverse reactions
Network biology: Large-scale integration of data and text
Medical data and text mining: Linking diseases, drugs, and adverse reactions
Cellular Network Biology
Network biology: Large-scale integration of data and text
Biomarker bioinformatics: Network-based candidate prioritization

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Approach and Philosophy of On baking technology
PDF
cuic standard and advanced reporting.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPT
Teaching material agriculture food technology
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Spectral efficient network and resource selection model in 5G networks
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Approach and Philosophy of On baking technology
cuic standard and advanced reporting.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Reach Out and Touch Someone: Haptics and Empathic Computing
Per capita expenditure prediction using model stacking based on satellite ima...
Chapter 3 Spatial Domain Image Processing.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
20250228 LYD VKU AI Blended-Learning.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Review of recent advances in non-invasive hemoglobin estimation
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Teaching material agriculture food technology
Agricultural_Statistics_at_a_Glance_2022_0.pdf

Mining large-scale data sets on the eukaryotic cell cycle