In silico and Text-Based Analysis of Cellular Networks

Download as PPT, PDF

1 like285 views

Lars Juhl Jensen

This document discusses in silico and text-based analysis of cellular networks. It describes using computational predictions from over 2000 genomes and experimental data to build protein interaction networks in databases like STRING. It also discusses challenges like different data sources using different formats and identifiers. The document outlines using natural language processing techniques like named entity recognition to extract and normalize biomolecular identifiers. It proposes using co-mentioning of entities within texts to assign confidence scores to interactions for building integrated interaction networks. Finally it acknowledges contributions to building networks describing protein interactions, chemical interactions, subcellular localization, tissue expression, cell cycle expression, and disease associations.

In Silico and Text-Based Analysis
of Cellular Networks
Lars Juhl Jensen

association networks

guilt by association

In silico and Text-Based Analysis of Cellular Networks

protein networks

STRING

2000+ genomes

computational predictions

gene fusion

Korbel et al., Nature Biotechnology, 2004

phylogenetic profiles

Korbel et al., Nature Biotechnology, 2004

experimental data

gene coexpression

In silico and Text-Based Analysis of Cellular Networks

physical interactions

Jensen & Bork, Science, 2008

curated knowledge

pathways

Letunic & Bork, Trends in Biochemical Sciences, 2008

many databases

different formats

different identifiers

variable quality

not comparable

hard work

parsers

mapping files

quality scores

von Mering et al., Nucleic Acids Research, 2005

score calibration

gold standard

von Mering et al., Nucleic Acids Research, 2005

common scale

missing most of the data

>10 km

too much to read

computer

as smart as a dog

teach it specific tricks

In silico and Text-Based Analysis of Cellular Networks

In silico and Text-Based Analysis of Cellular Networks

named entity recognition

comprehensive lexicon

cyclin dependent kinase 1

CDC2

orthographic variation

spaces and hyphens

cyclin dependent kinase 1

cyclin-dependent kinase 1

prefixes and suffixes

CDC2

hCdc2

“black list”

SDS

co-mentioning

within documents

within paragraphs

within sentences

quality score

protein networks

Szklarczyk et al., Nucleic Acids Research, 2015string-db.org

general approach

chemical networks

Kuhn et al., Nucleic Acids Research, 2014stitch-db.org

space

subcellular localization

Binder et al., Database, 2014compartments.jensenlab.org

tissue expression

tissues.jensenlab.org Santos et al., submitted, 2015

time

cell-cycle expression

Santos et al., Nucleic Acids Research, 2015cyclebase.org

disease associations

diseases.jensenlab.org Frankild et al., Methods, 2015

Acknowledgments
Molecular networks
Michael Kuhn
Damian Szklarczyk
Andrea Franceschini
Milan Simonovic
Alexander Roth
Sune Pletscher-Frankild
Jianyi Lin
Pablo Minguez
Christian von Mering
Peer Bork
Time and space
Alberto Santos
Sune Pletscher-Frankild
Janos Binder
Kalliopi Tsafou
Christian Stolte
Albert Palleja
Heiko Horn
Rasmus Wernersson
Reinhardt Schneider
Sean O’ Donoghue

Ad

Recommended

PPT

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

PPT

STRING: Protein networks from data and text mining

Lars Juhl Jensen

PPT

Network Biology: A crash course on STRING and Cytoscape

Lars Juhl Jensen

PPT

STRING - Large-scale integration of data and text

Lars Juhl Jensen

PPT

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

PPT

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

PPT

STRING - Protein networks from data and text mining

Lars Juhl Jensen

PPT

Gene association networks: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Gene association networks: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Gene association networks: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Introduction to STRING

Lars Juhl Jensen

PPT

Networks of proteins and diseases

Lars Juhl Jensen

PPT

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

PPT

Network biology - Large-scale integration of data and text

Lars Juhl Jensen

PPT

Protein association networks: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Network biology: Large-scale data and text mining

Lars Juhl Jensen

PPT

One tagger, many uses - Illustrating the power of ontologies in named entity ...

Lars Juhl Jensen

PPT

Data integration with STRING

Lars Juhl Jensen

PPT

Network Biology: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Making gene networks through data integration

Lars Juhl Jensen

PPT

Network biology: Large-scale data and text mining

Lars Juhl Jensen

PPT

Large-scale integration of data and text

Lars Juhl Jensen

PPT

Large-scale data and text mining

Lars Juhl Jensen

PPT

Advanced bioinformaticsof proteomics datasets

Lars Juhl Jensen

PPT

Gene Association Networks: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Data and Text Mining

Lars Juhl Jensen

PPT

Network biology: Large-scale data integration and text mining

Lars Juhl Jensen

KEY

STRING/STITCH tutorial

PPSX

Robertson immemxi final March 2016

IRIDA_community

PPTX

In-Silico Modelling of Tumour Growth

More Related Content

PPT

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

PPT

STRING: Protein networks from data and text mining

Lars Juhl Jensen

PPT

Network Biology: A crash course on STRING and Cytoscape

Lars Juhl Jensen

PPT

STRING - Large-scale integration of data and text

Lars Juhl Jensen

PPT

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

PPT

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

PPT

STRING - Protein networks from data and text mining

Lars Juhl Jensen

PPT

Gene association networks: Large-scale integration of data and text

Lars Juhl Jensen

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

STRING: Protein networks from data and text mining

Lars Juhl Jensen

Network Biology: A crash course on STRING and Cytoscape

Lars Juhl Jensen

STRING - Large-scale integration of data and text

Lars Juhl Jensen

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

STRING - Protein networks from data and text mining

Lars Juhl Jensen

Gene association networks: Large-scale integration of data and text

Lars Juhl Jensen

What's hot (20)

PPT

Gene association networks: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Gene association networks: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Introduction to STRING

Lars Juhl Jensen

PPT

Networks of proteins and diseases

Lars Juhl Jensen

PPT

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

PPT

Network biology - Large-scale integration of data and text

Lars Juhl Jensen

PPT

Protein association networks: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Network biology: Large-scale data and text mining

Lars Juhl Jensen

PPT

One tagger, many uses - Illustrating the power of ontologies in named entity ...

Lars Juhl Jensen

PPT

Data integration with STRING

Lars Juhl Jensen

PPT

Network Biology: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Making gene networks through data integration

Lars Juhl Jensen

PPT

Network biology: Large-scale data and text mining

Lars Juhl Jensen

PPT

Large-scale integration of data and text

Lars Juhl Jensen

PPT

Large-scale data and text mining

Lars Juhl Jensen

PPT

Advanced bioinformaticsof proteomics datasets

Lars Juhl Jensen

PPT

Gene Association Networks: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Data and Text Mining

Lars Juhl Jensen

PPT

Network biology: Large-scale data integration and text mining

Lars Juhl Jensen

KEY

STRING/STITCH tutorial

Gene association networks: Large-scale integration of data and text

Lars Juhl Jensen

Gene association networks: Large-scale integration of data and text

Lars Juhl Jensen

Introduction to STRING

Lars Juhl Jensen

Networks of proteins and diseases

Lars Juhl Jensen

Gene association networks - Large-scale integration of data and text

Lars Juhl Jensen

Network biology - Large-scale integration of data and text

Lars Juhl Jensen

Protein association networks: Large-scale integration of data and text

Lars Juhl Jensen

Network biology: Large-scale data and text mining

Lars Juhl Jensen

One tagger, many uses - Illustrating the power of ontologies in named entity ...

Lars Juhl Jensen

Data integration with STRING

Lars Juhl Jensen

Network Biology: Large-scale integration of data and text

Lars Juhl Jensen

Making gene networks through data integration

Lars Juhl Jensen

Network biology: Large-scale data and text mining

Lars Juhl Jensen

Large-scale integration of data and text

Lars Juhl Jensen

Large-scale data and text mining

Lars Juhl Jensen

Advanced bioinformaticsof proteomics datasets

Lars Juhl Jensen

Gene Association Networks: Large-scale integration of data and text

Lars Juhl Jensen

Data and Text Mining

Lars Juhl Jensen

Network biology: Large-scale data integration and text mining

Lars Juhl Jensen

STRING/STITCH tutorial

Ad

Viewers also liked (18)

PPSX

Robertson immemxi final March 2016

IRIDA_community

PPTX

In-Silico Modelling of Tumour Growth

PDF

Evaluation of the impact of error correction algorithms on SNP calling.

PPT

Tetra Arm PCR

Dr Dinesh Kumar

PDF

Big Data and Genomic Medicine by Corey Nislow

PDF

WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015

Torsten Seemann

PPT

Molecular modelling for in silico drug discovery

PPTX

Pharmacogenomics

PPT

Protein Modeling And In-Silico Drug Designing Approach

PPT

Intro to in silico drug discovery 2014

PPTX

SNp mining in crops

saurabh Pandey.Saurabh784

PPT

Identification of disease genes

Prasanthperceptron

PPTX

Snp

PDF

SNP Genotyping Technologies

SivamaniBalasubramaniam

PPTX

SNP

PPTX

Single nucleotide polymorphism

PPTX

Next generation sequencing

Dayananda Salam

PPT

Snp

Robertson immemxi final March 2016

IRIDA_community

In-Silico Modelling of Tumour Growth

Evaluation of the impact of error correction algorithms on SNP calling.

Tetra Arm PCR

Dr Dinesh Kumar

Big Data and Genomic Medicine by Corey Nislow

WGS in public health microbiology - MDU/VIDRL Seminar - wed 17 jun 2015

Torsten Seemann

Molecular modelling for in silico drug discovery

Pharmacogenomics

Protein Modeling And In-Silico Drug Designing Approach

Intro to in silico drug discovery 2014

SNp mining in crops

saurabh Pandey.Saurabh784

Identification of disease genes

Prasanthperceptron

Snp

SNP Genotyping Technologies

SivamaniBalasubramaniam

SNP

Single nucleotide polymorphism

Next generation sequencing

Dayananda Salam

Snp

Ad

Similar to In silico and Text-Based Analysis of Cellular Networks (20)

PPT

Network biology: Large-scale data integration and text mining

Lars Juhl Jensen

PPT

Large-scale integration of data and text

Lars Juhl Jensen

PPT

Protein networks: A basis for large-scale data mining

Lars Juhl Jensen

PPT

Protein networks: A basis for large-scale data mining

Lars Juhl Jensen

PPT

Information integration

Lars Juhl Jensen

PPT

Cellular Network Biology: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Large-scale integration of data and text

Lars Juhl Jensen

PPT

Cellular network biology: Proteome-wide analysis of heterogeneous data

Lars Juhl Jensen

PPT

Cellular Network Biology

Lars Juhl Jensen

PPT

Protein interaction networks

Lars Juhl Jensen

PPT

STRING & related databases: Large-scale integration of heterogeneous data

Lars Juhl Jensen

PPT

Network biology

Lars Juhl Jensen

PPT

Systems biology - Bioinformatics on complete biological systems

Lars Juhl Jensen

PPT

Systems biology: Bioinformatics on complete biological system

Lars Juhl Jensen

PPT

Protein networks: A basis for large-scale data mining

Lars Juhl Jensen

PPT

Systems biology - Understanding biology at the systems level

Lars Juhl Jensen

PPT

Protein networks: A basis for large-scale data mining

Lars Juhl Jensen

PPT

Systems biology: Bioinformatics on complete biological systems

Lars Juhl Jensen

PPT

Systems biology: Large-scale biomedical data mining

Lars Juhl Jensen

PPT

STRING: Large-scale data and text mining

Lars Juhl Jensen

Network biology: Large-scale data integration and text mining

Lars Juhl Jensen

Large-scale integration of data and text

Lars Juhl Jensen

Protein networks: A basis for large-scale data mining

Lars Juhl Jensen

Protein networks: A basis for large-scale data mining

Lars Juhl Jensen

Information integration

Lars Juhl Jensen

Cellular Network Biology: Large-scale integration of data and text

Lars Juhl Jensen

Large-scale integration of data and text

Lars Juhl Jensen

Cellular network biology: Proteome-wide analysis of heterogeneous data

Lars Juhl Jensen

Cellular Network Biology

Lars Juhl Jensen

Protein interaction networks

Lars Juhl Jensen

STRING & related databases: Large-scale integration of heterogeneous data

Lars Juhl Jensen

Network biology

Lars Juhl Jensen

Systems biology - Bioinformatics on complete biological systems

Lars Juhl Jensen

Systems biology: Bioinformatics on complete biological system

Lars Juhl Jensen

Protein networks: A basis for large-scale data mining

Lars Juhl Jensen

Systems biology - Understanding biology at the systems level

Lars Juhl Jensen

Protein networks: A basis for large-scale data mining

Lars Juhl Jensen

Systems biology: Bioinformatics on complete biological systems

Lars Juhl Jensen

Systems biology: Large-scale biomedical data mining

Lars Juhl Jensen

STRING: Large-scale data and text mining

Lars Juhl Jensen

More from Lars Juhl Jensen (20)

PPT

One tagger, many uses: Illustrating the power of dictionary-based named entit...

Lars Juhl Jensen

PPT

One tagger, many uses: Simple text-mining strategies for biomedicine

Lars Juhl Jensen

PPT

Extract 2.0: Text-mining-assisted interactive annotation

Lars Juhl Jensen

PPT

Network visualization: A crash course on using Cytoscape

Lars Juhl Jensen

PPT

STRING & STITCH: Network integration of heterogeneous data

Lars Juhl Jensen

PPT

Biomedical text mining: Automatic processing of unstructured text

Lars Juhl Jensen

PPT

Medical network analysis: Linking diseases and genes through data and text mi...

Lars Juhl Jensen

PPT

Cellular networks

Lars Juhl Jensen

PPT

Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...

Lars Juhl Jensen

PPT

Tagger: Rapid dictionary-based named entity recognition

Lars Juhl Jensen

PPT

Medical text mining: Linking diseases, drugs, and adverse reactions

Lars Juhl Jensen

PPT

Network biology: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Medical data and text mining: Linking diseases, drugs, and adverse reactions

Lars Juhl Jensen

PPT

Network biology: Large-scale integration of data and text

Lars Juhl Jensen

PPT

Biomarker bioinformatics: Network-based candidate prioritization

Lars Juhl Jensen

PPT

The Art of Counting: Scoring and ranking co-occurrences in literature

Lars Juhl Jensen

PPT

Text-mining-based retrieval of protein networks

Lars Juhl Jensen

PPT

Medical data and text mining: Linking diseases, drugs, and adverse reactions

Lars Juhl Jensen

PPT

Medical data and text mining: Linking diseases, drugs, and adverse reactions

Lars Juhl Jensen

PPT

Medical data and text mining: Linking diseases, drugs, and adverse reactions

Lars Juhl Jensen

One tagger, many uses: Illustrating the power of dictionary-based named entit...

Lars Juhl Jensen

One tagger, many uses: Simple text-mining strategies for biomedicine

Lars Juhl Jensen

Extract 2.0: Text-mining-assisted interactive annotation

Lars Juhl Jensen

Network visualization: A crash course on using Cytoscape

Lars Juhl Jensen

STRING & STITCH: Network integration of heterogeneous data

Lars Juhl Jensen

Biomedical text mining: Automatic processing of unstructured text

Lars Juhl Jensen

Medical network analysis: Linking diseases and genes through data and text mi...

Lars Juhl Jensen

Cellular networks

Lars Juhl Jensen

Statistics on big biomedical data: Methods and pitfalls when analyzing high-t...

Lars Juhl Jensen

Tagger: Rapid dictionary-based named entity recognition

Lars Juhl Jensen

Medical text mining: Linking diseases, drugs, and adverse reactions

Lars Juhl Jensen

Network biology: Large-scale integration of data and text

Lars Juhl Jensen

Medical data and text mining: Linking diseases, drugs, and adverse reactions

Lars Juhl Jensen

Network biology: Large-scale integration of data and text

Lars Juhl Jensen

Biomarker bioinformatics: Network-based candidate prioritization

Lars Juhl Jensen

The Art of Counting: Scoring and ranking co-occurrences in literature

Lars Juhl Jensen

Text-mining-based retrieval of protein networks

Lars Juhl Jensen

Medical data and text mining: Linking diseases, drugs, and adverse reactions

Lars Juhl Jensen

Medical data and text mining: Linking diseases, drugs, and adverse reactions

Lars Juhl Jensen

Medical data and text mining: Linking diseases, drugs, and adverse reactions

Lars Juhl Jensen

Recently uploaded (20)

PPTX

Cell Membrane: Structure, Composition & Functions

Muhammad Sajid Afridi

PPTX

BIOMOLECULES PPT........................

vachieagrawal1221

PPTX

Taita Taveta Laboratory Technician Workshop Presentation.pptx

PPTX

ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg

PDF

Formation of Supersonic Turbulence in the Primordial Star-forming Cloud

PDF

. Radiology Case Scenariosssssssssssssss

PDF

CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...

PDF

An interstellar mission to test astrophysical black holes

PPTX

ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...

PPTX

Comparative Structure of Integument in Vertebrates.pptx

Dr Showkat Ahmad Wani

PPTX

EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx

PPTX

2. Earth - The Living Planet earth and life

markjustinebarolobau

PPTX

Classification Systems_TAXONOMY_SCIENCE8.pptx

PPTX

Introduction to Cardiovascular system_structure and functions-1

PDF

Placing the Near-Earth Object Impact Probability in Context

PDF

Phytochemical Investigation of Miliusa longipes.pdf

IrfanShahirSharafi

PPTX

7. General Toxicologyfor clinical phrmacy.pptx

DOCX

Q1_LE_Mathematics 8_Lesson 5_Week 5.docx

marcusaviso1101

PDF

VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS

PPT

The World of Physical Science, • Labs: Safety Simulation, Measurement Practice

Cell Membrane: Structure, Composition & Functions

Muhammad Sajid Afridi

BIOMOLECULES PPT........................

vachieagrawal1221

Taita Taveta Laboratory Technician Workshop Presentation.pptx

ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg

Formation of Supersonic Turbulence in the Primordial Star-forming Cloud

. Radiology Case Scenariosssssssssssssss

CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...

An interstellar mission to test astrophysical black holes

ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...

Comparative Structure of Integument in Vertebrates.pptx

Dr Showkat Ahmad Wani

EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx

2. Earth - The Living Planet earth and life

markjustinebarolobau

Classification Systems_TAXONOMY_SCIENCE8.pptx

Introduction to Cardiovascular system_structure and functions-1

Placing the Near-Earth Object Impact Probability in Context

Phytochemical Investigation of Miliusa longipes.pdf

IrfanShahirSharafi

7. General Toxicologyfor clinical phrmacy.pptx

Q1_LE_Mathematics 8_Lesson 5_Week 5.docx

marcusaviso1101

VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS

The World of Physical Science, • Labs: Safety Simulation, Measurement Practice

In silico and Text-Based Analysis of Cellular Networks

1. In Silico and Text-Based Analysis of Cellular Networks Lars Juhl Jensen

2. association networks

3. guilt by association

5. protein networks

7. 2000+ genomes

8. computational predictions

10. Korbel et al., Nature Biotechnology, 2004

11. phylogenetic profiles

12. Korbel et al., Nature Biotechnology, 2004

13. experimental data

14. gene coexpression

16. physical interactions

17. Jensen & Bork, Science, 2008

18. curated knowledge

20. Letunic & Bork, Trends in Biochemical Sciences, 2008

21. many databases

22. different formats

23. different identifiers

24. variable quality

25. not comparable

28. mapping files

29. quality scores

30. von Mering et al., Nucleic Acids Research, 2005

31. score calibration

32. gold standard

33. von Mering et al., Nucleic Acids Research, 2005

34. common scale

35. missing most of the data

37. too much to read

39. as smart as a dog

40. teach it specific tricks

43. named entity recognition

44. comprehensive lexicon

45. cyclin dependent kinase 1

47. orthographic variation

48. spaces and hyphens

49. cyclin dependent kinase 1

50. cyclin-dependent kinase 1

51. prefixes and suffixes

54. “black list”

56. co-mentioning

57. within documents

58. within paragraphs

59. within sentences

60. quality score

61. protein networks

62. Szklarczyk et al., Nucleic Acids Research, 2015string-db.org

63. general approach

64. chemical networks

65. Kuhn et al., Nucleic Acids Research, 2014stitch-db.org

67. subcellular localization

68. Binder et al., Database, 2014compartments.jensenlab.org

69. tissue expression

70. tissues.jensenlab.org Santos et al., submitted, 2015

72. cell-cycle expression

73. Santos et al., Nucleic Acids Research, 2015cyclebase.org

74. disease associations

75. diseases.jensenlab.org Frankild et al., Methods, 2015

76. Acknowledgments Molecular networks Michael Kuhn Damian Szklarczyk Andrea Franceschini Milan Simonovic Alexander Roth Sune Pletscher-Frankild Jianyi Lin Pablo Minguez Christian von Mering Peer Bork Time and space Alberto Santos Sune Pletscher-Frankild Janos Binder Kalliopi Tsafou Christian Stolte Albert Palleja Heiko Horn Rasmus Wernersson Reinhardt Schneider Sean O’ Donoghue