SlideShare a Scribd company logo
2
Most read
3
Most read
9
Most read
Algorithmic approach to Computational
Biology using Graphs
Submitted by
S P Sajjan
Research Guide
Dr. Ishwar BaidariMCA,Ph. D.
Dept. of Computer Science
Karnatak University, Dharwad.
What is Computational Biology?
"Computational biology is not a "field", but an "approach" involving
the use of computers to study biological processes and hence it is an area as
diverse as biology itself."
• Biological data
Biological data are data or measurements collected from
biological sources,
which are often stored or exchanged in a digital form.
Biological data are commonly stored in files or databases.
Ex : DNA sequences, and population data used in ecology.
• Functional molecules
In organic chemistry, functional groups are specific groups
of atoms or bonds within molecules that are responsible for the
characteristic chemical reactions of those molecules.
• Mining in molecular biology
Text-mining in molecular biology is defined as the
automatic extraction of information about genes, proteins and
their functional relationships from text documents.
Ex: Information science, Bioinformatics and Computational
linguistics.
• Defining Metabolism
The term, 'Metabolism' refers to biochemical processes
that happen within a person or living organism.
Metabolism is something that consists of both,’
Catabolism,' and, 'Anabolism;' which are the buildup and
breakdown of substances.
Cellular networks
• Interacting molecular sets
within cells.
• It includes mainly p-p
interactions, metabolism, gene
transcriptional regulatory
networks and signal
transduction pathways.
• All of them are different subsets
of a single large-scale cellular
network, since they are
eventually cross-linked.
Purpose of Computational Biology
• Computational Biology can be summarized as the field
utilizing high throughput technology and computation to study
complex organizational patterns of biological systems and
how they contribute to the normal physiology and disease.
• Experimental systems biology uses various
genomics/proteomics.
• Large number of genes or proteins at a genome scale, which
naturally yields a large volume of data to be interpreted and
put within the context of real biology.
• There are several nation-wide large projects aiming at
characterizing the genome and proteome of different (e.g
cancer) cells.
• Billions of dollars are spending into this research that spans
many of the top institutions across the nation.
• Classical molecular biology has mainly focused on gene or
molecular centric research,
• 30-40 years of this research led to our realization of the
incredible complexity of biological systems.
• we need more global experimental approaches and equally as
importantly.
Relevance of the study and present status
Issues Related to Computational Biology
• ~22,000 noted Human genes in Sequence
• ~60,000 known protein-protein interactions in human
• Millions of indirect relationships between genes
• Typical genomic experiment: millions of data points
Statement of Research Problem
• The theory of complex networks plays an important role in a
wide variety of disciplines, ranging from communication to
molecular and population biology.
• The focus of this Research is on graph theory methods for
computational biology.
• We will survey methods and approaches in graph theory,
along with current applications in biomedical informatics.
• Within the fields of Biology and Medicine, potential
applications of network analysis by using graph theory
including identifying drug targets, determining the role of
proteins or genes of unknown function.
• There are several biological domains where graph theory
techniques are applied for knowledge extraction from data.
We have classified these problems as follows.
• Modeling methods of bio-molecular networks such as protein
interaction networks, metabolic networks, as well as
transcriptional regulatory networks.
• Measurement of centrality and importance in bio-molecular
networks. To identify the most important nodes in a large
complex network is of fundamental importance in
computational biology.
• We will introduce several researches that applied centrality
measures to identify structurally important genes or proteins
identified in this way.
• Mining new pathways from bio-molecular networks.
• Experimental validation of identification of the pathway in
different organisms is requires huge amounts of time and effort.
• Thus, there is a need for Graph theory tools help scientists predict
pathways in bio-molecular networks.
• Our primary goal in the present Research is to provide as broad a
survey as possible of the major advances made in this field.
Moreover, we also highlight what has been achieved as well as
some of the most significant open issues that need to be addressed.
• Finally, we hope that this Research will serve as a useful
introduction to the field for those unfamiliar with the literature.
The concept of Graph theory
• Graph: A graph G consists of a set of vertices V(G) and set of
edges E(G).
• Simple Graph: In simple graph, two of the vertices in G are
linked if there exits an edge (𝑉𝑖, 𝑉𝑗) ∈E(G). connecting the
vertices and in graph G such that 𝑉𝑖 ∈V(G) and 𝑉𝑗 ∈V(G).
• Undirected Graph : An undirected graph is graph, i.e., a set of
objects (called vertices or nodes) that are connected together,
where all the edges are bidirectional. An undirected graph is
sometimes called an undirected network.
• Directed Graph: A directed graph is graph, i.e., a set of objects
(called vertices or nodes) that are connected together, where all
the edges are directed from one vertex to another. A directed
graph is sometimes called a digraph or a directed network.
Modeling of Bio-molecular networks in
Graph
• In Biology, Transcriptional regulatory networks and metabolic
networks would usually be modeled as directed graphs.
• For instance, in a Transcriptional regulatory network, nodes
represent genes with edges denoting the Transcriptional
relationship between them.
• In recent years, attentions have been focused on the protein-
protein interaction networks of various simple organisms. These
networks describe the direct physical interaction between the
proteins in an organism’s proteome and there is no direction
associated with the interactions in such networks.
• Hence, PPI networks are typically modeled as undirected
graphs, in which nodes represent protein and edges represent
interaction.
Computational Limitations
• The challenges of computational biology are enormous, and may exceed
the expected increases in computing capability. Several years ago the
computational power of “state-of-the-art parallel supercomputers”
allowed highly predictive calculations treating only hundreds of atoms for
time scales of picoseconds, while molecular dynamics calculations of tens
of thousands of atoms for nanoseconds were becoming common, although
they were some what less predictive.
• A straightforward application of Moore’s Law would predict an increase
of about three – four doublings in capability in the intervening five or six
years.
• Using current methodologies, achieving the desired level of computation
would represent an increase of greater than ~109 times in computing
power.
• It must be noted that even an increase of ~109 in computing power would
only provide the ability to simulate certain cellular systems, and may not
provide a means to predictively model whole cells, organs or organisms.
Algorithmic approach to computational biology using graphs

More Related Content

PDF
Structural genomics
PPT
Chemoinformatic
PPTX
Comparative genomics in eukaryotes, organelles
PPTX
Functional proteomics, methods and tools
PPTX
Cheminformatics
PPTX
Protein microarray Preparation of protein microarray Different methods of arr...
PPTX
Databases pathways of genomics and proteomics
PPTX
ProCheck
Structural genomics
Chemoinformatic
Comparative genomics in eukaryotes, organelles
Functional proteomics, methods and tools
Cheminformatics
Protein microarray Preparation of protein microarray Different methods of arr...
Databases pathways of genomics and proteomics
ProCheck

What's hot (20)

PDF
Mass Spectrometry Based Proteomics Analysis
PPTX
The Gene Ontology & Gene Ontology Annotation resources
PPTX
Example of force fields
PPTX
Structural Genomics
PPTX
Role of ensembl in genome browsing
PPTX
Protein array, types and application
PPTX
Human genome project - Decoding the codes of life
PPTX
DNA microarray ppt
PPTX
Kegg
PPTX
Methods of Protein structure determination
PPTX
gene prediction programs
PPTX
Functional proteomics, and tools
PPTX
Protein microarray
PPT
EST Clustering.ppt
PPTX
Introduction to proteomics
PPTX
Computational biology
PPTX
Chemical File Formats for storing chemical data
PPT
Rna Folding
PPTX
Databases short nucletide polymorphism
Mass Spectrometry Based Proteomics Analysis
The Gene Ontology & Gene Ontology Annotation resources
Example of force fields
Structural Genomics
Role of ensembl in genome browsing
Protein array, types and application
Human genome project - Decoding the codes of life
DNA microarray ppt
Kegg
Methods of Protein structure determination
gene prediction programs
Functional proteomics, and tools
Protein microarray
EST Clustering.ppt
Introduction to proteomics
Computational biology
Chemical File Formats for storing chemical data
Rna Folding
Databases short nucletide polymorphism
Ad

Similar to Algorithmic approach to computational biology using graphs (20)

PDF
Functional Coherence Of Molecular Networks In Bioinformatics 1st Edition Mehm...
PPTX
Top Network Biology Research Titles for PhD & Master’s – Tutors India
PDF
Biological networks 1st Edition Frantois Kopos
PDF
Computational Systems Biology 1st Edition Leonardo Marioramrez
PDF
Protein-protein interactions-graph-theoretic-modeling
PPTX
2016 davis-plantbio
PPTX
2015 genome-center
PDF
Network components and biological network construction methods
PDF
Biological networks 1st Edition Frantois Kopos
PPTX
2016 davis-biotech
PDF
Network Biology Methods And Applications 1st Edition Seesandra V Rajagopala
PDF
Biological networks 1st Edition Frantois Kopos all chapter instant download
PDF
Network Biology: A paradigm for modeling biological complex systems
PPTX
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks
PDF
When Biology Meets Computer Science
PDF
Computational of Bioinformatics
PPTX
2022_Meetup_Mazza-Marzo.pptx
PDF
Technology R&D Theme 3: Multi-scale Network Representations
PDF
Data Mining for Systems Biology Methods and Protocols 1st Edition Koji Tsuda
Functional Coherence Of Molecular Networks In Bioinformatics 1st Edition Mehm...
Top Network Biology Research Titles for PhD & Master’s – Tutors India
Biological networks 1st Edition Frantois Kopos
Computational Systems Biology 1st Edition Leonardo Marioramrez
Protein-protein interactions-graph-theoretic-modeling
2016 davis-plantbio
2015 genome-center
Network components and biological network construction methods
Biological networks 1st Edition Frantois Kopos
2016 davis-biotech
Network Biology Methods And Applications 1st Edition Seesandra V Rajagopala
Biological networks 1st Edition Frantois Kopos all chapter instant download
Network Biology: A paradigm for modeling biological complex systems
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks
When Biology Meets Computer Science
Computational of Bioinformatics
2022_Meetup_Mazza-Marzo.pptx
Technology R&D Theme 3: Multi-scale Network Representations
Data Mining for Systems Biology Methods and Protocols 1st Edition Koji Tsuda
Ad

Recently uploaded (20)

PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
cuic standard and advanced reporting.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Approach and Philosophy of On baking technology
PPTX
Spectroscopy.pptx food analysis technology
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Unlocking AI with Model Context Protocol (MCP)
Per capita expenditure prediction using model stacking based on satellite ima...
Chapter 3 Spatial Domain Image Processing.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
cuic standard and advanced reporting.pdf
Machine learning based COVID-19 study performance prediction
Approach and Philosophy of On baking technology
Spectroscopy.pptx food analysis technology
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Big Data Technologies - Introduction.pptx
MIND Revenue Release Quarter 2 2025 Press Release
20250228 LYD VKU AI Blended-Learning.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Programs and apps: productivity, graphics, security and other tools
Empathic Computing: Creating Shared Understanding
Digital-Transformation-Roadmap-for-Companies.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

Algorithmic approach to computational biology using graphs

  • 1. Algorithmic approach to Computational Biology using Graphs Submitted by S P Sajjan Research Guide Dr. Ishwar BaidariMCA,Ph. D. Dept. of Computer Science Karnatak University, Dharwad.
  • 2. What is Computational Biology? "Computational biology is not a "field", but an "approach" involving the use of computers to study biological processes and hence it is an area as diverse as biology itself."
  • 3. • Biological data Biological data are data or measurements collected from biological sources, which are often stored or exchanged in a digital form. Biological data are commonly stored in files or databases. Ex : DNA sequences, and population data used in ecology. • Functional molecules In organic chemistry, functional groups are specific groups of atoms or bonds within molecules that are responsible for the characteristic chemical reactions of those molecules.
  • 4. • Mining in molecular biology Text-mining in molecular biology is defined as the automatic extraction of information about genes, proteins and their functional relationships from text documents. Ex: Information science, Bioinformatics and Computational linguistics. • Defining Metabolism The term, 'Metabolism' refers to biochemical processes that happen within a person or living organism. Metabolism is something that consists of both,’ Catabolism,' and, 'Anabolism;' which are the buildup and breakdown of substances.
  • 5. Cellular networks • Interacting molecular sets within cells. • It includes mainly p-p interactions, metabolism, gene transcriptional regulatory networks and signal transduction pathways. • All of them are different subsets of a single large-scale cellular network, since they are eventually cross-linked.
  • 6. Purpose of Computational Biology • Computational Biology can be summarized as the field utilizing high throughput technology and computation to study complex organizational patterns of biological systems and how they contribute to the normal physiology and disease. • Experimental systems biology uses various genomics/proteomics. • Large number of genes or proteins at a genome scale, which naturally yields a large volume of data to be interpreted and put within the context of real biology.
  • 7. • There are several nation-wide large projects aiming at characterizing the genome and proteome of different (e.g cancer) cells. • Billions of dollars are spending into this research that spans many of the top institutions across the nation. • Classical molecular biology has mainly focused on gene or molecular centric research, • 30-40 years of this research led to our realization of the incredible complexity of biological systems. • we need more global experimental approaches and equally as importantly. Relevance of the study and present status
  • 8. Issues Related to Computational Biology • ~22,000 noted Human genes in Sequence • ~60,000 known protein-protein interactions in human • Millions of indirect relationships between genes • Typical genomic experiment: millions of data points
  • 9. Statement of Research Problem • The theory of complex networks plays an important role in a wide variety of disciplines, ranging from communication to molecular and population biology. • The focus of this Research is on graph theory methods for computational biology. • We will survey methods and approaches in graph theory, along with current applications in biomedical informatics. • Within the fields of Biology and Medicine, potential applications of network analysis by using graph theory including identifying drug targets, determining the role of proteins or genes of unknown function.
  • 10. • There are several biological domains where graph theory techniques are applied for knowledge extraction from data. We have classified these problems as follows. • Modeling methods of bio-molecular networks such as protein interaction networks, metabolic networks, as well as transcriptional regulatory networks. • Measurement of centrality and importance in bio-molecular networks. To identify the most important nodes in a large complex network is of fundamental importance in computational biology. • We will introduce several researches that applied centrality measures to identify structurally important genes or proteins identified in this way.
  • 11. • Mining new pathways from bio-molecular networks. • Experimental validation of identification of the pathway in different organisms is requires huge amounts of time and effort. • Thus, there is a need for Graph theory tools help scientists predict pathways in bio-molecular networks. • Our primary goal in the present Research is to provide as broad a survey as possible of the major advances made in this field. Moreover, we also highlight what has been achieved as well as some of the most significant open issues that need to be addressed. • Finally, we hope that this Research will serve as a useful introduction to the field for those unfamiliar with the literature.
  • 12. The concept of Graph theory • Graph: A graph G consists of a set of vertices V(G) and set of edges E(G). • Simple Graph: In simple graph, two of the vertices in G are linked if there exits an edge (𝑉𝑖, 𝑉𝑗) ∈E(G). connecting the vertices and in graph G such that 𝑉𝑖 ∈V(G) and 𝑉𝑗 ∈V(G). • Undirected Graph : An undirected graph is graph, i.e., a set of objects (called vertices or nodes) that are connected together, where all the edges are bidirectional. An undirected graph is sometimes called an undirected network. • Directed Graph: A directed graph is graph, i.e., a set of objects (called vertices or nodes) that are connected together, where all the edges are directed from one vertex to another. A directed graph is sometimes called a digraph or a directed network.
  • 13. Modeling of Bio-molecular networks in Graph • In Biology, Transcriptional regulatory networks and metabolic networks would usually be modeled as directed graphs. • For instance, in a Transcriptional regulatory network, nodes represent genes with edges denoting the Transcriptional relationship between them. • In recent years, attentions have been focused on the protein- protein interaction networks of various simple organisms. These networks describe the direct physical interaction between the proteins in an organism’s proteome and there is no direction associated with the interactions in such networks. • Hence, PPI networks are typically modeled as undirected graphs, in which nodes represent protein and edges represent interaction.
  • 14. Computational Limitations • The challenges of computational biology are enormous, and may exceed the expected increases in computing capability. Several years ago the computational power of “state-of-the-art parallel supercomputers” allowed highly predictive calculations treating only hundreds of atoms for time scales of picoseconds, while molecular dynamics calculations of tens of thousands of atoms for nanoseconds were becoming common, although they were some what less predictive. • A straightforward application of Moore’s Law would predict an increase of about three – four doublings in capability in the intervening five or six years. • Using current methodologies, achieving the desired level of computation would represent an increase of greater than ~109 times in computing power. • It must be noted that even an increase of ~109 in computing power would only provide the ability to simulate certain cellular systems, and may not provide a means to predictively model whole cells, organs or organisms.