SlideShare a Scribd company logo
11
Most read
18
Most read
20
Most read
Bioinformatics
• What is bioinformatics?
• Why bioinformatics?
• The major molecular biology facts
• Brief history of bioinformatics
• Typical problems of bioinformatics:
collection and retrieval of data
alignment and similarity search
prediction and classification
• Expectations and the level of requirements
Lecture 1
What is Bioinformatics?
Mathematics
and Statistics
Biology
Computer
Science
A working definition is that of House of
Representatives Standing Committee on Primary
Industries and Regional Services Inquiry :-
"All aspects of gathering, storing, handling,
analyzing, interpreting and spreading vast amounts
of biological information in databases. The
information involved includes gene sequences,
biological activity/function, pharmacological
activity, biological structure, molecular structure,
protein-protein interactions, and gene expression.
Bioinformatics uses powerful computers and
statistical techniques to accomplish research
objectives, for example, to discover a new
pharmaceutical or herbicide."
What is bioinformatics?
• Molecular biology and genetics
• Phylogenetic and evolutionary sciences
• Different aspects of biotechnology including
pharmaceutical and microbiological industries
• Medicine
• Agriculture
•Eco-management
Areas of current and future development of
bioinformatics
• Exponential growth of investments
• Constant deficit of trained professionals
• Diversification of bioinformatics applications
• Need in different types of bioinformaticians
Why bioinformatics?
Central Dogma of Molecular Biology
GENOTYPE (i.e. Aa)
PHENOTYPE (pink)
GENE (DNA)
MESSENGER (RNA)
PROTEIN
TRAIT
ATGCAAGTCCACTGTATTCCA
UACGUUCAGGUGACAUAAGGG
transcription reverse tr
translation
replication
DNA
Symbol Meaning Explanation
G G Guanine
A A Adenine
T T Thymine
C C Cytosine
R A or G puRine
Y C or T pYrimidine
N A, C, G or T Any base
Double helix
5’
3’
3’
5’
A C G T C A T G
T G C A G T A C
RNA
5’ 3’
A C G U C A U G
template
U U Uracil
Genetic Code
1. Amino acids are coded by codons – triplets of
nucleotides, e.g. |ACG|TAT|….
2. There are 43
= 64 codons for ~20 amino acids, the
code is degenerate
3. Codons do not overlap
4. Deletions or insertions of one or few nucleotides (not
equal to 3 x N) usually destroy a message by shifting
a reading frame
5. Three specific codons (stop codons) do not code any
amino acid and are always located at the very end of
the protein coding part of a gene
The genetic code
The 20 amino acids common in living
organisms
PROTEINS
Green Fluorecent Protein (GFP)
1 mcgkkfelki dnvrfvghpt llqpphtiqa sktdpspkre lptmilfsvv falranadas
61 viscmhnlsr riaialqhee rrcqyltrea klmlamqdev ttiidsdgsp qspfrqilpk
121 cklardlkea ydslcttgvv rlhinnwlev sfclphkihr vggkhiplea lerslkairp
Genomic Hierarchy in Eukaryotes
Genome nuclear (1)
Chromosomes (23x2)
DNA molecules (23x2)
Genes (~30,000); only a small fraction of genome
Nucleotides (~3x109
)
Eukaryotic genes are complex
Promoter Exon 1 Exon 2 Exon 3 Exon 4
Start codon Intron 1 Intron 2 Intron 3 Stop codon
Protein coding regions
• The first biological database - Protein Identification Resource
was established in 1972 by Margaret Dayhoff
• Dayhoff and co-workers organized the proteins into families and
superfamilies based on degree of sequence similarity
• Idea of sequence alignment was introduced as well as special
tables that reflected the frequency of changes observed in the
sequences of a group of closely related proteins
• Currently there are several huge Protein Banks : SwissProt, PIR
International, etc.
• The first DNA database was established in 1979. Currently there
are several powerful databases: GenBank, EMBL, DDBJ, etc.
Brief history of bioinformatics: Databases
Brief history of bioinformatics:
evolutionary reconsructions
Brief history of bioinformatics: other
important steps
• Development of sequence retrieval methods (1970-80s)
• Development of principles of sequence alignment (1980s)
• Prediction of RNA secondary structure (1980s)
• Prediction of protein secondary structure and 3D (1980-90s)
• The FASTA and BLAST methods for DB search (1980-90s)
• Prediction of genes (1990s)
• Studies of complete genome sequences (late 1990s –2000s)
Collection and retrieval of data.
Alignment methods.
• Sequencing (DNA, proteins)
• Submission of sequences to the databases
• Computer storage of sequences
• Development of sequence formats
• Conversion of one sequence format to another
• Development of retrieval and alignment methods
Prediction, reconstruction and
classification
• Prediction of secondary and 3D structure of RNA and proteins
• Gene prediction in prokaryotes and eukaryotes
• Prediction of promoters and other functional sites
• Reconstruction of phylogeny
• Genome analysis
• Classification of proteins and genes
Prediction of RNA secondary structure:
an example
A. Single stranded RNA 5’ 3’
5’
3’
B. Stem and loop or hairpin loop
Expectations of students’ performance
• Basic understanding of general principles of molecular biology
• Some mathematical and computer science background
• Focus on using computational methods and understanding
general ideas of analysis used in bioinformatics
• Formal description of algorithms and complex methodology
will not be the core elements of this unit
• The core requirement is understanding of foundations of
bioinformatics and “hands on” approach

More Related Content

PDF
What is Bioinformatics.pdf
PDF
Introduction to Bioinformatics 2025.....pdf
PPTX
Introduction to bioinformatics.pptx
PPTX
Data Mining
PPTX
A comparative study using different measure of filteration
PDF
Bioinformatics: History of Bioinformatics, Components of Bioinformatics, Geno...
PPTX
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
PDF
Bioinformatics seminar
What is Bioinformatics.pdf
Introduction to Bioinformatics 2025.....pdf
Introduction to bioinformatics.pptx
Data Mining
A comparative study using different measure of filteration
Bioinformatics: History of Bioinformatics, Components of Bioinformatics, Geno...
617....sjuwbwjisjnslosoanwbwbdhidje.pptx
Bioinformatics seminar

Similar to BIOINFORMATICS.ppt History and applications (20)

PPTX
DNA, CHROMOSOMES & GENES
PPT
bioinfomatics
PPT
Bioinformatics
PPT
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
PPTX
GENOMICS AND BIOINFORMATICS
PPTX
introduction to bioinfromatics.pptx
PPT
Genomics and bioinformatics
PPTX
Genomics experimental-methods
PPTX
PadminiNarayanan-Intro-2018.pptx
PPT
Shriram belge (exome sequencing) 27 2003
PPTX
BASIC OF BIOINFORMATICS.pptx
PPTX
Introduction to Bioinformatics
PPTX
1.introduction to genetic engineering and restriction enzymes
PPTX
Databases_CSS2.pptx
PDF
Introduction to Bioinformatics-1.pdf
PPTX
Molecular profiling 2013
PPT
2013 10 23_dna_for_dummies_v_presented
PPTX
Biological database ppt(1).pptx Introuction
PPTX
Biological database ppt(1).pptx Introuction
PPTX
Bioinformatics_PresentationPresentation.pptx
DNA, CHROMOSOMES & GENES
bioinfomatics
Bioinformatics
Lecture 1 Introduction to Bioinformatics BCH 433.ppt
GENOMICS AND BIOINFORMATICS
introduction to bioinfromatics.pptx
Genomics and bioinformatics
Genomics experimental-methods
PadminiNarayanan-Intro-2018.pptx
Shriram belge (exome sequencing) 27 2003
BASIC OF BIOINFORMATICS.pptx
Introduction to Bioinformatics
1.introduction to genetic engineering and restriction enzymes
Databases_CSS2.pptx
Introduction to Bioinformatics-1.pdf
Molecular profiling 2013
2013 10 23_dna_for_dummies_v_presented
Biological database ppt(1).pptx Introuction
Biological database ppt(1).pptx Introuction
Bioinformatics_PresentationPresentation.pptx
Ad

More from RAJESHKUMAR428748 (20)

PPTX
bioinformatics-200510115939.pptx introduction
PDF
cloning vectors-2-85.pdf used in Recombinant DNA Technology
PDF
Basics of PTC (1).pdf basics of tissue culture
PDF
Anther and pollen culture-.pdf microsporogenesis
PPTX
DNA-Microaray(1).pptx Principle and applications
PPTX
RNA.pptx RNA splicing and its application
PPT
Learning Management System and Reference Management krishan.ppt
PPTX
Endosymbiotic.pptx Theories of origin of chloroplast and mitochondria
PPTX
Gymnosperm.pptx general character and affinities
PPTX
Affinities of bryophytes with algae and pteridophytes.pptx
PPT
Mitochondria.ppt and choloroplast origin
PPTX
Presentation.pptx A general account of Green Algae
PPTX
PTERIDOPHYTE FL-WPS Rajesh kumar Office.pptx
PPTX
Rajesh PPT.pptx algal application and occurrence
PPTX
Oral presentation.pptx in Research presentation
PPTX
Indian Constitution .pptx introduction part 1
PPTX
Sea weeds.pptx commercial and medicinal uses
PPTX
Carrier after BSc and MSc Botany.pptx. imp
PPTX
cellsuspensionculture-1607310747 113.pptx
PPTX
Notes.pptx Pharmacology application in daily life
bioinformatics-200510115939.pptx introduction
cloning vectors-2-85.pdf used in Recombinant DNA Technology
Basics of PTC (1).pdf basics of tissue culture
Anther and pollen culture-.pdf microsporogenesis
DNA-Microaray(1).pptx Principle and applications
RNA.pptx RNA splicing and its application
Learning Management System and Reference Management krishan.ppt
Endosymbiotic.pptx Theories of origin of chloroplast and mitochondria
Gymnosperm.pptx general character and affinities
Affinities of bryophytes with algae and pteridophytes.pptx
Mitochondria.ppt and choloroplast origin
Presentation.pptx A general account of Green Algae
PTERIDOPHYTE FL-WPS Rajesh kumar Office.pptx
Rajesh PPT.pptx algal application and occurrence
Oral presentation.pptx in Research presentation
Indian Constitution .pptx introduction part 1
Sea weeds.pptx commercial and medicinal uses
Carrier after BSc and MSc Botany.pptx. imp
cellsuspensionculture-1607310747 113.pptx
Notes.pptx Pharmacology application in daily life
Ad

Recently uploaded (20)

PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Lesson notes of climatology university.
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
master seminar digital applications in india
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Basic Mud Logging Guide for educational purpose
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Classroom Observation Tools for Teachers
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
Institutional Correction lecture only . . .
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Pre independence Education in Inndia.pdf
PDF
Sports Quiz easy sports quiz sports quiz
PDF
O7-L3 Supply Chain Operations - ICLT Program
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Final Presentation General Medicine 03-08-2024.pptx
Lesson notes of climatology university.
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
master seminar digital applications in india
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Basic Mud Logging Guide for educational purpose
Pharma ospi slides which help in ospi learning
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPH.pptx obstetrics and gynecology in nursing
Classroom Observation Tools for Teachers
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
GDM (1) (1).pptx small presentation for students
Institutional Correction lecture only . . .
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Pre independence Education in Inndia.pdf
Sports Quiz easy sports quiz sports quiz
O7-L3 Supply Chain Operations - ICLT Program

BIOINFORMATICS.ppt History and applications

  • 1. Bioinformatics • What is bioinformatics? • Why bioinformatics? • The major molecular biology facts • Brief history of bioinformatics • Typical problems of bioinformatics: collection and retrieval of data alignment and similarity search prediction and classification • Expectations and the level of requirements Lecture 1
  • 2. What is Bioinformatics? Mathematics and Statistics Biology Computer Science
  • 3. A working definition is that of House of Representatives Standing Committee on Primary Industries and Regional Services Inquiry :- "All aspects of gathering, storing, handling, analyzing, interpreting and spreading vast amounts of biological information in databases. The information involved includes gene sequences, biological activity/function, pharmacological activity, biological structure, molecular structure, protein-protein interactions, and gene expression. Bioinformatics uses powerful computers and statistical techniques to accomplish research objectives, for example, to discover a new pharmaceutical or herbicide." What is bioinformatics?
  • 4. • Molecular biology and genetics • Phylogenetic and evolutionary sciences • Different aspects of biotechnology including pharmaceutical and microbiological industries • Medicine • Agriculture •Eco-management Areas of current and future development of bioinformatics
  • 5. • Exponential growth of investments • Constant deficit of trained professionals • Diversification of bioinformatics applications • Need in different types of bioinformaticians Why bioinformatics?
  • 6. Central Dogma of Molecular Biology GENOTYPE (i.e. Aa) PHENOTYPE (pink) GENE (DNA) MESSENGER (RNA) PROTEIN TRAIT ATGCAAGTCCACTGTATTCCA UACGUUCAGGUGACAUAAGGG transcription reverse tr translation replication
  • 7. DNA Symbol Meaning Explanation G G Guanine A A Adenine T T Thymine C C Cytosine R A or G puRine Y C or T pYrimidine N A, C, G or T Any base Double helix 5’ 3’ 3’ 5’ A C G T C A T G T G C A G T A C RNA 5’ 3’ A C G U C A U G template U U Uracil
  • 8. Genetic Code 1. Amino acids are coded by codons – triplets of nucleotides, e.g. |ACG|TAT|…. 2. There are 43 = 64 codons for ~20 amino acids, the code is degenerate 3. Codons do not overlap 4. Deletions or insertions of one or few nucleotides (not equal to 3 x N) usually destroy a message by shifting a reading frame 5. Three specific codons (stop codons) do not code any amino acid and are always located at the very end of the protein coding part of a gene
  • 10. The 20 amino acids common in living organisms
  • 11. PROTEINS Green Fluorecent Protein (GFP) 1 mcgkkfelki dnvrfvghpt llqpphtiqa sktdpspkre lptmilfsvv falranadas 61 viscmhnlsr riaialqhee rrcqyltrea klmlamqdev ttiidsdgsp qspfrqilpk 121 cklardlkea ydslcttgvv rlhinnwlev sfclphkihr vggkhiplea lerslkairp
  • 12. Genomic Hierarchy in Eukaryotes Genome nuclear (1) Chromosomes (23x2) DNA molecules (23x2) Genes (~30,000); only a small fraction of genome Nucleotides (~3x109 )
  • 13. Eukaryotic genes are complex Promoter Exon 1 Exon 2 Exon 3 Exon 4 Start codon Intron 1 Intron 2 Intron 3 Stop codon Protein coding regions
  • 14. • The first biological database - Protein Identification Resource was established in 1972 by Margaret Dayhoff • Dayhoff and co-workers organized the proteins into families and superfamilies based on degree of sequence similarity • Idea of sequence alignment was introduced as well as special tables that reflected the frequency of changes observed in the sequences of a group of closely related proteins • Currently there are several huge Protein Banks : SwissProt, PIR International, etc. • The first DNA database was established in 1979. Currently there are several powerful databases: GenBank, EMBL, DDBJ, etc. Brief history of bioinformatics: Databases
  • 15. Brief history of bioinformatics: evolutionary reconsructions
  • 16. Brief history of bioinformatics: other important steps • Development of sequence retrieval methods (1970-80s) • Development of principles of sequence alignment (1980s) • Prediction of RNA secondary structure (1980s) • Prediction of protein secondary structure and 3D (1980-90s) • The FASTA and BLAST methods for DB search (1980-90s) • Prediction of genes (1990s) • Studies of complete genome sequences (late 1990s –2000s)
  • 17. Collection and retrieval of data. Alignment methods. • Sequencing (DNA, proteins) • Submission of sequences to the databases • Computer storage of sequences • Development of sequence formats • Conversion of one sequence format to another • Development of retrieval and alignment methods
  • 18. Prediction, reconstruction and classification • Prediction of secondary and 3D structure of RNA and proteins • Gene prediction in prokaryotes and eukaryotes • Prediction of promoters and other functional sites • Reconstruction of phylogeny • Genome analysis • Classification of proteins and genes
  • 19. Prediction of RNA secondary structure: an example A. Single stranded RNA 5’ 3’ 5’ 3’ B. Stem and loop or hairpin loop
  • 20. Expectations of students’ performance • Basic understanding of general principles of molecular biology • Some mathematical and computer science background • Focus on using computational methods and understanding general ideas of analysis used in bioinformatics • Formal description of algorithms and complex methodology will not be the core elements of this unit • The core requirement is understanding of foundations of bioinformatics and “hands on” approach