SlideShare a Scribd company logo
Genome Sequencing Projects, Genome
Size, Application of sequence information for
                identification of disease genes
Complete Genome Sequencing
 Whole genome shotgun sequencing
 BAC end sequencing
 Chromosome walking
 End sealing
Reference: http://en.wikipedia.org/wiki/File:Genome_Sizes.png
Cost of Genome Sequencing
Nextgen sequencing methods
 454 sequencing methods(2006)
    Principles of pyrophosphate detection(1985, 1988)

 Illumina(Solexa) Genome sequencing methods(2007)
 Applied Biosystems ABI SOLiD System(2007)
 Helicos single molecule sequencing(Helioscope, 2007)
 Pacific Biosciences single-molecule real-time(SMRT)
  technology, 2010
 Sequenom for Nanotechnology based sequencing.
 BioNanomatrixnanofluidiscs
 RNAP technology
http://www.ncbi.nlm.nih.gov/books/NBK20261/
Sequencing methods

          http://www.wellcome.ac.uk/Education-resources/Teaching-and-
          education/Animations/DNA/WTDV026689.htm



         Ref: http://www.wellcome.ac.uk/Education-resources/Teaching-and-
         education/Animations/DNA/WTX056046.htm




          http://www.wellcome.ac.uk/Education-resources/Teaching-and-
          education/Animations/DNA/WTX056051.htm
Ion Torrent
SOLiD Sequencing
http://www.genomesonline.org/cgi-bin/GOLD/index.cgi
http://www.insdc.org/   http://www.ebi.ac.uk/embl
                        /Contact/collaboration.ht
                        ml
Microbial Genome Sequencing
•   JGI – IMG [http://img.jgi.doe.gov/]
•   Broad [http://www.broadinstitute.org/]
•   TIGR [http://www.jcvi.org/]
•   WashU [http://genome.wustl.edu/]
•   VBI at Virginia Tech [www.vbi.vt.edu]
Human Genome Project
                                 NHGRI
                                Solicited                 RFAs were
                    First
                                  pilot                   sought for
                  Publicati
                               proposal for                  full
                   on in
                                ENCODE                    ENCODE
                    2000




  In October                              GWAS -
                              Finished        90% lies   First Report
 1990 Human                                                             ENCODE
                              paper in        outside     on Encode
   Genome                                      coding                   published
                                2003                     Published in
project started                                2005                       2012
                                                             2007
What happens next?
 You have 10 million characters – what to do with them?
    Locate genes
    Determine the function of the gene
         By similarity search
         By domain search
         By Predicting signal peptide
         By locating transmembrane region




Ref: http://www.nature.com/nature/journal/v406/n6797/pdf/406799a0.pdf
Genome Annotation


                       Run 6 frame                   Run Blastp
  ATGAAGATAGACAG       translation                   with nr
  CATACTAGCAGCAT
  AGAATAGATAAGAG
  ATAGAAATAGAATA                                           Matc
                                                            h
   AATATAAGAGAGA                                          found
                                             N
                                             o


      Repeat
      Finding, miRN                                        Product found
      A
                                         Make an
      finding, tRNAs
                                         hmmsearch
      can etc.                       N
                                     O
                                                     Pathway analysis
                                             Matc
                                                     Other analysis
                                              h
                                            found



                              Unknown
                               Genes                   Hypothesis
Genome Sizes
   Gametic Nuclear DNA content
   Represented as mass in pg(pico grams) or length in
    mega bases


                 1 pg = 10^-12 gms
                 1mb = 10^6 bases
                   1 pg = 978 Mb




Ref: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1669731/
Genome Sizes
 Database of Genome Sizes
    http://www.cbs.dtu.dk/databases/DOGS/
 Plant Genome database
    http://www.kew.org/genomesize/homepage.html
 Mamalian genome size database
    http://www.unipv.it/webbio/dbagsdb.htm
 Animal Genome size database
    www.genomesize.com
 Fungal Genome size database.
    www.zbi.ee/fungal-genomesize
Lecture 3,4
Ref: http://www.kew.org/genomesize/homepage.html
Ref: http://www.genomesize.com/
Ref: http://www-3.unipv.it/webbio/dbagsh.htm
Ref: http://www.zbi.ee/fungal-genomesize/
Identifying Human Disease genes
ref: http://www.ncbi.nlm.nih.gov/books/NBK7561/

  Before 1980, very few genes were recognized
     Reverse Genetics: Know gene product and go back to
      gene and do a positional cloning
     Genetic Redundancy: Multiple genes have the same
      function
Identification of genes through
protein product
1000 genomes project
  1092 genomes of different individuals sequenced.
     14 populations
     Low coverage exome sequencing




 38 million SNPs
 1.4 million short insertions
 14,000 large deletions




Ref: http://www.nature.com/nature/journal/v491/n7422/full/nature11632.html

More Related Content

PDF
Mouse Genomes Project + RNA-Editing
PPTX
Assessment of Genetic Diversity in Wheat Genotypes by using ISSR Molecular Ma...
PPTX
Clinical applications of NGS
PPTX
Gene Expression Analysis by Real Time PCR
PDF
Introduction to NGS
PPTX
PDF
Use of TGIRT for ssDNA-seq of cfDNA in human plasma
PPTX
Introduction to second generation sequencing
Mouse Genomes Project + RNA-Editing
Assessment of Genetic Diversity in Wheat Genotypes by using ISSR Molecular Ma...
Clinical applications of NGS
Gene Expression Analysis by Real Time PCR
Introduction to NGS
Use of TGIRT for ssDNA-seq of cfDNA in human plasma
Introduction to second generation sequencing

What's hot (14)

PDF
Neurotech seminar ish wish 2014 maduna
PPTX
Ngs microbiome
PDF
Data analysis pipelines for NGS applications
PDF
ECCB 2010 Next-gen sequencing Tutorial
PPTX
Ngs introduction
PDF
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
PPTX
Marker devt. workshop 27022012
PDF
Next-generation sequencing and quality control: An Introduction (2016)
PDF
Next-generation genomics: an integrative approach
PPTX
Molecular QC: Interpreting your Bioinformatics Pipeline
PDF
Rapd and its application
PPTX
Data Management for Quantitative Biology - Data sources (Next generation tech...
PDF
RNA sequencing: advances and opportunities
PPTX
Toolbox for bacterial population analysis using NGS
Neurotech seminar ish wish 2014 maduna
Ngs microbiome
Data analysis pipelines for NGS applications
ECCB 2010 Next-gen sequencing Tutorial
Ngs introduction
NGS Introduction and Technology Overview (UEB-UAT Bioinformatics Course - Ses...
Marker devt. workshop 27022012
Next-generation sequencing and quality control: An Introduction (2016)
Next-generation genomics: an integrative approach
Molecular QC: Interpreting your Bioinformatics Pipeline
Rapd and its application
Data Management for Quantitative Biology - Data sources (Next generation tech...
RNA sequencing: advances and opportunities
Toolbox for bacterial population analysis using NGS
Ad

Similar to Lecture 3,4 (20)

PPT
Experimentos de nubes científicas: Medical Genome Project
PDF
RNA-seq Analysis
PPTX
Caporaso sloan qiime_workshop_slides_18_oct2012
PDF
Introduction to Apollo for i5k
PDF
GeneArt® services - Gene synthesis through protein production
PPTX
Module5: Genomics sequençing technologies and NGS.pptx
PPTX
Lab2_3_Lecture_DNA_PCR (3).pptx
PPTX
Lecture5,6
PDF
Bio-IT 2010 Genome Commons
PPTX
human genome project_094513.pptx
PDF
An introduction to RNA-seq data analysis
PDF
Identification and characterization of effector genes from wheat stripe rust
PDF
General Principles of Toxicogenomics
PPTX
From Sequence to Knowledge: The Art and Science of Phage Genome Annotation
DOC
Databases used in forensic sciences and current status of this science in pak...
PDF
The Human Genome Project - Part III
PDF
Capanalysis Gene Expression Cage The Science Of Decoding Genes Transcription ...
PDF
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
PPT
Genome Sequencing Project
PDF
Stephen Friend Nature Genetics Colloquium 2012-03-24
Experimentos de nubes científicas: Medical Genome Project
RNA-seq Analysis
Caporaso sloan qiime_workshop_slides_18_oct2012
Introduction to Apollo for i5k
GeneArt® services - Gene synthesis through protein production
Module5: Genomics sequençing technologies and NGS.pptx
Lab2_3_Lecture_DNA_PCR (3).pptx
Lecture5,6
Bio-IT 2010 Genome Commons
human genome project_094513.pptx
An introduction to RNA-seq data analysis
Identification and characterization of effector genes from wheat stripe rust
General Principles of Toxicogenomics
From Sequence to Knowledge: The Art and Science of Phage Genome Annotation
Databases used in forensic sciences and current status of this science in pak...
The Human Genome Project - Part III
Capanalysis Gene Expression Cage The Science Of Decoding Genes Transcription ...
New Molecular Approaches to Identify 21st Century Microbes - Dr Melissa Mille...
Genome Sequencing Project
Stephen Friend Nature Genetics Colloquium 2012-03-24
Ad

More from Sucheta Tripathy (20)

PPTX
Ramorum2016 final
PPTX
Primer designgeneprediction
PPTX
Motif andpatterndatabase
PPTX
Databases ii
PPTX
Snps and microarray
PPTX
PPTX
26 nov2013seminar
PPTX
PPTX
Presentation2013
PPTX
Lecture7,8
PPTX
Primer designgeneprediction
PPTX
Lecture 1,2
PPTX
Sequence Alignment,Blast, Fasta, MSA
PPTX
Databases Part II
PPTX
Biological databases
PPTX
Genome sequencingprojects
PPTX
Human encodeproject
PPT
Tyler presentation
PPT
Tyler presentation
Ramorum2016 final
Primer designgeneprediction
Motif andpatterndatabase
Databases ii
Snps and microarray
26 nov2013seminar
Presentation2013
Lecture7,8
Primer designgeneprediction
Lecture 1,2
Sequence Alignment,Blast, Fasta, MSA
Databases Part II
Biological databases
Genome sequencingprojects
Human encodeproject
Tyler presentation
Tyler presentation

Recently uploaded (20)

PDF
1_English_Language_Set_2.pdf probationary
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PPTX
History, Philosophy and sociology of education (1).pptx
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Computer Architecture Input Output Memory.pptx
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PDF
Trump Administration's workforce development strategy
PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
PDF
Computing-Curriculum for Schools in Ghana
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
1_English_Language_Set_2.pdf probationary
AI-driven educational solutions for real-life interventions in the Philippine...
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
History, Philosophy and sociology of education (1).pptx
Paper A Mock Exam 9_ Attempt review.pdf.
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Computer Architecture Input Output Memory.pptx
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
Chinmaya Tiranga quiz Grand Finale.pdf
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
Trump Administration's workforce development strategy
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
Computing-Curriculum for Schools in Ghana
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...

Lecture 3,4

  • 1. Genome Sequencing Projects, Genome Size, Application of sequence information for identification of disease genes
  • 2. Complete Genome Sequencing  Whole genome shotgun sequencing  BAC end sequencing  Chromosome walking  End sealing
  • 4. Cost of Genome Sequencing
  • 5. Nextgen sequencing methods  454 sequencing methods(2006)  Principles of pyrophosphate detection(1985, 1988)  Illumina(Solexa) Genome sequencing methods(2007)  Applied Biosystems ABI SOLiD System(2007)  Helicos single molecule sequencing(Helioscope, 2007)  Pacific Biosciences single-molecule real-time(SMRT) technology, 2010  Sequenom for Nanotechnology based sequencing.  BioNanomatrixnanofluidiscs  RNAP technology http://www.ncbi.nlm.nih.gov/books/NBK20261/
  • 6. Sequencing methods http://www.wellcome.ac.uk/Education-resources/Teaching-and- education/Animations/DNA/WTDV026689.htm Ref: http://www.wellcome.ac.uk/Education-resources/Teaching-and- education/Animations/DNA/WTX056046.htm http://www.wellcome.ac.uk/Education-resources/Teaching-and- education/Animations/DNA/WTX056051.htm
  • 10. http://www.insdc.org/ http://www.ebi.ac.uk/embl /Contact/collaboration.ht ml
  • 11. Microbial Genome Sequencing • JGI – IMG [http://img.jgi.doe.gov/] • Broad [http://www.broadinstitute.org/] • TIGR [http://www.jcvi.org/] • WashU [http://genome.wustl.edu/] • VBI at Virginia Tech [www.vbi.vt.edu]
  • 12. Human Genome Project NHGRI Solicited RFAs were First pilot sought for Publicati proposal for full on in ENCODE ENCODE 2000 In October GWAS - Finished 90% lies First Report 1990 Human ENCODE paper in outside on Encode Genome coding published 2003 Published in project started 2005 2012 2007
  • 13. What happens next?  You have 10 million characters – what to do with them?  Locate genes  Determine the function of the gene  By similarity search  By domain search  By Predicting signal peptide  By locating transmembrane region Ref: http://www.nature.com/nature/journal/v406/n6797/pdf/406799a0.pdf
  • 14. Genome Annotation Run 6 frame Run Blastp ATGAAGATAGACAG translation with nr CATACTAGCAGCAT AGAATAGATAAGAG ATAGAAATAGAATA Matc h AATATAAGAGAGA found N o Repeat Finding, miRN Product found A Make an finding, tRNAs hmmsearch can etc. N O Pathway analysis Matc Other analysis h found Unknown Genes Hypothesis
  • 15. Genome Sizes  Gametic Nuclear DNA content  Represented as mass in pg(pico grams) or length in mega bases 1 pg = 10^-12 gms 1mb = 10^6 bases 1 pg = 978 Mb Ref: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1669731/
  • 16. Genome Sizes  Database of Genome Sizes  http://www.cbs.dtu.dk/databases/DOGS/  Plant Genome database  http://www.kew.org/genomesize/homepage.html  Mamalian genome size database  http://www.unipv.it/webbio/dbagsdb.htm  Animal Genome size database  www.genomesize.com  Fungal Genome size database.  www.zbi.ee/fungal-genomesize
  • 22. Identifying Human Disease genes ref: http://www.ncbi.nlm.nih.gov/books/NBK7561/  Before 1980, very few genes were recognized  Reverse Genetics: Know gene product and go back to gene and do a positional cloning  Genetic Redundancy: Multiple genes have the same function
  • 23. Identification of genes through protein product
  • 24. 1000 genomes project  1092 genomes of different individuals sequenced.  14 populations  Low coverage exome sequencing 38 million SNPs 1.4 million short insertions 14,000 large deletions Ref: http://www.nature.com/nature/journal/v491/n7422/full/nature11632.html