SlideShare a Scribd company logo
2
Most read
6
Most read
9
Most read
[Pink Sherbet Photography]RNAseq analysis: Differential gene expression (2/2)Hopscotch and isoformsAugust 25, 2011
Reads->alignment to reference genome->transcript assemblyResulting file type: BAM, gff/bed“What transcripts are in my samples?”August 25, 2011Transcript assemblyProjectsFastqMappingQuick recap: Mapping and transcript assemblyGarber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods. 2011 PMID: 21623353.
RNAseq analysis questionIs there a difference in the transcriptome of two different conditions ?Quantify expressionQuantify differenceAugust 25, 2011Condition1                             Condition2
RNAseqvsExpression ArrayRNAseq can capture a larger dynamic rangeRNAseq can handle degraded samplesGain additional informationNew transcripts(New) isoformsVariantsAugust 25, 2011FlatteningoutArray RNA-seqWang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009 PMID: 19015660
ChallengesStrand-specific methods still biasedNumber of reads not necessarily correlate with transcript abundanceLonger transcripts have more reads (fragmentation). Technical variability between runs causes different number of total reads.Lowly abundant does not mean non-functionalHow to quantify expression of isoformsAugust 25, 2011Ozsolak F, Milos PM. RNA sequencing: advances, challenges and opportunities. Nat Rev Genet. 2011 PMID: 21191423Garber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods. 2011 PMID: 21623353.
Production Informatics and BioinformaticsAugust 25, 2011Produce raw sequence readsBasic ProductionInformaticsMap to genome and generate raw genomic features (e.g. SNPs)Advanced Production Inform.Analyze the data; Uncover the biological meaningBioinformaticsResearchPer one-flowcell project
Quantifying expression in RNAseqLong genes get more readsNormalize: fragments per kilobase of transcript per million mapped reads (FPKM) FPKM accounts for the dependency between paired-end readsAugust 25, 2011Garber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods. 2011 PMID: 21623353.Oshlack A, Wakefield MJ. Transcript length bias in RNA-seq data confounds systems biology. Biol Direct. 2009 PMID: 19371405
Quantifying expression of overlapping isoformsWe do not know where reads of overlapping isoformsacutally belongAlexa-Seqcounting only the reads that map uniquely to a single isoformisoform-expression methods (cufflinks) likelihood function modeling the sequencing process (not very accurate for lowly expressed transcripts)'exon intersection method’ (analogous to expression microarrays)counts reads mapped to its constitutive exons (reduce power for differential expression analysis)'exon union method’counts all reads mapped to any exon in any of the gene's isoforms (underestimates expression for alternatively spliced genes). August 25, 2011Garber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods. 2011 PMID: 21623353.
Differentially expressionWhat is a statistically significant difference between a set of measurements (expression of a gene) of two populations (conditions)First, estimate variabilityObserve biological variability (needs large numbers of replicates to sample the population). model biological variabilitymodel the count variance across replicates as a nonlinear function of the mean counts using various different parametric approaches (such as the normal and negative binomial distributions) (EdgeR, DESeq, Cuffdiff)August 25, 2011Garber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods. 2011 PMID: 21623353.
Three things to rememberRNAseq captures larger dynamic range (more sensitive)Additional information compared to arrays (e.g. isoforms)Need to make assumptions/compromises (quantification, few replicates) August 25, 2011[cabbit]
Next Weeks: NGS Discussion group Jake’s topic August 25, 2011Two Weeks:Abstract: This session will focus on identifying SNPs from whole genome, exome capture or targeted resequencing data. The approaches of mapping, local realigment, recalibration, SNP calling, and SNP recalibration will be introduced and quality metrics discussed.

More Related Content

PPTX
Functional genomics, and tools
PPTX
Functional genomics
PPTX
Introduction to sequence alignment partii
PPTX
Applications of genomics and proteomics ppt
PPTX
Transcriptome analysis
PPTX
Genomics
PPTX
Cath
PPTX
Electrophoretic mobility shift assay
Functional genomics, and tools
Functional genomics
Introduction to sequence alignment partii
Applications of genomics and proteomics ppt
Transcriptome analysis
Genomics
Cath
Electrophoretic mobility shift assay

What's hot (20)

PPTX
PPTX
Types of genomics ppt
PPTX
Comparative genomics
PPT
Primer design
PPTX
Structural genomics
PPTX
Microarray technique
PPTX
Comparative genomics
PPTX
Alternative splicing : mechanism and regulation
PPTX
Single nucleotide polymorphism
PPTX
Ribozyme technology
PPTX
History of Genomics
PPT
Maximum parsimony
PPTX
Genome mapping
PPTX
blast bioinformatics
PPTX
Genome editing techniques
PPTX
MULTIPLE SEQUENCE ALIGNMENT
PPTX
Comparative genomics in eukaryotes, organelles
PPT
Microsatellite
PPTX
Whole genome sequencing
Types of genomics ppt
Comparative genomics
Primer design
Structural genomics
Microarray technique
Comparative genomics
Alternative splicing : mechanism and regulation
Single nucleotide polymorphism
Ribozyme technology
History of Genomics
Maximum parsimony
Genome mapping
blast bioinformatics
Genome editing techniques
MULTIPLE SEQUENCE ALIGNMENT
Comparative genomics in eukaryotes, organelles
Microsatellite
Whole genome sequencing
Ad

Similar to Differential gene expression (20)

PPTX
Transcript detection in RNAseq
PPT
Next generation seqencing tecnologies and application vegetable crops
PPTX
Rnaseq forgenefinding
PDF
High-throughput RNA sequencing with Thermostable Group II Intron Reverse Tran...
PPT
MAGIC :Multiparent advanced generation intercross and QTL discovery
PPTX
Marker devt. workshop 27022012
PDF
Visual Exploration of Clinical and Genomic Data for Patient Stratification
PDF
High throughput sequencing with Thermostable Group II Intron Reverse Transcri...
PPTX
Variant (SNPs/Indels) calling in DNA sequences, Part 2
PDF
SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM
PDF
BIOL335: Functional genomics
PPTX
SAGE- Serial Analysis of Gene Expression
PDF
2015 12-09 nmdd
PPTX
Use of SNP-HapMaps in plant breeding
PDF
An analogy of algorithms for tagging of single nucleotide polymorphism and ev
PDF
Use of TGIRT for ssDNA-seq
PPTX
RNA Sequencing Research
PPTX
ASHG 2015 - Redundant Annotations in Tertiary Analysis
PDF
A decade into Next Generation Sequencing on marine non-model organisms: curre...
Transcript detection in RNAseq
Next generation seqencing tecnologies and application vegetable crops
Rnaseq forgenefinding
High-throughput RNA sequencing with Thermostable Group II Intron Reverse Tran...
MAGIC :Multiparent advanced generation intercross and QTL discovery
Marker devt. workshop 27022012
Visual Exploration of Clinical and Genomic Data for Patient Stratification
High throughput sequencing with Thermostable Group II Intron Reverse Transcri...
Variant (SNPs/Indels) calling in DNA sequences, Part 2
SBVRLDNACOMP:AN EFFECTIVE DNA SEQUENCE COMPRESSION ALGORITHM
BIOL335: Functional genomics
SAGE- Serial Analysis of Gene Expression
2015 12-09 nmdd
Use of SNP-HapMaps in plant breeding
An analogy of algorithms for tagging of single nucleotide polymorphism and ev
Use of TGIRT for ssDNA-seq
RNA Sequencing Research
ASHG 2015 - Redundant Annotations in Tertiary Analysis
A decade into Next Generation Sequencing on marine non-model organisms: curre...
Ad

More from Denis C. Bauer (20)

PPTX
Cloud-native machine learning - Transforming bioinformatics research
PPTX
Translating genomics into clinical practice - 2018 AWS summit keynote
PPTX
Going Server-less for Web-Services that need to Crunch Large Volumes of Data
PPTX
How novel compute technology transforms life science research
PPTX
How novel compute technology transforms life science research
PPTX
VariantSpark: applying Spark-based machine learning methods to genomic inform...
PPTX
Population-scale high-throughput sequencing data analysis
PPTX
Trip Report Seattle
PPTX
Allelic Imbalance for Pre-capture Whole Exome Sequencing
PPTX
Centralizing sequence analysis
PPTX
Qbi Centre for Brain genomics (Informatics side)
PPTX
Functionally annotate genomic variants
PPTX
Variant (SNPs/Indels) calling in DNA sequences, Part 1
PPTX
Introduction to second generation sequencing
PPTX
Introduction to Bioinformatics
PPTX
The missing data issue for HiSeq runs
PDF
Deciphering the regulatory code in the genome
PPT
PPT
STAR: Recombination site prediction
PPT
SUMOylation site prediction
Cloud-native machine learning - Transforming bioinformatics research
Translating genomics into clinical practice - 2018 AWS summit keynote
Going Server-less for Web-Services that need to Crunch Large Volumes of Data
How novel compute technology transforms life science research
How novel compute technology transforms life science research
VariantSpark: applying Spark-based machine learning methods to genomic inform...
Population-scale high-throughput sequencing data analysis
Trip Report Seattle
Allelic Imbalance for Pre-capture Whole Exome Sequencing
Centralizing sequence analysis
Qbi Centre for Brain genomics (Informatics side)
Functionally annotate genomic variants
Variant (SNPs/Indels) calling in DNA sequences, Part 1
Introduction to second generation sequencing
Introduction to Bioinformatics
The missing data issue for HiSeq runs
Deciphering the regulatory code in the genome
STAR: Recombination site prediction
SUMOylation site prediction

Recently uploaded (20)

PDF
Why Quality Childcare in Howell Matters for Your Child’s Future.pdf
PPTX
Long Sleeve Tops for Women - 4 Girls Boutique
PDF
The Lifestyle Benefits and Challenges of Living Alone
PDF
Uniqaya Night Cream For All Skin Type With Natural Ingredients
PDF
PrayerPetals- Where Faith and Womanhood Flourish Together.pdf
PDF
Yoga for life ...........................
PDF
Intro to Culture, Society, and Politics.
PDF
Lifestyle and the Experience of Living a Full Day Without the Use of a Smartp...
PPTX
Faridabad The City That’s Redefining Urban Living.pptx
PDF
What Is Intimate Partner Violence - Understanding the Crisis, Data and Solutions
PPTX
Fashion Jewellery and Clothing Business.pptx
PDF
Home science Resource management project
PPTX
wepik-unveiling-the-influence-of-self-efficacy-on-entrepreneurial-mindset-202...
PPTX
Untitled presentation.pptxjkljlkjlkjlkjlkjlkjlkjlkj
PPTX
Saraf Furniture Reviews – A Story of Trust, Craftsmanship, and Happy Homes.pptx
PPTX
Participation in physical education to promote fitness
DOC
AAMU毕业证学历认证,爱默生学院毕业证ps毕业证
PDF
150 Unique Baby Names for 2025 (With Meanings & Origins)
PPTX
Life is Long about a life and to increase it.pptx
PDF
Equivalent Mass and Its Applications practical ppt
Why Quality Childcare in Howell Matters for Your Child’s Future.pdf
Long Sleeve Tops for Women - 4 Girls Boutique
The Lifestyle Benefits and Challenges of Living Alone
Uniqaya Night Cream For All Skin Type With Natural Ingredients
PrayerPetals- Where Faith and Womanhood Flourish Together.pdf
Yoga for life ...........................
Intro to Culture, Society, and Politics.
Lifestyle and the Experience of Living a Full Day Without the Use of a Smartp...
Faridabad The City That’s Redefining Urban Living.pptx
What Is Intimate Partner Violence - Understanding the Crisis, Data and Solutions
Fashion Jewellery and Clothing Business.pptx
Home science Resource management project
wepik-unveiling-the-influence-of-self-efficacy-on-entrepreneurial-mindset-202...
Untitled presentation.pptxjkljlkjlkjlkjlkjlkjlkjlkj
Saraf Furniture Reviews – A Story of Trust, Craftsmanship, and Happy Homes.pptx
Participation in physical education to promote fitness
AAMU毕业证学历认证,爱默生学院毕业证ps毕业证
150 Unique Baby Names for 2025 (With Meanings & Origins)
Life is Long about a life and to increase it.pptx
Equivalent Mass and Its Applications practical ppt

Differential gene expression

  • 1. [Pink Sherbet Photography]RNAseq analysis: Differential gene expression (2/2)Hopscotch and isoformsAugust 25, 2011
  • 2. Reads->alignment to reference genome->transcript assemblyResulting file type: BAM, gff/bed“What transcripts are in my samples?”August 25, 2011Transcript assemblyProjectsFastqMappingQuick recap: Mapping and transcript assemblyGarber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods. 2011 PMID: 21623353.
  • 3. RNAseq analysis questionIs there a difference in the transcriptome of two different conditions ?Quantify expressionQuantify differenceAugust 25, 2011Condition1 Condition2
  • 4. RNAseqvsExpression ArrayRNAseq can capture a larger dynamic rangeRNAseq can handle degraded samplesGain additional informationNew transcripts(New) isoformsVariantsAugust 25, 2011FlatteningoutArray RNA-seqWang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009 PMID: 19015660
  • 5. ChallengesStrand-specific methods still biasedNumber of reads not necessarily correlate with transcript abundanceLonger transcripts have more reads (fragmentation). Technical variability between runs causes different number of total reads.Lowly abundant does not mean non-functionalHow to quantify expression of isoformsAugust 25, 2011Ozsolak F, Milos PM. RNA sequencing: advances, challenges and opportunities. Nat Rev Genet. 2011 PMID: 21191423Garber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods. 2011 PMID: 21623353.
  • 6. Production Informatics and BioinformaticsAugust 25, 2011Produce raw sequence readsBasic ProductionInformaticsMap to genome and generate raw genomic features (e.g. SNPs)Advanced Production Inform.Analyze the data; Uncover the biological meaningBioinformaticsResearchPer one-flowcell project
  • 7. Quantifying expression in RNAseqLong genes get more readsNormalize: fragments per kilobase of transcript per million mapped reads (FPKM) FPKM accounts for the dependency between paired-end readsAugust 25, 2011Garber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods. 2011 PMID: 21623353.Oshlack A, Wakefield MJ. Transcript length bias in RNA-seq data confounds systems biology. Biol Direct. 2009 PMID: 19371405
  • 8. Quantifying expression of overlapping isoformsWe do not know where reads of overlapping isoformsacutally belongAlexa-Seqcounting only the reads that map uniquely to a single isoformisoform-expression methods (cufflinks) likelihood function modeling the sequencing process (not very accurate for lowly expressed transcripts)'exon intersection method’ (analogous to expression microarrays)counts reads mapped to its constitutive exons (reduce power for differential expression analysis)'exon union method’counts all reads mapped to any exon in any of the gene's isoforms (underestimates expression for alternatively spliced genes). August 25, 2011Garber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods. 2011 PMID: 21623353.
  • 9. Differentially expressionWhat is a statistically significant difference between a set of measurements (expression of a gene) of two populations (conditions)First, estimate variabilityObserve biological variability (needs large numbers of replicates to sample the population). model biological variabilitymodel the count variance across replicates as a nonlinear function of the mean counts using various different parametric approaches (such as the normal and negative binomial distributions) (EdgeR, DESeq, Cuffdiff)August 25, 2011Garber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat Methods. 2011 PMID: 21623353.
  • 10. Three things to rememberRNAseq captures larger dynamic range (more sensitive)Additional information compared to arrays (e.g. isoforms)Need to make assumptions/compromises (quantification, few replicates) August 25, 2011[cabbit]
  • 11. Next Weeks: NGS Discussion group Jake’s topic August 25, 2011Two Weeks:Abstract: This session will focus on identifying SNPs from whole genome, exome capture or targeted resequencing data. The approaches of mapping, local realigment, recalibration, SNP calling, and SNP recalibration will be introduced and quality metrics discussed.

Editor's Notes

  • #2: http://2.bp.blogspot.com/_BPr6hpMG0tg/TSZdkYDcRvI/AAAAAAAAAjY/ReScIkWNySg/s1600/drink.jpghttp://www.sciencemag.org/content/291/5507/1260.full?sid=23d07e07-ccc5-4b15-8e6d-934a02e9580chttp://biostar.stackexchange.com/questions/6638/rna-seq-analysis