SlideShare a Scribd company logo
RNA/DNA Sequencing
and Genotyping
Analysis
Bryce Christensen, PhD
Director of Services and Statistical Geneticist
Genetic Data Visualization and Analysis
with Golden Helix
Use the Questions pane in
your GoToWebinar window
Questions during
the presentation
Core Features Packages
Core Features
 Powerful Data Management
 Rich Visualizations
 Robust Statistics
 Flexible
 Easy to use
Applications
 Genotype Analysis
 DNA sequence analysis
 CNV Analysis
 RNA-seq differential expression
 Family Based Association
SNP & Variation Suite (SVS)
RNA-seq expression profiling analysis
DNA-seq variant validation Genomic interpretation
GenomeBrowse
Today’s Agenda
RNA-Seq differential expression example using SVS and
GenomeBrowse
2
3 Somatic mutation analysis example
Getting started with GenomeBrowse and new features1
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Acknowledgments
 NA12878 WGS data is from Illumina Genome Network
 RNA-seq example data provided by EA, same data is available to
all SVS and GenomeBrowse users.
 Gastric cancer sample pair described here:
- Zang, et al., Exome Sequencing of gastric adenocarcinoma
identifies recurrent somatic mutations in cell adhesion and
chromatin remodeling genes. Nat. Genet. 44, 570-574 (2012).
A big thank you to the Golden Helix product
development team!
Questions?
Use the Questions pane in
your GoToWebinar window
END
Visualization Experience
 Natural zooming and navigation controls that mimic familiar panning and
scrolling actions.
 Coverage and pile-up views with different modes to highlight mismatches
and look for strand bias.
 Deep, stable stacking algorithms to look at all reads in a pile-up, not just
the first 10 or 20.
 Easily generate exportable tables of any data in the viewable window.
 Context-sensitive information by clicking on any feature.
 A dynamic labeling system which gives optimal detail on annotation
features without cluttering the view.
Data Streaming
 Cloud-based repository of public annotations including dbSNP, 1000
Genomes, NHLBI 6500 Exomes, UCSC Known Genes, Ensembl, the
OMIM catalog, and much more.
 All public annotation tracks are hosted on the cloud with optimized on-
demand streaming so that you don’t have to download them to start
viewing data.
 Annotations are updated frequently and automatically by Golden Helix so
that you can have immediate access to the most up-to-date information.
 Additional species are also available including cattle, sheep, major food
crops, model organisms, etc.
Flexible Data Storage Options
 Easily manage repositories of BAM files, whether they are on local hard
drives or network attached storage with an integrated download manager
that can be used to create local copies of cloud-based public or private
data files.
 GenomeBrowse is tightly integrated with the EA Pipeline so that EA
customers have immediate streaming access to their RNA-Seq analysis
outputs, saving terabytes of data download.
 GenomeBrowse users are required to create an account, enabling secure
connections to cloud-based data sources.
- Illumina BaseSpace integration coming soon.
SVS Example: Ogden Syndrome Data
 Lethal X-linked recessive disease affecting males in multiple
generations of a family in Ogden, Utah
 Sequenced 5 members of the family, including 3 carriers, to identify
the causal mutation.
 Dr. Lyon graciously shared this data with GHI.
Study Design
 Five family members sequenced using X-chromosome
exome capture
The Human Reference Sequence
 Genome Reference Consortium (GRCh37)
- Feb 2009, previous was NCBI36 March 2006
- 9 alt loci and 187 patches (11 patch releases)
 Supercontigs: Large unplaced contigs
- Some localized to chr level and some unknown
 Does not include a Mitochondrial reference
- UCSC hg19 includes older NCBI 36 MT
- 1000 genomes project using revised Cambridge
Reference Sequence (rCRS)
- Provide “g1k” reference: includes rCRS, Human
herpesvirus 4 type 1, supercontigs and “decoy”
sequence
 v38 genome coming this summer:
- Incorporate all patches into the reference
- Some allele fixes to have reference match major
Single Nucleotide Variants (i.e. SNVs or SNPs)
 Single base substitution from reference
 Note that “reference” is not always the “major”
allele
 “Multi-allelic” sites have more than 2 cataloged
alleles
Gholson Lyon, 2012
Small Insertions/Deletions
 Generally defined as being < 150bp (often much shorter)
 Frameshift insertions/deletions important “loss of function” class of
variants
- Although InDels divisible by three are “in-frame” when in coding region
 Hard to call consistently. Poor concordance between algorithms.
 Where to call an InDel in a homopolymer?
- GTTTAC
- GTTTTAC
- 01234567
- How do you describe the insertion? Ins of T at 5? Or ins of T at 1?
- CGI in their v1 pipeline preferred calling insertion at end, others at beginning, now
always at beginning
 MNP – Can also be called differently
Copy Number Variants
 Best results with WGS
- CNVs > 10kb pretty accurate.
- Under 10kb problematic.
 Detecting Deletions
- Can see coverage drop to near zero
- Harder to pinpoint breakpoint
- Possible false positives in low-
mapability regions
 Amplifications
- Can see coverage jump
- False positives due sample prep or
sequence artifacts
 Need “baseline,” look at Log
Ratio
- Somatic detection uses normal tissues
- Can have control population
Venter vs Watson WGS CNV-seq
64kbp gain in DDAH1 Gene of NA12878
Structural Variants
 Looking for:
- Balanced rearrangements
- Inversions
- Translocations
- Complex
 Signals to detect SV:
- Paired-end mappings / insert length
- Depth of coverage
- Split-read mapping
 Translocations can result in
“fusion” genes.
- For example BCR-ABL fusion gene
central in pathogenesis certain
leukemias.
Example 1kb Inversion (intron of APP)
Let’s take a look at this one…
Golden Helix
Leaders in Genetic Analytics
 Founded in 1998
 Multi-disciplinary: computer science,
bioinformatics, statistics, genetics
 Software and analytic services
In Everything We Do…
 Empowerment
 Simplicity
 Responsiveness
 Excellence
About Golden Helix
Hundreds of Customers World-Wide
-
Over 750 Published Citations
Visualization
 Genome browsers:
- Validate variant calls
- Look at gene annotations,
problematic regions, population
catalogs
- Compare samples where no
variant called
 Free Genome Browsers:
- IGV
- Popular desktop by Broad
- UCSC
- Web-based, extensive annotations
- GenomeBrowse
- Designed to be publication ready
- Smooth zoom and navigation

More Related Content

PDF
Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
PDF
Toward A Better Understanding Of Plant Genome Structure: Combining NGS, Optic...
PDF
Jan2016 bio nano han cao
PPTX
Jan2016 dnanexus giab uses andrew carroll
PDF
Viral biodiversity in rodents
PPTX
Abrf 2017 hadfield j
PPTX
Sept2016 sv 10_x
PDF
Aug2015 analysis team spiral genetics
Odyssey Of The IWGSC Reference Genome Sequence: 12 Years 1 Month 28 Days 11 ...
Toward A Better Understanding Of Plant Genome Structure: Combining NGS, Optic...
Jan2016 bio nano han cao
Jan2016 dnanexus giab uses andrew carroll
Viral biodiversity in rodents
Abrf 2017 hadfield j
Sept2016 sv 10_x
Aug2015 analysis team spiral genetics

What's hot (20)

PPTX
True Single Molecule Sequencing
PPTX
2011 jeroen vanhoudt_ngs
PPTX
Knowing Your NGS Upstream: Alignment and Variants
PDF
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
PDF
RNASeq Experiment Design
PPTX
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
PPTX
Whole exome sequencing(wes)
PDF
Long read sequencing - LSCC lab talk - fri 5 june 2015
PPTX
Recent advances in CRISPR-CAS9 technology: an alternative to transgenic breeding
PPTX
RNASeq - Analysis Pipeline for Differential Expression
PPTX
Aug2015 analysis team 04 10x genomics
PDF
Overview of methods for variant calling from next-generation sequence data
PPTX
Mason abrf single_cell_2017
PPTX
Bioinfo ngs data format visualization v2
PDF
Data analysis pipelines for NGS applications
PDF
Jan2016 pac bio giab
PPTX
vectorQC: 'A pipeline for assembling and annotation of vectors'
PPTX
NGx Sequencing 101-platforms
PDF
A Tovchigrechko - MGTAXA: a toolkit and webserver for predicting taxonomy of ...
True Single Molecule Sequencing
2011 jeroen vanhoudt_ngs
Knowing Your NGS Upstream: Alignment and Variants
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
RNASeq Experiment Design
Rare Variant Analysis Workflows: Analyzing NGS Data in Large Cohorts
Whole exome sequencing(wes)
Long read sequencing - LSCC lab talk - fri 5 june 2015
Recent advances in CRISPR-CAS9 technology: an alternative to transgenic breeding
RNASeq - Analysis Pipeline for Differential Expression
Aug2015 analysis team 04 10x genomics
Overview of methods for variant calling from next-generation sequence data
Mason abrf single_cell_2017
Bioinfo ngs data format visualization v2
Data analysis pipelines for NGS applications
Jan2016 pac bio giab
vectorQC: 'A pipeline for assembling and annotation of vectors'
NGx Sequencing 101-platforms
A Tovchigrechko - MGTAXA: a toolkit and webserver for predicting taxonomy of ...
Ad

Viewers also liked (20)

PPT
Using Genetic Sequencing to Unravel the Dynamics of Your Superorganism Body
PPT
Dna sequencing
PPT
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
PPTX
Chapter 11 Introduction to Genetics
PPT
SCI 9 Lesson 1 Mar 15 - Introduction to Genetics
PPT
04 DNA Sequencing and Genetic Fingerprinting
PDF
Introduction to Bioinformatics
PPT
B.sc biochem i bobi u-1 introduction to bioinformatics
PPTX
Introduction to genetics for beginners
PPTX
2015 bioinformatics personal_genomics_wim_vancriekinge
PPT
Introduction to DNA and Genetics
PDF
Bioinformatics and BioPerl
PPT
Bioinformatics Project Training for 2,4,6 month
PPTX
Bioinformatics Analysis of Nucleotide Sequences
PPTX
Molecular genetics ppt
PPT
Biotechnology in agriculture and BioInformatics in Agriculture
PDF
Bioinformatics
PDF
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
PPTX
MM - KBAC: Using mixed models to adjust for population structure in a rare-va...
PPTX
opdracht3crisis
Using Genetic Sequencing to Unravel the Dynamics of Your Superorganism Body
Dna sequencing
Bioinformatics n bio-bio-1_uoda_workshop_4_july_2013_v1.0
Chapter 11 Introduction to Genetics
SCI 9 Lesson 1 Mar 15 - Introduction to Genetics
04 DNA Sequencing and Genetic Fingerprinting
Introduction to Bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformatics
Introduction to genetics for beginners
2015 bioinformatics personal_genomics_wim_vancriekinge
Introduction to DNA and Genetics
Bioinformatics and BioPerl
Bioinformatics Project Training for 2,4,6 month
Bioinformatics Analysis of Nucleotide Sequences
Molecular genetics ppt
Biotechnology in agriculture and BioInformatics in Agriculture
Bioinformatics
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
MM - KBAC: Using mixed models to adjust for population structure in a rare-va...
opdracht3crisis
Ad

Similar to Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS (20)

PDF
Using VarSeq to Improve Variant Analysis Research Workflows
PDF
Using VarSeq to Improve Variant Analysis Research Workflows
PPTX
Using the GRCh38 reference assembly for clinical interpretation in VSClinical
PPTX
CS Guest Lecture 2015 10-05 advanced databases
PPTX
CS Lecture 2017 04-11 from Data to Precision Medicine
PDF
Introduction to Apollo for i5k
PDF
The UCSC genome browser: A Neuroscience focused overview
PPTX
2015 functional genomics variant annotation and interpretation- tools and p...
PPTX
Understanding Genome
PPTX
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
PDF
Biological databases
PDF
Beagle Imputation in SVS
PDF
BEAGLE Imputation in SVS for Human & Animal SNP Data
PPTX
Next Generation Sequencing (NGS)
PDF
Exploring new frontiers with next-generation sequencing
PDF
Population-Based DNA Variant Analysis
PPTX
Microarray: A high-throughput tool used to analyze gene expression
PDF
Alignment Approaches II: Long Reads
PPTX
Bioinformaatics for M.Sc. Biotecchnology.pptx
PDF
Apollo Introduction for the Chestnut Research Community
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
Using the GRCh38 reference assembly for clinical interpretation in VSClinical
CS Guest Lecture 2015 10-05 advanced databases
CS Lecture 2017 04-11 from Data to Precision Medicine
Introduction to Apollo for i5k
The UCSC genome browser: A Neuroscience focused overview
2015 functional genomics variant annotation and interpretation- tools and p...
Understanding Genome
Genome in a Bottle - Towards new benchmarks for the “dark matter” of the huma...
Biological databases
Beagle Imputation in SVS
BEAGLE Imputation in SVS for Human & Animal SNP Data
Next Generation Sequencing (NGS)
Exploring new frontiers with next-generation sequencing
Population-Based DNA Variant Analysis
Microarray: A high-throughput tool used to analyze gene expression
Alignment Approaches II: Long Reads
Bioinformaatics for M.Sc. Biotecchnology.pptx
Apollo Introduction for the Chestnut Research Community

More from Golden Helix Inc (20)

PDF
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
PDF
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
PDF
Authoring Clinical Reports in VarSeq
PDF
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
PDF
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
PDF
The Molecular Sciences Made Personal
PDF
Prediction and Meta-Analysis
PDF
A Walk Through GWAS
PDF
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
PDF
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
PDF
Cancer Workflows in VarSeq
PDF
Getting Started with VSWarehouse - The User Experience
PDF
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
PDF
Custom Family Workflows
PDF
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
PDF
Using Clinical Reports as a part of a Gene Panel Pipeline
PDF
Investigating Shared Additive Genetic Variation for Alcohol Dependence
PDF
Personalized Medicine through Tumor Sequencing
PDF
CNV Analysis in VarSeq
PDF
Clinical Reporting Made Easy
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Authoring Clinical Reports in VarSeq
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
The Molecular Sciences Made Personal
Prediction and Meta-Analysis
A Walk Through GWAS
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
Cancer Workflows in VarSeq
Getting Started with VSWarehouse - The User Experience
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Custom Family Workflows
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
Using Clinical Reports as a part of a Gene Panel Pipeline
Investigating Shared Additive Genetic Variation for Alcohol Dependence
Personalized Medicine through Tumor Sequencing
CNV Analysis in VarSeq
Clinical Reporting Made Easy

Recently uploaded (20)

PDF
Placing the Near-Earth Object Impact Probability in Context
PDF
The Land of Punt — A research by Dhani Irwanto
PPTX
Introcution to Microbes Burton's Biology for the Health
PDF
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PDF
Sciences of Europe No 170 (2025)
PPT
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
PPTX
Overview of calcium in human muscles.pptx
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PPTX
Science Quipper for lesson in grade 8 Matatag Curriculum
PDF
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PPTX
Biomechanics of the Hip - Basic Science.pptx
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PPT
6.1 High Risk New Born. Padetric health ppt
PPTX
C1 cut-Methane and it's Derivatives.pptx
PPTX
Seminar Hypertension and Kidney diseases.pptx
Placing the Near-Earth Object Impact Probability in Context
The Land of Punt — A research by Dhani Irwanto
Introcution to Microbes Burton's Biology for the Health
GROUP 2 ORIGINAL PPT. pdf Hhfiwhwifhww0ojuwoadwsfjofjwsofjw
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Sciences of Europe No 170 (2025)
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
Overview of calcium in human muscles.pptx
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
Science Quipper for lesson in grade 8 Matatag Curriculum
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Biomechanics of the Hip - Basic Science.pptx
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
6.1 High Risk New Born. Padetric health ppt
C1 cut-Methane and it's Derivatives.pptx
Seminar Hypertension and Kidney diseases.pptx

Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS

  • 1. RNA/DNA Sequencing and Genotyping Analysis Bryce Christensen, PhD Director of Services and Statistical Geneticist Genetic Data Visualization and Analysis with Golden Helix
  • 2. Use the Questions pane in your GoToWebinar window Questions during the presentation
  • 3. Core Features Packages Core Features  Powerful Data Management  Rich Visualizations  Robust Statistics  Flexible  Easy to use Applications  Genotype Analysis  DNA sequence analysis  CNV Analysis  RNA-seq differential expression  Family Based Association SNP & Variation Suite (SVS)
  • 4. RNA-seq expression profiling analysis DNA-seq variant validation Genomic interpretation GenomeBrowse
  • 5. Today’s Agenda RNA-Seq differential expression example using SVS and GenomeBrowse 2 3 Somatic mutation analysis example Getting started with GenomeBrowse and new features1
  • 7. Acknowledgments  NA12878 WGS data is from Illumina Genome Network  RNA-seq example data provided by EA, same data is available to all SVS and GenomeBrowse users.  Gastric cancer sample pair described here: - Zang, et al., Exome Sequencing of gastric adenocarcinoma identifies recurrent somatic mutations in cell adhesion and chromatin remodeling genes. Nat. Genet. 44, 570-574 (2012). A big thank you to the Golden Helix product development team!
  • 8. Questions? Use the Questions pane in your GoToWebinar window
  • 9. END
  • 10. Visualization Experience  Natural zooming and navigation controls that mimic familiar panning and scrolling actions.  Coverage and pile-up views with different modes to highlight mismatches and look for strand bias.  Deep, stable stacking algorithms to look at all reads in a pile-up, not just the first 10 or 20.  Easily generate exportable tables of any data in the viewable window.  Context-sensitive information by clicking on any feature.  A dynamic labeling system which gives optimal detail on annotation features without cluttering the view.
  • 11. Data Streaming  Cloud-based repository of public annotations including dbSNP, 1000 Genomes, NHLBI 6500 Exomes, UCSC Known Genes, Ensembl, the OMIM catalog, and much more.  All public annotation tracks are hosted on the cloud with optimized on- demand streaming so that you don’t have to download them to start viewing data.  Annotations are updated frequently and automatically by Golden Helix so that you can have immediate access to the most up-to-date information.  Additional species are also available including cattle, sheep, major food crops, model organisms, etc.
  • 12. Flexible Data Storage Options  Easily manage repositories of BAM files, whether they are on local hard drives or network attached storage with an integrated download manager that can be used to create local copies of cloud-based public or private data files.  GenomeBrowse is tightly integrated with the EA Pipeline so that EA customers have immediate streaming access to their RNA-Seq analysis outputs, saving terabytes of data download.  GenomeBrowse users are required to create an account, enabling secure connections to cloud-based data sources. - Illumina BaseSpace integration coming soon.
  • 13. SVS Example: Ogden Syndrome Data  Lethal X-linked recessive disease affecting males in multiple generations of a family in Ogden, Utah  Sequenced 5 members of the family, including 3 carriers, to identify the causal mutation.  Dr. Lyon graciously shared this data with GHI.
  • 14. Study Design  Five family members sequenced using X-chromosome exome capture
  • 15. The Human Reference Sequence  Genome Reference Consortium (GRCh37) - Feb 2009, previous was NCBI36 March 2006 - 9 alt loci and 187 patches (11 patch releases)  Supercontigs: Large unplaced contigs - Some localized to chr level and some unknown  Does not include a Mitochondrial reference - UCSC hg19 includes older NCBI 36 MT - 1000 genomes project using revised Cambridge Reference Sequence (rCRS) - Provide “g1k” reference: includes rCRS, Human herpesvirus 4 type 1, supercontigs and “decoy” sequence  v38 genome coming this summer: - Incorporate all patches into the reference - Some allele fixes to have reference match major
  • 16. Single Nucleotide Variants (i.e. SNVs or SNPs)  Single base substitution from reference  Note that “reference” is not always the “major” allele  “Multi-allelic” sites have more than 2 cataloged alleles Gholson Lyon, 2012
  • 17. Small Insertions/Deletions  Generally defined as being < 150bp (often much shorter)  Frameshift insertions/deletions important “loss of function” class of variants - Although InDels divisible by three are “in-frame” when in coding region  Hard to call consistently. Poor concordance between algorithms.  Where to call an InDel in a homopolymer? - GTTTAC - GTTTTAC - 01234567 - How do you describe the insertion? Ins of T at 5? Or ins of T at 1? - CGI in their v1 pipeline preferred calling insertion at end, others at beginning, now always at beginning  MNP – Can also be called differently
  • 18. Copy Number Variants  Best results with WGS - CNVs > 10kb pretty accurate. - Under 10kb problematic.  Detecting Deletions - Can see coverage drop to near zero - Harder to pinpoint breakpoint - Possible false positives in low- mapability regions  Amplifications - Can see coverage jump - False positives due sample prep or sequence artifacts  Need “baseline,” look at Log Ratio - Somatic detection uses normal tissues - Can have control population Venter vs Watson WGS CNV-seq 64kbp gain in DDAH1 Gene of NA12878
  • 19. Structural Variants  Looking for: - Balanced rearrangements - Inversions - Translocations - Complex  Signals to detect SV: - Paired-end mappings / insert length - Depth of coverage - Split-read mapping  Translocations can result in “fusion” genes. - For example BCR-ABL fusion gene central in pathogenesis certain leukemias.
  • 20. Example 1kb Inversion (intron of APP) Let’s take a look at this one…
  • 21. Golden Helix Leaders in Genetic Analytics  Founded in 1998  Multi-disciplinary: computer science, bioinformatics, statistics, genetics  Software and analytic services In Everything We Do…  Empowerment  Simplicity  Responsiveness  Excellence About Golden Helix
  • 22. Hundreds of Customers World-Wide
  • 23. - Over 750 Published Citations
  • 24. Visualization  Genome browsers: - Validate variant calls - Look at gene annotations, problematic regions, population catalogs - Compare samples where no variant called  Free Genome Browsers: - IGV - Popular desktop by Broad - UCSC - Web-based, extensive annotations - GenomeBrowse - Designed to be publication ready - Smooth zoom and navigation