SlideShare a Scribd company logo
Advantages of VarSeq’s Annotation Capabilities
Darby Kammeraad - Field Application Scientist
20 most promising
Biotech Technology
Providers
Top 10 Analytics
Solution Providers
Hype Cycle for
Life sciences
Golden Helix – Who We Are
Golden Helix is a global bioinformatics
company founded in 1998.
GWAS
Genomic Prediction
Large-N-Population Studies
RNA-Seq
Large-N CNV-Analysis
Variant Warehouse
Centralized Annotations
Hosted Reports
Sharing and Integration
Variant Calling
Filtering and Annotation
Clinical Reports
CNV Analysis
Pipeline: Run Workflows
Cited in over 1100 peer-reviewed publications
Over 350 customers globally
Golden Helix – Who We Are
When you choose a Golden Helix solution, you get more than just software
▪ REPUTATION
▪ TRUST
▪ EXPERIENCE
▪ INDUSTRY FOCUS
▪ THOUGHT
LEADERSHIP
▪ COMMUNITY
▪ TRAINING
▪ SUPPORT
▪ RESPONSIVENESS
▪ INNOVATION and
SPEED
▪ CUSTOMIZATIONS
Annotation capabilities
Annotation Options in VarSeq
Types Description Popular Examples in VarSeq
Gene Tracks Gene and effect on
transcript(s)
RefSeq, Ensemble, dbNSFP
Assemblies Refence sequence
and alignment
review
GRCh 37 hg19, GRCh 38/37
g1k
Low complexity regions
Microarray Probe
Maps
Matching variant
with microarray
probe location
Affymetrix
Cytogenetic/500K/SNP
Variant/Function Allele frequencies
and functional
predictions
gnomAD, ExAC, ICGC, CADD,
OMIM, dbSNP, dbNSFP,
OncoMD, ClinVar, COSMIC
(cancer)
Targeted Panels Disease specific
regions
TruSight
(Cancer/Cardio/Autism), Ion
AmpliSeq Disease Panel
Data Curation – A Peek Under The Hood
▪ Frequently update annotations – monthly for most (ClinVar, OncoMD, & others)
▪ From many disparate sources, researching the best representation of the raw data sources
▪ Variant normalization and transformation ensures the precision and sensitivity in matching
genomic data source
▪ We work with creators of annotation sources providing feedback
▪ Substantial savings for clients – multiple Full Time Equivalents
Clinical assessment - ClinVar
▪ ClinVar – features 414,708 variants
- This public archive from NCBI
- Collaboration of many clinical labs (both commercial and academic)
- Reports the relationship among human variations and phenotypes (supporting evidence from dbSNP)
- Variants found in patient samples, their clinical significance, submitter information, and other supporting data
- Alleles mapped to reference sequences and use HGVS standards
- Submissions can be review by an expert panel.
Clinical assessment - OMIM
▪ OMIM (updated Monthly)
- Contains information from all known Mendelian disorders
- Variants (features 20,527 variants) These are specific variant assertions with clinical annotations and references
- Genes (features 14,825 variants) Includes linked phenotypes and their inheritance pattern, with full HTML descriptions
- Phenotypes (features 4,370 variants) Linked genes, alternative phenotype names, descriptions, and references
Annotations for Cancer
▪ CIViC (updated monthly) – features 634 variants
- Variant Clinical Evidence Summaries & Region Clinical Evidence Summaries (exon and gene deletions/gains).
- CIViC accepts public knowledge contributions but requires that experts review these submissions.
- Evidence statements & records (response to therapy, prognostic, diagnostic, or predisposing for cancer.
▪ COSMIC Mutations Left Aligned 71 – features 2,151,007 variants
- Catalogs somatic variants discovered in cancer samples.
- Provides details about the frequency, tumor types and histology
- Provides gene level annotations with relevant summary and curated oncology details
- COSMIC breaks out each sample-variant pair into a record
- VarSeq provides the fields in COSMIC with relevant hyperlinks.
Annotations for Cancer
▪ ICGC Simple Somatic Mutations 22 – features 47,879,813 variants
- Collection of data from across 89 committed projects currently
- Goals related to quality
- Ensure that most cancer genes with frequency of >3% are discovered
- High sequence level resolution
- High quality standards
- Control based data (tumor/normal pairs)
- Somatic mutations in 21 primary cancer sites in 21k donors
- Primary Site and affected donor frequency.
Annotations for Cancer
▪ OncoMD (updated Monthly)
- Variant and Gene Summaries
- Cancer related genes (onco and tumor suppressor genes)
- Effect on protein
- Publications/studies associated with the variant
- Drug Targeting Mutations
- List of open clinical trials
Frequency Tracks – From ExAC to gnomAD
▪ ExAC – features 10,324,246 variants
▪ gnomAD – features 17,439,605 variants
- Major changes from ExAC –
- Genome (15,496) and exome (123,136)
- Gnomad is a new product (data processing perspective)
- Cohort wider selection of ethnicities (Ashkenazi Jewish)
- New/novel ways of flagging low quality variants
Frequency Tracks cont… – NHLBI and 1kgenome
▪ NHLBI - Features 2,029,948 variants
- Current release is taken from 6503 samples
- Focus on heart, lung, and blood disorders
▪ 1kGenome - Features 85,823,495 variants
- Project ran from 2008 to 2015. One of the largest catalogs
- Goal: ID variants with at least 1% frequencies
Functional Prediction Annotations
▪ dbNSFP Functional Predictions and Scores 3.0 – features 82,832,027 variants
- 14 classifier/prediction algorithms: SIFT, Polyphen2, LRT, MutationTaster, MutationAssessor, FATHMM, MetaSVM,
MetaLR, VEST, PROVEAN, FATHMM-MKL coding and fitCons
- 8 conservation scores (phyloP46way_primate, phyloP46way_placental, phyloP100way_vertebrate,
phastCons46way_primate, phastCons46way_placental, phastCons100way_veterbrate, GERP++ and SiPhy)
▪ dbscSNV Splice Altering Predictions 1.1 – features 15,030,435 variants
- Predicts all snps −3 to +8 at the 5’ splice site and −12 to +2 at the 3’ splice site
- Two ensemble predictions scores, I can provide cut-offs for 95% specificity in calling splice altering mutations
Functional Prediction Annotations cont…
▪ GWAS Catalog 2015-12-29 – features 22,373 variants
- Identifies location of SNPs
- Lists associated publication where the SNP (assay <100,000 SNPS)
▪ CADD – Interpreting Variants of Clinical Significance
- Provides C-scores of “deleteriousness” for SNVs and indels in the human genome.
- Also scores coding/non-coding regions
- Score based on multiple annotation types:
- Conservation, population frequency, regulatory, functional/structural
Transcript Annotations
▪ RefSeq – features 84,950 variants
- Includes genomic DNA, transcripts, and proteins
- Effect of transcripts
- HGVS notation
- Sequence ontology of variant in all transcripts in database
▪ Ensembl – features 215,170 variants
- Joint effort from EBI and WTSI
- Annotate, analyze, and display
VarSeq Demonstration

More Related Content

PDF
An Exploration of Clinical Workflows in VarSeq
PDF
Advanced Report Customization via VSClinical
PDF
Evaluating Copy Number Variants with VSClinical's New ACMG Guideline Workflow
PDF
AMP-Based Variant Classification with VSClinical
PPTX
Exploring New Features and Clinical Reports in the ACMG Guideline Workflow
PDF
Golden Helix's End-to-End Solution for Clinical Labs
PPTX
Automating the ACMG Guidelines with VSClinical
PPTX
Cancer Workflows in VarSeq
An Exploration of Clinical Workflows in VarSeq
Advanced Report Customization via VSClinical
Evaluating Copy Number Variants with VSClinical's New ACMG Guideline Workflow
AMP-Based Variant Classification with VSClinical
Exploring New Features and Clinical Reports in the ACMG Guideline Workflow
Golden Helix's End-to-End Solution for Clinical Labs
Automating the ACMG Guidelines with VSClinical
Cancer Workflows in VarSeq

What's hot (20)

PDF
Building Secure Analysis and Storage Systems with Golden Helix
PPTX
Exome Analysis with VS-CNV and VSClinical: Updated Strategies and Expanded Ca...
PPTX
Evaluating Oncogenicity in VSClinical
PPTX
Performing a Trio Analysis in VSClinical
PPTX
VSWarehouse: Tracking Changing Variant Evidence and Classifications
PDF
CNV Annotations: a crucial step in your variant analysis
PPTX
Using Golden Helix CancerKB to Accelerate NGS Cancer Testing
PPTX
PhoRank 2.0: Improved Phenotype-Based Gene Ranking in VarSeq
PDF
Clinical Reporting Made Easy
PPTX
Introducing VSClinical AMP Guidelines: A Comprehensive Workflow for NGS Testi...
PDF
VSWarehouse Upgrade: Somatic Variant Analysis via VSClinical AMP Guidelines
PPT
BIO 2010 Partnering with Patients
PPTX
Reduce Turn-Around with Enhanced Cancer Annotations and CancerKB Updates
PPTX
Efficient Application of NGS Family-Based Analysis
PPTX
Creating & Managing Reusable Gene Lists with VSClinical
PDF
A Walk Through GWAS
PPTX
Advanced VSClinical Reports with Scripting and Custom Integrations
PPTX
Next-Generation Sequencing Analysis in VSClinical
PDF
Whole Genome Trait Association in SVS
PDF
Introducing VSClinical: Streamlining ACMG Variant Interpretation Guidelines
Building Secure Analysis and Storage Systems with Golden Helix
Exome Analysis with VS-CNV and VSClinical: Updated Strategies and Expanded Ca...
Evaluating Oncogenicity in VSClinical
Performing a Trio Analysis in VSClinical
VSWarehouse: Tracking Changing Variant Evidence and Classifications
CNV Annotations: a crucial step in your variant analysis
Using Golden Helix CancerKB to Accelerate NGS Cancer Testing
PhoRank 2.0: Improved Phenotype-Based Gene Ranking in VarSeq
Clinical Reporting Made Easy
Introducing VSClinical AMP Guidelines: A Comprehensive Workflow for NGS Testi...
VSWarehouse Upgrade: Somatic Variant Analysis via VSClinical AMP Guidelines
BIO 2010 Partnering with Patients
Reduce Turn-Around with Enhanced Cancer Annotations and CancerKB Updates
Efficient Application of NGS Family-Based Analysis
Creating & Managing Reusable Gene Lists with VSClinical
A Walk Through GWAS
Advanced VSClinical Reports with Scripting and Custom Integrations
Next-Generation Sequencing Analysis in VSClinical
Whole Genome Trait Association in SVS
Introducing VSClinical: Streamlining ACMG Variant Interpretation Guidelines
Ad

Similar to Annotation capabilities (20)

PPTX
VarSeq 2.6.0: Advancing Pharmacogenomics and Genomic Analysis
PPTX
2015 functional genomics variant annotation and interpretation- tools and p...
PPTX
Axt microarrays
PDF
Cancer Workflows in VarSeq
PPTX
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
PPTX
Using the GRCh38 reference assembly for clinical interpretation in VSClinical
PDF
CNV, GWAS & Clinical Analysis Advancements in SVS
PPTX
VS-CNV Annotations from the User's Perspective
PPTX
Using Public Access Clinical Databases to Interpret NGS Variants
PDF
Processing Hereditary Cancer Panels in VarSeq
PDF
140128 use cases of giab RMs
PDF
Functional Predictions and Conservation Scores in VSClinical
PDF
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
PPTX
VarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
PPTX
VarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
PPTX
Updates to VSClinical ACMG Guidelines & a Tour of Cancer Annotation Sources
PPTX
Updates to VSClinical ACMG Guidelines & a Tour of Cancer Annotation Sources
PPTX
GIAB for AMP GeT-RM Forum
PDF
Using VarSeq to Improve Variant Analysis Research Workflows
PDF
Using VarSeq to Improve Variant Analysis Research Workflows
VarSeq 2.6.0: Advancing Pharmacogenomics and Genomic Analysis
2015 functional genomics variant annotation and interpretation- tools and p...
Axt microarrays
Cancer Workflows in VarSeq
2015 TriCon - Clinical Grade Annotations - Public Data Resources for Interpre...
Using the GRCh38 reference assembly for clinical interpretation in VSClinical
CNV, GWAS & Clinical Analysis Advancements in SVS
VS-CNV Annotations from the User's Perspective
Using Public Access Clinical Databases to Interpret NGS Variants
Processing Hereditary Cancer Panels in VarSeq
140128 use cases of giab RMs
Functional Predictions and Conservation Scores in VSClinical
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
VarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
VarSeq 2.4.0: VSClinical ACMG Workflow from the User Perspective
Updates to VSClinical ACMG Guidelines & a Tour of Cancer Annotation Sources
Updates to VSClinical ACMG Guidelines & a Tour of Cancer Annotation Sources
GIAB for AMP GeT-RM Forum
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
Ad

More from Golden Helix (20)

PPTX
Automating Pharmacogenomic Workflows with VSWarehouse 3 From Variants to Clin...
PPTX
VSWarehouse 3: Secondary Analysis Platform Overview
PPTX
Automate, Import, & Interpret: Using Custom Scripts in VSClinical
PPTX
Powering Genomic Workflows with Upgraded Catalogs in VSWarehouse and VarSeq 3
PPTX
Dynamic and Flexible Fullstack NGS Pipelines in VSWarehouse 3
PPTX
VSWarehouse 3: Enterprise-Grade Genomic Analysis Across Cloud and On-Premise ...
PPTX
Automation in the Cloud With VSWarehouse 3.0: A User's Perspective
PPTX
The Latest and Greatest Golden Helix CancerKB 4.0 and Somatic Analysis within...
PPTX
Bring Your Own Cloud: Clinical Testing at Scale with VSWarehouse 3
PPTX
VarSeq 2.6.2: Advancements in Pharmacogenomics Reporting
PPTX
Combined Impact: New Tools to Assess Complex and Compound Heterozygous Varian...
PPTX
Integrating Long and Short Read Sequencing for Comprehensive NGS Analysis
PPTX
Complete Variant Assessment in VSClinical
PPTX
PGx Analysis in VarSeq: A User’s Perspective
PPTX
Introducing VarSeq Dx as a Medical Device in the European Union
PPTX
Introducing VSPGx: Pharmacogenomics Testing in VarSeq
PPTX
Analyzing Performance of the Twist Exome with CNV Backbone at Various Probe D...
PDF
From Panels to Genomes with VarSeq: The Complete Tertiary Platform for Short ...
PPTX
Enhance Genomic Research with Polygenic Risk Score Calculations in SVS
PPTX
VarSeq 2.5.0: VSClinical AMP Workflow from the User Perspective
Automating Pharmacogenomic Workflows with VSWarehouse 3 From Variants to Clin...
VSWarehouse 3: Secondary Analysis Platform Overview
Automate, Import, & Interpret: Using Custom Scripts in VSClinical
Powering Genomic Workflows with Upgraded Catalogs in VSWarehouse and VarSeq 3
Dynamic and Flexible Fullstack NGS Pipelines in VSWarehouse 3
VSWarehouse 3: Enterprise-Grade Genomic Analysis Across Cloud and On-Premise ...
Automation in the Cloud With VSWarehouse 3.0: A User's Perspective
The Latest and Greatest Golden Helix CancerKB 4.0 and Somatic Analysis within...
Bring Your Own Cloud: Clinical Testing at Scale with VSWarehouse 3
VarSeq 2.6.2: Advancements in Pharmacogenomics Reporting
Combined Impact: New Tools to Assess Complex and Compound Heterozygous Varian...
Integrating Long and Short Read Sequencing for Comprehensive NGS Analysis
Complete Variant Assessment in VSClinical
PGx Analysis in VarSeq: A User’s Perspective
Introducing VarSeq Dx as a Medical Device in the European Union
Introducing VSPGx: Pharmacogenomics Testing in VarSeq
Analyzing Performance of the Twist Exome with CNV Backbone at Various Probe D...
From Panels to Genomes with VarSeq: The Complete Tertiary Platform for Short ...
Enhance Genomic Research with Polygenic Risk Score Calculations in SVS
VarSeq 2.5.0: VSClinical AMP Workflow from the User Perspective

Recently uploaded (20)

PPTX
Nancy Caroline Emergency Paramedic Chapter 8
PPTX
Diabetes_Pathology_Colourful_With_Diagrams.pptx
PPTX
Public Health. Disasater mgt group 1.pptx
PPTX
Arthritis Types, Signs & Treatment with physiotherapy management
PDF
Medical_Biology_and_Genetics_Current_Studies_I.pdf
PPTX
Nancy Caroline Emergency Paramedic Chapter 17
PPTX
Understanding The Self : 1Sexual health
DOCX
ch 9 botes for OB aka Pregnant women eww
PPTX
Nancy Caroline Emergency Paramedic Chapter 15
PPTX
guidance--unit 1 semester-5 bsc nursing.
PPTX
Full Slide Deck - SY CF Talk Adelaide 10June.pptx
PPTX
DeployedMedicineMedical EquipmentTCCC.pptx
PDF
01. Histology New Classification of histo is clear calssification
PDF
Essentials of Hysteroscopy at World Laparoscopy Hospital
PDF
crisisintervention-210721062718.presentatiodnf
PPTX
HIGHLIGHTS of NDCT 2019 WITH IMPACT ON CLINICAL RESEARCH.pptx
PDF
Back node with known primary managementt
PDF
health promotion and maintenance of elderly
PDF
ENT MedMap you can study for the exam with this.pdf
PDF
Culturally Sensitive Health Solutions: Engineering Localized Practices (www....
Nancy Caroline Emergency Paramedic Chapter 8
Diabetes_Pathology_Colourful_With_Diagrams.pptx
Public Health. Disasater mgt group 1.pptx
Arthritis Types, Signs & Treatment with physiotherapy management
Medical_Biology_and_Genetics_Current_Studies_I.pdf
Nancy Caroline Emergency Paramedic Chapter 17
Understanding The Self : 1Sexual health
ch 9 botes for OB aka Pregnant women eww
Nancy Caroline Emergency Paramedic Chapter 15
guidance--unit 1 semester-5 bsc nursing.
Full Slide Deck - SY CF Talk Adelaide 10June.pptx
DeployedMedicineMedical EquipmentTCCC.pptx
01. Histology New Classification of histo is clear calssification
Essentials of Hysteroscopy at World Laparoscopy Hospital
crisisintervention-210721062718.presentatiodnf
HIGHLIGHTS of NDCT 2019 WITH IMPACT ON CLINICAL RESEARCH.pptx
Back node with known primary managementt
health promotion and maintenance of elderly
ENT MedMap you can study for the exam with this.pdf
Culturally Sensitive Health Solutions: Engineering Localized Practices (www....

Annotation capabilities

  • 1. Advantages of VarSeq’s Annotation Capabilities Darby Kammeraad - Field Application Scientist 20 most promising Biotech Technology Providers Top 10 Analytics Solution Providers Hype Cycle for Life sciences
  • 2. Golden Helix – Who We Are Golden Helix is a global bioinformatics company founded in 1998. GWAS Genomic Prediction Large-N-Population Studies RNA-Seq Large-N CNV-Analysis Variant Warehouse Centralized Annotations Hosted Reports Sharing and Integration Variant Calling Filtering and Annotation Clinical Reports CNV Analysis Pipeline: Run Workflows
  • 3. Cited in over 1100 peer-reviewed publications
  • 5. Golden Helix – Who We Are When you choose a Golden Helix solution, you get more than just software ▪ REPUTATION ▪ TRUST ▪ EXPERIENCE ▪ INDUSTRY FOCUS ▪ THOUGHT LEADERSHIP ▪ COMMUNITY ▪ TRAINING ▪ SUPPORT ▪ RESPONSIVENESS ▪ INNOVATION and SPEED ▪ CUSTOMIZATIONS
  • 7. Annotation Options in VarSeq Types Description Popular Examples in VarSeq Gene Tracks Gene and effect on transcript(s) RefSeq, Ensemble, dbNSFP Assemblies Refence sequence and alignment review GRCh 37 hg19, GRCh 38/37 g1k Low complexity regions Microarray Probe Maps Matching variant with microarray probe location Affymetrix Cytogenetic/500K/SNP Variant/Function Allele frequencies and functional predictions gnomAD, ExAC, ICGC, CADD, OMIM, dbSNP, dbNSFP, OncoMD, ClinVar, COSMIC (cancer) Targeted Panels Disease specific regions TruSight (Cancer/Cardio/Autism), Ion AmpliSeq Disease Panel
  • 8. Data Curation – A Peek Under The Hood ▪ Frequently update annotations – monthly for most (ClinVar, OncoMD, & others) ▪ From many disparate sources, researching the best representation of the raw data sources ▪ Variant normalization and transformation ensures the precision and sensitivity in matching genomic data source ▪ We work with creators of annotation sources providing feedback ▪ Substantial savings for clients – multiple Full Time Equivalents
  • 9. Clinical assessment - ClinVar ▪ ClinVar – features 414,708 variants - This public archive from NCBI - Collaboration of many clinical labs (both commercial and academic) - Reports the relationship among human variations and phenotypes (supporting evidence from dbSNP) - Variants found in patient samples, their clinical significance, submitter information, and other supporting data - Alleles mapped to reference sequences and use HGVS standards - Submissions can be review by an expert panel.
  • 10. Clinical assessment - OMIM ▪ OMIM (updated Monthly) - Contains information from all known Mendelian disorders - Variants (features 20,527 variants) These are specific variant assertions with clinical annotations and references - Genes (features 14,825 variants) Includes linked phenotypes and their inheritance pattern, with full HTML descriptions - Phenotypes (features 4,370 variants) Linked genes, alternative phenotype names, descriptions, and references
  • 11. Annotations for Cancer ▪ CIViC (updated monthly) – features 634 variants - Variant Clinical Evidence Summaries & Region Clinical Evidence Summaries (exon and gene deletions/gains). - CIViC accepts public knowledge contributions but requires that experts review these submissions. - Evidence statements & records (response to therapy, prognostic, diagnostic, or predisposing for cancer. ▪ COSMIC Mutations Left Aligned 71 – features 2,151,007 variants - Catalogs somatic variants discovered in cancer samples. - Provides details about the frequency, tumor types and histology - Provides gene level annotations with relevant summary and curated oncology details - COSMIC breaks out each sample-variant pair into a record - VarSeq provides the fields in COSMIC with relevant hyperlinks.
  • 12. Annotations for Cancer ▪ ICGC Simple Somatic Mutations 22 – features 47,879,813 variants - Collection of data from across 89 committed projects currently - Goals related to quality - Ensure that most cancer genes with frequency of >3% are discovered - High sequence level resolution - High quality standards - Control based data (tumor/normal pairs) - Somatic mutations in 21 primary cancer sites in 21k donors - Primary Site and affected donor frequency.
  • 13. Annotations for Cancer ▪ OncoMD (updated Monthly) - Variant and Gene Summaries - Cancer related genes (onco and tumor suppressor genes) - Effect on protein - Publications/studies associated with the variant - Drug Targeting Mutations - List of open clinical trials
  • 14. Frequency Tracks – From ExAC to gnomAD ▪ ExAC – features 10,324,246 variants ▪ gnomAD – features 17,439,605 variants - Major changes from ExAC – - Genome (15,496) and exome (123,136) - Gnomad is a new product (data processing perspective) - Cohort wider selection of ethnicities (Ashkenazi Jewish) - New/novel ways of flagging low quality variants
  • 15. Frequency Tracks cont… – NHLBI and 1kgenome ▪ NHLBI - Features 2,029,948 variants - Current release is taken from 6503 samples - Focus on heart, lung, and blood disorders ▪ 1kGenome - Features 85,823,495 variants - Project ran from 2008 to 2015. One of the largest catalogs - Goal: ID variants with at least 1% frequencies
  • 16. Functional Prediction Annotations ▪ dbNSFP Functional Predictions and Scores 3.0 – features 82,832,027 variants - 14 classifier/prediction algorithms: SIFT, Polyphen2, LRT, MutationTaster, MutationAssessor, FATHMM, MetaSVM, MetaLR, VEST, PROVEAN, FATHMM-MKL coding and fitCons - 8 conservation scores (phyloP46way_primate, phyloP46way_placental, phyloP100way_vertebrate, phastCons46way_primate, phastCons46way_placental, phastCons100way_veterbrate, GERP++ and SiPhy) ▪ dbscSNV Splice Altering Predictions 1.1 – features 15,030,435 variants - Predicts all snps −3 to +8 at the 5’ splice site and −12 to +2 at the 3’ splice site - Two ensemble predictions scores, I can provide cut-offs for 95% specificity in calling splice altering mutations
  • 17. Functional Prediction Annotations cont… ▪ GWAS Catalog 2015-12-29 – features 22,373 variants - Identifies location of SNPs - Lists associated publication where the SNP (assay <100,000 SNPS) ▪ CADD – Interpreting Variants of Clinical Significance - Provides C-scores of “deleteriousness” for SNVs and indels in the human genome. - Also scores coding/non-coding regions - Score based on multiple annotation types: - Conservation, population frequency, regulatory, functional/structural
  • 18. Transcript Annotations ▪ RefSeq – features 84,950 variants - Includes genomic DNA, transcripts, and proteins - Effect of transcripts - HGVS notation - Sequence ontology of variant in all transcripts in database ▪ Ensembl – features 215,170 variants - Joint effort from EBI and WTSI - Annotate, analyze, and display