SlideShare a Scribd company logo
British Columbia Cancer AgencyGenome Sciences CentreVancouver . British Columbia . CanadaComplementing Computation with Visualization in GenomicsMarch 11, 2010EBI Interfaces Interest ForumCydney Nielsen
Discovery pathBiological SampleGenomic DataScientific Insight
Discovery pathBiological SampleGenomic DataScientific Insight
Components of Data AnalysisAutomationAnalysisGenomic DataScientific InsightHuman Judgment
OutlineGenome Assembly VisualizationABySS-ExplorerComplement to genome browsing Using clustering and interactive data exploration
OutlineGenome Assembly VisualizationABySS-ExplorerComplement to genome browsing Using clustering and interactive data exploration
Genome Sequencingcell populationextracted DNAShotgun approachsheared DNAsequencing readsAGCGGATTGCATGACAGTGTACAGCCTGACAGAAGCGCGCTACGATCAGATCAACATGACAGTCCGAGTACATTCAGAATGGTACAGCAG
ABySS – Assembly ByShort SequencesSimpson et al. Genome Res 2009Sequencing read set (read length = 7 nt):GGACATCGGACAGACorresponding de Bruijn graph (k = 5 nt):
ABySS – Assembly ByShort SequencesSimpson et al. Genome Res 2009Sequencing read set (read length = 7 nt):GGACATCGGACAGACorresponding de Bruijn graph (k = 5 nt):ABySS merges unambiguously connected vertices to form contigs
Assembly AmbiguitiesTrue genome sequenceGGATTGAAAAAAAAAAAAAAAAGTAGCACGAATATACATAGAAAAAAAAAAAAAAAAATTACG
Assembly AmbiguitiesTrue genome sequenceGGATTGAAAAAAAAAAAAAAAAGTAGCACGAATATACATAGAAAAAAAAAAAAAAAAATTACGAssembled sequence de Bruijn graph representation
Starting PointShaun Jackman
Example of existing tools: Consed
Example of existing tools: Consed
Complementing Computation with Visualization in Genomics
Complementing Computation with Visualization in Genomics
Complementing Computation with Visualization in Genomics
Properties of DNA
Capture sequence strandAAAAAT2+1+
Capture sequence strandAAAAAT2+1+TTTTTA2-1-
Capture sequence strandAAAAAT1+2+TTTTTA
Capture sequence strandAAAAAT1-2-TTTTTA
Complementing Computation with Visualization in Genomics
Capture sequence lengthone oscillation = 100 nt
Genome Sequencingcell populationextracted DNAread pair informationreadsheared DNAdsDNAfragment(known size)sequencing reads(typically produce millions)AGCGGATTGCATGACAGTreadGTACAGCCTGACAGAAGCGCGCTACGATCAGATCAACATGACAGTCCGAGTACATTCAGAATGGTACAGCAG
Capture read pair informationAfter building the initial single-end (SE) contigs from k-mer sequences, ABySS uses paired-end reads to resolve ambiguities.
Capture read pair informationPaired end read information is used the construct paired end (PE) contigs… 13+  44-  46+  4+  79+  70+ …blue gradient = paired end contigorange = selected single end contig
ABySS-Explorer Visual representation of:
 contig adjacency information
 contig strand
 contig length
 paired-end relationships
 paired-end contigs
 Implemented using the Java Universal Network/Graph Framework (JUNG)
 Applied the Kamada-Kawai layout algorithm (JUNG implementation)
 Use ABySS files as input (version 1.1.0 and higher)
http://www.bcgsc.ca/platform/bioinfo/software/abyss-explorer
Part 1: Conclusions and Future Work Graph encoding provides a integrated display of genome assemblies and associated meta-data
 This representation is particularly powerful for revealing high-level genome assembly structure, not readily viewable in any other interactive tool
 Future work includes:
 support for other assembly algorithm outputs
enable flexible annotation display
 integrate with existing assembly editing toolsOutlineGenome Assembly VisualizationABySS-ExplorerComplement to genome browsing Using clustering and interactive data exploration
Genome Sequencingcell populationextracted DNAsheared DNAsequencing reads(typically produce millions)AGCGGATTGCATGACAGTGTACAGCCTGACAGAAGCGCGCTACGATCAGATCAACATGACAGTCCGAGTACATTCAGAATGGTACAGCAG
Genome Sequencingcell populationextracted DNAsheared DNAsequencing reads(typically produce millions)AGCGGATTGCATGACAGTGTACAGCCTGACAGAAGCGCGCTACGATCAGATCAACATGACAGTCCGAGTACATTCAGAATGGTACAGCAG
Genome Sequencingcell populationChromatin Immunoprecipitationand Sequencing (ChIP-Seq)extracted DNAselectionsheared DNAsequencing reads(typically produce millions)AGCGGATTGCATGACAGTGTACAGCCTGACAGAAGCGCGCTACGATCAGATCAAGTACAGCCTGACAGAAGCCATGACAGTCCGAGTACATTCAGAATGGTACAGCAGTTCAGAATGGTACAGCAG
Align sequences to the genomeCCGAGTACAGCCTGACAGAGCATGACAGTCCGAGTACTTGCATGACAGTCCGAGTAGCGGATTGCATGACAGTAGCGGATTGCATGACAGTAGCGGATTGCATGACAGTReference GenomeAGCGGATTGCATGACAGTCCGAGTACAGCCTGACAGARead coverageGenomic coordinate
Genome browser can reveal local patternsH3K4me3H3K36me3H3K27me3H3K9me3H3K9AcMRE
Difficult to get global overview
Focus on regions of interest1. For example, transcriptional start sites (TSS +/- 3000 nt)H3K4me3H3K9AcH3K4me1H3K36me3MeDIPMRE2. Extract data matricesNormalization for bin i, sample h:3. Cluster matrices (k-means clustering with Euclidean distance)

More Related Content

DOCX
IEEE 2014 JAVA NETWORKING PROJECTS Snapshot and continuous data collection in...
PDF
M.Phil Computer Science Remote Sensing Projects
PDF
M phil-computer-science-remote-sensing-projects
PDF
M.E Computer Science Remote Sensing Projects
PDF
Robust foreground modelling to segment and detect multiple moving objects in ...
PDF
A Novel Penalized and Compensated Constraints Based Modified Fuzzy Possibilis...
PDF
Optimal buffer allocation in
PDF
ME Synopsis
IEEE 2014 JAVA NETWORKING PROJECTS Snapshot and continuous data collection in...
M.Phil Computer Science Remote Sensing Projects
M phil-computer-science-remote-sensing-projects
M.E Computer Science Remote Sensing Projects
Robust foreground modelling to segment and detect multiple moving objects in ...
A Novel Penalized and Compensated Constraints Based Modified Fuzzy Possibilis...
Optimal buffer allocation in
ME Synopsis

Viewers also liked (6)

PPT
Usability Testing is Easy!
DOC
Manoocher's portfolio
PPT
Ensembl Redesign
PDF
PES Vitamin Series - Module 5 - How to Select a Premium Multivitamin
PPT
Usability Testing is Easy! (redux)
PDF
Cocoa for Scientists
Usability Testing is Easy!
Manoocher's portfolio
Ensembl Redesign
PES Vitamin Series - Module 5 - How to Select a Premium Multivitamin
Usability Testing is Easy! (redux)
Cocoa for Scientists
Ad

Similar to Complementing Computation with Visualization in Genomics (20)

PPTX
Genome Assembly copy
PDF
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
PPTX
Rnaseq forgenefinding
PPTX
The Transformation of Systems Biology Into A Large Data Science
PPTX
Kulakova sbb2014
PDF
Integration of single molecule, genome mapping data in a web-based genome bro...
PDF
Report-de Bruijn Graph
PDF
sb400161v
PPTX
Dgaston dec-06-2012
PDF
Accelerating GWAS epistatic interaction analysis methods
PPT
Bioinformatica 08-12-2011-t8-go-hmm
PPTX
Tools for Transcriptome Data Analysis
PPTX
R Analytics in the Cloud
PDF
Computational approaches to the regulatory genomics of neurogenesis
PPT
PPTX
Understanding Genome
PDF
Fpc A Software Package For Physical Maps Fred Engier And Cari Soderlund
PPTX
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
PDF
Cytoscape Talk 2010
PDF
Apollo Collaborative genome annotation editing
Genome Assembly copy
Interactive Analysis of Large-Scale Sequencing Genomics Data Sets using a Rea...
Rnaseq forgenefinding
The Transformation of Systems Biology Into A Large Data Science
Kulakova sbb2014
Integration of single molecule, genome mapping data in a web-based genome bro...
Report-de Bruijn Graph
sb400161v
Dgaston dec-06-2012
Accelerating GWAS epistatic interaction analysis methods
Bioinformatica 08-12-2011-t8-go-hmm
Tools for Transcriptome Data Analysis
R Analytics in the Cloud
Computational approaches to the regulatory genomics of neurogenesis
Understanding Genome
Fpc A Software Package For Physical Maps Fred Engier And Cari Soderlund
Exploring DNA/RNA-Seq Analysis Results with Golden Helix GenomeBrowse and SVS
Cytoscape Talk 2010
Apollo Collaborative genome annotation editing
Ad

More from Francis Rowland (20)

PPTX
Sabotage
PDF
Visual note-taking: listening, learning, remembering
PDF
A UX Journey into the World of Early Drug Discovery - UX Cambridge 2015
PDF
Les super pouvoirs du sketching
PDF
Le Design Studio
PPTX
Useful questions to ask when designing data visualisations
PDF
Jeux d'Innovation (FLUPA UX Day 2013)
PDF
What the heck are sketchnotes?
PDF
VIZBI 2013 - UX design tutorial
PDF
User research: the gentle art of not asking users what they want
KEY
Design for Society
PPT
Why usability problems go unfixed - UX Bristol 2012
PDF
Vizbi 2012 Takeaway
KEY
The user experience of EGA data access
KEY
Speed sketching UX Cambridge 2011
KEY
Drupal at the EBI
PPT
Reactome: Usability testing - is it useful?
PPT
Caroline Jarrett: Forms and their Users
PPT
Design Prototyping
PPTX
Gene Expression Atlas user interface
Sabotage
Visual note-taking: listening, learning, remembering
A UX Journey into the World of Early Drug Discovery - UX Cambridge 2015
Les super pouvoirs du sketching
Le Design Studio
Useful questions to ask when designing data visualisations
Jeux d'Innovation (FLUPA UX Day 2013)
What the heck are sketchnotes?
VIZBI 2013 - UX design tutorial
User research: the gentle art of not asking users what they want
Design for Society
Why usability problems go unfixed - UX Bristol 2012
Vizbi 2012 Takeaway
The user experience of EGA data access
Speed sketching UX Cambridge 2011
Drupal at the EBI
Reactome: Usability testing - is it useful?
Caroline Jarrett: Forms and their Users
Design Prototyping
Gene Expression Atlas user interface

Recently uploaded (20)

PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
cuic standard and advanced reporting.pdf
PPTX
Cloud computing and distributed systems.
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Approach and Philosophy of On baking technology
PDF
Machine learning based COVID-19 study performance prediction
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Encapsulation theory and applications.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Electronic commerce courselecture one. Pdf
PDF
A comparative analysis of optical character recognition models for extracting...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Dropbox Q2 2025 Financial Results & Investor Presentation
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
cuic standard and advanced reporting.pdf
Cloud computing and distributed systems.
The Rise and Fall of 3GPP – Time for a Sabbatical?
Approach and Philosophy of On baking technology
Machine learning based COVID-19 study performance prediction
Network Security Unit 5.pdf for BCA BBA.
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Mobile App Security Testing_ A Comprehensive Guide.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Encapsulation theory and applications.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Programs and apps: productivity, graphics, security and other tools
Reach Out and Touch Someone: Haptics and Empathic Computing
Assigned Numbers - 2025 - Bluetooth® Document
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Electronic commerce courselecture one. Pdf
A comparative analysis of optical character recognition models for extracting...

Complementing Computation with Visualization in Genomics