JBrowse &
Inter-mine
communication
Vivek Krishnakumar
J. Craig Venter Institute
vkrishna@jcvi.org
2017 InterMine Workshop and
Hackathon
Thursday, March 30th @ 11:10am
Overview
➔InterMine <-> JBrowse Integration
➔Extending FriendlyMines
➔MedicMine & ThaleMine
➔Summary
➔Acknowledgements
➔FIN
InterMine <->
JBrowse integration
➔JBrowse & REST API spec
➔Components in InterMine <->
JBrowse integration
➔Issues with initial
implementation
➔Fixes implemented in InterMine
core and JBrowse
JBrowse & REST API spec
• JBrowse (http://jbrowse.org)
• Javascript based portable genome browser tool developed by GMOD community
• Ships with two REST Store adapters: Features and Names
• REST Features Store handles genomic features within specified region:
• :base/stats/global
• :base/stats/region/:refseq?start=:start&end=:end
• :base/stats/regionFeatureDensities/:refseq?start=:start&end=:end&basesPerBin=20000
• :base/features/:refseq?start=:start&end=:end
• REST Names Store handles search and retrieval of features by identifier:
• :base/names/?startswith=:searchterm
• :base/names/?equals=:searchterm
• Accepted response type: application/json
http://gmod.org/wiki/JBrowse_Configuration_Guide#Writing_JBrowse-compatible_Web_Services
Components in InterMine <-> JBrowse
integration
• populate-child-features postprocessor task
• Processes SequenceFeature entities to ensure that feature hierarchy is correctly structured (based on SO)
• Genomic feature Engine (API)
• Returns genomic feature data in JBrowse compatible JSON format
• Config generator service (API)
• Returns feature tracks (trackList.json) & reference sequence listing (refSeqs.json)
• Names service (API)
• Returns list of genomic features matching the input search term
• Embedded ReportDisplayer
• Injects (via iframe) JBrowse visualization into InterMine report page
http://intermine.readthedocs.io/en/latest/webapp/third-party-tools/jbrowse/
Issues with initial
implementation
• populate-child-features postprocessor
• Failed to correctly construct Gene model
JSON object
|- gene:1
|----- mRNA:1
|----- exon:1
|----- exon:2
|----- mRNA:2
|----- exon:3
• Feature Engine & Config generator
• Failed to properly convert InterMine Class
names to SO terms
e.g. AntisenseLncRNA → antisense_lncRNA`
• Failed to specify transcript type in track
configuration
• Data issues
• Failed to represent Gene model CDS entity as
disjointed parts due to CDS loading
mechanism
e.g. FlyBaseCDSFastaLoader,
AraportCDSFastaLoader,
ChadoSequenceProcessor
• populate-child-features postprocessor
• Improved logic populates the childFeatures
collection appropriately
|- gene:1
|----- mRNA:1
|----- exon:1
|----- exon:2
|----- mRNA:2
|----- exon:3
• Feature Engine & Config generator
• Automatically resolves IM Class to SO type
• Ability to specify which tracks to display and
define extra parameters via web.properties
config
• Fixes to ProcessedTranscript glyph (in JBrowse)
• Automatically infers disjointed CDS parts from
exon parts and total CDS span
Fixes made to InterMine
& JBrowse core
intermine/#1454
intermine/#1426
jbrowse/#872
@sergiocontrino
@justinccdev
@vivekkrish
Extending
FriendlyMine
Functionality
➔Current state of Inter-mine
communication based on
standard Homology data model
➔FriendlyMines extended to
support alternate Homology
data model(s)
➔Extending FriendlyMines to
support new Entity types (e.g.
SyntenyBlocks)
Inter-mine communication using Homology data
1
2 3
Homologue data model
Friendly mine config in web.properties
Extending FriendlyMines to support alternate
Homology model(s)
Standard Model
Class Name Homologue
Primary Entity gene
Related Entity homologue
Relationship type (orthologue,
paralogue, LDO, etc.)
PhytoMine Data Model
Homolog
gene1
gene2
type (NO VALUE)
Method NA method (one-to-one, one-to-
many, many-to-many)
FriendlyMines extended to support alternate
Homology model(s)
1
2
3
stable/e93905e
@vivekkrish
Homolog data modelFriendly mine config in web.properties
stable/7800633
@vivekkrish
Extend FriendlyMines to support new Entity type
(Synteny Block)
3
@sammyjav
a
1
2
Medicago Genome
& Arabidopsis
Information Portal
➔MedicagoGenome.org &
MedicMine
➔Araport.org & ThaleMine
➔Additions to ThaleMine
MedicagoGenome.org
Communit
y
Orthologs
(Phytozome, LIS)
eFP Browser
(BAR)
Real-
time
Data
Centers
Communit
y
RNA-Seq
expression
(NCBI SRA)
Publications
(NCBI)
Pathways
(KEGG)
Mt4.0
• Genome sequence
• Genome
annotationMedicMin
e
Gene List
Analysis
Gene
Report
Query, Web
Services
MedicMine - Data Warehouse
Warehous
e
Data
Sources
Mt Gene Indices
(DFCI)
Krishnakumar et al. 2014, Plant Cell
Physiol
Araport.org
Community
Orthologs
(PhytoMine)
Co-expression
(ATTED)
Real-time
Data
Centers
Community
Protein
Interactions
(InAct, BioGrid)
Gene expression
(Araport, BAR)
Publications
(NCBI, UniProt)
Pathways
(KEGG)
Araport11
ThaleMine
Gene List
Analysis
Gene
Report
Query,
Data Tables
Web
Services
ThaleMine - Data Warehouse
Warehous
e
Araport
Data SourcesKrishnakumar et al. 2016, Plant Cell
Physiol
Reannotation of
Col-0 genome
(Araport)
Additions to ThaleMine
Heatmap for 113 RNA-Seq Studies
Gene Report
Gene List Analysis
Gene Report (e.g. dicer-like 1/AT1G01040)
Medicag
o
JGI
PhytoMine
Huma
nRNA-seq
study
RNA-seq
Gen
e
Genes
Summary
• Issues plaguing the functionality of InterMine’s zero-config JBrowse
integration have been resolved
• DISCUSSION: Fold-in changes made to JBrowse code base into InterMine (for
maintainability)
• FriendlyMine functionality extended by implementing link classes to support
alternative Homology data model(s)
• FriendlyMine functionality extended further to support Synteny based
interlinking of InterMine instances
• Araport and MedicagoGenome.org implement InterMine warehouses,
ThaleMine and MedicMine, respectively.
Acknowledgments
• JCVI
• Christopher D. Town (PI: Araport, LegFed)
• Agnes P. Chan (co-PI: Araport, LegFed)
• Cambridge
• Sergio Contrino (Araport Software Dev)
• Gos Micklem (co-PI: Araport)
• Justin Clark-Casey
• Julie Sullivan
• NCGR
• Samuel Hokin (LIS Lead Software Dev)
• Andrew Farmer (co-PI: LegFed)
• JGI Phytozome
• Joe Carlson (Lead Software Dev)
• InterMine and GMOD community
Thank you!

More Related Content

PPTX
Quick Intro to InterMine within AIP and MTGD - JCVI Research Works-in-Progres...
PPTX
Interoperation between InterMines
ODP
BiVi Intermine Slides
PPTX
InterMine Infrastructure LF Meeting 20150428
PDF
InterMine Data Browser - Google Summer of Code 2018
PPTX
Designing a community resource - Sandra Orchard
PDF
Adapt to2012 oak - the new repository
ODP
B03-GenomeContent-Intermine
Quick Intro to InterMine within AIP and MTGD - JCVI Research Works-in-Progres...
Interoperation between InterMines
BiVi Intermine Slides
InterMine Infrastructure LF Meeting 20150428
InterMine Data Browser - Google Summer of Code 2018
Designing a community resource - Sandra Orchard
Adapt to2012 oak - the new repository
B03-GenomeContent-Intermine

More from Vivek Krishnakumar (7)

PPTX
What's New at Araport - ICAR 2017
PPTX
Integrate JBrowse REST API Framework with Adama Federation Architecture
PDF
Teaching Bioinformatics data analysis using Medicago truncatula as a model - ...
PDF
Araport Data Integration - 2015 UMD Minisymposium
PPTX
JBrowse within the Arabidopsis Information Portal - PAG XXIII
PDF
Tripal within the Arabidopsis Information Portal - PAG XXIII
PPTX
Tutorial 1: Your First Science App - Araport Developer Workshop
What's New at Araport - ICAR 2017
Integrate JBrowse REST API Framework with Adama Federation Architecture
Teaching Bioinformatics data analysis using Medicago truncatula as a model - ...
Araport Data Integration - 2015 UMD Minisymposium
JBrowse within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIII
Tutorial 1: Your First Science App - Araport Developer Workshop
Ad

Recently uploaded (20)

PDF
S2 SOIL BY TR. OKION.pdf based on the new lower secondary curriculum
PPTX
Introcution to Microbes Burton's Biology for the Health
PPTX
limit test definition and all limit tests
PPTX
Substance Disorders- part different drugs change body
PDF
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
PPTX
Microbes in human welfare class 12 .pptx
PPT
Animal tissues, epithelial, muscle, connective, nervous tissue
PDF
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PPTX
PMR- PPT.pptx for students and doctors tt
PPTX
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
PPTX
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
PPT
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
PPT
Presentation of a Romanian Institutee 2.
PPTX
perinatal infections 2-171220190027.pptx
PDF
Science Form five needed shit SCIENEce so
PPTX
BODY FLUIDS AND CIRCULATION class 11 .pptx
PPT
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
PPT
Mutation in dna of bacteria and repairss
PPTX
gene cloning powerpoint for general biology 2
PDF
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
S2 SOIL BY TR. OKION.pdf based on the new lower secondary curriculum
Introcution to Microbes Burton's Biology for the Health
limit test definition and all limit tests
Substance Disorders- part different drugs change body
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
Microbes in human welfare class 12 .pptx
Animal tissues, epithelial, muscle, connective, nervous tissue
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PMR- PPT.pptx for students and doctors tt
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
Presentation of a Romanian Institutee 2.
perinatal infections 2-171220190027.pptx
Science Form five needed shit SCIENEce so
BODY FLUIDS AND CIRCULATION class 11 .pptx
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
Mutation in dna of bacteria and repairss
gene cloning powerpoint for general biology 2
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
Ad

JBrowse and Inter-"Mine" Communication - IMDEV 2017

  • 1. JBrowse & Inter-mine communication Vivek Krishnakumar J. Craig Venter Institute vkrishna@jcvi.org 2017 InterMine Workshop and Hackathon Thursday, March 30th @ 11:10am
  • 2. Overview ➔InterMine <-> JBrowse Integration ➔Extending FriendlyMines ➔MedicMine & ThaleMine ➔Summary ➔Acknowledgements ➔FIN
  • 3. InterMine <-> JBrowse integration ➔JBrowse & REST API spec ➔Components in InterMine <-> JBrowse integration ➔Issues with initial implementation ➔Fixes implemented in InterMine core and JBrowse
  • 4. JBrowse & REST API spec • JBrowse (http://jbrowse.org) • Javascript based portable genome browser tool developed by GMOD community • Ships with two REST Store adapters: Features and Names • REST Features Store handles genomic features within specified region: • :base/stats/global • :base/stats/region/:refseq?start=:start&end=:end • :base/stats/regionFeatureDensities/:refseq?start=:start&end=:end&basesPerBin=20000 • :base/features/:refseq?start=:start&end=:end • REST Names Store handles search and retrieval of features by identifier: • :base/names/?startswith=:searchterm • :base/names/?equals=:searchterm • Accepted response type: application/json http://gmod.org/wiki/JBrowse_Configuration_Guide#Writing_JBrowse-compatible_Web_Services
  • 5. Components in InterMine <-> JBrowse integration • populate-child-features postprocessor task • Processes SequenceFeature entities to ensure that feature hierarchy is correctly structured (based on SO) • Genomic feature Engine (API) • Returns genomic feature data in JBrowse compatible JSON format • Config generator service (API) • Returns feature tracks (trackList.json) & reference sequence listing (refSeqs.json) • Names service (API) • Returns list of genomic features matching the input search term • Embedded ReportDisplayer • Injects (via iframe) JBrowse visualization into InterMine report page http://intermine.readthedocs.io/en/latest/webapp/third-party-tools/jbrowse/
  • 6. Issues with initial implementation • populate-child-features postprocessor • Failed to correctly construct Gene model JSON object |- gene:1 |----- mRNA:1 |----- exon:1 |----- exon:2 |----- mRNA:2 |----- exon:3 • Feature Engine & Config generator • Failed to properly convert InterMine Class names to SO terms e.g. AntisenseLncRNA → antisense_lncRNA` • Failed to specify transcript type in track configuration • Data issues • Failed to represent Gene model CDS entity as disjointed parts due to CDS loading mechanism e.g. FlyBaseCDSFastaLoader, AraportCDSFastaLoader, ChadoSequenceProcessor • populate-child-features postprocessor • Improved logic populates the childFeatures collection appropriately |- gene:1 |----- mRNA:1 |----- exon:1 |----- exon:2 |----- mRNA:2 |----- exon:3 • Feature Engine & Config generator • Automatically resolves IM Class to SO type • Ability to specify which tracks to display and define extra parameters via web.properties config • Fixes to ProcessedTranscript glyph (in JBrowse) • Automatically infers disjointed CDS parts from exon parts and total CDS span Fixes made to InterMine & JBrowse core intermine/#1454 intermine/#1426 jbrowse/#872 @sergiocontrino @justinccdev @vivekkrish
  • 7. Extending FriendlyMine Functionality ➔Current state of Inter-mine communication based on standard Homology data model ➔FriendlyMines extended to support alternate Homology data model(s) ➔Extending FriendlyMines to support new Entity types (e.g. SyntenyBlocks)
  • 8. Inter-mine communication using Homology data 1 2 3 Homologue data model Friendly mine config in web.properties
  • 9. Extending FriendlyMines to support alternate Homology model(s) Standard Model Class Name Homologue Primary Entity gene Related Entity homologue Relationship type (orthologue, paralogue, LDO, etc.) PhytoMine Data Model Homolog gene1 gene2 type (NO VALUE) Method NA method (one-to-one, one-to- many, many-to-many)
  • 10. FriendlyMines extended to support alternate Homology model(s) 1 2 3 stable/e93905e @vivekkrish Homolog data modelFriendly mine config in web.properties
  • 11. stable/7800633 @vivekkrish Extend FriendlyMines to support new Entity type (Synteny Block) 3 @sammyjav a 1 2
  • 12. Medicago Genome & Arabidopsis Information Portal ➔MedicagoGenome.org & MedicMine ➔Araport.org & ThaleMine ➔Additions to ThaleMine
  • 14. Communit y Orthologs (Phytozome, LIS) eFP Browser (BAR) Real- time Data Centers Communit y RNA-Seq expression (NCBI SRA) Publications (NCBI) Pathways (KEGG) Mt4.0 • Genome sequence • Genome annotationMedicMin e Gene List Analysis Gene Report Query, Web Services MedicMine - Data Warehouse Warehous e Data Sources Mt Gene Indices (DFCI) Krishnakumar et al. 2014, Plant Cell Physiol
  • 16. Community Orthologs (PhytoMine) Co-expression (ATTED) Real-time Data Centers Community Protein Interactions (InAct, BioGrid) Gene expression (Araport, BAR) Publications (NCBI, UniProt) Pathways (KEGG) Araport11 ThaleMine Gene List Analysis Gene Report Query, Data Tables Web Services ThaleMine - Data Warehouse Warehous e Araport Data SourcesKrishnakumar et al. 2016, Plant Cell Physiol Reannotation of Col-0 genome (Araport)
  • 17. Additions to ThaleMine Heatmap for 113 RNA-Seq Studies Gene Report Gene List Analysis Gene Report (e.g. dicer-like 1/AT1G01040) Medicag o JGI PhytoMine Huma nRNA-seq study RNA-seq Gen e Genes
  • 18. Summary • Issues plaguing the functionality of InterMine’s zero-config JBrowse integration have been resolved • DISCUSSION: Fold-in changes made to JBrowse code base into InterMine (for maintainability) • FriendlyMine functionality extended by implementing link classes to support alternative Homology data model(s) • FriendlyMine functionality extended further to support Synteny based interlinking of InterMine instances • Araport and MedicagoGenome.org implement InterMine warehouses, ThaleMine and MedicMine, respectively.
  • 19. Acknowledgments • JCVI • Christopher D. Town (PI: Araport, LegFed) • Agnes P. Chan (co-PI: Araport, LegFed) • Cambridge • Sergio Contrino (Araport Software Dev) • Gos Micklem (co-PI: Araport) • Justin Clark-Casey • Julie Sullivan • NCGR • Samuel Hokin (LIS Lead Software Dev) • Andrew Farmer (co-PI: LegFed) • JGI Phytozome • Joe Carlson (Lead Software Dev) • InterMine and GMOD community