SlideShare a Scribd company logo
Lowering barriers to
publishing biological data on
          the web
           Brad Chapman
        Department of Molecular Biology
        Massachusetts General Hospital
              Boston, MA USA
           chapmanb@50mail.com
      http://friendfeed.com/chapmanb


            27 June 2009
Motivation
Motivation

    Web accessible
    Interoperable in standard formats
    Displays for browsing
    Analyses
    Scale
Current state: Reusable libraries
      Parse file formats
      Run programs
      Build analysis pipelines
      Communities

  Python examples
     Biopython                   pygr
     bx-python                   PyCogent
Current state: Database schemas


     Represent biological data
     Expand analyses beyond flat files
     Interoperate with standards

BioSQL                Chado
Current state: Web applications
Faster and Bigger
Proposal
    Provide
           Reusable presentation components
           Quickly deployable frameworks

    Integrate
           Bioinformatics libraries
           Database schemas
           Web development frameworks
Proposal
http://biosqlweb.appspot.com/
Challenges: Design
     Reusable
         Components: avoid large framework
         Multi-language: javascript front end
     Accessible
         Automated data retrieval (REST)
         Standard formats (GFF, RDF)
     Available
         Creative Commons
         http://creativecommons.org/about/licenses
         Open Data Commons
         http://www.opendatacommons.org/licenses/
Challenges: Community questions

 How do we. . .
    provide plug-in components?
    leverage existing code?
    make reuse easier?
    communicate about these issues?

More Related Content

PPT
PPTX
Quick Intro to InterMine within AIP and MTGD - JCVI Research Works-in-Progres...
PPT
Developing an integrated thesaurus for the cornell genomics initiative digita...
PPTX
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
PPT
Ontology Web Services for Semantic Applications
PPT
Publishing data and code openly
TXT
bio data
Quick Intro to InterMine within AIP and MTGD - JCVI Research Works-in-Progres...
Developing an integrated thesaurus for the cornell genomics initiative digita...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
Ontology Web Services for Semantic Applications
Publishing data and code openly
bio data

What's hot (17)

PPT
Enabling Semantically Aware Software Applications
PPTX
Data quality problem and solution
PPTX
Citing data in research articles: principles, implementation, challenges - an...
PPTX
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
PPTX
Building a Faculty Publications Database
PPTX
Important protein databases and proteomics softwares
PDF
Ala dcig-webinar
PPT
Xerxes Roadmap
PPT
Lsr vpresntation
PPT
Remsen EOL Content Summit
PPT
The public library catalogue as a social space: A case study of social discov...
PDF
RDAP 16 Poster: A Proposed Course Model for Integrating RDM with Research Rep...
PPTX
FAIR Data and Model Management for Systems Biology (and SOPs too!)
PPTX
The FAIRDOM Commons for Systems Biology
PPT
Hosting a compound centric community resource for chemistry data
PPTX
Open access to your content
PDF
Federating Research Profiling Data
Enabling Semantically Aware Software Applications
Data quality problem and solution
Citing data in research articles: principles, implementation, challenges - an...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
Building a Faculty Publications Database
Important protein databases and proteomics softwares
Ala dcig-webinar
Xerxes Roadmap
Lsr vpresntation
Remsen EOL Content Summit
The public library catalogue as a social space: A case study of social discov...
RDAP 16 Poster: A Proposed Course Model for Integrating RDM with Research Rep...
FAIR Data and Model Management for Systems Biology (and SOPs too!)
The FAIRDOM Commons for Systems Biology
Hosting a compound centric community resource for chemistry data
Open access to your content
Federating Research Profiling Data
Ad

Viewers also liked (20)

PPT
Eprotect
PDF
Tracking Objects To Detect Feature Dependencies
PPS
KANSAS CITY INVESTMENT PROPERTIES
PDF
201505 CSE340 Lecture 06
PPTX
CQRS introduction
PDF
201506 CSE340 Lecture 21
PDF
201506 CSE340 Lecture 14
PPT
LiveOffice Email Archiving Makes Cents
PPT
Corporate taxation introduction
PDF
Phenomenal Oct 15, 2009
PPT
Irem presentation final
PDF
201506 CSE340 Lecture 11
PPT
Chapter 3 presentation
PDF
RCMSL Phenomenal July 9, 2009
PDF
Week9
PDF
Heirloom Travel: Wine Country - Wineries
PDF
201505 CSE340 Lecture 05
PDF
RCMSL Phenomenal Aug 13 And 20, 2009
PPT
Monaco 020909
Eprotect
Tracking Objects To Detect Feature Dependencies
KANSAS CITY INVESTMENT PROPERTIES
201505 CSE340 Lecture 06
CQRS introduction
201506 CSE340 Lecture 21
201506 CSE340 Lecture 14
LiveOffice Email Archiving Makes Cents
Corporate taxation introduction
Phenomenal Oct 15, 2009
Irem presentation final
201506 CSE340 Lecture 11
Chapter 3 presentation
RCMSL Phenomenal July 9, 2009
Week9
Heirloom Travel: Wine Country - Wineries
201505 CSE340 Lecture 05
RCMSL Phenomenal Aug 13 And 20, 2009
Monaco 020909
Ad

Similar to Lowering barriers to publishing biological data on the web (20)

PDF
Developing an open source community for cloud bioinformatics
PPT
Bioinformatics&Databases.ppt
PPTX
EiTESAL eHealth Conference 14&15 May 2017
PDF
VictorCassen
PPTX
2013 nas-ehs-data-integration-dc
PPTX
Experiences with logic programming in bioinformatics
PPTX
Cartegena051811
PPTX
Databases, Web Services and Tools For Systems Immunology
PPTX
2015 genome-center
PPT
B.sc biochem i bobi u 2 database
PPTX
Bioinformatics
PPTX
Feasting onbrainswithworkflows
PPTX
Delivering biodiversity knowledge in the information age
PPT
B.sc biochem i bobi u-1 introduction to bioinformatics
PPT
B.sc biochem i bobi u-1 introduction to bioinformatics
PPTX
Bioinformatics
PDF
Introduction to Bioinformatics
PPTX
biological databases.pptx
PPT
Bioinformatics
PPTX
Bioinformatics
Developing an open source community for cloud bioinformatics
Bioinformatics&Databases.ppt
EiTESAL eHealth Conference 14&15 May 2017
VictorCassen
2013 nas-ehs-data-integration-dc
Experiences with logic programming in bioinformatics
Cartegena051811
Databases, Web Services and Tools For Systems Immunology
2015 genome-center
B.sc biochem i bobi u 2 database
Bioinformatics
Feasting onbrainswithworkflows
Delivering biodiversity knowledge in the information age
B.sc biochem i bobi u-1 introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformatics
Bioinformatics
Introduction to Bioinformatics
biological databases.pptx
Bioinformatics
Bioinformatics

More from Brad Chapman (6)

PDF
Amazon resource for bioinformatics
PDF
Developing distributed analysis pipelines with shared community resources usi...
PDF
Biopython at BOSC 2010
PDF
GATK recalibration plot
PDF
Next-generation sequencing request management system in Galaxy
PDF
BioHackathon 2010 Intro
Amazon resource for bioinformatics
Developing distributed analysis pipelines with shared community resources usi...
Biopython at BOSC 2010
GATK recalibration plot
Next-generation sequencing request management system in Galaxy
BioHackathon 2010 Intro

Recently uploaded (20)

PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Approach and Philosophy of On baking technology
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Modernizing your data center with Dell and AMD
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Big Data Technologies - Introduction.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Chapter 3 Spatial Domain Image Processing.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
“AI and Expert System Decision Support & Business Intelligence Systems”
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Per capita expenditure prediction using model stacking based on satellite ima...
Approach and Philosophy of On baking technology
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
20250228 LYD VKU AI Blended-Learning.pptx
The AUB Centre for AI in Media Proposal.docx
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Modernizing your data center with Dell and AMD
Review of recent advances in non-invasive hemoglobin estimation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Big Data Technologies - Introduction.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Diabetes mellitus diagnosis method based random forest with bat algorithm
Chapter 3 Spatial Domain Image Processing.pdf

Lowering barriers to publishing biological data on the web

  • 1. Lowering barriers to publishing biological data on the web Brad Chapman Department of Molecular Biology Massachusetts General Hospital Boston, MA USA chapmanb@50mail.com http://friendfeed.com/chapmanb 27 June 2009
  • 3. Motivation Web accessible Interoperable in standard formats Displays for browsing Analyses Scale
  • 4. Current state: Reusable libraries Parse file formats Run programs Build analysis pipelines Communities Python examples Biopython pygr bx-python PyCogent
  • 5. Current state: Database schemas Represent biological data Expand analyses beyond flat files Interoperate with standards BioSQL Chado
  • 6. Current state: Web applications
  • 8. Proposal Provide Reusable presentation components Quickly deployable frameworks Integrate Bioinformatics libraries Database schemas Web development frameworks
  • 11. Challenges: Design Reusable Components: avoid large framework Multi-language: javascript front end Accessible Automated data retrieval (REST) Standard formats (GFF, RDF) Available Creative Commons http://creativecommons.org/about/licenses Open Data Commons http://www.opendatacommons.org/licenses/
  • 12. Challenges: Community questions How do we. . . provide plug-in components? leverage existing code? make reuse easier? communicate about these issues?