SlideShare a Scribd company logo
Constellation Technologies
          & GeneStack
       Development of Sequence
      Services 2 in the Constellation
               Framework

1
Constellation
Experts in big data and bioinformatics
• Spin out from STFC (Science and Technology Facilities Council)
   – Largest research facility in UK specialising in large data computing
        • CERN, European physics and astronomy science
        • Supporting all UK disciplines in computing
• Strong IT & Bioinformatics expertise
   – Strong Bioinformatics delivery expertise
   – Strong connections into European academia
   – Excellent access to newly developed applications, tools and algorithms
• Supplier of cloud computing services to large Pharma.
• Partners for Pistoia SS2
   – Microsoft Azure
   – STFC
2
Constellation’s “Roadmap”
                                                       Text                   Genome
                                                  Mining/Search               Analysis
  Core
                                                        Data
                                                                             “AppMarket”
                                                     Integration


         Service                                 Service           Service         Service




                                                       “Workflow Management”




 API
                   Seamless Integration with Client systems
Bioinformatics
                                   IT




• Bioinformatics                       • IT
    –   Novel Algorithms                      –   Platform Design
    –   Research                              –   Support
    –   Scientific support                    –   Maintenance
    –   Discovery                             –   Testing
    –   Analysis                              –   Stability / Scalability
    –   Value Added                           –   Security
4
Hosting
                                Cloud




• Hosted                             • Cloud
    –   Single Vendor                    –   Vendor Agnostic
    –   Hardware limitations             –   As required
    –   Restricted storage               –   Selectable storage
    –   Limited cost models              –   Best model available
    –   “Lock in”                        –   “Flexible”


5
Cloud Vision
                                    Flexible                     Flexible
                                    Compute                      Storage

                                               Vendor Agnostic



    Academic or                                                                                 Client
     bespoke                                                                                   Business
     solutions                                                                                  Logic




        Client                                                                                Minimise
      Applications                                                                            Support




                        Virtual                                             “Bioinformatics
                     Organisation                                            Marketplace”




                                                   True
                                                   Cloud



6
High Level Architecture

                       Portal


Bioinformatics                           Deployed
                     Workflow UI
      UIs                              Workflow (Apps)

Bioinformatics             Workflow     Tools
   Systems            Bioinformatics   Applications

                 Distributed Compute

                 Distributed Storage
Our goal for SS2
• We believed the end goal was a flexible platform where ALL the
  application described in SS2 scope could be deployed for individual clients
  as required.
• Platform should be scalable where security, support and maintenance
  can be easily managed.
    – Reducing support costs allows for more focus on research
• Bioinformatics applications added as required:
    – GeneStack (Analysis Portal)
    – VIB (Arctix) (Workflow) (in discussion)
    – EBI (Services) (in discussion)
• Workflow delivered as a fundamental development principle
• Development of the “AppMarket” for Bioinformatics

8
Secure
Scalable
Storage



               Company
Workflow        Specific
 Core



               Integrating
                3rd Party
                Systems
Integration     Future
With other    Development
 Systems

9
Deliverables achieved
•    Portal with access to all the “Must Have” Web Services described in the SS2
     documentation
      –   Constellation Managed Administration Interface to allow organisational mapping of users to
          Programs / Projects / Applications
• “Tool Box” of Integrated Applications
      –   Galaxy
      –   Secure Ensembl
      –   Secure CellProfiler
      –   Content Search (New development)
•    Galaxy workflow engine with integrating applications deployed as a secure web
     application to cover “Must Have” tools
      –   Restricted set of apps based on feedback from “testing pool” (Restrictions based on Need/Security)
      –   Tools can be added on request
•    Scalable storage and compute (dependant on need and security)
      –   Structured Program - Project – User mapping
      –   Cost effective data storage and compute
•    Initial Integration with another Bioinformatics Vendor (GeneStack)


10
Other Available SAAS tools
• Secure EnsEMBL
     – Private copy of EnsEMBL (Rackspace)
     – Secure UI and API Access
     – Ability to map DAS (secure or Public)
• Parallelised CellProfiler
     – Private scalable version of CellProfiler on Azure


11

More Related Content

PPTX
Sequence Services Phase 2--Hewlett-Packard
PDF
Infosys sequence services proof of concept
PPTX
Secure Big Data Analytics - Hadoop & Intel
PPTX
Oracle Data Warehouse
PDF
Proclarity
PDF
Ugif 12 2011-informix iwa
PDF
Accel Partners New Data Workshop 7-14-10
PDF
The End of Appliances
Sequence Services Phase 2--Hewlett-Packard
Infosys sequence services proof of concept
Secure Big Data Analytics - Hadoop & Intel
Oracle Data Warehouse
Proclarity
Ugif 12 2011-informix iwa
Accel Partners New Data Workshop 7-14-10
The End of Appliances

Similar to Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack (20)

PDF
Healthcare and Life Sciences - Cloud Computing
PDF
IBM Cloud Strategy
PPTX
Bb3061 bess systems of record sv
PPT
Innovate 2012 ls 1439 linked data oslc
PDF
A Tour of Research Computing at Genentech
PDF
Considering the Cloud? 5 Points to Consider
PPT
Micro Strategies Overview
PPT
Ippeis Cloud Computing Presentation(Tokyo2.0)
PDF
Cloud Computing: da curiosidade para casos reais
PDF
Federal Cloud Computing Initiative
PDF
IBM BP Kickoff 2013 VDI Solutions
PDF
Aras Vision and Roadmap with Aras Innovator PLM Software
PDF
Data Center and System Optimization
PPTX
Cloud foundry elastic architecture and deploy based on openstack
PDF
Big Data Beyond Hadoop*: Research Directions for the Future
PDF
Thoughts on Utility, Grid, on demand, cloud computing and appliances
PPT
Konsolider, optimer og automatiser dit servermiljø med IBM PureApplications S...
PDF
Mach Technology
PDF
Marlabs- ISMNY Deck
PDF
Cloud Computing and System z
Healthcare and Life Sciences - Cloud Computing
IBM Cloud Strategy
Bb3061 bess systems of record sv
Innovate 2012 ls 1439 linked data oslc
A Tour of Research Computing at Genentech
Considering the Cloud? 5 Points to Consider
Micro Strategies Overview
Ippeis Cloud Computing Presentation(Tokyo2.0)
Cloud Computing: da curiosidade para casos reais
Federal Cloud Computing Initiative
IBM BP Kickoff 2013 VDI Solutions
Aras Vision and Roadmap with Aras Innovator PLM Software
Data Center and System Optimization
Cloud foundry elastic architecture and deploy based on openstack
Big Data Beyond Hadoop*: Research Directions for the Future
Thoughts on Utility, Grid, on demand, cloud computing and appliances
Konsolider, optimer og automatiser dit servermiljø med IBM PureApplications S...
Mach Technology
Marlabs- ISMNY Deck
Cloud Computing and System z
Ad

More from Pistoia Alliance (20)

PDF
Fairification experience clarifying the semantics of data matrices
PPTX
MPS webinar master deck
PPTX
Digital webinar master deck final
PDF
Heartificial intelligence - claudio-mirti
PDF
Fair by design
PDF
Knowledge graphs ilaria maresi the hyve 23apr2020
PPTX
2020.04.07 automated molecular design and the bradshaw platform webinar
PDF
Data market evolution, a future shaped by FAIR
PPTX
AI in translational medicine webinar
PDF
CEDAR work bench for metadata management
PDF
Open interoperability standards, tools and services at EMBL-EBI
PDF
Fair webinar, Ted slater: progress towards commercial fair data products and ...
PDF
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
PPTX
Implementing Blockchain applications in healthcare
PPTX
Building trust and accountability - the role User Experience design can play ...
PPTX
Pistoia Alliance-Elsevier Datathon
PDF
Data for AI models, the past, the present, the future
PDF
PA webinar on benefits & costs of FAIR implementation in life sciences
PDF
AI & ML in Drug Design: Pistoia Alliance CoE
PDF
Ai in drug design webinar 26 feb 2019
Fairification experience clarifying the semantics of data matrices
MPS webinar master deck
Digital webinar master deck final
Heartificial intelligence - claudio-mirti
Fair by design
Knowledge graphs ilaria maresi the hyve 23apr2020
2020.04.07 automated molecular design and the bradshaw platform webinar
Data market evolution, a future shaped by FAIR
AI in translational medicine webinar
CEDAR work bench for metadata management
Open interoperability standards, tools and services at EMBL-EBI
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Implementing Blockchain applications in healthcare
Building trust and accountability - the role User Experience design can play ...
Pistoia Alliance-Elsevier Datathon
Data for AI models, the past, the present, the future
PA webinar on benefits & costs of FAIR implementation in life sciences
AI & ML in Drug Design: Pistoia Alliance CoE
Ai in drug design webinar 26 feb 2019
Ad

Recently uploaded (20)

PDF
NewMind AI Monthly Chronicles - July 2025
PDF
cuic standard and advanced reporting.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Advanced IT Governance
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Approach and Philosophy of On baking technology
PDF
Machine learning based COVID-19 study performance prediction
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
Teaching material agriculture food technology
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Cloud computing and distributed systems.
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
NewMind AI Monthly Chronicles - July 2025
cuic standard and advanced reporting.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Electronic commerce courselecture one. Pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Understanding_Digital_Forensics_Presentation.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Reach Out and Touch Someone: Haptics and Empathic Computing
Advanced IT Governance
Unlocking AI with Model Context Protocol (MCP)
Approach and Philosophy of On baking technology
Machine learning based COVID-19 study performance prediction
Dropbox Q2 2025 Financial Results & Investor Presentation
Teaching material agriculture food technology
20250228 LYD VKU AI Blended-Learning.pptx
The AUB Centre for AI in Media Proposal.docx
Cloud computing and distributed systems.
Advanced methodologies resolving dimensionality complications for autism neur...

Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack

  • 1. Constellation Technologies & GeneStack Development of Sequence Services 2 in the Constellation Framework 1
  • 2. Constellation Experts in big data and bioinformatics • Spin out from STFC (Science and Technology Facilities Council) – Largest research facility in UK specialising in large data computing • CERN, European physics and astronomy science • Supporting all UK disciplines in computing • Strong IT & Bioinformatics expertise – Strong Bioinformatics delivery expertise – Strong connections into European academia – Excellent access to newly developed applications, tools and algorithms • Supplier of cloud computing services to large Pharma. • Partners for Pistoia SS2 – Microsoft Azure – STFC 2
  • 3. Constellation’s “Roadmap” Text Genome Mining/Search Analysis Core Data “AppMarket” Integration Service Service Service Service “Workflow Management” API Seamless Integration with Client systems
  • 4. Bioinformatics IT • Bioinformatics • IT – Novel Algorithms – Platform Design – Research – Support – Scientific support – Maintenance – Discovery – Testing – Analysis – Stability / Scalability – Value Added – Security 4
  • 5. Hosting Cloud • Hosted • Cloud – Single Vendor – Vendor Agnostic – Hardware limitations – As required – Restricted storage – Selectable storage – Limited cost models – Best model available – “Lock in” – “Flexible” 5
  • 6. Cloud Vision Flexible Flexible Compute Storage Vendor Agnostic Academic or Client bespoke Business solutions Logic Client Minimise Applications Support Virtual “Bioinformatics Organisation Marketplace” True Cloud 6
  • 7. High Level Architecture Portal Bioinformatics Deployed Workflow UI UIs Workflow (Apps) Bioinformatics Workflow Tools Systems Bioinformatics Applications Distributed Compute Distributed Storage
  • 8. Our goal for SS2 • We believed the end goal was a flexible platform where ALL the application described in SS2 scope could be deployed for individual clients as required. • Platform should be scalable where security, support and maintenance can be easily managed. – Reducing support costs allows for more focus on research • Bioinformatics applications added as required: – GeneStack (Analysis Portal) – VIB (Arctix) (Workflow) (in discussion) – EBI (Services) (in discussion) • Workflow delivered as a fundamental development principle • Development of the “AppMarket” for Bioinformatics 8
  • 9. Secure Scalable Storage Company Workflow Specific Core Integrating 3rd Party Systems Integration Future With other Development Systems 9
  • 10. Deliverables achieved • Portal with access to all the “Must Have” Web Services described in the SS2 documentation – Constellation Managed Administration Interface to allow organisational mapping of users to Programs / Projects / Applications • “Tool Box” of Integrated Applications – Galaxy – Secure Ensembl – Secure CellProfiler – Content Search (New development) • Galaxy workflow engine with integrating applications deployed as a secure web application to cover “Must Have” tools – Restricted set of apps based on feedback from “testing pool” (Restrictions based on Need/Security) – Tools can be added on request • Scalable storage and compute (dependant on need and security) – Structured Program - Project – User mapping – Cost effective data storage and compute • Initial Integration with another Bioinformatics Vendor (GeneStack) 10
  • 11. Other Available SAAS tools • Secure EnsEMBL – Private copy of EnsEMBL (Rackspace) – Secure UI and API Access – Ability to map DAS (secure or Public) • Parallelised CellProfiler – Private scalable version of CellProfiler on Azure 11