2
Most read
10
Most read
1
Jim Silva – Director
Kimberly Butler – Lead Data Engineer
3 October 2019, Pfizer, Inc.
Elastic as a Fundamental
Core to Pfizer’s Scientific
Data Cloud
2
Our Purpose
Our Company
$53.6BILLION
in revenue in 2018
40MANUFACTURING
sites worldwide
MORE THAN
125COUNTRIES
in which Pfizer sells products
8PRODUCTS
with sales greater
than $1 billion in 2018
MORE THAN
180NEW R&D
COLLABORATIONS
in 2018
90,000
COLLEAGUES
around the world
3
Worldwide Research and Development
Research and Development is at the heart of fulfilling Pfizer’s purpose
as we work to translate advanced science and technologies into
therapies that matter for patients in need.
We focus our resources in disease areas with the best chance of
scientific and commercial success:
Oncology Internal Medicine Inflammation
& Immunology
Vaccines Rare Disease
4
Introduction to the Scientific Data Cloud (SDC)
Goals
• Accelerate timelines through exploratory analysis and predictive modelling
• Modeling helped to accelerate a previous project discovery and development project.
• Physical experiments can be avoided through data reuse enabling faster decisions
• Satisfy regulatory requirements for secure data retention
• Regulators expect immediate response to data requests and demonstration of audit trail
5
SDC’s Elastic Core
• File Metadata
• Line-Specific Metadata
• Line-Specific Configuration
• Search
• Usage Reports
• Audit Reports
• Files and Metadata to
Populate Analytics Layer• File Download Logs
• Agent Downtime Logs
• Search Logs
• S3 Logs
Config
Search
Amazon
Redshift
Amazon S3
6
Search Interface
App Menu
File Details and File
Download
2
Filter on Metadata
1
3
7
Configuration of Search Interface
PUT sdc-config/masterconfig/invivo
{"metadata": [{
"metadataName": "experimentID",
"caption": "Experiment ID",
"searchable": true,
"filterable": true,
"filterType": "CheckBoxList"
}],
"resultsPage": [{
"metadataName": "experimentID"
}],
"apps": [{
" link": "https://sdctickets.pfizer.com",
"display": "Submit Ticket"
}]
}
8
Design Principles
"dynamic": "strict",
"analysis": {
"filter": {
"id_word_delimiter": {
"catenate_all": "false",
"preserve_original": "true",
"split_on_numerics": "false",
"catenate_numbers": "false",
"type": "word_delimiter_graph"
}
}
}
9
Audit & Usage Reports
SDC
S3 Bucket
Audit Reports
Custom Plugin
Amazon
CloudWatch
Pfizer Logging S3
Bucket
File Activity Logs
Tomcat
Webserver
BEATS KIBANALOGSTASH ELASTICSEARCH
Usage Reports
Native Kibana
File Download Logs
Search Logs
+
10
Elastic Upgrade Remediation Plan
Upgrade from ELK version 5.3 to 6.4
1 2 3 4 5
Build ELK 6.4 Sandbox Rebuild Indices Run Queries Install Custom Kibana
Plugin and Import
Visualizations
Point Applications to
Sandbox

More Related Content

DOCX
Pharmaceutical Biotechnology on Modern Technological Platform
PDF
Nanotech pdf
PPTX
Design and tech portfolio
PDF
Bosman-Kramer Changing Research Workflows
PPTX
EXTRACTION OF KERATIN FROM HUMAN HAIR WASTE.pptx
PDF
Enabling patient-centricity-pfizer
PPT
Data Driven Health Care Enterprise
Pharmaceutical Biotechnology on Modern Technological Platform
Nanotech pdf
Design and tech portfolio
Bosman-Kramer Changing Research Workflows
EXTRACTION OF KERATIN FROM HUMAN HAIR WASTE.pptx
Enabling patient-centricity-pfizer
Data Driven Health Care Enterprise

Similar to Elastic as a Fundamental Core to Pfizer’s Scientific Data Cloud (20)

PDF
Pivot to the Patient
PPTX
Dynamics Impacting the Future of Healthcare
PPTX
A Modern Data Strategy for Precision Medicine
PDF
ELSS use cases and strategy
PPTX
Enterprise Analytics: Serving Big Data Projects for Healthcare
PPT
Data Driven Health Care Enterprise
PPT
Data Driven Health Care Enterprise
PDF
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
PPTX
Centralizing Data to Address Imperatives in Clinical Development
PDF
Fair by design
PPTX
Innovation at the Edge_Final
PPTX
Pistoia Alliance US Conference 2015 - 1.1.2 Innovation in Pharma - Chris Waller
PPTX
Nikhil anmol pres_092014_1.0_final_2222
PPTX
Atul Butte NIPS 2017 ML4H
PPTX
Insight into AstraZeneca's Technology Services.
PPTX
Precision Medicine in the Big Data World
PDF
Research and Development Solutions | Accenture
DOCX
What is the role of predictive analytics in life sciences
PDF
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
PDF
Data-driven Healthcare for the Pharmaceutical Industry
Pivot to the Patient
Dynamics Impacting the Future of Healthcare
A Modern Data Strategy for Precision Medicine
ELSS use cases and strategy
Enterprise Analytics: Serving Big Data Projects for Healthcare
Data Driven Health Care Enterprise
Data Driven Health Care Enterprise
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
Centralizing Data to Address Imperatives in Clinical Development
Fair by design
Innovation at the Edge_Final
Pistoia Alliance US Conference 2015 - 1.1.2 Innovation in Pharma - Chris Waller
Nikhil anmol pres_092014_1.0_final_2222
Atul Butte NIPS 2017 ML4H
Insight into AstraZeneca's Technology Services.
Precision Medicine in the Big Data World
Research and Development Solutions | Accenture
What is the role of predictive analytics in life sciences
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
Data-driven Healthcare for the Pharmaceutical Industry
Ad

More from Elasticsearch (20)

PDF
An introduction to Elasticsearch's advanced relevance ranking toolbox
PDF
From MSP to MSSP using Elastic
PDF
Cómo crear excelentes experiencias de búsqueda en sitios web
PDF
Te damos la bienvenida a una nueva forma de realizar búsquedas
PDF
Tirez pleinement parti d'Elastic grâce à Elastic Cloud
PDF
Comment transformer vos données en informations exploitables
PDF
Plongez au cœur de la recherche dans tous ses états.
PDF
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
PDF
An introduction to Elasticsearch's advanced relevance ranking toolbox
PDF
Welcome to a new state of find
PDF
Building great website search experiences
PDF
Keynote: Harnessing the power of Elasticsearch for simplified search
PDF
Cómo transformar los datos en análisis con los que tomar decisiones
PDF
Explore relève les défis Big Data avec Elastic Cloud
PDF
Comment transformer vos données en informations exploitables
PDF
Transforming data into actionable insights
PDF
Opening Keynote: Why Elastic?
PDF
Empowering agencies using Elastic as a Service inside Government
PDF
The opportunities and challenges of data for public good
PDF
Enterprise search and unstructured data with CGI and Elastic
An introduction to Elasticsearch's advanced relevance ranking toolbox
From MSP to MSSP using Elastic
Cómo crear excelentes experiencias de búsqueda en sitios web
Te damos la bienvenida a una nueva forma de realizar búsquedas
Tirez pleinement parti d'Elastic grâce à Elastic Cloud
Comment transformer vos données en informations exploitables
Plongez au cœur de la recherche dans tous ses états.
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
An introduction to Elasticsearch's advanced relevance ranking toolbox
Welcome to a new state of find
Building great website search experiences
Keynote: Harnessing the power of Elasticsearch for simplified search
Cómo transformar los datos en análisis con los que tomar decisiones
Explore relève les défis Big Data avec Elastic Cloud
Comment transformer vos données en informations exploitables
Transforming data into actionable insights
Opening Keynote: Why Elastic?
Empowering agencies using Elastic as a Service inside Government
The opportunities and challenges of data for public good
Enterprise search and unstructured data with CGI and Elastic
Ad

Recently uploaded (20)

PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
Enhancing plagiarism detection using data pre-processing and machine learning...
PPTX
Modernising the Digital Integration Hub
PDF
UiPath Agentic Automation session 1: RPA to Agents
PDF
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
PDF
Architecture types and enterprise applications.pdf
DOCX
search engine optimization ppt fir known well about this
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
A proposed approach for plagiarism detection in Myanmar Unicode text
PDF
CloudStack 4.21: First Look Webinar slides
PDF
The influence of sentiment analysis in enhancing early warning system model f...
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PPTX
2018-HIPAA-Renewal-Training for executives
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
STKI Israel Market Study 2025 version august
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
Comparative analysis of machine learning models for fake news detection in so...
sustainability-14-14877-v2.pddhzftheheeeee
Enhancing plagiarism detection using data pre-processing and machine learning...
Modernising the Digital Integration Hub
UiPath Agentic Automation session 1: RPA to Agents
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
Architecture types and enterprise applications.pdf
search engine optimization ppt fir known well about this
1 - Historical Antecedents, Social Consideration.pdf
A proposed approach for plagiarism detection in Myanmar Unicode text
CloudStack 4.21: First Look Webinar slides
The influence of sentiment analysis in enhancing early warning system model f...
Consumable AI The What, Why & How for Small Teams.pdf
2018-HIPAA-Renewal-Training for executives
Custom Battery Pack Design Considerations for Performance and Safety
STKI Israel Market Study 2025 version august
Credit Without Borders: AI and Financial Inclusion in Bangladesh
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Improvisation in detection of pomegranate leaf disease using transfer learni...
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
Comparative analysis of machine learning models for fake news detection in so...

Elastic as a Fundamental Core to Pfizer’s Scientific Data Cloud

  • 1. 1 Jim Silva – Director Kimberly Butler – Lead Data Engineer 3 October 2019, Pfizer, Inc. Elastic as a Fundamental Core to Pfizer’s Scientific Data Cloud
  • 2. 2 Our Purpose Our Company $53.6BILLION in revenue in 2018 40MANUFACTURING sites worldwide MORE THAN 125COUNTRIES in which Pfizer sells products 8PRODUCTS with sales greater than $1 billion in 2018 MORE THAN 180NEW R&D COLLABORATIONS in 2018 90,000 COLLEAGUES around the world
  • 3. 3 Worldwide Research and Development Research and Development is at the heart of fulfilling Pfizer’s purpose as we work to translate advanced science and technologies into therapies that matter for patients in need. We focus our resources in disease areas with the best chance of scientific and commercial success: Oncology Internal Medicine Inflammation & Immunology Vaccines Rare Disease
  • 4. 4 Introduction to the Scientific Data Cloud (SDC) Goals • Accelerate timelines through exploratory analysis and predictive modelling • Modeling helped to accelerate a previous project discovery and development project. • Physical experiments can be avoided through data reuse enabling faster decisions • Satisfy regulatory requirements for secure data retention • Regulators expect immediate response to data requests and demonstration of audit trail
  • 5. 5 SDC’s Elastic Core • File Metadata • Line-Specific Metadata • Line-Specific Configuration • Search • Usage Reports • Audit Reports • Files and Metadata to Populate Analytics Layer• File Download Logs • Agent Downtime Logs • Search Logs • S3 Logs Config Search Amazon Redshift Amazon S3
  • 6. 6 Search Interface App Menu File Details and File Download 2 Filter on Metadata 1 3
  • 7. 7 Configuration of Search Interface PUT sdc-config/masterconfig/invivo {"metadata": [{ "metadataName": "experimentID", "caption": "Experiment ID", "searchable": true, "filterable": true, "filterType": "CheckBoxList" }], "resultsPage": [{ "metadataName": "experimentID" }], "apps": [{ " link": "https://sdctickets.pfizer.com", "display": "Submit Ticket" }] }
  • 8. 8 Design Principles "dynamic": "strict", "analysis": { "filter": { "id_word_delimiter": { "catenate_all": "false", "preserve_original": "true", "split_on_numerics": "false", "catenate_numbers": "false", "type": "word_delimiter_graph" } } }
  • 9. 9 Audit & Usage Reports SDC S3 Bucket Audit Reports Custom Plugin Amazon CloudWatch Pfizer Logging S3 Bucket File Activity Logs Tomcat Webserver BEATS KIBANALOGSTASH ELASTICSEARCH Usage Reports Native Kibana File Download Logs Search Logs +
  • 10. 10 Elastic Upgrade Remediation Plan Upgrade from ELK version 5.3 to 6.4 1 2 3 4 5 Build ELK 6.4 Sandbox Rebuild Indices Run Queries Install Custom Kibana Plugin and Import Visualizations Point Applications to Sandbox