SlideShare a Scribd company logo
© 2017 Delphix. All Rights Reserved. Private and Confidential.© 2017 Delphix. All Rights Reserved. Private and Confidential.
Kellyn Pot’Vin-Gorman | Technical Intelligence Manager| May, 2018
The Rise of DataOps
Making Big Data Bite-Size With DataOps
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Kellyn Pot’Vin-Gorman
Technical Intelligence Manager, Delphix
• Multi-platform DBA, (Oracle, MSSQL, MySQL, Sybase,
PostgreSQL, Informix…)
• Oracle ACE Director, (Alumni)
• Oak Table Network Member
• Idera ACE 2018
• APEX Women in Technology Award, CTA
• STEM education with Raspberry Pi and Python,
including DevOxx4Kids, Oracle Education Foundation
and TechGirls
• President, Rocky Mtn Oracle User Group
• President, Denver SQL Server User Group
• DevOps author, instructor and presenter.
• Author, blogger, (http://dbakevlar.com)
© 2017 Delphix. All Rights Reserved. Private and Confidential.
DevOps + Data = DataOps
• Data Experts heavily influence decisions.
• Adverse to bleeding edge or high risk.
• Introduction of the cloud, especially and SaaS, (Software
as a Service).
• Demand for decreased development cycles introduce risk,
older relational processes and systems are seen as a
roadblock to this.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
DataOps- Origin
DataOps takes DevOps to the next
level, recognizing that many
DevOps projects have data
integrated into them and requires
that data to move at the same
speed the rest of development and
testing.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Dave is a Data
Scientist who is
expected to do
more at a faster
pace with DevOps.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
The Company has hired
George as the new Scrum
Master
© 2017 Delphix. All Rights Reserved. Private and Confidential.
The Periodic Table of DevOps Tools
https://xebialabs.com/periodic-table-of-devops-tools/
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Scrum
https://www.scrumalliance.org/community/articles/2014/april/devops-and-agile
Team George Dave
© 2017 Delphix. All Rights Reserved. Private and Confidential.
George is heading up the
daily scrum meetings with
the team…
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Problem is, the team all
needs their own
development and testing
environments.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
And Dave Needs to Test His New Script Before Running it in
Production…
- name: Transfer and process data script.
hosts: server
remote_user: test_user
sudo: yes
tasks:
- name: Transfer the script
copy: src=test.sh dest=/home/test_user mode=0777
- name: Execute the script
command: sh /home/test_user/test.sh
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Data is Holding Them Back
What they can do currently
January February March April
What they need to do
January February March April
Database Refresh
Development Production Release
Testing Cycle
How can they
get here?
Refreshes and deployments
are taking too long
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Without a self-service
portal, Dave and the
team are also chained to
Operations…
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Where is George,
Dave and the rest of
the team going to get
what they need?
© 2017 Delphix. All Rights Reserved. Private and Confidential.
.
Segway: the Cloud Backlash
Go to the Cloud?
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Data Causes Friction
• If those managing the data sources aren’t included and working well with
those needing that data to produce features and products, friction is the
result.
• Data experts are working with robust, but archaic utilities that guarantee
outcome, rarely speed of results.
The pain for friction is felt by everyone.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
The Result are the
Developers, Testers and
Data Scientists are
Spending Upwards of 80%
of Their Time NOT
Performing their Primary
Function…
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Luckily, their DBA,
Marc, is keen on
Database
Virtualization and
Containers…
© 2017 Delphix. All Rights Reserved. Private and Confidential.
1st- Marc Embraces Virtualization…
A technical approach in which users and applications do not use physical
machines, but simulated systems running on actual, “real” hardware.
Virtualization can be used to eliminate resource usage and enable savings
for databases, network, file and application management, along with server
infrastructure.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Older Cloning Methods
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Virtualizes the Data Sources
Compress and Deduplicate
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Database Virtualization Tools
• Rubrik SQL Mount
• Redgate SQL Clone
• Windocks SQL Clone
• Actifio
• Delphix
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Database Virtualization Tools
• Veritas Velocity
• Redgate SQL Clone
• Oracle Thin Clone Features
• Actifio
• Delphix
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Why Data Virtualization is Different
• DBA is “only as good as their last backup”.
• Many database tools take considerable time to recover.
• DevOps is often about automation- automating an “undo” for
development and testing that includes data rewind.
• Include a self-service tool that will allow for rewind without Operations
intervention.
• Allows for data version control and DataOps, the next step in DevOps
© 2017 Delphix. All Rights Reserved. Private and Confidential.
▶▶▶
Virtualize and Deployed▶ ▶ ▶
Example of Virtualized Environment- Delphix
Storage Pool for Delphix
QA DEV PATCH TEST
PRODUCTION
Database/App Tier
1 TB
1 TB
600GB
Read From Production
TEST
Each Virtual Database takes up around 5-10Gb upon creation, (dependent upon parameters)
Read AND Write
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Virtualized Database, Independence
SGA
PGA
Buffer Cache
Library Cache
Java Pool Large Pool
Redo Logs
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Flat File Virtualization
/Pointers to source/file1
/Pointers to source/file2
/Pointers to source/file3
/Pointers to source/file4
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Marc Automates Database Deployments
• Creates Python and Shell Scripts to perform tasks he and his
team used to perform manually.
• If any pieces are missing in his logic/scripting, able to use
github repository/community examples to build out what is
needed.
• AsVirtual environments take little resources and almost no
storage, creating a DBA specific environment for
development, testing and maintenance makes the DBA
team less invasive to the Development/TestingTeam’s
scrum deadlines.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
2nd- Marc Creates Containers/Data Pods
Containers offer the ability to isolate application code and/or the whole
infrastructure stack into a package able entity to ease deployment, even
from the same kernel. This is a powerful tool for DevOps to ease
deployment for complex tiers, applications and multiple data stores. Data
Pods is the next step, where a pod is created from virtualized environments.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Why Create Containers/Data Pods
• Development is done by project or feature vs. tier or
product.
• With the introduction to the cloud, the business
requires those that can do more at a global scale.
• Ease of management and maintenance
• Provide more value to the business
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Data Pods for Ease of Delivery
Create “Container”
Rsync (UNIX/Linux)
Robocopy (Windows)
Reporting
NFS
iSCSI
Development
NFS
iSCSI
Testing
NFS
iSCSI
Delphix Virtualization Engine
Same Storage Size as Production
Deployed using Jenkins
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Now the
Developers and
Data Scientists
can work at the
speed the
business needs
them to…
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Self Service Portal
Release 1.2 Release 1.3 Branch 1.3
Branch 1.1
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Sprint Goals are Now
Possible to Meet!
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Confidential Data
http://ec.europa.eu/justice/data-protection/article-29/documentation/opinion-recommendation/files/2014/wp216_en.pdf
• GDPR, (General Data Protection Regulation States):
• Must have lawful basis to post process data
• Subject must have consented
• Must have contract that needs data
• Necessary and in order:
• Compliance
• Protect vital interests
• In public interest
• In legitimate interest of the owner/other party and doesn’t
violate the freedom and rights of the subject.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Confidential data
Exposure
Production
Non-production
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Confidential data
• Encryption is reversible data obfuscation, which is essential for
production access and MUST be DONE.
• Data masking is non-reversible.
• It solves the issue at the data level.
• Is authentication and authorization in non-production in compliance with
security goals?
• All organizations will soon need to review if critical data in non-
production environments be accessible to developers, testers and
users.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Data Virtualization, On-Prem with Masking
Data Source
8 TB Server
Delphix Masking Engine
Delphix Virtualization Engine
8 TB storage
Data Target
8 TB Server
Application Server
File Server
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Confidential Data with Masking
Exposure
Production
Non-production
Encryption
Mask
Solution
© 2017 Delphix. All Rights Reserved. Private and Confidential.© 2017 Delphix. All Rights Reserved. Private and Confidential.
DataOps Summary
© 2017 Delphix. All Rights Reserved. Private and Confidential.
The Goals Achieved?
• DevOps skills, (scripting, release builds, security, advanced
optimization.
• Automate or phase out tedious tasks with platform automation,
(monitoring, backups, maintenance jobs, etc.)
• Escape data gravity through features such as advanced self-
service portals, virtualization and container technology.
• Which encompasses advance features of DataOps, removing
the data friction and allowing data to move as fast as the rest of
development and testing.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
With the Enhancement to DataOps
Dave’s company is experiencing an
increase of 10% in data accessibility
which should result in more than $65
million in additional net income. You can
see why DataOps was important.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
Embracing DataOps
• Removes the last piece holding developers and testers
back.
• Uses virtualization and containers to simplify.
• Self-service portals for developers and testers to refresh
and develop the way they do in an agile environment.
• Tools with an interface made for Agile and Development
data at its focus.
© 2017 Delphix. All Rights Reserved. Private and Confidential.
References
Blog Posts-
• The DBA and DevOps: The Last Frontier
• The DBA and DevOps: Automation and
Configuration Management
• The DBA and DevOps - Orchestration and
Monitoring
• DBA and DevOps: The Ghost in the Machine
Webinar Recordings
• The DBA and DevOps- the Last Frontier
• Read- The Phoenix Project and the DevOps
Handbook!
DBA
DevOpsSkills
© 2017 Delphix. All Rights Reserved. Private and Confidential.© 2017 Delphix. All Rights Reserved. Private and Confidential.
Kellyn Pot’Vin-Gorman
Technical Intelligence Manager
kellyn@delphix.com
http://dbakevlar.com

More Related Content

PPTX
Screw DevOps, Let's Talk DataOps
PDF
DataOps, DevOps and the Developer: Treating Database Code Just Like App Code
PDF
Kelly O'Briant - DataOps in the Cloud: How To Supercharge Data Science with a...
PDF
451 Research: Data Is the Key to Friction in DevOps
PPTX
DevOps + DataOps = Digital Transformation
PPTX
Redis rise of Dataops
PPTX
DevOps and DBA- Delphix
PDF
A Continuously Deployed Hadoop Analytics Platform?
Screw DevOps, Let's Talk DataOps
DataOps, DevOps and the Developer: Treating Database Code Just Like App Code
Kelly O'Briant - DataOps in the Cloud: How To Supercharge Data Science with a...
451 Research: Data Is the Key to Friction in DevOps
DevOps + DataOps = Digital Transformation
Redis rise of Dataops
DevOps and DBA- Delphix
A Continuously Deployed Hadoop Analytics Platform?

What's hot (20)

PDF
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
PPTX
NYC Data Amp - Microsoft Azure and Data Services Overview
PDF
Big SQL: Powerful SQL Optimization - Re-Imagined for open source
PPTX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
PDF
5 Reasons not to use Dita from a CCMS Perspective
PDF
Enterprise Data Warehouse Optimization: 7 Keys to Success
PDF
Hadoop Trends
PPTX
Deploying Big Data Platforms
PDF
Offload, Transform, and Present - The New World of Data Integration
PDF
How to Automate Offloading ETL Processes to Hadoop
PPTX
Virtualization & the Cloud for Collaborate 2017
PPTX
Oracle: Building Cloud Native Applications
PDF
Lessons learned from over 25 Data Virtualization implementations
PPTX
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
PDF
2014.07.11 biginsights data2014
PDF
e-IT exec lunch - "It's all about data" - 25 May '16
PDF
OpenPOWER Update
PPTX
InterVision-Overview.January-2016
PPTX
Hadoop Hadoop & Spark meetup - Altiscale
PPTX
Big Data Maturity Scorecard
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
NYC Data Amp - Microsoft Azure and Data Services Overview
Big SQL: Powerful SQL Optimization - Re-Imagined for open source
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
5 Reasons not to use Dita from a CCMS Perspective
Enterprise Data Warehouse Optimization: 7 Keys to Success
Hadoop Trends
Deploying Big Data Platforms
Offload, Transform, and Present - The New World of Data Integration
How to Automate Offloading ETL Processes to Hadoop
Virtualization & the Cloud for Collaborate 2017
Oracle: Building Cloud Native Applications
Lessons learned from over 25 Data Virtualization implementations
Pivotal HAWQ and Hortonworks Data Platform: Modern Data Architecture for IT T...
2014.07.11 biginsights data2014
e-IT exec lunch - "It's all about data" - 25 May '16
OpenPOWER Update
InterVision-Overview.January-2016
Hadoop Hadoop & Spark meetup - Altiscale
Big Data Maturity Scorecard
Ad

Similar to The Rise of DataOps: Making Big Data Bite Size with DataOps (20)

PPTX
New DevOps for the DBA
PPTX
Empowering Agile Development with Containers
PPTX
Oracle Open World 2017 Delphix and DBVisit
PPTX
DevOps for the DBA- Jax Style!
PPTX
Data platforms 2017
PDF
From DBA to DevOps to DataOps- The Revolution
PPTX
Cloudy with a Chance of Databases
PPTX
Confessions of the AppDev VP
PPTX
Confessions of the AppDev VP Webinar (Delphix)
PPTX
Virtualization and Containers
PPTX
DevOps and the DBA
PPTX
There's More to Docker than the Container: The Docker Platform - Kendrick Col...
PPTX
Managing ScaleIO as Software on Mesos - David vonThenen - Dell EMC World 2017
PPTX
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
PPTX
Webinar: End-to-End CI/CD with GitLab and DC/OS
PPTX
The Power of DataOps for Cloud and Digital Transformation
PPTX
Using MySQL Containers
PPTX
Managing ScaleIO as Software on Mesos
PPTX
Scaling Data Science on Big Data
PDF
Database as code in Devops - DBを10分間で1000個構築するDB仮想化テクノロジーとは?(Adam)
New DevOps for the DBA
Empowering Agile Development with Containers
Oracle Open World 2017 Delphix and DBVisit
DevOps for the DBA- Jax Style!
Data platforms 2017
From DBA to DevOps to DataOps- The Revolution
Cloudy with a Chance of Databases
Confessions of the AppDev VP
Confessions of the AppDev VP Webinar (Delphix)
Virtualization and Containers
DevOps and the DBA
There's More to Docker than the Container: The Docker Platform - Kendrick Col...
Managing ScaleIO as Software on Mesos - David vonThenen - Dell EMC World 2017
Data Analytics Using Container Persistence Through SMACK - Manny Rodriguez-Pe...
Webinar: End-to-End CI/CD with GitLab and DC/OS
The Power of DataOps for Cloud and Digital Transformation
Using MySQL Containers
Managing ScaleIO as Software on Mesos
Scaling Data Science on Big Data
Database as code in Devops - DBを10分間で1000個構築するDB仮想化テクノロジーとは?(Adam)
Ad

More from Delphix (20)

PPTX
Fast Data Flow Is the Secret to Accelerating Digital Transformation
PPTX
Data Agility for Enterprise DevOps Adoption
PPTX
Secure Your Enterprise Data Now and Be Ready for CCPA in 2020
PPTX
Accelerating Secure SAP Application Delivery
PPTX
90% of Enterprises are Using DataOps. Why Aren’t You?
PPTX
Simplify and Accelerate SQL Server Migration to Azure
PPTX
Move and Secure Your Data
PPTX
Confessions of a CIO
PPTX
Let Data Flow: Removing the Latest DevOps Constraints with DataOps
PPTX
Confessions of an IT Director
PPTX
Confessions of the Tester
PPTX
Confessions of a Developer
PPTX
Confessions of a DBA: “I always avoid requests from DevOps” and Other Admissions
PPTX
Solving the Data Management Challenge for Healthcare
PPTX
Accelerate Design and Development of Data Projects Using AWS
PPTX
GDPR Fast Start
PPTX
Data Masking With The Delphix Dynamic Data Platform
PPTX
The GDPR and What It Means to You
PPTX
Why Your Approach To Data Governance Needs a Major Update
PPTX
A Data Privacy & Security Year in Review: Top 10 Trends and Predictions
Fast Data Flow Is the Secret to Accelerating Digital Transformation
Data Agility for Enterprise DevOps Adoption
Secure Your Enterprise Data Now and Be Ready for CCPA in 2020
Accelerating Secure SAP Application Delivery
90% of Enterprises are Using DataOps. Why Aren’t You?
Simplify and Accelerate SQL Server Migration to Azure
Move and Secure Your Data
Confessions of a CIO
Let Data Flow: Removing the Latest DevOps Constraints with DataOps
Confessions of an IT Director
Confessions of the Tester
Confessions of a Developer
Confessions of a DBA: “I always avoid requests from DevOps” and Other Admissions
Solving the Data Management Challenge for Healthcare
Accelerate Design and Development of Data Projects Using AWS
GDPR Fast Start
Data Masking With The Delphix Dynamic Data Platform
The GDPR and What It Means to You
Why Your Approach To Data Governance Needs a Major Update
A Data Privacy & Security Year in Review: Top 10 Trends and Predictions

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Empathic Computing: Creating Shared Understanding
PDF
Encapsulation theory and applications.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
cuic standard and advanced reporting.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Approach and Philosophy of On baking technology
20250228 LYD VKU AI Blended-Learning.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Per capita expenditure prediction using model stacking based on satellite ima...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Network Security Unit 5.pdf for BCA BBA.
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
A Presentation on Artificial Intelligence
Unlocking AI with Model Context Protocol (MCP)
MYSQL Presentation for SQL database connectivity
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Empathic Computing: Creating Shared Understanding
Encapsulation theory and applications.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
cuic standard and advanced reporting.pdf
Encapsulation_ Review paper, used for researhc scholars
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...

The Rise of DataOps: Making Big Data Bite Size with DataOps

  • 1. © 2017 Delphix. All Rights Reserved. Private and Confidential.© 2017 Delphix. All Rights Reserved. Private and Confidential. Kellyn Pot’Vin-Gorman | Technical Intelligence Manager| May, 2018 The Rise of DataOps Making Big Data Bite-Size With DataOps
  • 2. © 2017 Delphix. All Rights Reserved. Private and Confidential. Kellyn Pot’Vin-Gorman Technical Intelligence Manager, Delphix • Multi-platform DBA, (Oracle, MSSQL, MySQL, Sybase, PostgreSQL, Informix…) • Oracle ACE Director, (Alumni) • Oak Table Network Member • Idera ACE 2018 • APEX Women in Technology Award, CTA • STEM education with Raspberry Pi and Python, including DevOxx4Kids, Oracle Education Foundation and TechGirls • President, Rocky Mtn Oracle User Group • President, Denver SQL Server User Group • DevOps author, instructor and presenter. • Author, blogger, (http://dbakevlar.com)
  • 3. © 2017 Delphix. All Rights Reserved. Private and Confidential. DevOps + Data = DataOps • Data Experts heavily influence decisions. • Adverse to bleeding edge or high risk. • Introduction of the cloud, especially and SaaS, (Software as a Service). • Demand for decreased development cycles introduce risk, older relational processes and systems are seen as a roadblock to this.
  • 4. © 2017 Delphix. All Rights Reserved. Private and Confidential. DataOps- Origin DataOps takes DevOps to the next level, recognizing that many DevOps projects have data integrated into them and requires that data to move at the same speed the rest of development and testing.
  • 5. © 2017 Delphix. All Rights Reserved. Private and Confidential.
  • 6. © 2017 Delphix. All Rights Reserved. Private and Confidential. Dave is a Data Scientist who is expected to do more at a faster pace with DevOps.
  • 7. © 2017 Delphix. All Rights Reserved. Private and Confidential. The Company has hired George as the new Scrum Master
  • 8. © 2017 Delphix. All Rights Reserved. Private and Confidential. The Periodic Table of DevOps Tools https://xebialabs.com/periodic-table-of-devops-tools/
  • 9. © 2017 Delphix. All Rights Reserved. Private and Confidential. Scrum https://www.scrumalliance.org/community/articles/2014/april/devops-and-agile Team George Dave
  • 10. © 2017 Delphix. All Rights Reserved. Private and Confidential. George is heading up the daily scrum meetings with the team…
  • 11. © 2017 Delphix. All Rights Reserved. Private and Confidential. Problem is, the team all needs their own development and testing environments.
  • 12. © 2017 Delphix. All Rights Reserved. Private and Confidential. And Dave Needs to Test His New Script Before Running it in Production… - name: Transfer and process data script. hosts: server remote_user: test_user sudo: yes tasks: - name: Transfer the script copy: src=test.sh dest=/home/test_user mode=0777 - name: Execute the script command: sh /home/test_user/test.sh
  • 13. © 2017 Delphix. All Rights Reserved. Private and Confidential. Data is Holding Them Back What they can do currently January February March April What they need to do January February March April Database Refresh Development Production Release Testing Cycle How can they get here? Refreshes and deployments are taking too long
  • 14. © 2017 Delphix. All Rights Reserved. Private and Confidential. Without a self-service portal, Dave and the team are also chained to Operations…
  • 15. © 2017 Delphix. All Rights Reserved. Private and Confidential. Where is George, Dave and the rest of the team going to get what they need?
  • 16. © 2017 Delphix. All Rights Reserved. Private and Confidential. . Segway: the Cloud Backlash Go to the Cloud?
  • 17. © 2017 Delphix. All Rights Reserved. Private and Confidential. Data Causes Friction • If those managing the data sources aren’t included and working well with those needing that data to produce features and products, friction is the result. • Data experts are working with robust, but archaic utilities that guarantee outcome, rarely speed of results. The pain for friction is felt by everyone.
  • 18. © 2017 Delphix. All Rights Reserved. Private and Confidential. The Result are the Developers, Testers and Data Scientists are Spending Upwards of 80% of Their Time NOT Performing their Primary Function…
  • 19. © 2017 Delphix. All Rights Reserved. Private and Confidential. Luckily, their DBA, Marc, is keen on Database Virtualization and Containers…
  • 20. © 2017 Delphix. All Rights Reserved. Private and Confidential. 1st- Marc Embraces Virtualization… A technical approach in which users and applications do not use physical machines, but simulated systems running on actual, “real” hardware. Virtualization can be used to eliminate resource usage and enable savings for databases, network, file and application management, along with server infrastructure.
  • 21. © 2017 Delphix. All Rights Reserved. Private and Confidential. Older Cloning Methods
  • 22. © 2017 Delphix. All Rights Reserved. Private and Confidential. Virtualizes the Data Sources Compress and Deduplicate
  • 23. © 2017 Delphix. All Rights Reserved. Private and Confidential. Database Virtualization Tools • Rubrik SQL Mount • Redgate SQL Clone • Windocks SQL Clone • Actifio • Delphix
  • 24. © 2017 Delphix. All Rights Reserved. Private and Confidential. Database Virtualization Tools • Veritas Velocity • Redgate SQL Clone • Oracle Thin Clone Features • Actifio • Delphix
  • 25. © 2017 Delphix. All Rights Reserved. Private and Confidential. Why Data Virtualization is Different • DBA is “only as good as their last backup”. • Many database tools take considerable time to recover. • DevOps is often about automation- automating an “undo” for development and testing that includes data rewind. • Include a self-service tool that will allow for rewind without Operations intervention. • Allows for data version control and DataOps, the next step in DevOps
  • 26. © 2017 Delphix. All Rights Reserved. Private and Confidential. ▶▶▶ Virtualize and Deployed▶ ▶ ▶ Example of Virtualized Environment- Delphix Storage Pool for Delphix QA DEV PATCH TEST PRODUCTION Database/App Tier 1 TB 1 TB 600GB Read From Production TEST Each Virtual Database takes up around 5-10Gb upon creation, (dependent upon parameters) Read AND Write
  • 27. © 2017 Delphix. All Rights Reserved. Private and Confidential. Virtualized Database, Independence SGA PGA Buffer Cache Library Cache Java Pool Large Pool Redo Logs
  • 28. © 2017 Delphix. All Rights Reserved. Private and Confidential. Flat File Virtualization /Pointers to source/file1 /Pointers to source/file2 /Pointers to source/file3 /Pointers to source/file4
  • 29. © 2017 Delphix. All Rights Reserved. Private and Confidential. Marc Automates Database Deployments • Creates Python and Shell Scripts to perform tasks he and his team used to perform manually. • If any pieces are missing in his logic/scripting, able to use github repository/community examples to build out what is needed. • AsVirtual environments take little resources and almost no storage, creating a DBA specific environment for development, testing and maintenance makes the DBA team less invasive to the Development/TestingTeam’s scrum deadlines.
  • 30. © 2017 Delphix. All Rights Reserved. Private and Confidential. 2nd- Marc Creates Containers/Data Pods Containers offer the ability to isolate application code and/or the whole infrastructure stack into a package able entity to ease deployment, even from the same kernel. This is a powerful tool for DevOps to ease deployment for complex tiers, applications and multiple data stores. Data Pods is the next step, where a pod is created from virtualized environments.
  • 31. © 2017 Delphix. All Rights Reserved. Private and Confidential. Why Create Containers/Data Pods • Development is done by project or feature vs. tier or product. • With the introduction to the cloud, the business requires those that can do more at a global scale. • Ease of management and maintenance • Provide more value to the business
  • 32. © 2017 Delphix. All Rights Reserved. Private and Confidential. Data Pods for Ease of Delivery Create “Container” Rsync (UNIX/Linux) Robocopy (Windows) Reporting NFS iSCSI Development NFS iSCSI Testing NFS iSCSI Delphix Virtualization Engine Same Storage Size as Production Deployed using Jenkins
  • 33. © 2017 Delphix. All Rights Reserved. Private and Confidential. Now the Developers and Data Scientists can work at the speed the business needs them to…
  • 34. © 2017 Delphix. All Rights Reserved. Private and Confidential. Self Service Portal Release 1.2 Release 1.3 Branch 1.3 Branch 1.1
  • 35. © 2017 Delphix. All Rights Reserved. Private and Confidential. Sprint Goals are Now Possible to Meet!
  • 36. © 2017 Delphix. All Rights Reserved. Private and Confidential. Confidential Data http://ec.europa.eu/justice/data-protection/article-29/documentation/opinion-recommendation/files/2014/wp216_en.pdf • GDPR, (General Data Protection Regulation States): • Must have lawful basis to post process data • Subject must have consented • Must have contract that needs data • Necessary and in order: • Compliance • Protect vital interests • In public interest • In legitimate interest of the owner/other party and doesn’t violate the freedom and rights of the subject.
  • 37. © 2017 Delphix. All Rights Reserved. Private and Confidential. Confidential data Exposure Production Non-production
  • 38. © 2017 Delphix. All Rights Reserved. Private and Confidential. Confidential data • Encryption is reversible data obfuscation, which is essential for production access and MUST be DONE. • Data masking is non-reversible. • It solves the issue at the data level. • Is authentication and authorization in non-production in compliance with security goals? • All organizations will soon need to review if critical data in non- production environments be accessible to developers, testers and users.
  • 39. © 2017 Delphix. All Rights Reserved. Private and Confidential. Data Virtualization, On-Prem with Masking Data Source 8 TB Server Delphix Masking Engine Delphix Virtualization Engine 8 TB storage Data Target 8 TB Server Application Server File Server
  • 40. © 2017 Delphix. All Rights Reserved. Private and Confidential. Confidential Data with Masking Exposure Production Non-production Encryption Mask Solution
  • 41. © 2017 Delphix. All Rights Reserved. Private and Confidential.© 2017 Delphix. All Rights Reserved. Private and Confidential. DataOps Summary
  • 42. © 2017 Delphix. All Rights Reserved. Private and Confidential. The Goals Achieved? • DevOps skills, (scripting, release builds, security, advanced optimization. • Automate or phase out tedious tasks with platform automation, (monitoring, backups, maintenance jobs, etc.) • Escape data gravity through features such as advanced self- service portals, virtualization and container technology. • Which encompasses advance features of DataOps, removing the data friction and allowing data to move as fast as the rest of development and testing.
  • 43. © 2017 Delphix. All Rights Reserved. Private and Confidential. With the Enhancement to DataOps Dave’s company is experiencing an increase of 10% in data accessibility which should result in more than $65 million in additional net income. You can see why DataOps was important.
  • 44. © 2017 Delphix. All Rights Reserved. Private and Confidential. Embracing DataOps • Removes the last piece holding developers and testers back. • Uses virtualization and containers to simplify. • Self-service portals for developers and testers to refresh and develop the way they do in an agile environment. • Tools with an interface made for Agile and Development data at its focus.
  • 45. © 2017 Delphix. All Rights Reserved. Private and Confidential. References Blog Posts- • The DBA and DevOps: The Last Frontier • The DBA and DevOps: Automation and Configuration Management • The DBA and DevOps - Orchestration and Monitoring • DBA and DevOps: The Ghost in the Machine Webinar Recordings • The DBA and DevOps- the Last Frontier • Read- The Phoenix Project and the DevOps Handbook! DBA DevOpsSkills
  • 46. © 2017 Delphix. All Rights Reserved. Private and Confidential.© 2017 Delphix. All Rights Reserved. Private and Confidential. Kellyn Pot’Vin-Gorman Technical Intelligence Manager kellyn@delphix.com http://dbakevlar.com

Editor's Notes

  • #5: Data Gravity is the ability for data to attract applications, services, etc. As with the laws of physical gravity, data, due to its mass, will attract all else that has less mass.
  • #6: There are larger data sources every day. Databases are at the center of this friction and the natural life of a database is growth. By 2020, a third of all data will be on the cloud and 58% of data will be comprised in big data. By 2020, we’ll grow from today’s 4.4 zettabyets to an approximate, but staggering 44 zettabytes, or 44 trillion gigabytes. And by 2020, a third of that data will pass through the cloud. More data has been created in just the last two years than the previous history of humanity 1.7MB of new information created every second per human on the planet. Introduction of big data often has same development pain points.
  • #7: Agile 2008 conference, Andrew Clay Shafer and Patrick Debois discussed "Agile Infrastructure” The term DevOps was popularized through a series of "devopsdays" starting in 2009 in Belgium I made an attempt to introduce it to my local user group in 2012 and it failed miserably. Last year, made a second attempt to great success.
  • #8: . It will be their first Scrum “sprint” Looking to increase productivity 20% 25% increase in collaboration. Gains in revenue are predicted. And they’re looking forward to succeeding…
  • #9: George has worked with the team to pick out the right tools tools for their environment, including Git for their repository, Jenkins for collaboration and deployment, Juju for Security and they’re even using Ansible for some of the automation. They’re well on their way with a great introduction of tools.
  • #10: The team builds out the sprint backlog and plans out what they will do and have to accomplish in the two weeks they have for the sprint. Everyone is assigned their tasks and ready to begin.
  • #11: Each morning starts out great, as they have their daily scrum standups, taking just a few minutes to get everyone on board, tasks assigned and goals on what will be accomplished for the day off the burndown list.
  • #12: The developers are starting to trip over each other in their traditional waterfall data environment. DBAs are busy and having difficulty providing them the data they need. They all want to succeed, but without the data, they start to miss the deadlines for the daily scrum burndown list.
  • #13: Our Developer, Dave, built out a new test script that has been automated, but needs to be tested against fresh copy of production to development and test. The problem is, the DBAs can’t get him the data fast enough through traditional methods, even when they use DevOps methodologies, It can’t fix the current technologies the DBAs are employing.
  • #14: Over 80% of time is waiting for RDBMS, (relational databases) to be refreshed. Developers and Testers are waiting for data to do their primary functions. This allows for faster and less costly migrations to the cloud, too.
  • #15: The developers are feeling under more pressure as they can’t get the data they need and the DBAs are pressured to get space, time and resources.
  • #16: This has become an oxy-moron. Databases are becoming larger and DBAs are slowing down the process, so we’ll remove those that understand how to manage it best?
  • #17: George is at his wits end trying to figure out how they will ever succeed at DevOps and at Scrum if they can’t get through a simple two week sprint.
  • #18: Where we had that one department with money to throw at an odd project, buying a server, developing what they needed and then it was our problem, now we have companies doing 30% or more business auditing cloud projects to deem if they are viable or not. BUT WHAT HAPPENS WHEN THEY DON’T or we don’t address the problem??
  • #19: For a typical Fortune 1000 company, just a 10% increase in data accessibility will result in more than $65 million additional net income. Leveraging data coupld increase revenue by as much as 60%
  • #21: Marc gets it. He sees how much he and his team is in demand and knows that something needs to change.
  • #22: In computing, virtualization means to create a virtual version of a device or resource, such as a server, storage device, network or even a database. The framework divides the resource into one or more execution environments. For data, this can result in a golden copy or source that is used for a centralized location and removal of duplicated data. For read and writes, having unique data for that given copy, while duplicates are kept to singular.
  • #23: Oracle: RMAN duplicates, cold backup to restores, datapump and other archaic data transfer processes are time consuming. SQL Server: Import/Export, Recover from Backup, Replication with log shipping,
  • #24: By virtualizing, we remove the “weight” of the data. We know that 80% of the data won’t change between copies, so why do we need individual copies of it. Our source is then deduped and compressed to conserve more space.
  • #25: I work with Delphix, so you would think I know our virtualization the best, but the truth is, I also know many other virtualization tools at a very detailed level. The amount of information I know on Oracle virtualization tools is pretty insane, in fact.
  • #26: I work with Delphix, so you would think I know our virtualization the best, but the truth is, I also know many other virtualization tools at a very detailed level. The amount of information I know on Oracle virtualization tools is pretty insane, in fact.
  • #27: How do we “rewind” data and code changes now? Why should the DBA rewind changes made in dev and test? Why should you be the one to do this in test? Virtualization removes this. The Virtual databases are read and write, so even maintenance tasks, like DBCC’s can be offloaded to one. Ability to version control, not just the meta data, but the user data!
  • #28: Point out the engine and size after we’ve compressed and de-duplicated. Note that each of the VDBs will take approximately 5-10G vs. 1TB to offer a FULL read/write copy of the production system It will do so in just a matter of minutes. That this can also be done for the application tier!
  • #29: Each Virtual Database, (VDB) will no longer require space, (only background and foreground memory for SGA/PGA, etc.) and local redo logs. This is a considerable savings, but… If we take this a step further by embracing write changes only on blocks changed from the source, then we’ll experience 10-20 copies of a database in about the same space that one database requires.
  • #32: Package software into standardized units for development, shipment and deployment. A container image is a lightweight, stand-alone, executable package of a piece of software that includes everything needed to run it: code, runtime, system tools, system libraries, settings.
  • #33: The next step is the ability to migrate to the cloud or from one cloud to another. Right now, 60% of customers are using 2-5 clouds on average. The ability to move a Data Pod from one cloud to another is incredibly powerful. Companies are spending increased time now just migrating to the cloud, but to other clouds and if it would be as simple as migrating a Data pod with a few changes to the new storage location, (i.e. cloud) that could save companies millions of dollars.
  • #34: We refer to a container as a template in our product. Note that a data pod can be moved here or to the cloud…
  • #35: Agile 2008 conference, Andrew Clay Shafer and Patrick Debois discussed "Agile Infrastructure” The term DevOps was popularized through a series of "devopsdays" starting in 2009 in Belgium I made an attempt to introduce it to my local user group in 2012 and it failed miserably. Last year, made a second attempt to great success.
  • #36: This may appear to be a traffic disaster of changes, but for developers with Agile experience, a “sprint” looks just like this. You have different sprints that are quick runs and merges where developers are working separately on code that must merge successfully at the correct intersection and be deployed. Versioning with source control is displayed at the top, using Virtual images. You can see each iteration of the sprints. In the middle section is the branches of that occur during the development process. A virtual can be spun from a virtual, which means that it’s easier for developers to work from the work another developer has produced. Stopping points and release via a clone is simply minutes vs. hours or days.
  • #37: This is the interface for Developers and testers- they can bookmark before important tasks or rewind to any point in the process. They can bookmark and branch for full development/testing needs.
  • #38: Agile 2008 conference, Andrew Clay Shafer and Patrick Debois discussed "Agile Infrastructure” The term DevOps was popularized through a series of "devopsdays" starting in 2009 in Belgium I made an attempt to introduce it to my local user group in 2012 and it failed miserably. Last year, made a second attempt to great success.
  • #39: Data can only be processed if there is at least one lawful basis to do so[14]. The lawful basis for processing data are: the data subject has given consent to the processing of his or her personal data for one or more specific purposes. processing is necessary for the performance of a contract to which the data subject is party or in order to take steps at the request of the data subject prior to entering into a contract. processing is necessary for compliance with a legal obligation to which the controller is subject. processing is necessary in order to protect the vital interests of the data subject or of another natural person. processing is necessary for the performance of a task carried out in the public interest or in the exercise of official authority vested in the controller. processing is necessary for the purposes of the legitimate interests pursued by the controller or by a third party, except where such interests are overridden by the interests or fundamental rights and freedoms of the data subject which require protection of personal data, in particular where the data subject is a child.
  • #41: Or does it shift the problem toward authentication and authorization? Masking personally-identifiable, (PII, HIPPA, PCI, etc.) information renders it useless from a security standpoint Resolves both the technical and personal responsibility issue. The data can be masked before it moves to non-production, removing unnecessary risk. As we discussed, on average, 80% of data is non-production. GDPR is on the horizon and US must be compliant by May, 2018
  • #46: And yet we state that we won’t need DBAs? That data isn’t the center of challenge?
  • #47: The business is able to provision new environments or refresh existing ones in a matter of minutes. Developers and testers who’ve worked with bookmarks and branching of their code changes can now do the same with database changes, rewinding and refreshing as they need without impacting the DBAs day. This allows the DBA to do more with their time. Having tools that includes the database in the Agile development cycle makes a pivotal change in how the DBA is capable of being part of DevOps.