SlideShare a Scribd company logo
Confidential and Proprietary1
Optimize Your Vertica Data
Management Infrastructure
Srinivas Vadlamani, Chief Architect
January 2017
Confidential and Proprietary2
My background
Co-founder and Chief
Architect at Talena.
Prior to Talena, I was an
early architect at
Couchbase and Aster
Data, helping design
some of their key data
management
capabilities.
Confidential and Proprietary3
ing
Data Management Drivers
Application
Iteration
Compliance
70% of businesses
lost data over the past
two years
40% of businesses hit
by ransomware in 2016
Robust testing requires
up-to-date data
90% of enterprises delay
application rollouts waiting
for production data
Average Global 2000
company has seven copies
of prod data
Storage costs for archival
growing by 35% yearly
$900K
average cost of
a data loss incident
$1.08M
cost to implement
manual test data efforts
$300K
average yearly cost of
managing archives
BUSINESSCHALLENGESFINANCIALIMPACT
Data Loss
Source: EMC, CA, Ponemon
Confidential and Proprietary4
Key Big Data Protection Principles
Replication and
backup are not
the same
You need an
incremental
forever
architecture
It really is about
recovery, not
backup
Even commodity
storage is
expensive
Confidential and Proprietary5
Replication vs Backup
Replication: ideal for hardware failures
Backup: ideal to protect against human errors and
application corruption
Need both as part of your Vertica data protection strategy
Confidential and Proprietary6
Why Incremental-Forever?
Data volumes are getting too large for traditional backup
methods. Backing up hundreds of terabytes on a weekly
basis is not feasible as a backup policy
Confidential and Proprietary7
A Recovery-centric Architecture
How quickly you recover impacts your business and brand
Your recovery architecture needs to handle changes to
the production topology over time
Your recovery flexibility (whole database or at a schema
level) will influence your recovery point and recovery
time objectives
Confidential and Proprietary8
Speeding Up Application Delivery
Real versus
Synthetic
Data
Supporting
Compliance
Initiatives
Minimizing
Network
Overhead
Confidential and Proprietary9
What’s Different In The Cloud
Metadata
management
is made more
complex
PROBLEM
Storage
optimization
becomes that
much more
difficult
PROBLEM
Relying on
traditional
backup
mechanisms
does not scale
PROBLEM
Confidential and Proprietary10
The Talena Architecture
• Deep de-duplication and compression with app-aware architecture
• Incremental-forever backup architecture
• High availability via erasure coding in distributed cluster architecture
Smart Storage Optimizer
Confidential and Proprietary11
The Talena Architecture
Native querying and analytics
via active compute layer
Unbounded scale with a
Hadoop-native architecture
Smart Storage Optimizer
Active Compute Services Distributed File System
Confidential and Proprietary12
The Talena Architecture
• Google-like catalog
shortens data recovery
time
• Automatic schema
generation for mirroring
and backups
• Granular recovery at an
object level
• Recovery to multiple
topologies
• Native integration with
LDAP and Kerberos for
authentication
• Role-based access control
defines specific privileges
• Transparent data encryption
• Masking for PII data
Smart Storage Optimizer
Active Compute Services Distributed File System
Metadata Catalog Data Orchestration ServicesSecurity Services
Confidential and Proprietary13
Smart Storage Optimizer
The Talena Architecture
GUI CLI API
Active Compute Services Distributed File System
• ‘Single pane of glass’ for multiple use cases and data platforms
• Agentless architecture minimizes management overhead
• GUI, CLI, REST-based Talena API options
Metadata Catalog Data Orchestration ServicesSecurity Services
Confidential and Proprietary14
Talena and vbr.py
vbr.py Talena
Recovery to different
Vertica version
No Yes
Recovery to different
Vertica topology
No Yes
Google-like metadata
catalog for rapid discovery
No Yes
Built-in storage optimization No Yes
UI for automated policy and
workflow creation
No Yes
Ability to support test data
management
No Yes
Inherent scalable
infrastructure
No Yes
Data masking support No Yes
Sampling support No Yes
Confidential and Proprietary15
Q&A
 We’ll send you a link to our
eBook “The Vertica Backup
Guide”
 Additional resources: talena-
inc.com/resources and
talena-inc.com/blog
 Ping us with any additional
questions: info@talena-
inc.com
Confidential and Proprietary16
Q and A

More Related Content

PPTX
Debunking Common Myths of Hadoop Backup & Test Data Management
PPTX
Hp vertica certification guide
PPTX
Debunking Common Myths of Cassandra Backup
PPT
Migrating legacy ERP data into Hadoop
PDF
Die 10 besten PostgreSQL-Replikationsstrategien für Ihr Unternehmen
 
PDF
The Future of Postgres Sharding / Bruce Momjian (PostgreSQL)
PDF
Cloud Migration Paths: Kubernetes, IaaS, or DBaaS
 
PPTX
Achieving cloud scale with microservices based applications on azure
Debunking Common Myths of Hadoop Backup & Test Data Management
Hp vertica certification guide
Debunking Common Myths of Cassandra Backup
Migrating legacy ERP data into Hadoop
Die 10 besten PostgreSQL-Replikationsstrategien für Ihr Unternehmen
 
The Future of Postgres Sharding / Bruce Momjian (PostgreSQL)
Cloud Migration Paths: Kubernetes, IaaS, or DBaaS
 
Achieving cloud scale with microservices based applications on azure

What's hot (20)

PPTX
Bootstrapping state in Apache Flink
PPTX
Db2 analytics accelerator on ibm integrated analytics system technical over...
PPTX
HPE Keynote Hadoop Summit San Jose 2016
PPTX
Big Data Case Study: Fortune 100 Telco
PDF
Key trends in Big Data and new reference architecture from Hewlett Packard En...
PPTX
Understanding the IBM Power Systems Advantage
PPTX
Light-weighted HDFS disaster recovery
PDF
Apache Spark Workshop at Hadoop Summit
PDF
IMCSummit 2015 - Day 2 General Session - Flash-Extending In-Memory Computing
PPTX
Insights into Real-world Data Management Challenges
PDF
EDB Postgres Platform
 
PDF
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds
PDF
Protect your Private Data in your Hadoop Clusters with ORC Column Encryption
PDF
From limited Hadoop compute capacity to increased data scientist efficiency
PPTX
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
PPTX
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
PPTX
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
PPTX
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
PPTX
In Memory Data Grids, Demystified!
PPTX
Containerized Hadoop beyond Kubernetes
Bootstrapping state in Apache Flink
Db2 analytics accelerator on ibm integrated analytics system technical over...
HPE Keynote Hadoop Summit San Jose 2016
Big Data Case Study: Fortune 100 Telco
Key trends in Big Data and new reference architecture from Hewlett Packard En...
Understanding the IBM Power Systems Advantage
Light-weighted HDFS disaster recovery
Apache Spark Workshop at Hadoop Summit
IMCSummit 2015 - Day 2 General Session - Flash-Extending In-Memory Computing
Insights into Real-world Data Management Challenges
EDB Postgres Platform
 
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds
Protect your Private Data in your Hadoop Clusters with ORC Column Encryption
From limited Hadoop compute capacity to increased data scientist efficiency
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Apache Ignite vs Alluxio: Memory Speed Big Data Analytics
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
In Memory Data Grids, Demystified!
Containerized Hadoop beyond Kubernetes
Ad

Viewers also liked (18)

PDF
Vertica mpp columnar dbms
PPTX
Vertica finalist interview
PPTX
Vertica the convertro way
PDF
Vertica 7.0 Architecture Overview
PPTX
Bridging Structured and Unstructred Data with Apache Hadoop and Vertica
PDF
Vertica loading best practices
ODP
Vertica
PDF
HP Vertica basics
PPTX
Vertica
PDF
A short introduction to Vertica
PPTX
HPE Vertica Chile Desayuno Oct 2016
PPTX
Vertica-Database
PPTX
Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...
PPT
Hadoop World Vertica
PDF
Hortonworks and HP Vertica Webinar
KEY
Hadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at Twitter
PPSX
Introduction to Vertica (Architecture & More)
Vertica mpp columnar dbms
Vertica finalist interview
Vertica the convertro way
Vertica 7.0 Architecture Overview
Bridging Structured and Unstructred Data with Apache Hadoop and Vertica
Vertica loading best practices
Vertica
HP Vertica basics
Vertica
A short introduction to Vertica
HPE Vertica Chile Desayuno Oct 2016
Vertica-Database
Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...
Hadoop World Vertica
Hortonworks and HP Vertica Webinar
Hadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at Twitter
Introduction to Vertica (Architecture & More)
Ad

Similar to Optimize Your Vertica Data Management Infrastructure (20)

PDF
PROACT SYNC 2013 - Breakout - CommVault IntelliSnap Recovery Manager de inzet...
PPTX
Key Architecture and Performance Principles to Optimize Data Management
PDF
Houd controle over uw data
PDF
Data Architecture Best Practices for Advanced Analytics
PPTX
Optimizing Data Management for MongoDB
PPTX
4 Ways To Save Big Money in Your Data Center and Private Cloud
PDF
Data Warehouse or Data Lake, Which Do I Choose?
PDF
Storage simplicity value_110810
PPTX
Enterprise data management for microsoft hd insight
PDF
Symantec Appliances Strategy Launch
PDF
Oracle Storage Cloud Conference
PPTX
Webinar | Introducing DataStax Enterprise 4.6
PDF
2022 Trends in Enterprise Analytics
PPTX
Building Confidence in Big Data - IBM Smarter Business 2013
PDF
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
PPTX
Data Mesh using Microsoft Fabric
PPTX
Veritas 360 data management
PDF
Logicalis Backup as a Service: Re-defining Data Protection
PDF
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
PDF
Oracle - Next Generation Datacenter - Alan Hartwell
PROACT SYNC 2013 - Breakout - CommVault IntelliSnap Recovery Manager de inzet...
Key Architecture and Performance Principles to Optimize Data Management
Houd controle over uw data
Data Architecture Best Practices for Advanced Analytics
Optimizing Data Management for MongoDB
4 Ways To Save Big Money in Your Data Center and Private Cloud
Data Warehouse or Data Lake, Which Do I Choose?
Storage simplicity value_110810
Enterprise data management for microsoft hd insight
Symantec Appliances Strategy Launch
Oracle Storage Cloud Conference
Webinar | Introducing DataStax Enterprise 4.6
2022 Trends in Enterprise Analytics
Building Confidence in Big Data - IBM Smarter Business 2013
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Data Mesh using Microsoft Fabric
Veritas 360 data management
Logicalis Backup as a Service: Re-defining Data Protection
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Oracle - Next Generation Datacenter - Alan Hartwell

Recently uploaded (20)

PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Programs and apps: productivity, graphics, security and other tools
PPT
Teaching material agriculture food technology
PDF
Empathic Computing: Creating Shared Understanding
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Encapsulation theory and applications.pdf
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Electronic commerce courselecture one. Pdf
20250228 LYD VKU AI Blended-Learning.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Understanding_Digital_Forensics_Presentation.pptx
Unlocking AI with Model Context Protocol (MCP)
Programs and apps: productivity, graphics, security and other tools
Teaching material agriculture food technology
Empathic Computing: Creating Shared Understanding
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Per capita expenditure prediction using model stacking based on satellite ima...
Machine learning based COVID-19 study performance prediction
Encapsulation theory and applications.pdf
sap open course for s4hana steps from ECC to s4
Digital-Transformation-Roadmap-for-Companies.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
MYSQL Presentation for SQL database connectivity
Encapsulation_ Review paper, used for researhc scholars
MIND Revenue Release Quarter 2 2025 Press Release
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Electronic commerce courselecture one. Pdf

Optimize Your Vertica Data Management Infrastructure

  • 1. Confidential and Proprietary1 Optimize Your Vertica Data Management Infrastructure Srinivas Vadlamani, Chief Architect January 2017
  • 2. Confidential and Proprietary2 My background Co-founder and Chief Architect at Talena. Prior to Talena, I was an early architect at Couchbase and Aster Data, helping design some of their key data management capabilities.
  • 3. Confidential and Proprietary3 ing Data Management Drivers Application Iteration Compliance 70% of businesses lost data over the past two years 40% of businesses hit by ransomware in 2016 Robust testing requires up-to-date data 90% of enterprises delay application rollouts waiting for production data Average Global 2000 company has seven copies of prod data Storage costs for archival growing by 35% yearly $900K average cost of a data loss incident $1.08M cost to implement manual test data efforts $300K average yearly cost of managing archives BUSINESSCHALLENGESFINANCIALIMPACT Data Loss Source: EMC, CA, Ponemon
  • 4. Confidential and Proprietary4 Key Big Data Protection Principles Replication and backup are not the same You need an incremental forever architecture It really is about recovery, not backup Even commodity storage is expensive
  • 5. Confidential and Proprietary5 Replication vs Backup Replication: ideal for hardware failures Backup: ideal to protect against human errors and application corruption Need both as part of your Vertica data protection strategy
  • 6. Confidential and Proprietary6 Why Incremental-Forever? Data volumes are getting too large for traditional backup methods. Backing up hundreds of terabytes on a weekly basis is not feasible as a backup policy
  • 7. Confidential and Proprietary7 A Recovery-centric Architecture How quickly you recover impacts your business and brand Your recovery architecture needs to handle changes to the production topology over time Your recovery flexibility (whole database or at a schema level) will influence your recovery point and recovery time objectives
  • 8. Confidential and Proprietary8 Speeding Up Application Delivery Real versus Synthetic Data Supporting Compliance Initiatives Minimizing Network Overhead
  • 9. Confidential and Proprietary9 What’s Different In The Cloud Metadata management is made more complex PROBLEM Storage optimization becomes that much more difficult PROBLEM Relying on traditional backup mechanisms does not scale PROBLEM
  • 10. Confidential and Proprietary10 The Talena Architecture • Deep de-duplication and compression with app-aware architecture • Incremental-forever backup architecture • High availability via erasure coding in distributed cluster architecture Smart Storage Optimizer
  • 11. Confidential and Proprietary11 The Talena Architecture Native querying and analytics via active compute layer Unbounded scale with a Hadoop-native architecture Smart Storage Optimizer Active Compute Services Distributed File System
  • 12. Confidential and Proprietary12 The Talena Architecture • Google-like catalog shortens data recovery time • Automatic schema generation for mirroring and backups • Granular recovery at an object level • Recovery to multiple topologies • Native integration with LDAP and Kerberos for authentication • Role-based access control defines specific privileges • Transparent data encryption • Masking for PII data Smart Storage Optimizer Active Compute Services Distributed File System Metadata Catalog Data Orchestration ServicesSecurity Services
  • 13. Confidential and Proprietary13 Smart Storage Optimizer The Talena Architecture GUI CLI API Active Compute Services Distributed File System • ‘Single pane of glass’ for multiple use cases and data platforms • Agentless architecture minimizes management overhead • GUI, CLI, REST-based Talena API options Metadata Catalog Data Orchestration ServicesSecurity Services
  • 14. Confidential and Proprietary14 Talena and vbr.py vbr.py Talena Recovery to different Vertica version No Yes Recovery to different Vertica topology No Yes Google-like metadata catalog for rapid discovery No Yes Built-in storage optimization No Yes UI for automated policy and workflow creation No Yes Ability to support test data management No Yes Inherent scalable infrastructure No Yes Data masking support No Yes Sampling support No Yes
  • 15. Confidential and Proprietary15 Q&A  We’ll send you a link to our eBook “The Vertica Backup Guide”  Additional resources: talena- inc.com/resources and talena-inc.com/blog  Ping us with any additional questions: info@talena- inc.com

Editor's Notes

  • #2: .
  • #4: Data Loss Source: https://www.emc.com/collateral/presentation/emc-dpi-key-findings-global.pdf Test Data Management Source: http://www.ca.com/content/dam/ca/us/files/industry-analyst-report/the-total-economic-impact-of-the-ca-technologies-test-data-manager-solution.pdf Compliance Source: http://www.ponemon.org/blog/the-true-cost-of-compliance-a-benchmark-study-of-multinational-organizations
  • #11: The next few slides will introduce the unique Talena architecture and highlight how this architecture delivers on these core business benefits. One of the most significant components of our architecture is our Smart Storage Optimizer. By integrating compute and storage management into our storage optimizer, we’re able to deliver significant cost savings. Our application-aware architecture enables us to do deep de-duplication and compression. Our backup process is incremental-forever, saving on storage costs, and by incorporating erasure coding we also ensure high availability no matter how large a Talena cluster you choose to deploy.
  • #13: Supports transparent data encryption in the security services section
  • #14: Our agentless architecture makes Talena an ideal solution for big data architectures and minimizes your operational overhead. Furthermore, Talena can support multiple data platforms, versions, and use cases in a single deployment of Talena, thereby providing a “single pane of glass” for all your big data management needs. While most of our clients work within our user interface, we also provide a REST-based API to accomplish the same tasks.