SlideShare a Scribd company logo
Get More Out of 
MySQL with TokuDB 
Tim Callaghan 
VP/Engineering, Tokutek 
tim@tokutek.com 
@tmcallaghan
Tokutek: Database Performance Engines 
What is Tokutek? 
Tokutek® offers high performance and scalability for MySQL, 
MariaDB and MongoDB. Our easy-to-use open source solutions 
are compatible with your existing code and application 
infrastructure. 
Tokutek Performance Engines Remove Limitations 
• Improve insertion performance by 20X 
• Reduce HDD and flash storage requirements up to 90% 
• No need to rewrite code 
Tokutek Mission: 
Empower your database to handle the Big Data 
requirements of today’s applications
3 
A Global Customer Base
Housekeeping 
• This presentation will be available for replay 
following the event 
• We welcome your questions; please use the console 
on the right of your screen and we will answer 
following the presentation 
• A copy of the presentation is available upon request
Agenda 
Lets answer the following questions, “How can you…?” 
• Easily install and configure TokuDB. 
• Dramatically increase performance without rewriting 
code. 
• Reduce the total cost of your servers and storage. 
• Simply perform online schema changes. 
• Avoid becoming the support staff for your 
application. 
• And Q+A
How easy is it to install and 
configure TokuDB for 
MySQL or MariaDB?
What is TokuDB? 
• TokuDB = MySQL* Storage Engine + Patches** 
– * MySQL, MariaDB, Percona Server 
– ** Patches are required for full functionality 
– TokuDB is more than a plugin 
• Transactional, ACID + MVCC 
– Like InnoDB 
• Drop-in replacement for MySQL 
• Open Source 
– http://github.com/Tokutek/ft-engine
Where can I get TokuDB? 
• Tokutek offers MySQL 5.5 and MariaDB 5.5 builds 
– www.tokutek.com 
• MariaDB 5.5 and 10 
– www.mariadb.org 
– Also in MariaDB 5.5 from various package repositories 
• Experimental Percona Server 5.6 builds 
– www.percona.com
Is it truly a “drop in replacement”? 
• No Foreign Key support 
– you’ll need to drop them 
• No Windows or OSX binaries 
– Virtual machines are helpful in evaluations 
• No 32-bit builds 
• Otherwise, yes
How do I get started? 
• Start Fresh 
– create table <table> engine=tokudb; 
– mysqldump / load data infile 
• Use your existing MySQL data folder 
– alter table <table-to-convert> engine=tokudb; 
• Measure the differences 
– compression : load/convert your tables 
– performance : run your workload 
– online schema changes : add a column
Before you dive in – check you’re my.cnf 
• TokuDB uses sensible server parameter defaults, but 
• Be mindful of your memory 
– Reduce innodb_buffer_pool_size (InnoDB) and 
key_cache_size (MyISAM) 
– Especially if converting tables 
– tokudb_cache_size=?G 
– Defaults to 50% of RAM, I recommend 80% 
– tokudb_directio=1 
• Leave everything else alone
How can I dramatically 
increase performance without 
having to rewrite code?
Where does the performance come from? 
• Tokutek’s Fractal Tree® indexes 
– Much faster than B-trees in > RAM workloads 
– InnoDB and MyISAM use B-trees 
– Significant IO reduction 
– Messages defer IO on add/update/delete 
– All reads and writes are compressed 
– Enables users to add more indexes 
– Queries go faster 
• Lots of good webinar content on our website 
– www.tokutek.com/resources/webinars
How much can I reduce my IO? 
Converted from 
InnoDB to TokuDB
How fast can I insert data into TokuDB? 
• InnoDB’s B-trees 
– Fast until the index not longer fits in RAM 
• TokuDB’s Fractal Tree indexes 
– Start fast, stay fast! 
• iiBench benchmark 
– Insert 1 billion rows 
– 1000 inserts per batch 
– Auto-increment PK 
– 3 secondary indexes
How fast can I insert data into TokuDB?
How fast are mixed workloads? 
• Fast, since > RAM mixed workloads generally contain… 
– Index maintenance (insert, update, delete) 
– Fractal Tree indexes FTW! 
– Queries 
– TokuDB enables richer indexing (more indexes) 
• Sysbench benchmark 
– 16 tables, 50 million rows per table 
– Each Sysbench transaction contains 
– 1 of each query : point, range, aggregation 
– indexed update, unindexed update, delete, insert
How fast are mixed workloads?
How do secondary indexes work? 
• InnoDB and TokuDB “cluster” the primary key index 
– The key (PK) and all other columns are co-located in 
memory and on disk 
• Secondary indexes co-locate the “index key” and PK 
– When a candidate row is found a second lookup 
occurs into the PK index 
– This means an additional IO is required 
– MySQL’s “hidden join”
What is a clustered secondary index? 
• “Covering” indexes remove this second lookup, but 
require putting the right columns into the index 
– create index idx_1 on t1 (c1, c2, c3, c4, c5, c6); 
– If c1/c2 are queried, only c3/c4/c5/c6 are covered 
– No additional IO, but c7 isn’t covered 
• TokuDB supports clustered secondary indexes 
– create clustering index idx_1 on t1 (c1, c2); 
– All columns in t1 are covered, forever 
– Even if new columns are added to the table
What are clustered secondary indexes good at? 
• Two words, “RANGE SCANS” 
• Several rows (maybe thousands) are scanned without 
requiring additional lookups on the PK index 
• Also, TokuDB blocks are much larger than InnoDB 
– TokuDB = 4MB blocks = sequential IO 
– InnoDB = 16KB blocks = random IO 
• Can be orders of magnitude faster for range queries
Can SQL be optimized? 
• Fractal Tree indexes support message injection 
– The actual work (and IO) can be deferred 
• Example: 
– update t1 set k = k + 1 where pk = 5; 
– InnoDB follows read-modify-write pattern 
– If field “k” is not indexed, TokuDB avoids IO entirely 
– An “increment” message is injected 
• Current optimizations 
– “replace into”, “insert ignore”, “update”, “insert on 
duplicate key update”
How can I reduce the total cost 
of my servers and storage?
How can I use less storage? 
• Compression, compression, compression! 
• All IO in TokuDB is compressed 
– Reads and writes 
– Usually ~5x compression (but I’ve seen 25x or more) 
• TokuDB [currently] supports 3 compression algorithms 
– lzma = highest compression (and high CPU) 
– zlib = high compression (and much less CPU) 
– quicklz = medium compression (even less CPU) 
– pluggable architecture, lz4 and snappy “in the lab”
But doesn’t InnoDB support compression? 
• Yes, but the compression achieved is far lower 
– InnoDB compresses 16K blocks, TokuDB is 64K or 128K 
– InnoDB requires fixed on-disk size, TokuDB is flexible 
*log style data
But doesn’t InnoDB support compression? 
• And InnoDB performance is severely impacted by it 
– Compression “misses” are costly 
*iiBench workload
How do I compress my data in TokuDB? 
create table t1 (c1 bigint not null primary key) 
engine=tokudb 
row_format=[tokudb_lzma | tokudb_zlib | tokudb_quicklz]; 
NOTE: Compression is not optional in TokuDB, we use 
compression to provide performance advantages as well as save 
space.
How can I perform online 
schema changes?
What is an “online” schema change? 
My definition 
“An online schema change is the ability to add or drop a column 
on an existing table without blocking further changes to the 
table or requiring substantial server resources (CPU, RAM, IO, 
disk) to accomplish the operation.” 
P.S., I’d like for it to be instantaneous!
What do blocking schema changes look like?
How have online schema changes evolved? 
• MySQL 5.5 
– Table is read-only while entire table is re-created 
• “Manual” process 
– Take slave offline, apply to slave, catch up to master, 
switch places, repeat 
• MySQL 5.6 (and ~ Percona’s pt-online-schema-change-tool) 
– Table is rebuilt “in the background” 
– Changes are captured, and replayed on new table 
– Uses significant RAM, CPU, IO, and disk space 
• TokuDB 
– alter table t1 add column new_column bigint; 
– Done!
What online schema changes can TokuDB handle? 
• Add column 
• Drop column 
• Expand column 
– integer types 
– varchar, char, varbinary 
• Index creation
How can I avoid becoming the 
support staff for my 
application?
34 
Where can I get TokuDB support? 
TokuDB is offered in 2 editions 
• Community 
– Community support (Google Groups “tokudb-user”) 
• Enterprise subscription 
– Commercial support 
– Wouldn’t you rather be developing another application? 
– Extra features 
–Hot backup, more on the way 
– Access to TokuDB experts 
– Input to the product roadmap
35 
Tokutek: Database Performance Engines 
Any Questions? 
Download TokuDB at www.tokutek.com/products/downloads 
Register for product updates, access to premium content, and 
invitations at www.tokutek.com 
Join the Conversation

More Related Content

PPTX
Get More Out of MySQL with TokuDB
PDF
TokuDB 高科扩展性 MySQL 和 MariaDB 数据库
PPTX
Introduction to TokuDB v7.5 and Read Free Replication
PPTX
Percona FT / TokuDB
PDF
TokuDB - What You Need to Know
PDF
MySQL in the Hosted Cloud - Percona Live 2015
PDF
Bootstrapping Using Free Software
PDF
MHA: Getting started & moving past quirks percona live santa clara 2013
Get More Out of MySQL with TokuDB
TokuDB 高科扩展性 MySQL 和 MariaDB 数据库
Introduction to TokuDB v7.5 and Read Free Replication
Percona FT / TokuDB
TokuDB - What You Need to Know
MySQL in the Hosted Cloud - Percona Live 2015
Bootstrapping Using Free Software
MHA: Getting started & moving past quirks percona live santa clara 2013

What's hot (20)

PPTX
Get More Out of MongoDB with TokuMX
PDF
Databases in the hosted cloud
PDF
MariaDB with SphinxSE
PDF
Best practices for MySQL/MariaDB Server/Percona Server High Availability
PPTX
An introduction to SQL Server in-memory OLTP Engine
PDF
InnoDB Architecture and Performance Optimization, Peter Zaitsev
PDF
Databases in the Hosted Cloud
PPTX
Is It Fast? : Measuring MongoDB Performance
PDF
Capacity planning for your data stores
PDF
MariaDB 5.5 and what comes next - Percona Live NYC 2012
PDF
MySQL High Availability Solutions
PDF
MariaDB: The New M In LAMP - SCALE10x
PDF
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
PDF
Fractal Tree Indexes : From Theory to Practice
PDF
MariaDB: The 2012 Edition
PDF
Remote DBA Experts SQL Server 2008 New Features
PDF
Tuning Linux for your database FLOSSUK 2016
PDF
Distributions from the view a package
PDF
Lessons PostgreSQL learned from commercial databases, and didn’t
PDF
My first moments with MongoDB
Get More Out of MongoDB with TokuMX
Databases in the hosted cloud
MariaDB with SphinxSE
Best practices for MySQL/MariaDB Server/Percona Server High Availability
An introduction to SQL Server in-memory OLTP Engine
InnoDB Architecture and Performance Optimization, Peter Zaitsev
Databases in the Hosted Cloud
Is It Fast? : Measuring MongoDB Performance
Capacity planning for your data stores
MariaDB 5.5 and what comes next - Percona Live NYC 2012
MySQL High Availability Solutions
MariaDB: The New M In LAMP - SCALE10x
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
Fractal Tree Indexes : From Theory to Practice
MariaDB: The 2012 Edition
Remote DBA Experts SQL Server 2008 New Features
Tuning Linux for your database FLOSSUK 2016
Distributions from the view a package
Lessons PostgreSQL learned from commercial databases, and didn’t
My first moments with MongoDB
Ad

Viewers also liked (7)

PDF
O que é o SecondLife? Sabia que este pode catalisar a colaboração na sua empr...
PDF
Porque a-burocracia-deve-morrer
PDF
Ulteo virtual desktop system
ODP
softwares livres - open source
PDF
Open Source Vantagens E Beneficios - By Softelabs
PPT
Apresentacao Suse
PDF
Software Livre (Conceitos, contextualização histórica, licenças, sistemas ope...
O que é o SecondLife? Sabia que este pode catalisar a colaboração na sua empr...
Porque a-burocracia-deve-morrer
Ulteo virtual desktop system
softwares livres - open source
Open Source Vantagens E Beneficios - By Softelabs
Apresentacao Suse
Software Livre (Conceitos, contextualização histórica, licenças, sistemas ope...
Ad

Similar to 20140128 webinar-get-more-out-of-mysql-with-tokudb-140319063324-phpapp02 (20)

PDF
Toku DB by Aswin
PDF
Big challenges
PDF
MySQL Storage Engines - which do you use? TokuDB? MyRocks? InnoDB?
PPTX
Best storage engine for MySQL
PPTX
Handling Massive Writes
PDF
MySQL 8 Server Optimization Swanseacon 2018
PDF
MySQL 8 Tips and Tricks from Symfony USA 2018, San Francisco
PDF
TokuDB internals / Лесин Владислав (Percona)
PDF
30334823 my sql-cluster-performance-tuning-best-practices
PDF
MySQL Tokudb engine benchmark
PDF
Bogdan Kecman INIT Presentation
PDF
Introduction of MariaDB AX / TX
PDF
MariaDB - a MySQL Replacement #SELF2014
ODP
Bogdan Kecman Advanced Databasing
PDF
The History and Future of the MySQL ecosystem
PDF
Using ScyllaDB for Real-Time Write-Heavy Workloads
PDF
Understanding the architecture of MariaDB ColumnStore
PDF
Scaling, Tuning and Maintaining the Monolith
PDF
Storage Methods for Nonstandard Data Patterns
PDF
Introducing TiDB [Delivered: 09/27/18 at NYC SQL Meetup]
Toku DB by Aswin
Big challenges
MySQL Storage Engines - which do you use? TokuDB? MyRocks? InnoDB?
Best storage engine for MySQL
Handling Massive Writes
MySQL 8 Server Optimization Swanseacon 2018
MySQL 8 Tips and Tricks from Symfony USA 2018, San Francisco
TokuDB internals / Лесин Владислав (Percona)
30334823 my sql-cluster-performance-tuning-best-practices
MySQL Tokudb engine benchmark
Bogdan Kecman INIT Presentation
Introduction of MariaDB AX / TX
MariaDB - a MySQL Replacement #SELF2014
Bogdan Kecman Advanced Databasing
The History and Future of the MySQL ecosystem
Using ScyllaDB for Real-Time Write-Heavy Workloads
Understanding the architecture of MariaDB ColumnStore
Scaling, Tuning and Maintaining the Monolith
Storage Methods for Nonstandard Data Patterns
Introducing TiDB [Delivered: 09/27/18 at NYC SQL Meetup]

More from Francisco Gonçalves (20)

PDF
A automatização e virtualização do seu negócio
PDF
O rad da wave maker developing for the cloud
PDF
Scale out database apps através de galera cluster e maria db
PDF
Re pensando-virtualização-através-linux containers
PDF
O docker vai mudar tudo na sua infra estrutura-ti
PPTX
Teamwork Web Application
PPTX
Hypervisor "versus" Linux Containers with Docker !
PDF
MariaDB Galera Cluster presentation
PDF
Linux capacity planning
PPTX
Wavemaker RAD for the Cloud with CloudJee - Future Direction 2014
PDF
Juju on ubuntu cloud
PDF
Cloud foundry and openstackcloud
PDF
Ubuntu cloud infrastructures
PDF
Openstack deployment-with ubuntu
PDF
Desk top virtual_gbanif
PPT
Consolidação winintel
PPT
Legacy mainframe soa&workflow&documentcontentmgr fgon2007
PPT
Proj storage&backups&consolidaservidores&as400&pcov3
PPT
Softelab it strategies for 2010 and beyond
A automatização e virtualização do seu negócio
O rad da wave maker developing for the cloud
Scale out database apps através de galera cluster e maria db
Re pensando-virtualização-através-linux containers
O docker vai mudar tudo na sua infra estrutura-ti
Teamwork Web Application
Hypervisor "versus" Linux Containers with Docker !
MariaDB Galera Cluster presentation
Linux capacity planning
Wavemaker RAD for the Cloud with CloudJee - Future Direction 2014
Juju on ubuntu cloud
Cloud foundry and openstackcloud
Ubuntu cloud infrastructures
Openstack deployment-with ubuntu
Desk top virtual_gbanif
Consolidação winintel
Legacy mainframe soa&workflow&documentcontentmgr fgon2007
Proj storage&backups&consolidaservidores&as400&pcov3
Softelab it strategies for 2010 and beyond

Recently uploaded (20)

PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PPTX
CHAPTER 2 - PM Management and IT Context
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
top salesforce developer skills in 2025.pdf
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PDF
medical staffing services at VALiNTRY
PPTX
Introduction to Artificial Intelligence
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PPTX
L1 - Introduction to python Backend.pptx
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
Upgrade and Innovation Strategies for SAP ERP Customers
Reimagine Home Health with the Power of Agentic AI​
Odoo Companies in India – Driving Business Transformation.pdf
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Design an Analysis of Algorithms II-SECS-1021-03
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
CHAPTER 2 - PM Management and IT Context
Operating system designcfffgfgggggggvggggggggg
How to Migrate SBCGlobal Email to Yahoo Easily
wealthsignaloriginal-com-DS-text-... (1).pdf
Softaken Excel to vCard Converter Software.pdf
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
top salesforce developer skills in 2025.pdf
2025 Textile ERP Trends: SAP, Odoo & Oracle
medical staffing services at VALiNTRY
Introduction to Artificial Intelligence
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
L1 - Introduction to python Backend.pptx

20140128 webinar-get-more-out-of-mysql-with-tokudb-140319063324-phpapp02

  • 1. Get More Out of MySQL with TokuDB Tim Callaghan VP/Engineering, Tokutek tim@tokutek.com @tmcallaghan
  • 2. Tokutek: Database Performance Engines What is Tokutek? Tokutek® offers high performance and scalability for MySQL, MariaDB and MongoDB. Our easy-to-use open source solutions are compatible with your existing code and application infrastructure. Tokutek Performance Engines Remove Limitations • Improve insertion performance by 20X • Reduce HDD and flash storage requirements up to 90% • No need to rewrite code Tokutek Mission: Empower your database to handle the Big Data requirements of today’s applications
  • 3. 3 A Global Customer Base
  • 4. Housekeeping • This presentation will be available for replay following the event • We welcome your questions; please use the console on the right of your screen and we will answer following the presentation • A copy of the presentation is available upon request
  • 5. Agenda Lets answer the following questions, “How can you…?” • Easily install and configure TokuDB. • Dramatically increase performance without rewriting code. • Reduce the total cost of your servers and storage. • Simply perform online schema changes. • Avoid becoming the support staff for your application. • And Q+A
  • 6. How easy is it to install and configure TokuDB for MySQL or MariaDB?
  • 7. What is TokuDB? • TokuDB = MySQL* Storage Engine + Patches** – * MySQL, MariaDB, Percona Server – ** Patches are required for full functionality – TokuDB is more than a plugin • Transactional, ACID + MVCC – Like InnoDB • Drop-in replacement for MySQL • Open Source – http://github.com/Tokutek/ft-engine
  • 8. Where can I get TokuDB? • Tokutek offers MySQL 5.5 and MariaDB 5.5 builds – www.tokutek.com • MariaDB 5.5 and 10 – www.mariadb.org – Also in MariaDB 5.5 from various package repositories • Experimental Percona Server 5.6 builds – www.percona.com
  • 9. Is it truly a “drop in replacement”? • No Foreign Key support – you’ll need to drop them • No Windows or OSX binaries – Virtual machines are helpful in evaluations • No 32-bit builds • Otherwise, yes
  • 10. How do I get started? • Start Fresh – create table <table> engine=tokudb; – mysqldump / load data infile • Use your existing MySQL data folder – alter table <table-to-convert> engine=tokudb; • Measure the differences – compression : load/convert your tables – performance : run your workload – online schema changes : add a column
  • 11. Before you dive in – check you’re my.cnf • TokuDB uses sensible server parameter defaults, but • Be mindful of your memory – Reduce innodb_buffer_pool_size (InnoDB) and key_cache_size (MyISAM) – Especially if converting tables – tokudb_cache_size=?G – Defaults to 50% of RAM, I recommend 80% – tokudb_directio=1 • Leave everything else alone
  • 12. How can I dramatically increase performance without having to rewrite code?
  • 13. Where does the performance come from? • Tokutek’s Fractal Tree® indexes – Much faster than B-trees in > RAM workloads – InnoDB and MyISAM use B-trees – Significant IO reduction – Messages defer IO on add/update/delete – All reads and writes are compressed – Enables users to add more indexes – Queries go faster • Lots of good webinar content on our website – www.tokutek.com/resources/webinars
  • 14. How much can I reduce my IO? Converted from InnoDB to TokuDB
  • 15. How fast can I insert data into TokuDB? • InnoDB’s B-trees – Fast until the index not longer fits in RAM • TokuDB’s Fractal Tree indexes – Start fast, stay fast! • iiBench benchmark – Insert 1 billion rows – 1000 inserts per batch – Auto-increment PK – 3 secondary indexes
  • 16. How fast can I insert data into TokuDB?
  • 17. How fast are mixed workloads? • Fast, since > RAM mixed workloads generally contain… – Index maintenance (insert, update, delete) – Fractal Tree indexes FTW! – Queries – TokuDB enables richer indexing (more indexes) • Sysbench benchmark – 16 tables, 50 million rows per table – Each Sysbench transaction contains – 1 of each query : point, range, aggregation – indexed update, unindexed update, delete, insert
  • 18. How fast are mixed workloads?
  • 19. How do secondary indexes work? • InnoDB and TokuDB “cluster” the primary key index – The key (PK) and all other columns are co-located in memory and on disk • Secondary indexes co-locate the “index key” and PK – When a candidate row is found a second lookup occurs into the PK index – This means an additional IO is required – MySQL’s “hidden join”
  • 20. What is a clustered secondary index? • “Covering” indexes remove this second lookup, but require putting the right columns into the index – create index idx_1 on t1 (c1, c2, c3, c4, c5, c6); – If c1/c2 are queried, only c3/c4/c5/c6 are covered – No additional IO, but c7 isn’t covered • TokuDB supports clustered secondary indexes – create clustering index idx_1 on t1 (c1, c2); – All columns in t1 are covered, forever – Even if new columns are added to the table
  • 21. What are clustered secondary indexes good at? • Two words, “RANGE SCANS” • Several rows (maybe thousands) are scanned without requiring additional lookups on the PK index • Also, TokuDB blocks are much larger than InnoDB – TokuDB = 4MB blocks = sequential IO – InnoDB = 16KB blocks = random IO • Can be orders of magnitude faster for range queries
  • 22. Can SQL be optimized? • Fractal Tree indexes support message injection – The actual work (and IO) can be deferred • Example: – update t1 set k = k + 1 where pk = 5; – InnoDB follows read-modify-write pattern – If field “k” is not indexed, TokuDB avoids IO entirely – An “increment” message is injected • Current optimizations – “replace into”, “insert ignore”, “update”, “insert on duplicate key update”
  • 23. How can I reduce the total cost of my servers and storage?
  • 24. How can I use less storage? • Compression, compression, compression! • All IO in TokuDB is compressed – Reads and writes – Usually ~5x compression (but I’ve seen 25x or more) • TokuDB [currently] supports 3 compression algorithms – lzma = highest compression (and high CPU) – zlib = high compression (and much less CPU) – quicklz = medium compression (even less CPU) – pluggable architecture, lz4 and snappy “in the lab”
  • 25. But doesn’t InnoDB support compression? • Yes, but the compression achieved is far lower – InnoDB compresses 16K blocks, TokuDB is 64K or 128K – InnoDB requires fixed on-disk size, TokuDB is flexible *log style data
  • 26. But doesn’t InnoDB support compression? • And InnoDB performance is severely impacted by it – Compression “misses” are costly *iiBench workload
  • 27. How do I compress my data in TokuDB? create table t1 (c1 bigint not null primary key) engine=tokudb row_format=[tokudb_lzma | tokudb_zlib | tokudb_quicklz]; NOTE: Compression is not optional in TokuDB, we use compression to provide performance advantages as well as save space.
  • 28. How can I perform online schema changes?
  • 29. What is an “online” schema change? My definition “An online schema change is the ability to add or drop a column on an existing table without blocking further changes to the table or requiring substantial server resources (CPU, RAM, IO, disk) to accomplish the operation.” P.S., I’d like for it to be instantaneous!
  • 30. What do blocking schema changes look like?
  • 31. How have online schema changes evolved? • MySQL 5.5 – Table is read-only while entire table is re-created • “Manual” process – Take slave offline, apply to slave, catch up to master, switch places, repeat • MySQL 5.6 (and ~ Percona’s pt-online-schema-change-tool) – Table is rebuilt “in the background” – Changes are captured, and replayed on new table – Uses significant RAM, CPU, IO, and disk space • TokuDB – alter table t1 add column new_column bigint; – Done!
  • 32. What online schema changes can TokuDB handle? • Add column • Drop column • Expand column – integer types – varchar, char, varbinary • Index creation
  • 33. How can I avoid becoming the support staff for my application?
  • 34. 34 Where can I get TokuDB support? TokuDB is offered in 2 editions • Community – Community support (Google Groups “tokudb-user”) • Enterprise subscription – Commercial support – Wouldn’t you rather be developing another application? – Extra features –Hot backup, more on the way – Access to TokuDB experts – Input to the product roadmap
  • 35. 35 Tokutek: Database Performance Engines Any Questions? Download TokuDB at www.tokutek.com/products/downloads Register for product updates, access to premium content, and invitations at www.tokutek.com Join the Conversation

Editor's Notes