SlideShare a Scribd company logo
Tungsten Replicator for Elasticsearch - Real-time data
loading from Oracle and MySQL into Elasticsearch
Topics
In today’s webinar, we will discuss:
• Why Elasticsearch?
• How replication into Elasticsearch works
• Customizations and configurations
• Future direction
2
Why Elasticsearch
• Elasticsearch is a REST-based search and analytics engine
• Provides a quick method to store/search data and link back to source
• Very fast
• Great at doing very large scale analytics and aggregations
• Able to combine different forms and data (structured, unstructured, metrics)
• Easily scalable
• Provides different buckets and different types to allow multiple data repos
Data Collection and Organization with Elasticsearch
Tungsten
Replicator
Index
Index
Index
How Replication Works
DBMS
Logs
Download transactions
via network
Elasticsearch Applier
THL = Events + Metadata
Redo Logging
Master Replicator:
Extractor
THL
Slave Replicator:
Applier
THL
Redo ReaderGenerated
PLOG
Extractor gets event
data by reading the
PLOG files generated by
the Redo Reader (PLOG
generation requires the
DBMS service to be
online).
What Tungsten Replicator Does to Apply into Elasticsearch
• Takes an incoming row and converts it to a JSON document
• By default:
– Incoming schema/database name is used as the index name
– Incoming table name is used as the type
• Can also be explicitly set to an index/type to concentrate data (or use filters)
• Key is taken from the primary key information
– Can be converted/merged from multiple keys and formats
• Document is constructed with metadata and record information
• INSERTS create the data
• UPDATES update the existing data using the key information
• DELETE delete the record (but this can be disabled)
Message Structure
{
"_index" : "test",
"_type" : "messages",
"_id" : "99999",
"_version" : 1,
"found" : true,
"_source" : {
"msg" : "Hello Elasticsearch",
"committime" : "2017-06-23 19:02:22.0",
"id" : "99999",
"source_table" : "messages",
"source_schema" : "test"
}
}
Customizable Elements
• Documentation ID format
– pkey, pkeyus, tspkey, tspkeyus
• Embedded information
– Commit time
– Schema/table source information
• Ignore delete or update errors
• Whether to use schema and table names as index and type
• Custom index name and type names
• Whether to use a self generated ID or one derived from primary key
Demo
Future Direction
• Further customization of the structures and stored information
• Integration with the data combination and concentration tools
• DDL integration and control of the structures
– Currently TRUNCATE and CREATE/DROP TABLE are not replicated
• Improved data structures and record metadata
Next Steps
• If you are interested in knowing more about Tungsten Replicator and would like to try it out for
yourself, please contact our sales team who will be able to take you through the details and
setup a POC – sales@continuent.com
• Read the documentation at http://docs.continuent.com
• Follow us on Twitter @ continuent or www.facebook.com/Continuent
• Subscribe to our Tungsten University YouTube channel! http://tinyurl.com/TungstenUni
14
For more information, contact us:
Eero Teerikorpi
Founder, CEO
eero.teerikorpi@continuent.com
+1 (408) 431-3305
Eric Stone
COO
eric.stone@continuent.com
MC Brown
VP Products
mc.brown@continuent.com
Petri Virsunen
VP Marketing
petri.virsunen@continuent.com
+1 (408) 806-9860

More Related Content

PDF
Using ElasticSearch as a fast, flexible, and scalable solution to search occu...
PPTX
Your data layer - Choosing the right database solutions for the future
PPTX
Big Data Overview Part 1
PPTX
Survey on NoSQL integration
PDF
Análisis del roadmap del Elastic Stack
PDF
"TextMining with ElasticSearch", Saskia Vola, CEO at textminers.io
PPTX
BigData, NoSQL & ElasticSearch
PDF
Elk - An introduction
Using ElasticSearch as a fast, flexible, and scalable solution to search occu...
Your data layer - Choosing the right database solutions for the future
Big Data Overview Part 1
Survey on NoSQL integration
Análisis del roadmap del Elastic Stack
"TextMining with ElasticSearch", Saskia Vola, CEO at textminers.io
BigData, NoSQL & ElasticSearch
Elk - An introduction

What's hot (20)

PDF
Design of Experiments on Federator Polystore Architecture
PPTX
An Approach for RDF-based Semantic Access to NoSQL Repositories
PPTX
ElasticSearch for data mining
PDF
Why Elastic? @ 50th Vinitaly 2016
PPTX
Elasticsearch Arcihtecture & What's New in Version 5
PPTX
Log analysis using elk
PDF
Elastic Stack Roadmap
PDF
What Is ELK Stack | ELK Tutorial For Beginners | Elasticsearch Kibana | ELK S...
PPTX
Elastic search
PPTX
Elasticsearch - Zero to Hero
PPTX
Visualizing Austin's data with Elasticsearch and Kibana
PPTX
Elastic Stack Introduction
PPTX
Elasticsearch
PDF
Au cœur de la roadmap de la Suite Elastic
PPTX
Elastic meetup june16
PPT
Configuring elasticsearch for performance and scale
PDF
Introduction to elasticsearch
PPTX
NIH Data Commons Architecture Ideas
PPTX
ELK - Stack - Munich .net UG
PPTX
Db presentation google_megastore
Design of Experiments on Federator Polystore Architecture
An Approach for RDF-based Semantic Access to NoSQL Repositories
ElasticSearch for data mining
Why Elastic? @ 50th Vinitaly 2016
Elasticsearch Arcihtecture & What's New in Version 5
Log analysis using elk
Elastic Stack Roadmap
What Is ELK Stack | ELK Tutorial For Beginners | Elasticsearch Kibana | ELK S...
Elastic search
Elasticsearch - Zero to Hero
Visualizing Austin's data with Elasticsearch and Kibana
Elastic Stack Introduction
Elasticsearch
Au cœur de la roadmap de la Suite Elastic
Elastic meetup june16
Configuring elasticsearch for performance and scale
Introduction to elasticsearch
NIH Data Commons Architecture Ideas
ELK - Stack - Munich .net UG
Db presentation google_megastore
Ad

Similar to Webinar Slides: Tungsten Replicator for Elasticsearch - Real-time data loading from Oracle and MySQL into Elasticsearch (20)

PPTX
Elasticsearch - Scalability and Multitenancy
PDF
Elasticsearch Introduction at BigData meetup
PPT
Elk presentation1#3
PPTX
Structured Query Language powerpoint presentation
PPTX
PDF
Overview of data analytics service: Treasure Data Service
PPSX
Elasticsearch - basics and beyond
PDF
Roaring with elastic search sangam2018
PDF
(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
PPTX
ElasticSearch as (only) datastore
PDF
Analyzing Semi-Structured Data At Volume In The Cloud
PDF
Basic Introduction to Crate @ ViennaDB Meetup
PDF
World2016_T5_S5_SQLServerFunctionalOverview
PPT
Oracle by Muhammad Iqbal
PPTX
Centralized log-management-with-elastic-stack
PPTX
How Clean is your Database? Data Scrubbing for all Skill Sets
PDF
2021 04-20 apache arrow and its impact on the database industry.pptx
PPTX
Dev nexus 2017
PPTX
Devnexus 2018
PDF
AWS CLOUD 2017 - Amazon Athena 및 Glue를 통한 빠른 데이터 질의 및 처리 기능 소개 (김상필 솔루션즈 아키텍트)
Elasticsearch - Scalability and Multitenancy
Elasticsearch Introduction at BigData meetup
Elk presentation1#3
Structured Query Language powerpoint presentation
Overview of data analytics service: Treasure Data Service
Elasticsearch - basics and beyond
Roaring with elastic search sangam2018
(ATS6-PLAT02) Accelrys Catalog and Protocol Validation
ElasticSearch as (only) datastore
Analyzing Semi-Structured Data At Volume In The Cloud
Basic Introduction to Crate @ ViennaDB Meetup
World2016_T5_S5_SQLServerFunctionalOverview
Oracle by Muhammad Iqbal
Centralized log-management-with-elastic-stack
How Clean is your Database? Data Scrubbing for all Skill Sets
2021 04-20 apache arrow and its impact on the database industry.pptx
Dev nexus 2017
Devnexus 2018
AWS CLOUD 2017 - Amazon Athena 및 Glue를 통한 빠른 데이터 질의 및 처리 기능 소개 (김상필 솔루션즈 아키텍트)
Ad

More from Continuent (20)

PDF
Tungsten Webinar: v6 & v7 Release Recap, and Beyond
PDF
Continuent Tungsten Value Proposition Webinar
PDF
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #7: ClusterControl
PDF
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #5: Oracle’s InnoDB Cluster
PDF
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #4: MS Azure Database MySQL
PDF
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #2: Galera Cluster
PDF
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #1: AWS Aurora
PDF
Webinar Slides: AWS Aurora MySQL Replacement: Break Away From Geo-Limitations...
PDF
Webinar Slides: No Data Loss MySQL: Guaranteed Credit Card Transaction Availa...
PDF
Webinar Slides: Intelligent Database Proxies: Routing & Transparent Failover
PPTX
Webinar Slides: High Volume MySQL HA: SaaS Continuous Operations with Terabyt...
PDF
Training Slides: 205 - Installing and Configuring Tungsten Dashboard
PDF
Training Slides: 352 - Tungsten Replicator for MongoDB & Kafka
PDF
Training Slides: 351 - Tungsten Replicator for Data Warehouses
PDF
Training Slides: 303 - Replicating out of a Cluster
PDF
Training Slides: 206 - Using the Tungsten Cluster AMI
PDF
Training Slides: 254 - Using the Tungsten Replicator AMI
PDF
Training Slides: 253 - Filter like a Pro
PDF
Training Slides: 252 - Monitoring & Troubleshooting
PDF
Training Slides: 302 - Securing Your Cluster With SSL
Tungsten Webinar: v6 & v7 Release Recap, and Beyond
Continuent Tungsten Value Proposition Webinar
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #7: ClusterControl
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #5: Oracle’s InnoDB Cluster
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #4: MS Azure Database MySQL
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #2: Galera Cluster
Webinar Slides: MySQL HA/DR/Geo-Scale - High Noon #1: AWS Aurora
Webinar Slides: AWS Aurora MySQL Replacement: Break Away From Geo-Limitations...
Webinar Slides: No Data Loss MySQL: Guaranteed Credit Card Transaction Availa...
Webinar Slides: Intelligent Database Proxies: Routing & Transparent Failover
Webinar Slides: High Volume MySQL HA: SaaS Continuous Operations with Terabyt...
Training Slides: 205 - Installing and Configuring Tungsten Dashboard
Training Slides: 352 - Tungsten Replicator for MongoDB & Kafka
Training Slides: 351 - Tungsten Replicator for Data Warehouses
Training Slides: 303 - Replicating out of a Cluster
Training Slides: 206 - Using the Tungsten Cluster AMI
Training Slides: 254 - Using the Tungsten Replicator AMI
Training Slides: 253 - Filter like a Pro
Training Slides: 252 - Monitoring & Troubleshooting
Training Slides: 302 - Securing Your Cluster With SSL

Recently uploaded (20)

PDF
RPKI Status Update, presented by Makito Lay at IDNOG 10
PDF
Decoding a Decade: 10 Years of Applied CTI Discipline
PPTX
Job_Card_System_Styled_lorem_ipsum_.pptx
PDF
Cloud-Scale Log Monitoring _ Datadog.pdf
PDF
Unit-1 introduction to cyber security discuss about how to secure a system
PDF
The Internet -By the Numbers, Sri Lanka Edition
PPTX
Module 1 - Cyber Law and Ethics 101.pptx
PDF
WebRTC in SignalWire - troubleshooting media negotiation
PPTX
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
PDF
The New Creative Director: How AI Tools for Social Media Content Creation Are...
PDF
Introduction to the IoT system, how the IoT system works
DOCX
Unit-3 cyber security network security of internet system
PDF
Testing WebRTC applications at scale.pdf
PDF
An introduction to the IFRS (ISSB) Stndards.pdf
PDF
FINAL CALL-6th International Conference on Networks & IOT (NeTIOT 2025)
PDF
APNIC Update, presented at PHNOG 2025 by Shane Hermoso
PPT
Design_with_Watersergyerge45hrbgre4top (1).ppt
PPTX
INTERNET------BASICS-------UPDATED PPT PRESENTATION
PPT
isotopes_sddsadsaadasdasdasdasdsa1213.ppt
PPTX
Slides PPTX World Game (s) Eco Economic Epochs.pptx
RPKI Status Update, presented by Makito Lay at IDNOG 10
Decoding a Decade: 10 Years of Applied CTI Discipline
Job_Card_System_Styled_lorem_ipsum_.pptx
Cloud-Scale Log Monitoring _ Datadog.pdf
Unit-1 introduction to cyber security discuss about how to secure a system
The Internet -By the Numbers, Sri Lanka Edition
Module 1 - Cyber Law and Ethics 101.pptx
WebRTC in SignalWire - troubleshooting media negotiation
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
The New Creative Director: How AI Tools for Social Media Content Creation Are...
Introduction to the IoT system, how the IoT system works
Unit-3 cyber security network security of internet system
Testing WebRTC applications at scale.pdf
An introduction to the IFRS (ISSB) Stndards.pdf
FINAL CALL-6th International Conference on Networks & IOT (NeTIOT 2025)
APNIC Update, presented at PHNOG 2025 by Shane Hermoso
Design_with_Watersergyerge45hrbgre4top (1).ppt
INTERNET------BASICS-------UPDATED PPT PRESENTATION
isotopes_sddsadsaadasdasdasdasdsa1213.ppt
Slides PPTX World Game (s) Eco Economic Epochs.pptx

Webinar Slides: Tungsten Replicator for Elasticsearch - Real-time data loading from Oracle and MySQL into Elasticsearch

  • 1. Tungsten Replicator for Elasticsearch - Real-time data loading from Oracle and MySQL into Elasticsearch
  • 2. Topics In today’s webinar, we will discuss: • Why Elasticsearch? • How replication into Elasticsearch works • Customizations and configurations • Future direction 2
  • 3. Why Elasticsearch • Elasticsearch is a REST-based search and analytics engine • Provides a quick method to store/search data and link back to source • Very fast • Great at doing very large scale analytics and aggregations • Able to combine different forms and data (structured, unstructured, metrics) • Easily scalable • Provides different buckets and different types to allow multiple data repos
  • 4. Data Collection and Organization with Elasticsearch Tungsten Replicator Index Index Index
  • 5. How Replication Works DBMS Logs Download transactions via network Elasticsearch Applier THL = Events + Metadata Redo Logging Master Replicator: Extractor THL Slave Replicator: Applier THL Redo ReaderGenerated PLOG Extractor gets event data by reading the PLOG files generated by the Redo Reader (PLOG generation requires the DBMS service to be online).
  • 6. What Tungsten Replicator Does to Apply into Elasticsearch • Takes an incoming row and converts it to a JSON document • By default: – Incoming schema/database name is used as the index name – Incoming table name is used as the type • Can also be explicitly set to an index/type to concentrate data (or use filters) • Key is taken from the primary key information – Can be converted/merged from multiple keys and formats • Document is constructed with metadata and record information • INSERTS create the data • UPDATES update the existing data using the key information • DELETE delete the record (but this can be disabled)
  • 7. Message Structure { "_index" : "test", "_type" : "messages", "_id" : "99999", "_version" : 1, "found" : true, "_source" : { "msg" : "Hello Elasticsearch", "committime" : "2017-06-23 19:02:22.0", "id" : "99999", "source_table" : "messages", "source_schema" : "test" } }
  • 8. Customizable Elements • Documentation ID format – pkey, pkeyus, tspkey, tspkeyus • Embedded information – Commit time – Schema/table source information • Ignore delete or update errors • Whether to use schema and table names as index and type • Custom index name and type names • Whether to use a self generated ID or one derived from primary key
  • 10. Future Direction • Further customization of the structures and stored information • Integration with the data combination and concentration tools • DDL integration and control of the structures – Currently TRUNCATE and CREATE/DROP TABLE are not replicated • Improved data structures and record metadata
  • 11. Next Steps • If you are interested in knowing more about Tungsten Replicator and would like to try it out for yourself, please contact our sales team who will be able to take you through the details and setup a POC – sales@continuent.com • Read the documentation at http://docs.continuent.com • Follow us on Twitter @ continuent or www.facebook.com/Continuent • Subscribe to our Tungsten University YouTube channel! http://tinyurl.com/TungstenUni 14
  • 12. For more information, contact us: Eero Teerikorpi Founder, CEO eero.teerikorpi@continuent.com +1 (408) 431-3305 Eric Stone COO eric.stone@continuent.com MC Brown VP Products mc.brown@continuent.com Petri Virsunen VP Marketing petri.virsunen@continuent.com +1 (408) 806-9860