SlideShare a Scribd company logo
Greg Grubbs, Product Manager | Big Data
Jorge A. Lopez, Director Marketing | Big Data
DMX-h Release 8
Syncsort Confidential and Proprietary - do not copy or distribute
2
Introducing Syncsort DMX-h Release 8
 Intelligent Execution Layer isolates users from
underlying complexities of Hadoop
 Design once. Deploy anywhere - with or without
Hadoop; on-premises or in the Cloud
 Provides the most complete, end-to-end solution for
offloading heavy legacy workloads to Hadoop
 Delivers best-in-class, one-step data ingestion
capabilities for Hadoop: mainframes, RDBMS, MPP,
JSON, Avro/Parquet, NoSQL, and more
 Facilitates metadata management and data lineage
by automatically updating HCatalog when loading to
Hive, Avro and Parquet
Syncsort Confidential and Proprietary - do not copy or distribute
3
A Complete Solution to Harness the Power of Hadoop
Syncsort Confidential and Proprietary - do not copy or distribute
4
Collect
Build Your Enterprise Data Hub
Hadoop + DMX-h
Avro
Parquet
Cassandra
MongoDB
Mainframe
Vertica
Oracle
Teradata
Netezza
JSON HBaseFiles
Cloud
• Collect virtually any data from mainframe to Big Data and NoSQL sources
• Access, re-format and load data directly into Avro & Parquet. No staging
required
• Access & translate mainframe data using Sqoop and Spark
• Load more data into Hadoop in less time. Let DMX-h dynamically split the data
and load it to HDFS in parallel
Syncsort Confidential and Proprietary - do not copy or distribute
5
Hive and HCatalog
Read & write to Hive
Support for multiple file formats including
text, Avro, Parquet
Metadata using HCatalog (Hive Meta
Store)
Dynamically parallelize load to Hive
0.00
2.00
4.00
6.00
8.00
10.00
12.00
1.01 2.03 4.50
Hour
Hive Write Throughput (TB/hour)
DMX ODBC Parallel
Hive Command
Syncsort Confidential and Proprietary - do not copy or distribute
6
Prepare
Get Your Data Ready for Analytics
• Sort
• Cleanse
• Partition
• Translate
• Reformat
• Compress
• Validate
• Prepare your data on-the-fly at lightning speeds before
loading into Hadoop
• Increase data compression ratios by up to 10x
• Achieve significant storage savings
Hadoop + DMX-h
Syncsort Confidential and Proprietary - do not copy or distribute
7
Blend
Find Bigger Insights by Combining New
and Legacy Data
• Fastest, most efficient data joins
• Best-in-class mainframe data access & translation
• Common user experience with or without Hadoop!
• No need to worry about mappers, reducers, big side, small side and so on
• No code to generate, compile, maintain or tune!
Mainframe
JSON
RDBMS
Syncsort Confidential and Proprietary - do not copy or distribute
8
Transform
Design Once, Deploy Anywhere!
• Free your users from the underlying complexities of Hadoop
• Visually design data transformations once, and run anywhere
• No changes or tuning required
• Intelligent Execution Layer dynamically optimizes the job for each platform:
Hadoop, Windows, Unix, Linux or Cloud
• Future-proof your applications!
Intelligent
ExecutionLayer
Windows, Linux, Unix
Hadoop
Cloud
Syncsort Confidential and Proprietary - do not copy or distribute
9
DEMO
DEMO
Syncsort Confidential and Proprietary - do not copy or distribute
10
Distribute
Achieve the Fastest Path from Raw Data to Insight
• Create Tableau & Qlikview files with one click
• Achieve the fastest data loads without tuning hassles:
• Fastest parallel loads to Greenplum, Netezza, Teradata & Vertica
• High-performance connectivity to Big Data & NoSQL databases such as
Cassandra, Hbase & MongoDB
Hadoop + DMX-h
NoSQL
Syncsort Confidential and Proprietary - do not copy or distribute
11
Not Using Hadoop?
 Single Design Experience =
ETL Anywhere!
 Best-in-class Data Visualizations,
Just a Click Away
 Complete Access to All Your Data,
Big or Small
 Web Based Monitoring &
Administration
 Secure Mainframe Data Access
Intelligent Execution Layer
Windows, Linux,
Unix, Cloud, and
more… when
you’re ready
Syncsort Confidential and Proprietary - do not copy or distribute
12
DEMO
DEMO
Syncsort Confidential and Proprietary - do not copy or distribute
13
Plus… The Only Tool Specifically Designed for EDW Offload
Now with automatic DTL
generation!
• Web-based utility
• Takes SQL as an input
• Provides visual analysis of SQL
ELT jobs
• Generates metadata and data
migration with DMX jobs
• Supports ANSI-SQL 2011, BTEQ,
Netezza, Oracle PL/SQL
Syncsort Confidential and Proprietary - do not copy or distribute
14
 Save users from underlying Hadoop complexities
 Future-proof your applications. Design once, deploy anywhere!
 Offload heavy ELT workloads to Hadoop
 Secure, monitor, manage and scale with minimum effort
Sign up for a Free
Trial!
Break Free from ETL Complexity
Experience DMX & DMX-h Release 8
Syncsort.com/dmxh8
Watch this webcast on demand – including the product demos!
http://bit.ly/1wI1SRN
Big Data Education Webcast: Introducing DMX and DMX-h Release 8

More Related Content

PPTX
How Experian increased insights with Hadoop
PDF
How to Succeed in Hadoop: comScore’s Deceptively Simple Secrets to Deploying ...
PDF
Simplifying Big Data Integration with Syncsort DMX and DMX-h
PDF
Big Data Customer Education Webcast: The Latest Advancements in Syncsort DMX ...
PPTX
How do spark_kafka_and_syncsort_dmx-h
PPTX
Data Engineer's Lunch #55: Get Started in Data Engineering
PPTX
SQL Server on Linux - march 2017
PDF
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
How Experian increased insights with Hadoop
How to Succeed in Hadoop: comScore’s Deceptively Simple Secrets to Deploying ...
Simplifying Big Data Integration with Syncsort DMX and DMX-h
Big Data Customer Education Webcast: The Latest Advancements in Syncsort DMX ...
How do spark_kafka_and_syncsort_dmx-h
Data Engineer's Lunch #55: Get Started in Data Engineering
SQL Server on Linux - march 2017
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...

What's hot (20)

PDF
Spark meetup - Zoomdata Streaming
PPTX
Big Data at your Desk with KNIME
PDF
Family data sheet HP Virtual Connect(May 2013)
PDF
IBM Power8 announce
PPTX
Seamless, Real-Time Data Integration with Connect
PPTX
Solr + Hadoop: Interactive Search for Hadoop
PDF
Key trends in Big Data and new reference architecture from Hewlett Packard En...
PPTX
The DAP - Where YARN, HBase, Kafka and Spark go to Production
PDF
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
PPTX
Preventative Maintenance of Robots in Automotive Industry
PPTX
Big Data Case Study: Fortune 100 Telco
PPTX
Extending Twitter's Data Platform to Google Cloud
PPTX
LLAP: Sub-Second Analytical Queries in Hive
PDF
Open Innovation with Power Systems
PDF
Big Data Q2 Customer Education Webcast: New DMX Change Data Capture for Hadoo...
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
PPTX
Scaling Deep Learning on Hadoop at LinkedIn
PPTX
Built-In Security for the Cloud
PPTX
Achieving cloud scale with microservices based applications on azure
PPTX
DEVNET-1166 Open SDN Controller APIs
Spark meetup - Zoomdata Streaming
Big Data at your Desk with KNIME
Family data sheet HP Virtual Connect(May 2013)
IBM Power8 announce
Seamless, Real-Time Data Integration with Connect
Solr + Hadoop: Interactive Search for Hadoop
Key trends in Big Data and new reference architecture from Hewlett Packard En...
The DAP - Where YARN, HBase, Kafka and Spark go to Production
Startup Case Study: Leveraging the Broad Hadoop Ecosystem to Develop World-Fi...
Preventative Maintenance of Robots in Automotive Industry
Big Data Case Study: Fortune 100 Telco
Extending Twitter's Data Platform to Google Cloud
LLAP: Sub-Second Analytical Queries in Hive
Open Innovation with Power Systems
Big Data Q2 Customer Education Webcast: New DMX Change Data Capture for Hadoo...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Scaling Deep Learning on Hadoop at LinkedIn
Built-In Security for the Cloud
Achieving cloud scale with microservices based applications on azure
DEVNET-1166 Open SDN Controller APIs
Ad

Similar to Big Data Education Webcast: Introducing DMX and DMX-h Release 8 (20)

PDF
End-to-End, Source to Analytics, Data Lineage with Syncsort DMX-h
PDF
Customer Education Webcast: New Features in Data Integration and Streaming CDC
PDF
Keeping Data in Sync with Syncsort
PDF
What’s New in Syncsort Integrate? New User Experience for Fast Data Onboarding
PDF
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
PDF
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
PPTX
Why Hadoop is important to Syncsort
PDF
Syncsort et le retour d'expérience ComScore
PPTX
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
PPTX
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
PPTX
Hug syncsort etl hadoop big data
PPTX
Simplifying and Future-Proofing Hadoop
PDF
Hadoop is Happening
PDF
Performance advantages of Hadoop ETL offload with the Intel processor-powered...
PPTX
Data Con LA 2018 - Populating your Enterprise Data Hub for Next Gen Analytics...
PPTX
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
PPTX
How to Leverage Mainframe Data with Hadoop: Bridging the Gap Between Big Iron...
PPTX
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
PPTX
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
PDF
Design advantages of Hadoop ETL offload with the Intel processor-powered Dell...
End-to-End, Source to Analytics, Data Lineage with Syncsort DMX-h
Customer Education Webcast: New Features in Data Integration and Streaming CDC
Keeping Data in Sync with Syncsort
What’s New in Syncsort Integrate? New User Experience for Fast Data Onboarding
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Why Hadoop is important to Syncsort
Syncsort et le retour d'expérience ComScore
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Use Cases from Batch to Streaming, MapReduce to Spark, Mainframe to Cloud: To...
Hug syncsort etl hadoop big data
Simplifying and Future-Proofing Hadoop
Hadoop is Happening
Performance advantages of Hadoop ETL offload with the Intel processor-powered...
Data Con LA 2018 - Populating your Enterprise Data Hub for Next Gen Analytics...
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
How to Leverage Mainframe Data with Hadoop: Bridging the Gap Between Big Iron...
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
Syncsort, Tableau, & Cloudera present: Break the Barriers to Big Data Insight
Design advantages of Hadoop ETL offload with the Intel processor-powered Dell...
Ad

More from Precisely (20)

PDF
The Future of Automation: AI, APIs, and Cloud Modernization.pdf
PDF
Unlock new opportunities with location data.pdf
PDF
Reimagining Insurance: Connected Data for Confident Decisions.pdf
PDF
Introducing Syncsort™ Storage Management.pdf
PDF
Enable Enterprise-Ready Security on IBM i Systems.pdf
PDF
A Day in the Life of Location Data - Turning Where into How.pdf
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
PDF
Solving the CIO’s Dilemma: Speed, Scale, and Smarter SAP Modernization.pdf
PDF
Solving the Data Disconnect: Why Success Hinges on Pre-Linked Data.pdf
PDF
Cooking Up Clean Addresses - 3 Ways to Whip Messy Data into Shape.pdf
PDF
Building Confidence in AI & Analytics with High-Integrity Location Data.pdf
PDF
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
PDF
Precisely Demo Showcase: Powering ServiceNow Discovery with Precisely Ironstr...
PDF
The 2025 Guide on What's Next for Automation.pdf
PDF
Outdated Tech, Invisible Expenses – How Data Silos Undermine Operational Effi...
PDF
Modernización de SAP: Maximizando el Valor de su Migración a SAP S/4HANA.pdf
PDF
Outdated Tech, Invisible Expenses – The Hidden Cost of Disconnected Data Syst...
PDF
Migration vers SAP S/4HANA: Un levier stratégique pour votre transformation d...
PDF
Outdated Tech, Invisible Expenses: The Hidden Cost of Poor Data Integration o...
PDF
The Changing Compliance Landscape in 2025.pdf
The Future of Automation: AI, APIs, and Cloud Modernization.pdf
Unlock new opportunities with location data.pdf
Reimagining Insurance: Connected Data for Confident Decisions.pdf
Introducing Syncsort™ Storage Management.pdf
Enable Enterprise-Ready Security on IBM i Systems.pdf
A Day in the Life of Location Data - Turning Where into How.pdf
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Solving the CIO’s Dilemma: Speed, Scale, and Smarter SAP Modernization.pdf
Solving the Data Disconnect: Why Success Hinges on Pre-Linked Data.pdf
Cooking Up Clean Addresses - 3 Ways to Whip Messy Data into Shape.pdf
Building Confidence in AI & Analytics with High-Integrity Location Data.pdf
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
Precisely Demo Showcase: Powering ServiceNow Discovery with Precisely Ironstr...
The 2025 Guide on What's Next for Automation.pdf
Outdated Tech, Invisible Expenses – How Data Silos Undermine Operational Effi...
Modernización de SAP: Maximizando el Valor de su Migración a SAP S/4HANA.pdf
Outdated Tech, Invisible Expenses – The Hidden Cost of Disconnected Data Syst...
Migration vers SAP S/4HANA: Un levier stratégique pour votre transformation d...
Outdated Tech, Invisible Expenses: The Hidden Cost of Poor Data Integration o...
The Changing Compliance Landscape in 2025.pdf

Recently uploaded (20)

PPTX
Transform Your Business with a Software ERP System
PPTX
L1 - Introduction to python Backend.pptx
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
System and Network Administraation Chapter 3
PDF
PTS Company Brochure 2025 (1).pdf.......
PPTX
ManageIQ - Sprint 268 Review - Slide Deck
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PPTX
Introduction to Artificial Intelligence
PDF
Digital Strategies for Manufacturing Companies
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Transform Your Business with a Software ERP System
L1 - Introduction to python Backend.pptx
Upgrade and Innovation Strategies for SAP ERP Customers
Operating system designcfffgfgggggggvggggggggg
System and Network Administraation Chapter 3
PTS Company Brochure 2025 (1).pdf.......
ManageIQ - Sprint 268 Review - Slide Deck
Wondershare Filmora 15 Crack With Activation Key [2025
VVF-Customer-Presentation2025-Ver1.9.pptx
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
2025 Textile ERP Trends: SAP, Odoo & Oracle
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Introduction to Artificial Intelligence
Digital Strategies for Manufacturing Companies
How to Migrate SBCGlobal Email to Yahoo Easily
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
CHAPTER 2 - PM Management and IT Context
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...

Big Data Education Webcast: Introducing DMX and DMX-h Release 8

  • 1. Greg Grubbs, Product Manager | Big Data Jorge A. Lopez, Director Marketing | Big Data DMX-h Release 8
  • 2. Syncsort Confidential and Proprietary - do not copy or distribute 2 Introducing Syncsort DMX-h Release 8  Intelligent Execution Layer isolates users from underlying complexities of Hadoop  Design once. Deploy anywhere - with or without Hadoop; on-premises or in the Cloud  Provides the most complete, end-to-end solution for offloading heavy legacy workloads to Hadoop  Delivers best-in-class, one-step data ingestion capabilities for Hadoop: mainframes, RDBMS, MPP, JSON, Avro/Parquet, NoSQL, and more  Facilitates metadata management and data lineage by automatically updating HCatalog when loading to Hive, Avro and Parquet
  • 3. Syncsort Confidential and Proprietary - do not copy or distribute 3 A Complete Solution to Harness the Power of Hadoop
  • 4. Syncsort Confidential and Proprietary - do not copy or distribute 4 Collect Build Your Enterprise Data Hub Hadoop + DMX-h Avro Parquet Cassandra MongoDB Mainframe Vertica Oracle Teradata Netezza JSON HBaseFiles Cloud • Collect virtually any data from mainframe to Big Data and NoSQL sources • Access, re-format and load data directly into Avro & Parquet. No staging required • Access & translate mainframe data using Sqoop and Spark • Load more data into Hadoop in less time. Let DMX-h dynamically split the data and load it to HDFS in parallel
  • 5. Syncsort Confidential and Proprietary - do not copy or distribute 5 Hive and HCatalog Read & write to Hive Support for multiple file formats including text, Avro, Parquet Metadata using HCatalog (Hive Meta Store) Dynamically parallelize load to Hive 0.00 2.00 4.00 6.00 8.00 10.00 12.00 1.01 2.03 4.50 Hour Hive Write Throughput (TB/hour) DMX ODBC Parallel Hive Command
  • 6. Syncsort Confidential and Proprietary - do not copy or distribute 6 Prepare Get Your Data Ready for Analytics • Sort • Cleanse • Partition • Translate • Reformat • Compress • Validate • Prepare your data on-the-fly at lightning speeds before loading into Hadoop • Increase data compression ratios by up to 10x • Achieve significant storage savings Hadoop + DMX-h
  • 7. Syncsort Confidential and Proprietary - do not copy or distribute 7 Blend Find Bigger Insights by Combining New and Legacy Data • Fastest, most efficient data joins • Best-in-class mainframe data access & translation • Common user experience with or without Hadoop! • No need to worry about mappers, reducers, big side, small side and so on • No code to generate, compile, maintain or tune! Mainframe JSON RDBMS
  • 8. Syncsort Confidential and Proprietary - do not copy or distribute 8 Transform Design Once, Deploy Anywhere! • Free your users from the underlying complexities of Hadoop • Visually design data transformations once, and run anywhere • No changes or tuning required • Intelligent Execution Layer dynamically optimizes the job for each platform: Hadoop, Windows, Unix, Linux or Cloud • Future-proof your applications! Intelligent ExecutionLayer Windows, Linux, Unix Hadoop Cloud
  • 9. Syncsort Confidential and Proprietary - do not copy or distribute 9 DEMO DEMO
  • 10. Syncsort Confidential and Proprietary - do not copy or distribute 10 Distribute Achieve the Fastest Path from Raw Data to Insight • Create Tableau & Qlikview files with one click • Achieve the fastest data loads without tuning hassles: • Fastest parallel loads to Greenplum, Netezza, Teradata & Vertica • High-performance connectivity to Big Data & NoSQL databases such as Cassandra, Hbase & MongoDB Hadoop + DMX-h NoSQL
  • 11. Syncsort Confidential and Proprietary - do not copy or distribute 11 Not Using Hadoop?  Single Design Experience = ETL Anywhere!  Best-in-class Data Visualizations, Just a Click Away  Complete Access to All Your Data, Big or Small  Web Based Monitoring & Administration  Secure Mainframe Data Access Intelligent Execution Layer Windows, Linux, Unix, Cloud, and more… when you’re ready
  • 12. Syncsort Confidential and Proprietary - do not copy or distribute 12 DEMO DEMO
  • 13. Syncsort Confidential and Proprietary - do not copy or distribute 13 Plus… The Only Tool Specifically Designed for EDW Offload Now with automatic DTL generation! • Web-based utility • Takes SQL as an input • Provides visual analysis of SQL ELT jobs • Generates metadata and data migration with DMX jobs • Supports ANSI-SQL 2011, BTEQ, Netezza, Oracle PL/SQL
  • 14. Syncsort Confidential and Proprietary - do not copy or distribute 14  Save users from underlying Hadoop complexities  Future-proof your applications. Design once, deploy anywhere!  Offload heavy ELT workloads to Hadoop  Secure, monitor, manage and scale with minimum effort Sign up for a Free Trial! Break Free from ETL Complexity Experience DMX & DMX-h Release 8 Syncsort.com/dmxh8 Watch this webcast on demand – including the product demos! http://bit.ly/1wI1SRN

Editor's Notes

  • #4: So when you take a look at what we do as company, you will notice we no longer use the term ETL. Big Data requires a new approach. ETL tools that failed in the past are even less suitable for Hadoop. For most legacy ETL tools, Hadoop is not the cornerstone, it is an after-thought: They have heavy stacks with software that you don’t need and dependencies on lots of 3rd party components They are a result of patchwork that now generates highly inefficient code On the other hand, newer tools do not provide a complete solution, require lots of hand coding and most of them lack enterprise-grade capabilities   Syncsort DMX-h was designed specifically for Hadoop – combining a long history of innovation with significant contributions Syncsort has made to improve Apache Hadoop. DMX-h is deployed as part of the Hadoop Cluster Delivers everything you need to collect, prepare, blend, transform and distribute data with Hadoop – nothing more, nothing less Facilitates adoption and development of Hadoop jobs with a design once, deploy anywhere approach Packages industrial-grade capabilities to deploy, manage, monitor and secure your Hadoop environment in fact, we are much more than just ETL – we are a complete solution to help organizations collect, prepare, blend, transform and distribute your data. Now that’s a mouthful, so let me explain what this mean…
  • #9: Removes the complexity of Hadoop Makes it more accessible to more people within your organization Future-proofs your applications, that means, when the new execution framework comes, you can deploy there with minimum or no changes to your applications So if you look at it from the perspective of the business it removes a tremendous amount of risk, because you ensure your investments will continue to pay back even as the Hadoop ecosystem continues to evolve