SlideShare a Scribd company logo
Seamless, real-time
data integration
Precisely Connect Overview
Liz Thompson | Sales Engineer
The global leader in data integrity
Trust your data. Build your possibilities.
Our data integrity software and data enrichment products
deliver accuracy and consistency to power confident
business decisions.
Brands you trust, trust us
Data leaders partner with us
of the Fortune 100
90
Customers in more than
100
2,000
employees
customers
12,000
countries
Modern Data Integration + Legacy Data =
Easier Said Than Done!
Integrate data seamlessly from legacy systems into next-gen
cloud and data platforms with one solution.
Batch data transformation Data replication via CDC
High-performance, lightweight batch
data processing for distributed
platforms
Real-time data replication to streaming
platforms, cloud, databases and data
warehouses
Connect
Connect is the best
solution for accessing
and integrating
mainframe and IBM i
data with cloud data
platforms
Quickly and efficiently integrate
ALL enterprise data – including
mainframe and IBM i
Design-once, deploy anywhere
approach to building data
pipelines
Minimal development costs and
time
Unrivaled scalability and
performance
Connect speaks mainframe
Data Translation
• Connect can translate mainframe
on the fly
• No mainframe expertise required
• No need to stage extra copies
Complex Copybooks
• Connect leverages copybooks as-is - no
matter how complex
• Multiple redefines
• Directives, SKIP, EJECT, etc.
• Compiler tags
• ODO’s, even nested
• No padding/exploding the data
• Smaller files = faster execution
Variable Length Records
• Record descriptor words make it
hard to split variable length data
• Connect handles distributing
variable length data with ease.
• Retains the integrity of data
• If you choose, data can remain
unchanged on object storage,
HDFS, etc. while still matching the
original copybook.
Or, get your mainframe data into big data, in a big
data format
Want your data in a cloud-native format? Connect can translate on the fly!
• Big Data lacks robust connectivity to
mainframe.
• With Connect, you can:
• Securely access mainframe data
with FTPS, Connect:Direct.
• Transform data on the fly – no
staging.
• Import hundreds or Db2 tables to
your cluster with a few mouse
clicks.
Access
• Mainframe data needs to be joined with
other data sources for full benefit.
• With Connect, you can:
• Integrate mainframe, relational data,
files, streaming data, …
• Handle all kinds of data formats, old and
new: RDBMS, mainframe variable,
HDFS, DBFS, Hive, Delta Lake, text, Avro,
Kafka, Parquet, S3, …
Integrate
• SLA’s are shorter and data is growing.
• With Connect, you can:
• Access data in parallel.
• Integrate even streaming data with
Spark support.
• Keep Big Data in sync with
mainframe and relational database
changes in real-time with Connect’s
CDC capabilities.
Fast
Connect ETL
Mainframe
IBM i
Relational
databases,
EDW, DBMS
Flat files,
XML, JSON
Sources
Strategic Projects:
Real-time analytics, AI
and machine learning
Targets
Deploy Connect ETL
on a cluster,
Windows, Unix,
Linux, or cloud
Integrate,
Prepare,
Load, Cleanse,
Transform, Deliver
Amazon S3
On-prem
legacy
stores
Cloud
platforms
Big
data
Connect ETL
Precisely Connect
Offload data and processing
• Offload data and processing from mainframe to Cloudera, Databricks,
Snowflake, and more
• Direct, high-performance connectivity to the mainframe
• Ingest from a variety of file formats and data types
• Examples: VSAM, fixed length, variable length, OCCURS, OCCURS DEPENDING ON,
REDEFINES, packed decimals, COMP data types
• Translations from EBCDIC to a variety of encodings
• Examples: ASCII, UTF-8, ICU codepage encodings
• Convert compressed number formats to displayable number formats
• Archive in raw format to target and process without the need for conversion
Connect to any source
• Native drivers to extract data for guaranteed optimal speed and efficiency
• Direct mainframe connectivity
• Support for distributed sources, external sources, flat files, weblogs, etc.
• Join, transform, de-normalize, before landing to the target
• Unique approach to convert the data to native Big Data formats, such as Avro, Parquet,
and ORC
• Quickly create reports showing off the insights from their data
• Connections for Tableau and Qlik
• Read and write to/from Apache Kafka
Design once, deploy anywhere
No re-design, re-compile or re-work – ever.
• Design future-proof workflows for batch and streaming
data
• Cleanse, blend and transform data for context and
meaning
• Move from dev to test to production
• Move from on-premises to Cloud
• Move from one Cloud to another
Drive results through high performance
• Get excellent performance every time
without tuning, load balancing, etc.
Use existing skills to drive new projects
• No parallel programming – Java, MapReduce, Spark …
No worries about:
• Mappers, Reducers
• Big side or small side of joins …
Design
Once
in visual UX
Deploy Anywhere!
On-
premises ,
Cloud
Spark,
MapReduce,
Future
Platforms
Windows,
Unix,
Linux
Batch,
Streaming
Single
Node,
Cluster
Connect comes equipped with
Intelligent Execution
• No administration or management, Connect’s ETL engine
dynamically selects the most efficient algorithms based on the data
structures and systems attributes at run-time
• Highly performant with small-footprint and comprehensive support
for growing modern data warehouses
• Visual design that enables jobs to be created once, and deployed
them anywhere
— Spark, Hadoop, Linux, Unix, Windows
— Public, private, hybrid or multi-cloud
• Simplicity and ease of use - move applications from standalone
server environments and from MapReduce to Spark – as easy as
clicking on a dropdown menu
• Future-proof your organization from underlying complexities of
your technology stack
Connect ETL Advantages
• Connect ETL requires no installation on the mainframe
• No mainframe MIPS used
• Extract from the mainframe, apply translation and transformation in memory and load directly to
RDBMS in one step (no source staging needed), Cloud, Big Data platforms
• No source staging keeps data more secure
• Future proofing
• Design once, deploy anywhere
• Simple, lightweight architecture streamlines development and deployment
• No data repository required
• High performance light weight, self-tuning engine
• No coding, no tuning
Connect ETL Advantages
• We are the experts in mainframe data conversion with excellent support
• Seamlessly read or write mainframe data based on COBOL Copybook
• Apply multiple copybooks to a single mainframe file in one step
• No data loss or data type mismatches -
• Support for complex hierarchical data from Mainframe and IBM i
• Automatically convert from EBCDIC to ASCII including binary values!
• Support for Occurs Depending On (ODO)
• Support high/low values,
• Support for packed decimal
• Support for COMP data
Connect Data Replication (CDC)
Transform your business with Connect CDC
capabilities
• Build streaming data pipelines to power business decision-making with real-time
data
• Get a consistent view of the data across the enterprise and keep your business in
sync
• Keep data lakes fresh with changes made on transactional systems – including
mainframe
• Enable timely reporting and meet tightening SLAs
• Migrate data with zero downtime for database/application upgrades and system
re-platforming
• Resilient data delivery and support data governance and security requirements
Connect – Data Replication Overview
• IBM Db2 for i
• IBM Db2 for z
• IBM Db2 LUW
• IBM Informix
• IMS
Real-time Data Sync
• Without overloading networks.
• Without affecting source database
performance.
• Without coding or tuning.
Real-Time Data Replication +
Transformation
• Apache Kafka
• Oracle
• Sybase
• VSAM
• IMS
• Snowflake
Resilient Delivery – Reliable transfer
of data even if connectivity fails on
either side. No data loss. Auto restart.
High Performance – Captures
changes in source as they happen.
Updates table statistics for faster
queries.
Flexible – RDBMS, Cloud, Cluster
• Replicate data to RDBMSs,
enterprise data warehouses,
cloud systems, and Kafka
• Oracle
• Oracle
RAC
• Sybase
• VSAM
• MS SQL
Server
• IBM Db2 (i, z,
LUW)
• MS Azure SQL
Database
• MS SQL Server
• Teradata
• IBM Informix
Conflict Resolution,
Collision Monitoring,
Tracking and Auditing
Example Sources Example Targets
Connect’s CDC High-level Architecture
Source
Database
PROJECT
METADATA
CHANGE DATA
CAPTURE
Target
Database
PROJECT
METADATA
Connect SaaS
Flexible Replication Options
One Way Two Way
Cascade
Bi-Directional
Distribute
Consolidate
Choose a topology
or combine them
to meet your data
sharing needs
Resilient replication: enable information
accuracy
Ensures ongoing integrity
• Changes collected in queue on source
• Moved to target only after committed
on source
• Ensures write-order-consistency retained
• Queues retained until successfully applied
• No database table locking
Ensures failure integrity
• Automatically detects communications
errors
• Automatically recovers the connection and
processes
• Alerts administrator
• No data is lost
Email Alerting
Accurate tracking and data auditing
Detects and resolves conflicts
• Maintains data integrity
Model verification
• Validates data movement model
Audit Journal Mapping tracks
all updates and changes
• Records
• Before and after values for every column
• Type of transaction
• Type of sending DBMS
• Table name
• User name
• Transaction information
• Records to flat file or to database table
• Can assist with SOX, HIPAA audit
requirements
Replicate exactly what you need
Filters determine what data gets moved
• Select specific column and table
• Select specific rows and table
Transforms data exactly how you need to
• Transforms data into useful information
• 80+ built-in transformation methods
• Field transformations, such as:
• DECIMAL(5,2)
• nulltostring(ZIP_CODE,'00000')
• Table transformation, such as:
• Column merging
• Column splitting
• Creating derived columns
• Custom lookup tables
• Create custom data transformations using
powerful Java scripting interface
Connect CDC Advantages
• Extract from the mainframe, apply translation
and transformation in memory and load directly
to RDBMS in one step (no source staging
needed), Cloud, Big Data platforms
• No staging of source data keeps data more
secure
• Connect CDC uses minimal mainframe MIPS
• No data loss or data type mismatches -
• Support for complex hierarchical data from
Mainframe and IBM i
• Automatically convert from EBCDIC to ASCII
including binary values!
• Support for Occurs Depending On (ODO)
• Support high/low values,
• Support for packed decimal
• Support for COMP data
• Fast real-time capture for IMS,
VSAM, and DB2. Our capture uses
the active logs, where other products
can only look at the IMS archive logs
• Automatic recovery from network
failures for all sources including MF
• Simplicity – no coding, no tuning
• We are the experts in mainframe
data conversion with excellent
support
Precisely data integration top differentiators
1
Resilient & Fault-
Tolerant Replication
2
Scalability &
Performance
3
Design Once,
Deploy Anywhere
4
Native Tech
Partner Integration
Seamless, Real-Time Data Integration with Connect

More Related Content

PPTX
TechEvent Building a Data Lake
PDF
1200x630 1
PDF
Machine Learning for z/OS
PDF
Microsoft: Building a Massively Scalable System with DataStax and Microsoft's...
PDF
Simplifying Big Data Integration with Syncsort DMX and DMX-h
PPTX
Next Generation Enterprise Architecture
PDF
Simplifying Disaster Recovery with Delta Lake
PPTX
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent
TechEvent Building a Data Lake
1200x630 1
Machine Learning for z/OS
Microsoft: Building a Massively Scalable System with DataStax and Microsoft's...
Simplifying Big Data Integration with Syncsort DMX and DMX-h
Next Generation Enterprise Architecture
Simplifying Disaster Recovery with Delta Lake
Apache Kafka and the Data Mesh | Ben Stopford and Michael Noll, Confluent

What's hot (20)

PDF
Which Change Data Capture Strategy is Right for You?
PDF
Big Data Computing Architecture
PPTX
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
PDF
Data Lake and the rise of the microservices
PPTX
Innovation in the Enterprise Rent-A-Car Data Warehouse
PDF
The Hidden Value of Hadoop Migration
PDF
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
PDF
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
PDF
Virtual Flink Forward 2020: Netflix Data Mesh: Composable Data Processing - J...
PDF
Future of Data Engineering
PPTX
How Experian increased insights with Hadoop
PDF
Sidecars and a Microservices Mesh
PPTX
Big Data Education Webcast: Introducing DMX and DMX-h Release 8
PPTX
Modern Data Warehousing with the Microsoft Analytics Platform System
PPTX
Preventative Maintenance of Robots in Automotive Industry
PPTX
Accelerating Data Warehouse Modernization
PDF
IlOUG Tech Days 2016 - Big Data for Oracle Developers - Towards Spark, Real-T...
PDF
A7 storytelling with_oracle_analytics_cloud
PPTX
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
PPTX
Case Study: Elasticsearch Ingest Using StreamSets at Cisco Intercloud
Which Change Data Capture Strategy is Right for You?
Big Data Computing Architecture
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Data Lake and the rise of the microservices
Innovation in the Enterprise Rent-A-Car Data Warehouse
The Hidden Value of Hadoop Migration
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Virtual Flink Forward 2020: Netflix Data Mesh: Composable Data Processing - J...
Future of Data Engineering
How Experian increased insights with Hadoop
Sidecars and a Microservices Mesh
Big Data Education Webcast: Introducing DMX and DMX-h Release 8
Modern Data Warehousing with the Microsoft Analytics Platform System
Preventative Maintenance of Robots in Automotive Industry
Accelerating Data Warehouse Modernization
IlOUG Tech Days 2016 - Big Data for Oracle Developers - Towards Spark, Real-T...
A7 storytelling with_oracle_analytics_cloud
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Case Study: Elasticsearch Ingest Using StreamSets at Cisco Intercloud
Ad

Similar to Seamless, Real-Time Data Integration with Connect (20)

PPTX
Streaming IBM i to Kafka for Next-Gen Use Cases
PDF
Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...
PDF
What's New Tajo 0.10 and Its Beyond
PPTX
Moving IBM i Applications to the Cloud with AWS and Precisely
PPTX
Get Mainframe and IBM i Data to Snowflake
PDF
IBM Sterling Connect: Direct
PPTX
Modernizing Mission-Critical Apps with SQL Server
PPTX
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
PDF
inmation Presentation
PDF
Ibm integrated analytics system
PPTX
Making the Case for Legacy Data in Modern Data Analytics Platforms
PPTX
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
PDF
Snowflake + Syncsort: Get Value from Your Mainframe Data
PDF
inmation Presentation_2017
PDF
2010.03.16 Pollock.Edw2010.Modern D Ifor Warehousing
PDF
Performance Analysis of Apache Spark and Presto in Cloud Environments
PPTX
Microsoft SQL Server 2012
PPTX
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
PPTX
Democratized Data & Analytics for the Cloud​
PDF
Data & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023]
Streaming IBM i to Kafka for Next-Gen Use Cases
Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...
What's New Tajo 0.10 and Its Beyond
Moving IBM i Applications to the Cloud with AWS and Precisely
Get Mainframe and IBM i Data to Snowflake
IBM Sterling Connect: Direct
Modernizing Mission-Critical Apps with SQL Server
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
inmation Presentation
Ibm integrated analytics system
Making the Case for Legacy Data in Modern Data Analytics Platforms
Bring Your SAP and Enterprise Data to Hadoop, Kafka, and the Cloud
Snowflake + Syncsort: Get Value from Your Mainframe Data
inmation Presentation_2017
2010.03.16 Pollock.Edw2010.Modern D Ifor Warehousing
Performance Analysis of Apache Spark and Presto in Cloud Environments
Microsoft SQL Server 2012
Foundational Strategies for Trusted Data: Getting Your Data to the Cloud
Democratized Data & Analytics for the Cloud​
Data & Analytics ReInvent Recap [AWS Basel Meetup - Jan 2023]
Ad

More from Precisely (20)

PDF
The Future of Automation: AI, APIs, and Cloud Modernization.pdf
PDF
Unlock new opportunities with location data.pdf
PDF
Reimagining Insurance: Connected Data for Confident Decisions.pdf
PDF
Introducing Syncsort™ Storage Management.pdf
PDF
Enable Enterprise-Ready Security on IBM i Systems.pdf
PDF
A Day in the Life of Location Data - Turning Where into How.pdf
PDF
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
PDF
Solving the CIO’s Dilemma: Speed, Scale, and Smarter SAP Modernization.pdf
PDF
Solving the Data Disconnect: Why Success Hinges on Pre-Linked Data.pdf
PDF
Cooking Up Clean Addresses - 3 Ways to Whip Messy Data into Shape.pdf
PDF
Building Confidence in AI & Analytics with High-Integrity Location Data.pdf
PDF
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
PDF
Precisely Demo Showcase: Powering ServiceNow Discovery with Precisely Ironstr...
PDF
The 2025 Guide on What's Next for Automation.pdf
PDF
Outdated Tech, Invisible Expenses – How Data Silos Undermine Operational Effi...
PDF
Modernización de SAP: Maximizando el Valor de su Migración a SAP S/4HANA.pdf
PDF
Outdated Tech, Invisible Expenses – The Hidden Cost of Disconnected Data Syst...
PDF
Migration vers SAP S/4HANA: Un levier stratégique pour votre transformation d...
PDF
Outdated Tech, Invisible Expenses: The Hidden Cost of Poor Data Integration o...
PDF
The Changing Compliance Landscape in 2025.pdf
The Future of Automation: AI, APIs, and Cloud Modernization.pdf
Unlock new opportunities with location data.pdf
Reimagining Insurance: Connected Data for Confident Decisions.pdf
Introducing Syncsort™ Storage Management.pdf
Enable Enterprise-Ready Security on IBM i Systems.pdf
A Day in the Life of Location Data - Turning Where into How.pdf
Get More from Fiori Automation - What’s New, What Works, and What’s Next.pdf
Solving the CIO’s Dilemma: Speed, Scale, and Smarter SAP Modernization.pdf
Solving the Data Disconnect: Why Success Hinges on Pre-Linked Data.pdf
Cooking Up Clean Addresses - 3 Ways to Whip Messy Data into Shape.pdf
Building Confidence in AI & Analytics with High-Integrity Location Data.pdf
SAP Modernization Strategies for a Successful S/4HANA Journey.pdf
Precisely Demo Showcase: Powering ServiceNow Discovery with Precisely Ironstr...
The 2025 Guide on What's Next for Automation.pdf
Outdated Tech, Invisible Expenses – How Data Silos Undermine Operational Effi...
Modernización de SAP: Maximizando el Valor de su Migración a SAP S/4HANA.pdf
Outdated Tech, Invisible Expenses – The Hidden Cost of Disconnected Data Syst...
Migration vers SAP S/4HANA: Un levier stratégique pour votre transformation d...
Outdated Tech, Invisible Expenses: The Hidden Cost of Poor Data Integration o...
The Changing Compliance Landscape in 2025.pdf

Recently uploaded (20)

PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Electronic commerce courselecture one. Pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Empathic Computing: Creating Shared Understanding
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Cloud computing and distributed systems.
PDF
NewMind AI Monthly Chronicles - July 2025
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Modernizing your data center with Dell and AMD
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Reach Out and Touch Someone: Haptics and Empathic Computing
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
NewMind AI Weekly Chronicles - August'25 Week I
Unlocking AI with Model Context Protocol (MCP)
Electronic commerce courselecture one. Pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Understanding_Digital_Forensics_Presentation.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Review of recent advances in non-invasive hemoglobin estimation
Empathic Computing: Creating Shared Understanding
The AUB Centre for AI in Media Proposal.docx
Cloud computing and distributed systems.
NewMind AI Monthly Chronicles - July 2025
“AI and Expert System Decision Support & Business Intelligence Systems”
Modernizing your data center with Dell and AMD
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
CIFDAQ's Market Insight: SEC Turns Pro Crypto

Seamless, Real-Time Data Integration with Connect

  • 1. Seamless, real-time data integration Precisely Connect Overview Liz Thompson | Sales Engineer
  • 2. The global leader in data integrity Trust your data. Build your possibilities. Our data integrity software and data enrichment products deliver accuracy and consistency to power confident business decisions. Brands you trust, trust us Data leaders partner with us of the Fortune 100 90 Customers in more than 100 2,000 employees customers 12,000 countries
  • 3. Modern Data Integration + Legacy Data = Easier Said Than Done!
  • 4. Integrate data seamlessly from legacy systems into next-gen cloud and data platforms with one solution. Batch data transformation Data replication via CDC High-performance, lightweight batch data processing for distributed platforms Real-time data replication to streaming platforms, cloud, databases and data warehouses Connect
  • 5. Connect is the best solution for accessing and integrating mainframe and IBM i data with cloud data platforms Quickly and efficiently integrate ALL enterprise data – including mainframe and IBM i Design-once, deploy anywhere approach to building data pipelines Minimal development costs and time Unrivaled scalability and performance
  • 6. Connect speaks mainframe Data Translation • Connect can translate mainframe on the fly • No mainframe expertise required • No need to stage extra copies Complex Copybooks • Connect leverages copybooks as-is - no matter how complex • Multiple redefines • Directives, SKIP, EJECT, etc. • Compiler tags • ODO’s, even nested • No padding/exploding the data • Smaller files = faster execution Variable Length Records • Record descriptor words make it hard to split variable length data • Connect handles distributing variable length data with ease. • Retains the integrity of data • If you choose, data can remain unchanged on object storage, HDFS, etc. while still matching the original copybook.
  • 7. Or, get your mainframe data into big data, in a big data format Want your data in a cloud-native format? Connect can translate on the fly! • Big Data lacks robust connectivity to mainframe. • With Connect, you can: • Securely access mainframe data with FTPS, Connect:Direct. • Transform data on the fly – no staging. • Import hundreds or Db2 tables to your cluster with a few mouse clicks. Access • Mainframe data needs to be joined with other data sources for full benefit. • With Connect, you can: • Integrate mainframe, relational data, files, streaming data, … • Handle all kinds of data formats, old and new: RDBMS, mainframe variable, HDFS, DBFS, Hive, Delta Lake, text, Avro, Kafka, Parquet, S3, … Integrate • SLA’s are shorter and data is growing. • With Connect, you can: • Access data in parallel. • Integrate even streaming data with Spark support. • Keep Big Data in sync with mainframe and relational database changes in real-time with Connect’s CDC capabilities. Fast
  • 9. Mainframe IBM i Relational databases, EDW, DBMS Flat files, XML, JSON Sources Strategic Projects: Real-time analytics, AI and machine learning Targets Deploy Connect ETL on a cluster, Windows, Unix, Linux, or cloud Integrate, Prepare, Load, Cleanse, Transform, Deliver Amazon S3 On-prem legacy stores Cloud platforms Big data Connect ETL Precisely Connect
  • 10. Offload data and processing • Offload data and processing from mainframe to Cloudera, Databricks, Snowflake, and more • Direct, high-performance connectivity to the mainframe • Ingest from a variety of file formats and data types • Examples: VSAM, fixed length, variable length, OCCURS, OCCURS DEPENDING ON, REDEFINES, packed decimals, COMP data types • Translations from EBCDIC to a variety of encodings • Examples: ASCII, UTF-8, ICU codepage encodings • Convert compressed number formats to displayable number formats • Archive in raw format to target and process without the need for conversion
  • 11. Connect to any source • Native drivers to extract data for guaranteed optimal speed and efficiency • Direct mainframe connectivity • Support for distributed sources, external sources, flat files, weblogs, etc. • Join, transform, de-normalize, before landing to the target • Unique approach to convert the data to native Big Data formats, such as Avro, Parquet, and ORC • Quickly create reports showing off the insights from their data • Connections for Tableau and Qlik • Read and write to/from Apache Kafka
  • 12. Design once, deploy anywhere No re-design, re-compile or re-work – ever. • Design future-proof workflows for batch and streaming data • Cleanse, blend and transform data for context and meaning • Move from dev to test to production • Move from on-premises to Cloud • Move from one Cloud to another Drive results through high performance • Get excellent performance every time without tuning, load balancing, etc. Use existing skills to drive new projects • No parallel programming – Java, MapReduce, Spark … No worries about: • Mappers, Reducers • Big side or small side of joins … Design Once in visual UX Deploy Anywhere! On- premises , Cloud Spark, MapReduce, Future Platforms Windows, Unix, Linux Batch, Streaming Single Node, Cluster
  • 13. Connect comes equipped with Intelligent Execution • No administration or management, Connect’s ETL engine dynamically selects the most efficient algorithms based on the data structures and systems attributes at run-time • Highly performant with small-footprint and comprehensive support for growing modern data warehouses • Visual design that enables jobs to be created once, and deployed them anywhere — Spark, Hadoop, Linux, Unix, Windows — Public, private, hybrid or multi-cloud • Simplicity and ease of use - move applications from standalone server environments and from MapReduce to Spark – as easy as clicking on a dropdown menu • Future-proof your organization from underlying complexities of your technology stack
  • 14. Connect ETL Advantages • Connect ETL requires no installation on the mainframe • No mainframe MIPS used • Extract from the mainframe, apply translation and transformation in memory and load directly to RDBMS in one step (no source staging needed), Cloud, Big Data platforms • No source staging keeps data more secure • Future proofing • Design once, deploy anywhere • Simple, lightweight architecture streamlines development and deployment • No data repository required • High performance light weight, self-tuning engine • No coding, no tuning
  • 15. Connect ETL Advantages • We are the experts in mainframe data conversion with excellent support • Seamlessly read or write mainframe data based on COBOL Copybook • Apply multiple copybooks to a single mainframe file in one step • No data loss or data type mismatches - • Support for complex hierarchical data from Mainframe and IBM i • Automatically convert from EBCDIC to ASCII including binary values! • Support for Occurs Depending On (ODO) • Support high/low values, • Support for packed decimal • Support for COMP data
  • 17. Transform your business with Connect CDC capabilities • Build streaming data pipelines to power business decision-making with real-time data • Get a consistent view of the data across the enterprise and keep your business in sync • Keep data lakes fresh with changes made on transactional systems – including mainframe • Enable timely reporting and meet tightening SLAs • Migrate data with zero downtime for database/application upgrades and system re-platforming • Resilient data delivery and support data governance and security requirements
  • 18. Connect – Data Replication Overview • IBM Db2 for i • IBM Db2 for z • IBM Db2 LUW • IBM Informix • IMS Real-time Data Sync • Without overloading networks. • Without affecting source database performance. • Without coding or tuning. Real-Time Data Replication + Transformation • Apache Kafka • Oracle • Sybase • VSAM • IMS • Snowflake Resilient Delivery – Reliable transfer of data even if connectivity fails on either side. No data loss. Auto restart. High Performance – Captures changes in source as they happen. Updates table statistics for faster queries. Flexible – RDBMS, Cloud, Cluster • Replicate data to RDBMSs, enterprise data warehouses, cloud systems, and Kafka • Oracle • Oracle RAC • Sybase • VSAM • MS SQL Server • IBM Db2 (i, z, LUW) • MS Azure SQL Database • MS SQL Server • Teradata • IBM Informix Conflict Resolution, Collision Monitoring, Tracking and Auditing Example Sources Example Targets
  • 19. Connect’s CDC High-level Architecture Source Database PROJECT METADATA CHANGE DATA CAPTURE Target Database PROJECT METADATA Connect SaaS
  • 20. Flexible Replication Options One Way Two Way Cascade Bi-Directional Distribute Consolidate Choose a topology or combine them to meet your data sharing needs
  • 21. Resilient replication: enable information accuracy Ensures ongoing integrity • Changes collected in queue on source • Moved to target only after committed on source • Ensures write-order-consistency retained • Queues retained until successfully applied • No database table locking Ensures failure integrity • Automatically detects communications errors • Automatically recovers the connection and processes • Alerts administrator • No data is lost Email Alerting
  • 22. Accurate tracking and data auditing Detects and resolves conflicts • Maintains data integrity Model verification • Validates data movement model Audit Journal Mapping tracks all updates and changes • Records • Before and after values for every column • Type of transaction • Type of sending DBMS • Table name • User name • Transaction information • Records to flat file or to database table • Can assist with SOX, HIPAA audit requirements
  • 23. Replicate exactly what you need Filters determine what data gets moved • Select specific column and table • Select specific rows and table
  • 24. Transforms data exactly how you need to • Transforms data into useful information • 80+ built-in transformation methods • Field transformations, such as: • DECIMAL(5,2) • nulltostring(ZIP_CODE,'00000') • Table transformation, such as: • Column merging • Column splitting • Creating derived columns • Custom lookup tables • Create custom data transformations using powerful Java scripting interface
  • 25. Connect CDC Advantages • Extract from the mainframe, apply translation and transformation in memory and load directly to RDBMS in one step (no source staging needed), Cloud, Big Data platforms • No staging of source data keeps data more secure • Connect CDC uses minimal mainframe MIPS • No data loss or data type mismatches - • Support for complex hierarchical data from Mainframe and IBM i • Automatically convert from EBCDIC to ASCII including binary values! • Support for Occurs Depending On (ODO) • Support high/low values, • Support for packed decimal • Support for COMP data • Fast real-time capture for IMS, VSAM, and DB2. Our capture uses the active logs, where other products can only look at the IMS archive logs • Automatic recovery from network failures for all sources including MF • Simplicity – no coding, no tuning • We are the experts in mainframe data conversion with excellent support
  • 26. Precisely data integration top differentiators 1 Resilient & Fault- Tolerant Replication 2 Scalability & Performance 3 Design Once, Deploy Anywhere 4 Native Tech Partner Integration