Connecting mq&kafka

TechCon 2021
Connecting MQ and
Kafka together
Matt Leming
Architect, MQ for z/OS
lemingma@uk.ibm.com

© 2021 IBM Corporation
Messaging is essential for building fully connected, efficient and scalable solutions. More now than ever
before

before
Critical exchange of information from
one system to another
Messages must get through, no
questions asked. The system must be
secure and distributed
Real-time event notification for
microservice scalability
Must be frictionless, scalable and
lightweight. The system must be simple
to exploit, invisible to the application
TechCon 2021 © 2021 IBM Corporation
Event streaming for data caching and
processing
Must maintain a projection of the data
for efficient repeatable processing. The
system must scale for large data
volumes
Messaging patterns

before
Critical exchange of information from
one system to another
Messages must get through, no
questions asked. The system must be
secure and distributed
Real-time event notification for
microservice scalability
Must be frictionless, scalable and
lightweight. The system must be simple
to exploit, invisible to the application
Event streaming for data caching and
processing
Must maintain a projection of the data
for efficient repeatable processing. The
system must scale for large data
volumes
Messaging patterns
IBM MQ Kafka

before
Messages and events for communication and
notification
Systems rely on messages and events to
communicate between services in real time, not
just within single applications, but across
organisations and between them
Events for data persistence
Events represent past state changes, retaining
a record enables reply and blurs the line
between the role of messaging and data
storage
Architectural patterns
IBM MQ Kafka

Connecting MQ and Kafka:
why?
With IBM MQ and Apache Kafka specialising in
different aspects of the messaging spectrum, one on
connectivity and the other on data, solutions often
require messages to flow between the two
IBM MQ
Common scenarios:
Core banking system with MQ used as connectivity
backbone. Customer wants to take a copy of
messages from MQ and push them into Kafka for
analytics
Customer wants to extend core banking system to
emit data into Kafka, but doesn’t want to add
network latency that might affect SLAs, so uses
local queue manager as a bridge
Customer needs to get data into Kafka from z/OS
but doesn’t want to invest in Kafka skills so uses in-
house MQ experience
Customer needs to get data into z/OS from
distributed. Distributed development team has
experience with Kafka, z/OS team want to exploit
MQ integration with CICS / IMS

Kafka Connect
As Kafka is commonly used for analytics there is a
need to be able to get data into it from a wide range of
sources
Kafka Connect provides a framework for this which
makes it easy to build connectors
Source connectors: data from external system into
Kafka
Sink connectors: data into external system from Kafka
Over 120 connectors exist today: IBM MQ, JDBC,
ElasticSearch, Splunk…
Some supported, some not
External
System
Kafka Connect worker
TOPIC
SINK
CONNECTOR
TOPIC
SOURCE
CONNECTOR
Kafka brokers
NB: a connector is actually divided up into a connector which
deals with configuration, and configuration changes, and a
set of tasks which do the copying of data. For this
presentation I will just use connector to refer to both

IBM MQ - Kafka Connector
Several connectors exist for connecting MQ queues
to Kafka topics
Both source and sink connectors exist. The source
one is by far the most commonly used one today
IBM provides functionally identical connectors in two
forms:
Unsupported open source
Supported with CP4I
Confluent also provides a supported connector
IBM MQ
QUEUE:
TO.KAFKA
QUEUE:
FROM.KAFKA
TOPIC:
FROM.MQ
MQ SINK
CONNECTOR
TOPIC:
TO.MQ
MQ SOURCE
CONNECTOR
Kafka brokers
https://github.com/ibm-messaging/kafka-connect-mq-
source
https://github.com/ibm-messaging/kafka-connect-mq-sink

Application options
The IBM source / sink connectors both require a
queue name
With the source connector there are three different
approaches that can be used to get messages onto
the queue to be consumed by the connector…
Application Application Application
Application
Direct to queue
#
TO.KAFKA
TO.KAFKA
TO.KAFKA
Subscribe to topic Streaming queue copy
CLIENT
QUEUE:
FROM.KAFKA
TOPIC:
FROM.MQ
MQ SINK
CONNECTOR
TOPIC:
TO.MQ
MQ SOURCE
CONNECTOR
CLIENT
Kafka brokers
IBM MQ
QUEUE:
TO.KAFKA

Direct to queue
New applications, or applications that are being
changed, can just put the relevant data to the queue
used by the connector
Tends to be used when you don’t want to generate a
copy of existing messages
Often used on z/OS to remove the need for network
latency associated with a direct connection to Kafka
via REST, or for having to make use of more
complicated approaches like exploiting the Kafka Java
API in CICS
MQ Application
TO.KAFKA

Subscribe to topic
If messages are already being published to a topic, its
trivial to generate another copy of them by using an
administrative subscription pointing to the queue used
by the connector
This approach is transparent to existing applications
MQ Application
TO.KAFKA
#

Subscribe to topic
This approach can also be used to take a copy of
messages that are being sent to a queue
The queue is changed into a queue alias pointing to a
topic. Two administrative subscriptions are created,
one for application that used to consume from the
queue, the other for the connector
This approach might not be viable in all cases
depending on whether queue aliases have been used,
or whether the application can be adjusted to use
them.
It also relies on existing applications being able to
tolerate the changes to message ID, correlation ID,
etc when using pub / sub
MQ Application
TO.KAFKA
#
MQ Application
TO.APP
TO.APP
MQ Application
MQ Application
ALIAS.TO.APP
ALIAS.TO.APP

Streaming queue copy
From 9.2.3 on distributed, there is an alternative option: use
streaming queues
Streaming queues allows messages being put to one queue to
be copied to a second queue without affecting the applications
using the first queue
For example:
DEF QL(TO.APP) STREAMQ(TO.KAFKA)
STRMQOS(MUSTDUP)
DEF QL(TO.KAFKA)
Says, when a message is sent to TO.APP a copy of that
message must be sent to TO.KAFKA
Enabling streaming queues has no effects on existing
applications as the original message doesn’t change, the
message sent to the second queue will is identical (*) to the
original message: same payload, message & correlation ID,
etc
MQ Application
TO.KAFKA
TO.APP STREAMQ(TO.KAFKA)
MQ Application

The source connector in detail
The connector is Java based and so uses the MQ
JMS API to interact with MQ
Provided in the form of a jar file. If using the open
source connector, you build this yourself, otherwise
you download it from CP4I
The connector is installed into Kafka Connect and run
with a properties file containing configuration
Lots of flexibility in configuration:
Client / bindings mode
TLS including mutual auth
User id and password
JMS and MQ format messages
Message properties
Client mode connections are the default
CLIENT
IBM MQ
QUEUE:
TO.KAFKA
TOPIC:
FROM.MQ
MQ SOURCE
CONNECTOR
Kafka brokers
# The name of the target Kafka topic
topic=FROM.MQ
# The name of the MQ queue manager
mq.queue.manager=MQ1
mq.connection.mode=bindings
# The name of the source MQ queue
mq.queue=TO.KAFKA
Example properties

Data conversion
Messages read in from MQ go through a number of
conversions before being sent to Kafka
If the connector receives a message it can’t convert it will
stop to prevent throwing the message away, so
understanding the data format is important!
When a message is read from MQ it’s payload is converted
to an internal format known as a source record
This conversion is done via a record builder. Two record
builders are provided with the connector (default, JSON) or
you can write your own
The source record is then converted into a Kafka message
This conversion is done via a converter provided by Kafka
(byte array, string, JSON)
The documentation for the connector provides
recommendations for common use cases
MQ message
MQMD
(MQRFH2)
Payload
Source record
Schema
Value
Kafka message
Key
Value
MQ source connector
IBM MQ
Record builder
Converter

Partitioning
When messages are published to a Kafka topic they
need to be spread across the different partitions of the
topic
This is either done by a round-robin process, or if the
message contains a key, the hash of the key is used
to select a partition (same key => same partition)
Its also possible to write your own partitioning
implementation
By default the MQ source connector doesn’t specify a
key
However it can be configured to use the MQ message
ID, correlation ID, or the queue name as a key
Kafka message
Key
Value
Broker 1
Topic 1: partition 1
Broker 2
Broker 3
?

Fault tolerance and scalability
Both MQ and Kafka are highly fault tolerant and
scalable. This extends to Kafka Connect and the MQ
connectors
Kafka Connect can be run in two modes:
Standalone: a single Kafka Connect worker process
runs the connector. This is useful for getting started
with the connectors, or if you want guaranteed
message ordering, but is a single point of failure, and
can be a scalability bottle neck
Distributed: multiple Kafka Connect worker
processes run across a set of machines and form a
Kafka Connect cluster. Connectors are run across a
number of these workers depending on configuration.
If a worker process running a connector fails, the
connector can be restarted on a different worker in the
cluster. The workers collaborate to ensure they each
have about the same amount of work
IBM MQ
Connect
Worker
Connect
Worker
Connect
Worker
Broker
Broker
Broker
Queue
Manager
Queue
Manager
Queue
Manager

Fault tolerance and scalability
Messages are received from MQ in batches using a
JMS transacted session. I.e using a transaction
coordinated by the queue manager
If there is any failure in converting any MQ messages
then the transaction is rolled back and the messages
can be reprocessed later
The connector automatically deals with reconnections
to MQ if needed
Similarly the Kafka Connect framework automatically
deals with reconnections to Kafka if needed,
depending on configuration
Kafka doesn’t have the ability to take part in two
phase commit transactions. Therefore some failure
scenarios might end up with MQ messages being
written to the Kafka topic multiple times
IBM MQ
Connect
Worker
Connect
Worker
Connect
Worker
Broker
Broker
Broker
Queue
Manager
Queue
Manager
Queue
Manager

Using the connector on z/OS
Lots of customers on z/OS have MQ and use it to
communicate between z/OS LPARs as well as
between z/OS and distributed
There has been a lot of interest in the MQ connector
on z/OS
It’s possible to run Kafka Connect on distributed and
connect to z/OS as a client
This is the same model you would likely use with
distributed MQ
However …
CLIENT
IBM MQ for z/OS
QUEUE:
TO.KAFKA
Kafka Connect
TOPIC:
FROM.MQ
MQ SOURCE
CONNECTOR
Kafka brokers
z/OS LPAR

Using the connector on z/OS
An alternative is to run the connector on z/OS in USS
and connect to the queue manager in bindings
The connector then connects to the remote Kafka
cluster over the network. Connections to Kafka are
always network based
This model uses less MQ CPU, and because Kafka
connect is Java based it is zIIP eligible making costs
very completive
Kafka Connect works fine on z/OS. However various
properties files and shell scripts need to be converted
to EBCDIC first
The conversion is documented here:
https://ibm.github.io/event-
streams/connecting/mq/zos/
BINDINGS
IBM MQ for z/OS
QUEUE:
TO.KAFKA
TOPIC:
FROM.MQ
Kafka Connect
MQ SOURCE
CONNECTOR
Kafka brokers
z/OS LPAR
Connector
location
Connection to
MQ
Total
CPU US
MQ CPU
US
Connector
CPU US
USS Bindings 100.6
(4.9)
2.4 98.2
(2.5)
USS Client
(Advanced VUE
only)
152.9
(60.9)
58.4 94.5
(2.5)
Distributed Client 55.7 55.7 N/A
Values in brackets are if maximum zIIP offload is achieved
https://www.imwuc.org/HigherLogic/System/DownloadDocumentFile.ashx?Do
cumentFileKey=179ebb75-7be2-42aa-6f0e-818aeef805f2

MQ Application
TO.KAFKA
TO.APP STREAMQ(TO.KAFKA)
MQ Application
Starting with the following on Linux:
MQ 9.2.3 installed
Latest Kafka installed
MQ source connector built and installed

Check connector is installed and configured

Start a Kafka consumer, nothing there…

Thank you
© Copyright IBM Corporation 2021. All rights reserved. The information contained in these materials is provided for informational purposes only, and is provided AS IS without warranty of any kind,
express or implied. Any statement of direction represents IBM’s current intent, is subject to change or withdrawal, and represent only goals and objectives. IBM, the IBM logo, and ibm.com are trademarks
of IBM Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available at Copyright and
trademark information.
Matt Leming
Architect, MQ for z/OS
lemingma@uk.ibm.com

Connecting mq&kafka

More Related Content

What's hot (20)

Similar to Connecting mq&kafka (20)

More from Matt Leming (12)

Recently uploaded (20)