SlideShare a Scribd company logo
Real-time Stream Processing
using
Apache Apex
Bhupesh Chawda
bhupesh@apache.org
DataTorrent Software
Apache Apex - Stream Processing
● YARN - Native - Uses Hadoop YARN framework for resource negotiation
● Highly Scalable - Scales statically as well as dynamically
● Highly Performant - Can reach single digit millisecond end-to-end latency
● Fault Tolerant - Automatically recovers from failures - without manual intervention
● Stateful - Guarantees that no state will be lost
● Easily Operable - Exposes an easy API for developing Operators (part of an
application) and Applications
Project History
● Project development started in 2012 at DataTorrent
● Open-sourced in July 2015
● Apache Apex started incubation in August 2015
● 50+ committers from Apple, GE, Capital One, DirecTV, Silver Spring Networks,
Barclays, Ampool and DataTorrent
● Mentors from Class Software, MapR and Hortonworks
● Soon to be a top level Apache project
Apex Platform Overview
An Apex Application is a DAG
(Directed Acyclic Graph)
● A DAG is composed of vertices (Operators) and edges (Streams).
● A Stream is a sequence of data tuples which connects operators at end-points called Ports
● An Operator takes one or more input streams, performs computations & emits one or more output streams
● Each operator is USER’s business logic, or built-in operator from our open source library
● Operator may have multiple instances that run in parallel
Hadoop 1.0 vs 2.0 - YARN
Apex as a YARN Application
● YARN (Hadoop 2.0) replaces MapReduce with
a more generic Resource Management
Framework.
● Apex uses YARN for resource management
and HDFS for storing any persistent storage
Support for Windowing
● Apex splits incoming tuples into finite time slices - Streaming Windows
○ Transparent to the user
○ Apex Default = 500 ms
● Checkpointing and book-keeping done at Streaming window boundary
● Applications may need to perform computations in windows - Application Windows
○ Specified as a multiple of Streaming Window size
○ Call backs to user operator logic
■ beginWindow(long windowId)
■ endWindow()
○ Example - An application which identifies some aggregates and emits them every minute. Here
application window size = 60 secs = 30 Streaming Windows
● Sliding and Tumbling Application windows are supported natively
Buffer Server
● Staging area for outgoing tuples
● Downstream operators connect to upstream Buffer Server to subscribe for tuples
● Plays a role in recovery by replaying data to the downstream operator from a
particular checkpoint
● Spooling to disk is also supported
Fault Tolerance - Checkpointing
● During checkpointing all operator state is written to HDFS asynchronously
● This is decentralized and happens independently for each operator
● If all operators in the DAG have checkpointed a particular window, then that window
is said to be committed and all previous checkpoints are purged
O1 O2 O3 O4
3 3 3 2Checkpoint # --->
Committed Window # = 120180 180 180 120
Checkpoint
Window # --->
Committed Checkpoint # = 2
Checkpoint Window = 60 Streaming Windows
Recovery
● Apex Application Master detects the failure
of an operator based on the missing heart
beats from the operators or if windows are
not progressing
● All downstream operators from the failed
operator are restarted from the last
committed checkpoint to recover from
their states.
● Data is replayed from the same checkpoint
by the Buffer Server
● Recovery is automatic and does not require
manual intervention.
Scalability - Partitioning
● Operators can be “replicated” (partitioned) into
multiple instances to cope up with high speed
input streams.
● Can be specified at Application launch time
● User can control the distribution of tuples to
downstream partitions.
● Automatic Unifier to unify the tuples
Scalability - Dynamic scaling
● Auto scaling is also supported. Number of partitions may automatically increase or
decrease based on the incoming load. Can be customized by the user
● User has to define the trigger for auto scaling:
○ Example - Increase partitions if latency goes above 100 ms.
Apex Processing Semantics
● AT_LEAST_ONCE (default): Windows are processed at least once
● AT_MOST_ONCE: Windows are processed at most once
○ During recovery, all downstream operators are fast-forwarded to the window of latest checkpoint
● EXACTLY_ONCE: Windows are processed exactly once
○ Checkpoint every window
○ Checkpointing becomes blocking
Apex Guarantees
● Apex guarantees No loss of data and computational state - Checkpointed
periodically
● Automatic recovery ensures that processing resumes from where it left off
● Order of incoming data is guaranteed to be maintained
○ Not applicable in case of partitioning of operators
● Events in a window are always replayed in the same window in case of failures
Application Specification
1. Add Operators
2. Add Streams
Logical and Physical DAGs
Apex Malhar Library
1. Performance requirements
a. A system which can provide a very very low latency for decision making
(40 ms)
b. Ability to handle large volumes of data and ever changing rules (1,000
events per 20 ms burst)
c. 99.5% uptime. Which is about 1.5 days downtime in an year
➔ Apex achieved:
◆ 2 ms latency against the requirement of 40ms
◆ Was able to handle 2,000 events burst against requirement of 1,000
events burst at a net rate of 70,000 events/s.
◆ 99.9995% uptime against requirement of and 99.5% uptime and
2. Relevant Roadmap
3. Enterprise grade
4. Have a healthy and diverse community and committers, i.e. not controlled by one
vendor
Talk Slides: http://www.slideshare.net/ilganeli/nextgen-decision-making-in-under-2ms
DataTorrent Blog: https://www.datatorrent.com/blog/next-gen-decision-making-in-under-
2-milliseconds/
Decision Making in < 2ms
Decision making in < 2ms contd..
● Comparison finally boiled down to
○ Apache Storm
○ Apache Flink
○ Apache Apex
● Some problems in Storm and Flink among others
○ Nimbus is a single point of failure
○ Bolts / Spouts / Operators share a JVM. Hard to debug
○ No dynamic topologies
○ Restarting entire topologies in case of failures
Resources
● Mailing List
○ Developers dev@apex.incubator.apache.org
○ Users users@apex.incubator.apache.org
● Apache Apex http://apex.apache.org/
● Github
○ Apex Core: http://github.com/apache/incubator-apex-core
○ Apex Malhar: http://github.com/apache/incubator-apex-malhar
● DataTorrent: http://www.datatorrent.com
● Twitter @ApacheApex Follow - https://twitter.com/apacheapex
● Facebook https://www.facebook.com/ApacheApex/
● Meetup http://www.meetup.com/topics/apache-apex
● Startup Program Free Enterprise License for Startups, Universities, Non-Profits
Thank you!
Please send your questions to bhupesh@apache.org

More Related Content

PDF
Apex as yarn application
PPTX
IoT Ingestion & Analytics using Apache Apex - A Native Hadoop Platform
PPTX
Apache Big Data 2016: Next Gen Big Data Analytics with Apache Apex
PPTX
Apache Apex Fault Tolerance and Processing Semantics
PPTX
Apache Apex: Stream Processing Architecture and Applications
PPTX
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex
PPTX
Architectual Comparison of Apache Apex and Spark Streaming
PPTX
Introduction to Apache Apex
Apex as yarn application
IoT Ingestion & Analytics using Apache Apex - A Native Hadoop Platform
Apache Big Data 2016: Next Gen Big Data Analytics with Apache Apex
Apache Apex Fault Tolerance and Processing Semantics
Apache Apex: Stream Processing Architecture and Applications
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex
Architectual Comparison of Apache Apex and Spark Streaming
Introduction to Apache Apex

What's hot (20)

PPTX
Intro to Apache Apex - Next Gen Native Hadoop Platform - Hackac
PPTX
Introduction to Apache Apex
PDF
Ingestion file copy using apex
PPTX
Intro to Apache Apex (next gen Hadoop) & comparison to Spark Streaming
PDF
Actionable Insights with Apache Apex at Apache Big Data 2017 by Devendra Tagare
PDF
Building your first aplication using Apache Apex
PPTX
Intro to Apache Apex - Next Gen Platform for Ingest and Transform
PPTX
Java High Level Stream API
PDF
From Batch to Streaming with Apache Apex Dataworks Summit 2017
PDF
Log ingestion kafka -- impala using apex
PDF
Developing streaming applications with apache apex (strata + hadoop world)
PPTX
Introduction to Apache Apex and writing a big data streaming application
PPTX
Deep Dive into Apache Apex App Development
PPTX
Intro to Apache Apex @ Women in Big Data
PPTX
Smart Partitioning with Apache Apex (Webinar)
PDF
Apache Big Data EU 2016: Next Gen Big Data Analytics with Apache Apex
PPTX
DataTorrent Presentation @ Big Data Application Meetup
PPTX
Stream Processing with Apache Apex
PPTX
University program - writing an apache apex application
PDF
Extending The Yahoo Streaming Benchmark to Apache Apex
Intro to Apache Apex - Next Gen Native Hadoop Platform - Hackac
Introduction to Apache Apex
Ingestion file copy using apex
Intro to Apache Apex (next gen Hadoop) & comparison to Spark Streaming
Actionable Insights with Apache Apex at Apache Big Data 2017 by Devendra Tagare
Building your first aplication using Apache Apex
Intro to Apache Apex - Next Gen Platform for Ingest and Transform
Java High Level Stream API
From Batch to Streaming with Apache Apex Dataworks Summit 2017
Log ingestion kafka -- impala using apex
Developing streaming applications with apache apex (strata + hadoop world)
Introduction to Apache Apex and writing a big data streaming application
Deep Dive into Apache Apex App Development
Intro to Apache Apex @ Women in Big Data
Smart Partitioning with Apache Apex (Webinar)
Apache Big Data EU 2016: Next Gen Big Data Analytics with Apache Apex
DataTorrent Presentation @ Big Data Application Meetup
Stream Processing with Apache Apex
University program - writing an apache apex application
Extending The Yahoo Streaming Benchmark to Apache Apex
Ad

Viewers also liked (20)

PDF
Windowing in Apache Apex
PDF
Introduction to Real-Time Data Processing
PPTX
Writing an Apache Apex Application
PPTX
Apache Apex & Bigtop
PPTX
DataFlow & Beam
PPTX
The Avant-garde of Apache NiFi
PPTX
Apache NiFi in the Hadoop Ecosystem
PPTX
Apache Apex Introduction with PubMatic
PPTX
Integrating Apache NiFi and Apache Flink
PPTX
Introduction to Apache NiFi - Seattle Scalability Meetup
PPTX
Apache NiFi 1.0 in Nutshell
PPTX
Next Gen Big Data Analytics with Apache Apex
PDF
Introduction to Apache Beam
PPTX
Apache Beam: A unified model for batch and stream processing data
PPTX
Apache NiFi Crash Course Intro
PDF
Joe Witt presentation on Apache NiFi
PPTX
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
PPTX
Integrating Apache Spark and NiFi for Data Lakes
PDF
Introduction to Real-time data processing
PDF
Stream Processing use cases and applications with Apache Apex by Thomas Weise
Windowing in Apache Apex
Introduction to Real-Time Data Processing
Writing an Apache Apex Application
Apache Apex & Bigtop
DataFlow & Beam
The Avant-garde of Apache NiFi
Apache NiFi in the Hadoop Ecosystem
Apache Apex Introduction with PubMatic
Integrating Apache NiFi and Apache Flink
Introduction to Apache NiFi - Seattle Scalability Meetup
Apache NiFi 1.0 in Nutshell
Next Gen Big Data Analytics with Apache Apex
Introduction to Apache Beam
Apache Beam: A unified model for batch and stream processing data
Apache NiFi Crash Course Intro
Joe Witt presentation on Apache NiFi
Dataflow with Apache NiFi - Apache NiFi Meetup - 2016 Hadoop Summit - San Jose
Integrating Apache Spark and NiFi for Data Lakes
Introduction to Real-time data processing
Stream Processing use cases and applications with Apache Apex by Thomas Weise
Ad

Similar to Real-time Stream Processing using Apache Apex (20)

PPTX
An adaptive and eventually self healing framework for geo-distributed real-ti...
PDF
Introduction to Apache Apex by Thomas Weise
PPSX
GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)
PDF
Stateful streaming data pipelines
PPTX
Apache Apex: Stream Processing Architecture and Applications
PPTX
Big Data Berlin v8.0 Stream Processing with Apache Apex
PPTX
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
PDF
Gatling - Bordeaux JUG
PDF
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
PPTX
Apache Apex - Hadoop Users Group
PPTX
February 2016 HUG: Apache Apex (incubating): Stream Processing Architecture a...
PDF
Apache flink
PPTX
Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...
PDF
M|18 Choosing the Right High Availability Strategy for You
ODP
Performance Test Automation With Gatling
ODP
Gatling
PDF
Things You MUST Know Before Deploying OpenStack: Bruno Lago, Catalyst IT
PPTX
February 2017 HUG: Exactly-once end-to-end processing with Apache Apex
PPTX
Netflix Data Pipeline With Kafka
PPTX
Netflix Data Pipeline With Kafka
An adaptive and eventually self healing framework for geo-distributed real-ti...
Introduction to Apache Apex by Thomas Weise
GE IOT Predix Time Series & Data Ingestion Service using Apache Apex (Hadoop)
Stateful streaming data pipelines
Apache Apex: Stream Processing Architecture and Applications
Big Data Berlin v8.0 Stream Processing with Apache Apex
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Gatling - Bordeaux JUG
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
Apache Apex - Hadoop Users Group
February 2016 HUG: Apache Apex (incubating): Stream Processing Architecture a...
Apache flink
Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...
M|18 Choosing the Right High Availability Strategy for You
Performance Test Automation With Gatling
Gatling
Things You MUST Know Before Deploying OpenStack: Bruno Lago, Catalyst IT
February 2017 HUG: Exactly-once end-to-end processing with Apache Apex
Netflix Data Pipeline With Kafka
Netflix Data Pipeline With Kafka

More from Apache Apex (16)

PDF
Low Latency Polyglot Model Scoring using Apache Apex
PDF
Apache Big Data EU 2016: Building Streaming Applications with Apache Apex
PPTX
Hadoop Interacting with HDFS
PPTX
Introduction to Real-Time Data Processing
PPTX
Introduction to Yarn
PPTX
Introduction to Map Reduce
PPTX
HDFS Internals
PPTX
Intro to Big Data Hadoop
PPTX
Kafka to Hadoop Ingest with Parsing, Dedup and other Big Data Transformations
PPTX
Building Your First Apache Apex (Next Gen Big Data/Hadoop) Application
PPTX
Intro to YARN (Hadoop 2.0) & Apex as YARN App (Next Gen Big Data)
PPTX
Ingesting Data from Kafka to JDBC with Transformation and Enrichment
PPTX
Ingestion and Dimensions Compute and Enrich using Apache Apex
PPTX
Apache Beam (incubating)
PPTX
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache Apex
PDF
Building Your First Apache Apex Application
Low Latency Polyglot Model Scoring using Apache Apex
Apache Big Data EU 2016: Building Streaming Applications with Apache Apex
Hadoop Interacting with HDFS
Introduction to Real-Time Data Processing
Introduction to Yarn
Introduction to Map Reduce
HDFS Internals
Intro to Big Data Hadoop
Kafka to Hadoop Ingest with Parsing, Dedup and other Big Data Transformations
Building Your First Apache Apex (Next Gen Big Data/Hadoop) Application
Intro to YARN (Hadoop 2.0) & Apex as YARN App (Next Gen Big Data)
Ingesting Data from Kafka to JDBC with Transformation and Enrichment
Ingestion and Dimensions Compute and Enrich using Apache Apex
Apache Beam (incubating)
Making sense of Apache Bigtop's role in ODPi and how it matters to Apache Apex
Building Your First Apache Apex Application

Recently uploaded (20)

PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PDF
medical staffing services at VALiNTRY
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PPTX
ISO 45001 Occupational Health and Safety Management System
PPTX
Introduction to Artificial Intelligence
PPTX
ManageIQ - Sprint 268 Review - Slide Deck
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PDF
Nekopoi APK 2025 free lastest update
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PDF
PTS Company Brochure 2025 (1).pdf.......
PPTX
Transform Your Business with a Software ERP System
PDF
top salesforce developer skills in 2025.pdf
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
How to Migrate SBCGlobal Email to Yahoo Easily
medical staffing services at VALiNTRY
Adobe Illustrator 28.6 Crack My Vision of Vector Design
ISO 45001 Occupational Health and Safety Management System
Introduction to Artificial Intelligence
ManageIQ - Sprint 268 Review - Slide Deck
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Nekopoi APK 2025 free lastest update
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Design an Analysis of Algorithms II-SECS-1021-03
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Upgrade and Innovation Strategies for SAP ERP Customers
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PTS Company Brochure 2025 (1).pdf.......
Transform Your Business with a Software ERP System
top salesforce developer skills in 2025.pdf

Real-time Stream Processing using Apache Apex

  • 1. Real-time Stream Processing using Apache Apex Bhupesh Chawda bhupesh@apache.org DataTorrent Software
  • 2. Apache Apex - Stream Processing ● YARN - Native - Uses Hadoop YARN framework for resource negotiation ● Highly Scalable - Scales statically as well as dynamically ● Highly Performant - Can reach single digit millisecond end-to-end latency ● Fault Tolerant - Automatically recovers from failures - without manual intervention ● Stateful - Guarantees that no state will be lost ● Easily Operable - Exposes an easy API for developing Operators (part of an application) and Applications
  • 3. Project History ● Project development started in 2012 at DataTorrent ● Open-sourced in July 2015 ● Apache Apex started incubation in August 2015 ● 50+ committers from Apple, GE, Capital One, DirecTV, Silver Spring Networks, Barclays, Ampool and DataTorrent ● Mentors from Class Software, MapR and Hortonworks ● Soon to be a top level Apache project
  • 5. An Apex Application is a DAG (Directed Acyclic Graph) ● A DAG is composed of vertices (Operators) and edges (Streams). ● A Stream is a sequence of data tuples which connects operators at end-points called Ports ● An Operator takes one or more input streams, performs computations & emits one or more output streams ● Each operator is USER’s business logic, or built-in operator from our open source library ● Operator may have multiple instances that run in parallel
  • 6. Hadoop 1.0 vs 2.0 - YARN
  • 7. Apex as a YARN Application ● YARN (Hadoop 2.0) replaces MapReduce with a more generic Resource Management Framework. ● Apex uses YARN for resource management and HDFS for storing any persistent storage
  • 8. Support for Windowing ● Apex splits incoming tuples into finite time slices - Streaming Windows ○ Transparent to the user ○ Apex Default = 500 ms ● Checkpointing and book-keeping done at Streaming window boundary ● Applications may need to perform computations in windows - Application Windows ○ Specified as a multiple of Streaming Window size ○ Call backs to user operator logic ■ beginWindow(long windowId) ■ endWindow() ○ Example - An application which identifies some aggregates and emits them every minute. Here application window size = 60 secs = 30 Streaming Windows ● Sliding and Tumbling Application windows are supported natively
  • 9. Buffer Server ● Staging area for outgoing tuples ● Downstream operators connect to upstream Buffer Server to subscribe for tuples ● Plays a role in recovery by replaying data to the downstream operator from a particular checkpoint ● Spooling to disk is also supported
  • 10. Fault Tolerance - Checkpointing ● During checkpointing all operator state is written to HDFS asynchronously ● This is decentralized and happens independently for each operator ● If all operators in the DAG have checkpointed a particular window, then that window is said to be committed and all previous checkpoints are purged O1 O2 O3 O4 3 3 3 2Checkpoint # ---> Committed Window # = 120180 180 180 120 Checkpoint Window # ---> Committed Checkpoint # = 2 Checkpoint Window = 60 Streaming Windows
  • 11. Recovery ● Apex Application Master detects the failure of an operator based on the missing heart beats from the operators or if windows are not progressing ● All downstream operators from the failed operator are restarted from the last committed checkpoint to recover from their states. ● Data is replayed from the same checkpoint by the Buffer Server ● Recovery is automatic and does not require manual intervention.
  • 12. Scalability - Partitioning ● Operators can be “replicated” (partitioned) into multiple instances to cope up with high speed input streams. ● Can be specified at Application launch time ● User can control the distribution of tuples to downstream partitions. ● Automatic Unifier to unify the tuples
  • 13. Scalability - Dynamic scaling ● Auto scaling is also supported. Number of partitions may automatically increase or decrease based on the incoming load. Can be customized by the user ● User has to define the trigger for auto scaling: ○ Example - Increase partitions if latency goes above 100 ms.
  • 14. Apex Processing Semantics ● AT_LEAST_ONCE (default): Windows are processed at least once ● AT_MOST_ONCE: Windows are processed at most once ○ During recovery, all downstream operators are fast-forwarded to the window of latest checkpoint ● EXACTLY_ONCE: Windows are processed exactly once ○ Checkpoint every window ○ Checkpointing becomes blocking
  • 15. Apex Guarantees ● Apex guarantees No loss of data and computational state - Checkpointed periodically ● Automatic recovery ensures that processing resumes from where it left off ● Order of incoming data is guaranteed to be maintained ○ Not applicable in case of partitioning of operators ● Events in a window are always replayed in the same window in case of failures
  • 16. Application Specification 1. Add Operators 2. Add Streams
  • 19. 1. Performance requirements a. A system which can provide a very very low latency for decision making (40 ms) b. Ability to handle large volumes of data and ever changing rules (1,000 events per 20 ms burst) c. 99.5% uptime. Which is about 1.5 days downtime in an year ➔ Apex achieved: ◆ 2 ms latency against the requirement of 40ms ◆ Was able to handle 2,000 events burst against requirement of 1,000 events burst at a net rate of 70,000 events/s. ◆ 99.9995% uptime against requirement of and 99.5% uptime and 2. Relevant Roadmap 3. Enterprise grade 4. Have a healthy and diverse community and committers, i.e. not controlled by one vendor Talk Slides: http://www.slideshare.net/ilganeli/nextgen-decision-making-in-under-2ms DataTorrent Blog: https://www.datatorrent.com/blog/next-gen-decision-making-in-under- 2-milliseconds/ Decision Making in < 2ms
  • 20. Decision making in < 2ms contd.. ● Comparison finally boiled down to ○ Apache Storm ○ Apache Flink ○ Apache Apex ● Some problems in Storm and Flink among others ○ Nimbus is a single point of failure ○ Bolts / Spouts / Operators share a JVM. Hard to debug ○ No dynamic topologies ○ Restarting entire topologies in case of failures
  • 21. Resources ● Mailing List ○ Developers dev@apex.incubator.apache.org ○ Users users@apex.incubator.apache.org ● Apache Apex http://apex.apache.org/ ● Github ○ Apex Core: http://github.com/apache/incubator-apex-core ○ Apex Malhar: http://github.com/apache/incubator-apex-malhar ● DataTorrent: http://www.datatorrent.com ● Twitter @ApacheApex Follow - https://twitter.com/apacheapex ● Facebook https://www.facebook.com/ApacheApex/ ● Meetup http://www.meetup.com/topics/apache-apex ● Startup Program Free Enterprise License for Startups, Universities, Non-Profits
  • 22. Thank you! Please send your questions to bhupesh@apache.org