SlideShare a Scribd company logo
IBM Spark Technology Center
Apache Big Data NA 2016
Leveraging Open Source Analytics for
making game changing decisions
Luciano Resende
IBM | Spark Technology Center
IBM Spark Technology Center 2
Apache Big Data Evolution
http://pepperdata.com/2014/06/the-10-hottest-words-at-hadoop-summit-2014/
IBM Spark Technology Center 3
Apache Big Data Evolution
IBM Spark Technology Center 4
Apache Big Data Evolution
IBM Spark Technology Center 5
The
Analytics
Operating System
IBM Spark Technology Center
Enhance	it!	 Offer	it!	
Leverage	it!	
Spark	Technology	
Center	@	SF	
Shipping	with	
BigInsights	/	
Spark	as	a	Service	
Inside	our	products		
At IBM, We Love Spark!
Open	sourced	
Apache	
SystemML	
Open	sourced	
Apache	Quarks
IBM Spark Technology Center
IBM is Building on Apache Spark
IBM Analytics
IBM Commerce
IBM Watson
IBM Research
IBM Cloud
Image source: http://zdnet2.cbsistatic.com/hub/i/r/2015/06/15/1a23c9cd-74bc-4c8b-9e83-da45e977d97d/thumbnail/770x578/4a70eb03e79c794393d1d7d26bb34687/ibm-apache-spark.gif
IBM Spark Technology Center 8
How our customers
are leveraging
open source analytics
IBM Spark Technology Center
The Weather Company
Data volumes from weather are growing !
–  ~30 billion API requests per day
–  ~120 million active mobile users
-  #3 most active mobile user base
–  ~360 PB of traffic daily
–  Billions of events per day (~1.3 M per sec)
–  Keep data forever
The use case
– Efficient batch + streaming analysis
– Self-service data science
– BI / Visualization tool support
IBM Spark Technology Center
Healthcare Enterprise
Health Care Data Lakes
– Improve how health care is delivered
– Collect and combine data from dozens of sources
– Clinical, Operational, Financial
– Inside and outside your enterprise
Benefits
– Better medical outcomes for patients
– Control cost and improve quality
SystemML on Spark
– Predictive Risk Modeling
– Right patient intervention relating to adverse heath events
10
IBM Spark Technology Center
Spark maps Customer Experience “journey”
The Challenge
– Improve Customer satisfaction rates
– Multiple channels for customer iteractions
– Very large volumes of data
The need
– Create a 360 degree view of a customer
– Stich all interactions across channels –
“Customer Experience Journey”
– Classify interaction sentiment and take necessary actions
PUB / SUB
MQTT / WebSockets / Flume / Kafka
` ` `
Journey
Dashboards
Interaction & Journey Data
Voice &
Text Data
IBM Spark Technology Center
12Image source: http://az616578.vo.msecnd.net/files/2016/03/21/6359412499310138501557867529_thank-you-1400x800-c-default.gif

More Related Content

PDF
How mentoring can help you start contributing to open source
PDF
SystemML - Declarative Machine Learning
PDF
Writing Apache Spark and Apache Flink Applications Using Apache Bahir
PDF
Spark Hsinchu meetup
PPTX
JEEConf 2015 - Introduction to real-time big data with Apache Spark
PPTX
Apache Spark & Scala
PPTX
Scalable Machine Learning with PySpark
PPT
Running Apache Zeppelin production
How mentoring can help you start contributing to open source
SystemML - Declarative Machine Learning
Writing Apache Spark and Apache Flink Applications Using Apache Bahir
Spark Hsinchu meetup
JEEConf 2015 - Introduction to real-time big data with Apache Spark
Apache Spark & Scala
Scalable Machine Learning with PySpark
Running Apache Zeppelin production

What's hot (20)

PDF
Apache Zeppelin Helium and Beyond
PDF
SparkOscope: Enabling Apache Spark Optimization through Cross Stack Monitorin...
PPTX
Multi User Data science with Zeppelin
PPTX
SparkR + Zeppelin
PPTX
Real-Time Ingesting and Transforming Sensor Data and Social Data with NiFi an...
PDF
Intro to PySpark: Python Data Analysis at scale in the Cloud
PDF
Spark Streaming
PDF
Apache Zeppelin, Helium and Beyond
PPTX
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
PDF
Ncku csie talk about Spark
PPTX
Hadoop or Spark: is it an either-or proposition? By Slim Baltagi
PDF
Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...
PPTX
Apache spark
PDF
Spark Summit EU talk by Jakub Hava
PDF
Podling Hivemall in the Apache Incubator
PDF
SparkR Best Practices for R Data Scientists
PPTX
Why is my Hadoop* job slow?
PPTX
Zeppelin at Twitter
PPTX
Apache Zeppelin and Spark for Enterprise Data Science
PDF
Dataflow with Apache NiFi - Crash Course - HS16SJ
Apache Zeppelin Helium and Beyond
SparkOscope: Enabling Apache Spark Optimization through Cross Stack Monitorin...
Multi User Data science with Zeppelin
SparkR + Zeppelin
Real-Time Ingesting and Transforming Sensor Data and Social Data with NiFi an...
Intro to PySpark: Python Data Analysis at scale in the Cloud
Spark Streaming
Apache Zeppelin, Helium and Beyond
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
Ncku csie talk about Spark
Hadoop or Spark: is it an either-or proposition? By Slim Baltagi
Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...
Apache spark
Spark Summit EU talk by Jakub Hava
Podling Hivemall in the Apache Incubator
SparkR Best Practices for R Data Scientists
Why is my Hadoop* job slow?
Zeppelin at Twitter
Apache Zeppelin and Spark for Enterprise Data Science
Dataflow with Apache NiFi - Crash Course - HS16SJ
Ad

Similar to Luciano Resende's keynote at Apache big data conference (20)

PPTX
IBM Smarter Analytics
PPTX
Keynote at spark summit east anjul
PPTX
Spark Summit Presentation by Anjul Bhambhri
PPTX
Spark Summit East Keynote by Anjul Bhambhri
PDF
20150617 spark meetup zagreb
PDF
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
PDF
IBM CDS Overview
PDF
Experiences in Delivering Spark as a Service
PDF
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
PDF
Spark Summit EU: IBM Keynote
PDF
IBM and Apache Spark
PPTX
Iotbds v1.0
PPTX
Machine Learning with Apache Spark
PDF
NRB - BE MAINFRAME DAY 2017 - Data spark and the data federation
 
PDF
NRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data Federation
 
PDF
Apache Spark and future of advanced analytics
PDF
Libera la potenza del Machine Learning
PDF
Spark | IBM
PPTX
Intro to Big Data Analytics and the Hybrid Cloud
PDF
Getting started with Hadoop on the Cloud with Bluemix
IBM Smarter Analytics
Keynote at spark summit east anjul
Spark Summit Presentation by Anjul Bhambhri
Spark Summit East Keynote by Anjul Bhambhri
20150617 spark meetup zagreb
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
IBM CDS Overview
Experiences in Delivering Spark as a Service
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
Spark Summit EU: IBM Keynote
IBM and Apache Spark
Iotbds v1.0
Machine Learning with Apache Spark
NRB - BE MAINFRAME DAY 2017 - Data spark and the data federation
 
NRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data Federation
 
Apache Spark and future of advanced analytics
Libera la potenza del Machine Learning
Spark | IBM
Intro to Big Data Analytics and the Hybrid Cloud
Getting started with Hadoop on the Cloud with Bluemix
Ad

More from Luciano Resende (20)

PDF
A Jupyter kernel for Scala and Apache Spark.pdf
PDF
Using Elyra for COVID-19 Analytics
PDF
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
PDF
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
PDF
Ai pipelines powered by jupyter notebooks
PDF
Strata - Scaling Jupyter with Jupyter Enterprise Gateway
PDF
Scaling notebooks for Deep Learning workloads
PDF
Jupyter Enterprise Gateway Overview
PPTX
Inteligencia artificial, open source e IBM Call for Code
PDF
IoT Applications and Patterns using Apache Spark & Apache Bahir
PDF
Getting insights from IoT data with Apache Spark and Apache Bahir
PDF
Open Source AI - News and examples
PDF
Building analytical microservices powered by jupyter kernels
PDF
Building iot applications with Apache Spark and Apache Bahir
PDF
An Enterprise Analytics Platform with Jupyter Notebooks and Apache Spark
PDF
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
PDF
What's new in Apache SystemML - Declarative Machine Learning
PDF
Big analytics meetup - Extended Jupyter Kernel Gateway
PDF
Jupyter con meetup extended jupyter kernel gateway
PPT
Asf icfoss-mentoring
A Jupyter kernel for Scala and Apache Spark.pdf
Using Elyra for COVID-19 Analytics
Elyra - a set of AI-centric extensions to JupyterLab Notebooks.
From Data to AI - Silicon Valley Open Source projects come to you - Madrid me...
Ai pipelines powered by jupyter notebooks
Strata - Scaling Jupyter with Jupyter Enterprise Gateway
Scaling notebooks for Deep Learning workloads
Jupyter Enterprise Gateway Overview
Inteligencia artificial, open source e IBM Call for Code
IoT Applications and Patterns using Apache Spark & Apache Bahir
Getting insights from IoT data with Apache Spark and Apache Bahir
Open Source AI - News and examples
Building analytical microservices powered by jupyter kernels
Building iot applications with Apache Spark and Apache Bahir
An Enterprise Analytics Platform with Jupyter Notebooks and Apache Spark
The Analytic Platform behind IBM’s Watson Data Platform - Big Data Spain 2017
What's new in Apache SystemML - Declarative Machine Learning
Big analytics meetup - Extended Jupyter Kernel Gateway
Jupyter con meetup extended jupyter kernel gateway
Asf icfoss-mentoring

Recently uploaded (20)

PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Computer network topology notes for revision
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Miokarditis (Inflamasi pada Otot Jantung)
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Galatica Smart Energy Infrastructure Startup Pitch Deck
Computer network topology notes for revision
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Business Ppt On Nestle.pptx huunnnhhgfvu
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
IBA_Chapter_11_Slides_Final_Accessible.pptx
Business Acumen Training GuidePresentation.pptx
Clinical guidelines as a resource for EBP(1).pdf
STUDY DESIGN details- Lt Col Maksud (21).pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Acceptance and paychological effects of mandatory extra coach I classes.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Moving the Public Sector (Government) to a Digital Adoption
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Database Infoormation System (DBIS).pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx

Luciano Resende's keynote at Apache big data conference

  • 1. IBM Spark Technology Center Apache Big Data NA 2016 Leveraging Open Source Analytics for making game changing decisions Luciano Resende IBM | Spark Technology Center
  • 2. IBM Spark Technology Center 2 Apache Big Data Evolution http://pepperdata.com/2014/06/the-10-hottest-words-at-hadoop-summit-2014/
  • 3. IBM Spark Technology Center 3 Apache Big Data Evolution
  • 4. IBM Spark Technology Center 4 Apache Big Data Evolution
  • 5. IBM Spark Technology Center 5 The Analytics Operating System
  • 6. IBM Spark Technology Center Enhance it! Offer it! Leverage it! Spark Technology Center @ SF Shipping with BigInsights / Spark as a Service Inside our products At IBM, We Love Spark! Open sourced Apache SystemML Open sourced Apache Quarks
  • 7. IBM Spark Technology Center IBM is Building on Apache Spark IBM Analytics IBM Commerce IBM Watson IBM Research IBM Cloud Image source: http://zdnet2.cbsistatic.com/hub/i/r/2015/06/15/1a23c9cd-74bc-4c8b-9e83-da45e977d97d/thumbnail/770x578/4a70eb03e79c794393d1d7d26bb34687/ibm-apache-spark.gif
  • 8. IBM Spark Technology Center 8 How our customers are leveraging open source analytics
  • 9. IBM Spark Technology Center The Weather Company Data volumes from weather are growing ! –  ~30 billion API requests per day –  ~120 million active mobile users -  #3 most active mobile user base –  ~360 PB of traffic daily –  Billions of events per day (~1.3 M per sec) –  Keep data forever The use case – Efficient batch + streaming analysis – Self-service data science – BI / Visualization tool support
  • 10. IBM Spark Technology Center Healthcare Enterprise Health Care Data Lakes – Improve how health care is delivered – Collect and combine data from dozens of sources – Clinical, Operational, Financial – Inside and outside your enterprise Benefits – Better medical outcomes for patients – Control cost and improve quality SystemML on Spark – Predictive Risk Modeling – Right patient intervention relating to adverse heath events 10
  • 11. IBM Spark Technology Center Spark maps Customer Experience “journey” The Challenge – Improve Customer satisfaction rates – Multiple channels for customer iteractions – Very large volumes of data The need – Create a 360 degree view of a customer – Stich all interactions across channels – “Customer Experience Journey” – Classify interaction sentiment and take necessary actions PUB / SUB MQTT / WebSockets / Flume / Kafka ` ` ` Journey Dashboards Interaction & Journey Data Voice & Text Data
  • 12. IBM Spark Technology Center 12Image source: http://az616578.vo.msecnd.net/files/2016/03/21/6359412499310138501557867529_thank-you-1400x800-c-default.gif