SlideShare a Scribd company logo
© 2016 IBM Corporation
Introduction to Big Data,
Analytics and the Hybrid
Cloud
© 2016 IBM Corporation2
About Me
Ian Balina
 Big data visionary and story teller
 10+ years in software industry
 Previous experience as software
developer and Deloitte consultant
Ian Balina
Open Source Analytics Sales Evangelist
Retail, CPG and Travel Industry
© 2016 IBM Corporation3
Agenda
 The story of Big Data
 Hadoop
 The emergence of Big Data
Analytics
 Spark
 The birth of the Cloud
 Hybrid Cloud
© 2016 IBM Corporation4
An overview on Big Data, Analytics and the Cloud
 The story of Big Data
 Expensive data
warehouse
 Commodity servers?
2.5 million items
per minute
300,000 tweets
per minute
200 million emails
per minute 220,000 photos
per minute
5 TB per flight
> 1 PB per day
gas turbines
© 2016 IBM Corporation5
An overview on Big Data, Analytics and the Cloud
 The story of Big Data
 Hadoop: reliable, scalable,
distributed computing and
data storage
© 2016 IBM Corporation6
An overview on Big Data, Analytics and the Cloud
 The story of Big Data
 Hadoop
 The emergence of Big Data Analytics
 FAST DATA
#PerishableInsights
Insights that can provide
exponentially more value
than traditional analytics
but the value expires and
evaporates once the
moment is gone
Forrester: Mike Gualtieri, Principal Analyst
Value
Event
Action with traditional
analytics
Immediate Action
Time
Lost Revenue
© 2016 IBM Corporation7
An overview on Big Data, Analytics and the Cloud
 The story of Big Data
 Hadoop
 The emergence of Big Data Analytics
 Spark: open source data processing engine built
for speed, ease of use, and sophisticated
analytics
Hadoop
, 110
Spark,
0.9
0
20
40
60
80
100
120
Logistic Regression in
Hadoop & Spark
Hadoop
Spark
Graph Analytics
Fast and integrated
graph computation
Stream Processing
Near real-time data
processing & analytics
Machine Learning
Incredibly fast, easy to
deploy algorithms
Unified Data Access
Fast, familiar query
language for all data
SparkCore
Spark SQL
Spark
Streaming
MLlib
(machine
learning)
GraphX
(graph)
© 2013 IBM Corporation8
“Using IBM Analytics for Apache Spark, we can
now give in-store teams valuable insight in
seconds.”
—Ram Himmatraopet, Founder & CEO, SmarterData
Business challenge
To help its clients navigate the uncertainties of the digital-age
retail industry, SmarterData wanted to find new ways to
provide relevant, actionable, data-driven insights into
consumer behavior.
Transformation
SmarterData uses IBM Analytics for Apache Spark to deliver intelligent
applications that combine operational and contextual data to help
retailers understand consumers’ behavior and desires.
Helping retailers redefine practices
for the digital age
Based in San Ramon, California, Smarter Data, Inc.
leverages advanced data science technologies –
predictive and prescriptive analytics – to help
companies achieve relevance with their customers
both online and in a retail environment, and manage
the demands of digital-age business challenges.
Business benefits:
Empowers
retailers with data-driven
insights into consumer
behavior, helping drive sales
Helps
in-store teams provide smarter
customer service based on
real-time analysis
Leverages
contextual data to predict
individual needs and create
personalized offers
© 2016 IBM Corporation9
An overview on Big Data, Analytics and the Cloud
 The story of Big Data
 Hadoop
 The emergence of Big Data Analytics
 Spark
 The birth of the Cloud
Infrastructure as
a Service
Code
Data
Runtime
Middleware
OS
Virtualization
Servers
Storage
Networking
Code
Data
Runtime
Middleware
OS
Virtualization
Servers
Storage
Networking
Platform as a
Service
Code
Data
Runtime
Middleware
OS
Virtualization
Servers
Storage
Networking
Code
Data
Runtime
Middleware
OS
Virtualization
Servers
Storage
Networking
Software as a
Service
Traditional IT – On-
premise or Hosted
Customer Managed
Service Provider Managed
© 2016 IBM Corporation10
An overview on Big Data, Analytics and the Cloud
 The story of Big Data
 Hadoop
 The emergence of Big Data Analytics
 Spark
 The birth of the Cloud
 Hybrid Cloud Private
Managed
Private
Hosted
Private PublicEnterprise
Hybrid Cloud
Integration
Enterprise
Data Center
Enterprise
Data Center
IBM
SO
SoftLayer
And IBM SO
Enterprise UsersEnterprise
Data Center
© 2016 IBM Corporation11
A US grocery store chain uses business intelligence to identify
insights that help make a proof of concept detailed and convincing
Business challenge: The CEO of this grocery store chain knew that analytics
and cloud-based computing were going to help take the company to the next
level by guiding marketing and merchandising decisions, but he needed to
convince key stakeholders. His team came to IBM for help developing a proof of
concept.
The smarter solution: The company used a business intelligence and
predictive modeling solution to develop a detailed and groundbreaking
understanding of the link between weather and grocery shopping behavior in
its US stores. By demonstrating that analytics can provide insight into which
items it should procure, feature and market during which kinds of weather,
the company not only convinced stakeholders of the value of analytics but also
gained valuable new insight into its business.
Using big data to anticipate the ebbs and flows of demand holds tremendous
potential in the grocery store industry in terms of procurement, merchandising
and staffing.
Half the cost
of similar projects, thanks to a
cloud-based infrastructure
75% faster
completion of proof of concept
than anticipated
Successful
in convincing stakeholders of the
value of cloud-based analytics

More Related Content

PPTX
Digital Shift in Insurance: How is the Industry Responding with the Influx of...
PPTX
Monitizing Big Data at Telecom Service Providers
PDF
Overview - IBM Big Data Platform
PPTX
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
PPTX
Big Data Expo 2015 - Pentaho The Future of Analytics
PPTX
Capgemini Insights and Data
PPTX
Harnessing the Power of Big Data at Freddie Mac
PPTX
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements
Digital Shift in Insurance: How is the Industry Responding with the Influx of...
Monitizing Big Data at Telecom Service Providers
Overview - IBM Big Data Platform
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
Big Data Expo 2015 - Pentaho The Future of Analytics
Capgemini Insights and Data
Harnessing the Power of Big Data at Freddie Mac
Gov & Private Sector Regulatory Compliance: Using Hadoop to Address Requirements

What's hot (20)

PPTX
Who changed my data? Need for data governance and provenance in a streaming w...
PDF
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
PPTX
Apache Hadoop India Summit 2011 talk "Informatica and Big Data" by Snajeev Kumar
PDF
Key Considerations for Putting Hadoop in Production SlideShare
PPTX
Integrating Hadoop Into the Enterprise
PDF
Big Data Scotland 2017
PDF
A Big Data Telco Solution by Dr. Laura Wynter
PDF
Driving Digital Transformation Through Global Data Management
PDF
Strategyzing big data in telco industry
PDF
Ibm big data
PPTX
Top 5 Strategies for Retail Data Analytics
PPTX
San Antonio’s electric utility making big data analytics the business of the ...
PPT
8.17.11 big data and hadoop with informatica slideshare
PPTX
Deutsche Telekom on Big Data
PDF
Hadoop Crash Course
PDF
Banalytics - Monetizing corporate big data | Instarea
PDF
Why Data Virtualization? An Introduction.
PPTX
The Power of your Data Achieved - Next Gen Modernization
PDF
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
PDF
Telco Big Data Workshop Sample
Who changed my data? Need for data governance and provenance in a streaming w...
Leverage Big Data to Enhance Customer Experience in Telecommunications – with...
Apache Hadoop India Summit 2011 talk "Informatica and Big Data" by Snajeev Kumar
Key Considerations for Putting Hadoop in Production SlideShare
Integrating Hadoop Into the Enterprise
Big Data Scotland 2017
A Big Data Telco Solution by Dr. Laura Wynter
Driving Digital Transformation Through Global Data Management
Strategyzing big data in telco industry
Ibm big data
Top 5 Strategies for Retail Data Analytics
San Antonio’s electric utility making big data analytics the business of the ...
8.17.11 big data and hadoop with informatica slideshare
Deutsche Telekom on Big Data
Hadoop Crash Course
Banalytics - Monetizing corporate big data | Instarea
Why Data Virtualization? An Introduction.
The Power of your Data Achieved - Next Gen Modernization
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
Telco Big Data Workshop Sample
Ad

Viewers also liked (17)

PDF
IBM Watson March Madness 2017 Predictions
PDF
Bluemix Standard Deck for Clients
PDF
Ultimate hybrid cloud
PDF
A practical introduction to Web analytics for technical communicators
PPTX
MODAClouds Value - Solving Top Problems of Cloud Dev Lifecycle
PDF
Next generation cloud data center technologies
PDF
Big Data: Using free Bluemix Analytics Exchange Data with Big SQL
PPTX
Hybrid Cloud example for SlideShare
PDF
Predicting March Madness with IBM Watson Analytics
PDF
Hybrid Cloud Point of View - IBM Event, 2015
PDF
IBM Bluemix Paris meetup - Big Data & Analytics dans le Cloud - Epitech- 2016...
PPT
Choosing Public vs. Private vs. Hybrid Cloud Computing
PDF
DevOps in the Hybrid Cloud
PDF
IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA- Bluemix, présent &...
PPTX
Orchestration tool roundup kubernetes vs. docker vs. heat vs. terra form vs...
PDF
Cloud orchestration major tools comparision
PPTX
CloudStack vs OpenStack vs Eucalyptus: IaaS Private Cloud Brief Comparison
IBM Watson March Madness 2017 Predictions
Bluemix Standard Deck for Clients
Ultimate hybrid cloud
A practical introduction to Web analytics for technical communicators
MODAClouds Value - Solving Top Problems of Cloud Dev Lifecycle
Next generation cloud data center technologies
Big Data: Using free Bluemix Analytics Exchange Data with Big SQL
Hybrid Cloud example for SlideShare
Predicting March Madness with IBM Watson Analytics
Hybrid Cloud Point of View - IBM Event, 2015
IBM Bluemix Paris meetup - Big Data & Analytics dans le Cloud - Epitech- 2016...
Choosing Public vs. Private vs. Hybrid Cloud Computing
DevOps in the Hybrid Cloud
IBM Bluemix Paris Meetup #22-20170315 Meetup @VillagebyCA- Bluemix, présent &...
Orchestration tool roundup kubernetes vs. docker vs. heat vs. terra form vs...
Cloud orchestration major tools comparision
CloudStack vs OpenStack vs Eucalyptus: IaaS Private Cloud Brief Comparison
Ad

Similar to Intro to Big Data Analytics and the Hybrid Cloud (20)

PDF
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
PDF
Cloudian 451-hortonworks - webinar
PDF
Big Data – wie aus Daten strategische Resourcen und Ihr Wettbewerbsvorteil we...
PDF
Ibm What we offer
PDF
Big Data Driven Transformations
PPTX
Finding business value in Big Data
PPTX
Big data solutions on cloud – the way forward
PPTX
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
PDF
IBM Technology Day 2013 BigData Salle Rome
PPTX
IBM Relay 2015: Open for Data
 
PDF
Key note big data analytics ecosystem strategy
PDF
Future of Power: Big Data - Søren Ravn
PDF
Getting started with Hadoop on the Cloud with Bluemix
PDF
IBM CDS Overview
PPTX
Big data4businessusers
PDF
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
PDF
OC Big Data Monthly Meetup #6 - Session 1 - IBM
PDF
SD Big Data Monthly Meetup #4 - Session 1 - IBM
PPTX
IBM Smarter Analytics
PDF
📊 Big Data: The Future of Data Analytics 🚀
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Cloudian 451-hortonworks - webinar
Big Data – wie aus Daten strategische Resourcen und Ihr Wettbewerbsvorteil we...
Ibm What we offer
Big Data Driven Transformations
Finding business value in Big Data
Big data solutions on cloud – the way forward
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
IBM Technology Day 2013 BigData Salle Rome
IBM Relay 2015: Open for Data
 
Key note big data analytics ecosystem strategy
Future of Power: Big Data - Søren Ravn
Getting started with Hadoop on the Cloud with Bluemix
IBM CDS Overview
Big data4businessusers
Big Data: Introducing BigInsights, IBM's Hadoop- and Spark-based analytical p...
OC Big Data Monthly Meetup #6 - Session 1 - IBM
SD Big Data Monthly Meetup #4 - Session 1 - IBM
IBM Smarter Analytics
📊 Big Data: The Future of Data Analytics 🚀

Recently uploaded (20)

PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
IB Computer Science - Internal Assessment.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
Mega Projects Data Mega Projects Data
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Introduction to Knowledge Engineering Part 1
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
IB Computer Science - Internal Assessment.pptx
Miokarditis (Inflamasi pada Otot Jantung)
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Supervised vs unsupervised machine learning algorithms
Mega Projects Data Mega Projects Data
Business Ppt On Nestle.pptx huunnnhhgfvu
Data_Analytics_and_PowerBI_Presentation.pptx
Introduction-to-Cloud-ComputingFinal.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Clinical guidelines as a resource for EBP(1).pdf
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Business Acumen Training GuidePresentation.pptx

Intro to Big Data Analytics and the Hybrid Cloud

  • 1. © 2016 IBM Corporation Introduction to Big Data, Analytics and the Hybrid Cloud
  • 2. © 2016 IBM Corporation2 About Me Ian Balina  Big data visionary and story teller  10+ years in software industry  Previous experience as software developer and Deloitte consultant Ian Balina Open Source Analytics Sales Evangelist Retail, CPG and Travel Industry
  • 3. © 2016 IBM Corporation3 Agenda  The story of Big Data  Hadoop  The emergence of Big Data Analytics  Spark  The birth of the Cloud  Hybrid Cloud
  • 4. © 2016 IBM Corporation4 An overview on Big Data, Analytics and the Cloud  The story of Big Data  Expensive data warehouse  Commodity servers? 2.5 million items per minute 300,000 tweets per minute 200 million emails per minute 220,000 photos per minute 5 TB per flight > 1 PB per day gas turbines
  • 5. © 2016 IBM Corporation5 An overview on Big Data, Analytics and the Cloud  The story of Big Data  Hadoop: reliable, scalable, distributed computing and data storage
  • 6. © 2016 IBM Corporation6 An overview on Big Data, Analytics and the Cloud  The story of Big Data  Hadoop  The emergence of Big Data Analytics  FAST DATA #PerishableInsights Insights that can provide exponentially more value than traditional analytics but the value expires and evaporates once the moment is gone Forrester: Mike Gualtieri, Principal Analyst Value Event Action with traditional analytics Immediate Action Time Lost Revenue
  • 7. © 2016 IBM Corporation7 An overview on Big Data, Analytics and the Cloud  The story of Big Data  Hadoop  The emergence of Big Data Analytics  Spark: open source data processing engine built for speed, ease of use, and sophisticated analytics Hadoop , 110 Spark, 0.9 0 20 40 60 80 100 120 Logistic Regression in Hadoop & Spark Hadoop Spark Graph Analytics Fast and integrated graph computation Stream Processing Near real-time data processing & analytics Machine Learning Incredibly fast, easy to deploy algorithms Unified Data Access Fast, familiar query language for all data SparkCore Spark SQL Spark Streaming MLlib (machine learning) GraphX (graph)
  • 8. © 2013 IBM Corporation8 “Using IBM Analytics for Apache Spark, we can now give in-store teams valuable insight in seconds.” —Ram Himmatraopet, Founder & CEO, SmarterData Business challenge To help its clients navigate the uncertainties of the digital-age retail industry, SmarterData wanted to find new ways to provide relevant, actionable, data-driven insights into consumer behavior. Transformation SmarterData uses IBM Analytics for Apache Spark to deliver intelligent applications that combine operational and contextual data to help retailers understand consumers’ behavior and desires. Helping retailers redefine practices for the digital age Based in San Ramon, California, Smarter Data, Inc. leverages advanced data science technologies – predictive and prescriptive analytics – to help companies achieve relevance with their customers both online and in a retail environment, and manage the demands of digital-age business challenges. Business benefits: Empowers retailers with data-driven insights into consumer behavior, helping drive sales Helps in-store teams provide smarter customer service based on real-time analysis Leverages contextual data to predict individual needs and create personalized offers
  • 9. © 2016 IBM Corporation9 An overview on Big Data, Analytics and the Cloud  The story of Big Data  Hadoop  The emergence of Big Data Analytics  Spark  The birth of the Cloud Infrastructure as a Service Code Data Runtime Middleware OS Virtualization Servers Storage Networking Code Data Runtime Middleware OS Virtualization Servers Storage Networking Platform as a Service Code Data Runtime Middleware OS Virtualization Servers Storage Networking Code Data Runtime Middleware OS Virtualization Servers Storage Networking Software as a Service Traditional IT – On- premise or Hosted Customer Managed Service Provider Managed
  • 10. © 2016 IBM Corporation10 An overview on Big Data, Analytics and the Cloud  The story of Big Data  Hadoop  The emergence of Big Data Analytics  Spark  The birth of the Cloud  Hybrid Cloud Private Managed Private Hosted Private PublicEnterprise Hybrid Cloud Integration Enterprise Data Center Enterprise Data Center IBM SO SoftLayer And IBM SO Enterprise UsersEnterprise Data Center
  • 11. © 2016 IBM Corporation11 A US grocery store chain uses business intelligence to identify insights that help make a proof of concept detailed and convincing Business challenge: The CEO of this grocery store chain knew that analytics and cloud-based computing were going to help take the company to the next level by guiding marketing and merchandising decisions, but he needed to convince key stakeholders. His team came to IBM for help developing a proof of concept. The smarter solution: The company used a business intelligence and predictive modeling solution to develop a detailed and groundbreaking understanding of the link between weather and grocery shopping behavior in its US stores. By demonstrating that analytics can provide insight into which items it should procure, feature and market during which kinds of weather, the company not only convinced stakeholders of the value of analytics but also gained valuable new insight into its business. Using big data to anticipate the ebbs and flows of demand holds tremendous potential in the grocery store industry in terms of procurement, merchandising and staffing. Half the cost of similar projects, thanks to a cloud-based infrastructure 75% faster completion of proof of concept than anticipated Successful in convincing stakeholders of the value of cloud-based analytics

Editor's Notes

  • #5: The story of big data starts with Google. Back in the early 2000s, as a startup, Google was growing really fast thanks to the internet. Big Data Data was growing fast with the birth of social media and web 2.0 sites Data was growing faster than Google could keep up with. Expensive data warehouses were no longer a viable option. Google engineers had the idea of using commodity (cheap) servers as a replacement. They created a way to store and process data in parallel on commodity servers. They published these insights in a paper on MapReduce.
  • #6: So what is Hadoop? Interesting tidbit, Hadoop was developed by Doug Cutting while he was working at Yahoo. Hadoop is named after his son’s toy elephant. Apache Hadoop's MapReduce and HDFS components were inspired by research work done by Google. Apache Hadoop is an open-source software framework for distributed storage and distributed processing of very large data sets on clusters built on commodity hardware. At Hadoop’s core are 3 main components: HDFS, YARN, and MapReduce. HDFS provides the storage for Hadoop MapReduce provides the processing for Hadoop YARN coordinates the processing and scheduling of work across all the nodes HDFS was designed to be fault-tolerant and to run on commodity hardware, therefore blocks are replicated a number of times to ensure high data availability. By default – Hadoop’s replication factor is set to 3 meaning there would be one original block and two replicas. This can be adjusted. The true power of the Hadoop distributed computing architecture lies in its distribution. In other words, the ability to distribute work to many nodes in parallel permits Hadoop to scale to large infrastructures and, similarly, the processing of large amounts of data Hadoop was designed more for batch processing. It’s a sequential process that relies on disk to store intermediate results. Hadoop was driven by the need to capture and process the large volumes of data driven by the Big Data initiatives we talked about in the previous slides. These data storage needs include: The ability to store all types of data. These sources may be coming from sensors, social media feeds, log files or even traditional databases. The data storage can’t be limited to a particular format. Need the processing capability to analyze these large volumes - not a sampling – but actually review each record as part of the analysis. Ability to scale from from a single server to thousands of machines Need for a lower cost alternative to traditional data warehouses since the volume would not make these use cases practical once hardware, storage and software costs were taken into account
  • #7: Sometimes 1 minute is too late. How to quickly process, analyze and act on data - what opportunity are you missing? The challenges clients face when trying to capture real-time value is the cost associated with storing these high volumes of data for analysis. Once the data is stored, it needs to be inspected and analyzed to identify the signal from the noise that determines what should be acted. This requires storage and analysis – but at that point, it’s no longer relevant as the opportunity has passed. Take as an example a website that offers real-time personalization by presenting its visitors with an offer that’s appropriate based on what you’ve been viewing. To accomplish this, the website must understand your clickstream data, in real-time, to quickly serve up the offer relevant to your web visit. There is no time to store and analyze the data, at that point, the visitor has left the website. These clients need Streams to quickly stream in the clickstream data, analyze on the fly, and present the offer to the web visitor.
  • #8: Spark Streaming Process live streams of data (IoT, Twitter, Kafka, etc.) with the Spark engine to drive some action or be outputted in batches to various data stores Implementing near-realtime stream event processing (e.g. fraud / security detection) Mllib – Machine Learning Processing machine learning algorithms in areas such as clustering, classification, etc. Applicability in sentiment analysis, predictive intelligence, segmentation, modeling, etc. Building and deploying rich analytics models (e.g. risk metrics) Spark SQL – Interactive Analytics Query your structured data sets with SQL or other dataframe APIs. Use BI tools to connect and query via JDBC or ODBC. Interactive querying of very large data sets is a no-brainer, it’s one of the most important value adds enabled by Spark, versus Hadoop. The more interactive or the more iterative it is, the greater the performance improvement GraphX (graph) Represent and analyze systems represented by nodes and interconnections between them – transportation, person relationships, etc. Allows you to perform operations on the graph to determine relationships e.g. behavior propensity, churn and fraud detection as examples Data Processing and Integration Existing data processing workloads done much faster Coding that is simplified e.g. 3 lines of code instead of 6 pages in traditional programming
  • #9: Client Name SmarterData Company Background Based in San Ramon, California, Smarter Data, Inc. leverages advanced data science technologies – predictive and prescriptive analytics – to help companies achieve relevance with their customers both online and in a retail environment, and manage the demands of digital-age business challenges. Business challenge To help its retail clients navigate the uncertainties of the digital-age industry, SmarterData wanted to find new ways to provide relevant, actionable, data-driven insights into consumer behavior. The benefit SmarterData’s clients can now perform real-time analysis, utilizing everything from point-of-sale data to weather data, empowering in-store employees to take immediate action on the shop floor. Pull Quote “Using IBM Analytics for Apache Spark, we can now give in-store teams valuable insight in seconds.” —Ram Himmatraopet, Founder & CEO, SmarterData Solution components IBM® Analytics for Apache Spark IBM Bluemix® Case study Link http://www.ibm.com/common/ssi/cgi-bin/ssialias?subtype=AB&infotype=PM&htmlfid=YTC04066USEN&attachment=YTC04066USEN.PDF
  • #10: Customers can run their IT and development in one of 4 options. In Traditional IT applications can either be run at the customer’s location or On-premise or hosted at a 3rd party location. In this option the good news is the customer has the capability and responsibility to investigate the right solutions, source and buy the solutions, integrate them and control, run and manage the entire stack. The bad news is the customer HAS TO spend the time and money to research and test solutions and control, run, integrate and manage the entire stack! This can be incredibly expensive and time consuming and does not add value to the customer’s business For IaaS or Infrastructure as a Service the customer has the responsibility and requirements to run and manage the Operating System on up. The Service Provider manages the bottom layers. SoftLayer, AWS and Azure are IaaS solutions. For Platform as a Service, which is what Bluemix is, the service provider manages the infrastructure and the customer’s developers focus 100% on their application code and the data. For Software as a Service the service provider hosts 100% of the data, logic and infrastructure. The customer only gets a browser. Examples are Salesforce.com, Microsoft Office 365, facebook, eBay, LinkedIn and Concur are examples. Basically, All that is needed is a browser and a printer.
  • #11: Let’s look at the three major cloud deployment models. These are private, public and hybrid clouds. On the far left of the graphic, you see the enterprise data center, most clients will continue to maintain a traditional data center for some IT services and in this deployment model, the client owns and operates all of the hardware and software and their enterprise data center. The next box, the Private Cloud deployment model is a Private Cloud inside the client’s data center. The client owns and operates the infrastructure and software. The next type of Private Cloud is the Managed Private Cloud. In this deployment model the cloud is located in the client’s data center but IBM is operating and managing the cloud for the client. The next Private Cloud deployment model is the Hosted Private Cloud. This Private Cloud resides in an IBM Data Center, it is still owned by the client but IBM performs all of the operational and management support. Note, as we move further and further to the right, the client gives up more and more control to a third party. To Cloud Data Services sellers, the differences between Private Cloud types are not as important as they are to IBM GTS or GBS sellers. Sales will be of monthly or perpetual licenses, and someone else is selling the infrastructure and labor. The only major thing to watch out for is that Hosted Private may require additional selling of data security and data movement technology, since the client’s data is moving off-premise. To the far right, you see the Public Cloud deployment model and in this model a service provider makes resources such as applications and storage available to consumers over the internet. The client pays for the resources that they consume. SoftLayer and Amazon are examples of a Public Cloud. The final deployment model is the hybrid cloud shown at the bottom of the page. The hybrid cloud is an integrated cloud which may be cloud to enterprise or cloud to cloud integration, so the clients have the benefit of the seamless IT system. Many enterprise clients are moving to a hybrid cloud model.
  • #12: 11