SlideShare a Scribd company logo
Analytics	on	Cloud
From	prototyping	to	scalable	production	with	
Cloud	Data	Services
Wilfried	Hoge
Architect	Big	Data	Analytics
hoge@de.ibm.com
@wilfriedhoge
1.	IDEAS
In	the	beginning	there	is	an	idea:
• Collect	new	data
• Combine	internal	and	external	data
• Explore	data
• Analyze	data	in	new	ways
But	how	to	bring	the	ideas	to	life?
ENERGY
HEALTH GEO
SCIENCEMOBILITY
INTERNAL	 DATA
SHOPPING
1.	IDEASIMAGINE
I	work	in	a	company	that	offers	a	running	
app.
• I	already	have	a	fast	system	to	capture	
all	activities	of	my	customers
• The	activities	(and	tracks)	are	
collected	in	a	NoSQL	system	in	the	
cloud
Picture	by	jkd_shanghai
Cloud-Based	Systems	of	Engagement
(NoSQL,	Mobile	Apps,	Internet	of	Things,	Social	Media)
CLOUDANT
IBM’s	NoSQL store
• is	deployed	in	minutes	on	cloud
• easily	integrates	with	applications	and	
data	sources	in	the	Bluemix context
• includes	analytic	capabilites(e.g.	
geospacial)
• is	flexible	in	scaling	and	replication
1.	IDEASIDEA:	MONETIZE	DATA
I	would	like	to	see,	if	the	data	is	
interesting	for	others,	so	that	my	
company	can	monetize	it.
First,	I	have	to	analyze,	what	could	be	
found	in	the	data
2.	PROTOTYPE
A	prototyping	environment	is	needed
• Fast	and	easy	deployment
• Various	technologies
• Integration	with	internal	structures
• Compatible	with	tools	of	choice
DASHDB
IBM’s	in-memory	analytics	database
• is	deployed	in	minutes	on	cloud
• easily	integrates	with	applications	
and	data	sources	in	the	Bluemix
context,	cloud	providers	(e.g.	
Amazon)	and	internal	IT
• has	a	rich	SQL	interface	and	in-
database	analytics	capabilities	(e.g.	
statistics,	geospatial)
• comes	with	R	environment	included
Cloud-Based	Systems	of	Engagement
(NoSQL,	Mobile	Apps,	Internet	of	Things,	Social	Media)
Schema	Discovery	 Process
IBM	&	Third	Party	Integrations
(Cognos,	SPSS,	SAS,	Tableau,	ESRI	ArcGIS,	R)
• is	an	analytics	warehouse	extension	
of	Cloudant
Cloud Data Services - from prototyping to scalable analytics on cloud
Cloud Data Services - from prototyping to scalable analytics on cloud
Cloud Data Services - from prototyping to scalable analytics on cloud
1.	IDEASDATA	IN	MY	SANDBOX
• I	created	my	own	Sandbox	system	to	
analyze	the	data	on	Bluemix:	an	
analytical	database	(dashDB)
• I	copied	the	data	from	Cloudantto	
dashDB
Schema	Discovery	 Process
• But	now	I	would	like	to	analyze	the	
data	interactively	with	my	favorite	
language:	Python
Cloud-Based	Spark	environment
(Object	Storage,	SQL,	R,	Python,	Text	Analytics,	
Machine	Learning,	Jupyter notebooks)
SPARK
IBM’s	Spark-as-a-Service	offering
• is	deployed	in	minutes	on	cloud	
• Pay-as-you-go	or	reserved	
environment
• integrated	object	storage,	scales	with	
number	of	executors
• easily	integrates	with	applications	and	
data	sources	in	the	Bluemix context
• Jupyternotebooks	as	frontendSQL
Cloud Data Services - from prototyping to scalable analytics on cloud
Cloud Data Services - from prototyping to scalable analytics on cloud
Cloud Data Services - from prototyping to scalable analytics on cloud
1.	IDEASI	LIKE	NOTEBOOKS	
• I	create	a	Spark	service	on	Bluemix
• I	access	the	data	from	my	Jupyter
notebook
• In	the	notebook	I	can	create	a	
visualization	to	see	where	runners	
typically	do	their	training	in	Munich
• Interactively	I	can	dig	deeper	into	the	
data
• I	look	at	the	most	active	tracks	and	
the	toughest	tracks
1.	IDEASI	FOUND	SOMETHING
• The	combination	of	popular	tracks	
that	are	tough	give	me	routes	that	are	
suitable	to	place	ads
• The	points	of	highest	inclination	are	
the	places	that	are	best	to	place	
refreshment	ads,	e.g.	energy	drinks
• Maybe,	we	could	sell	that	knowledge
Cloud-Based	Hadoop environment
(Spark,	SQL,	R,	Text	Analytics,	Machine	Learning)
BIGINSIGHTS
IBM’s	Hadoop	distribution
• is	deployed	in	minutes	with	
virtualized	HW	on	cloud	
• is	deployed	in	days	with	bare	metal	
HW	for	high	performance	on	cloud	
• easily	integrates	with	applications	and	
data	sources	in	the	Bluemix context
• is	100%	open	source	with	IBM	added	
value	extensions
• is	flexible	in	scaling
SQL
WATSON	ANALYTICS
IBM’s	Self-service	analytics	for	business	
users	and	experts
• is	deployed	in	minutes	on	cloud
• automatically	analyzes	your	data	and		
provides	questions	for	you	to	explore	
• provides	automated	data	
visualizations
• identifies	predictors	of	your	target	
analysis
• allows	to	pin	your	results	for	use	in	
dashboards	and	storytelling
Fully Automated
Intelligence
Natural Language
Dialogue
Guided Analytic
Discovery
Single Analytics
Experience
DATAWORKS
IBM’s	Data	Refinery	Service	that
• is	deployed	in	minutes	on	cloud
• is	a	collection	of	cloud-based	data	
access	and	refinement	services
• integrates	with	data	services	on	
Bluemixand	on-premise
• offers	data	load,	data	profiling,	
probabilistic	match
Systems	of	Record	&	Insight	Integration
(Watson	Analytics,	DB2,	Oracle,	Hadoop,	
dashDB,	flat	files)
Cloud	Data	Services	offers	the	most	complete	portfolio	of	data	&	analytics	services
Retrieve and Rank
Natural Language
Classifier
Tone Analyzer
Watson’s	APIs	are	also	around	the	corner	– the	cognitive	building	blocks	that	harness	your	data
3.	FAIL	FAST
If	the	idea	is	not	working	or	gives	no	
valuable	insights
• Don’t	spend	time	on	it	any	more
• Throw	your	prototyping	
environment	away	
• Move	on	to	the	next	idea
• All	the	IBM	offerings	described	are	
cloud-based,	have	entry	versions	free	
of	charge	and	are	paid	on	a	monthly	
basis
• If	you	don’t	need	them	any	more,	just	
stop	the	subscription
1.	IDEASWE	CAN	DO	MORE
• Bluemixdevelopment	platform	offers	
many	services	to	build	a	complete	
solution
• Use	weather	data	in	combination	
with	location	info	and	our	knowledge	
while	customers	use	the	app	for	an	
activity
• Push	special	offers	to	customer	
related	to	current	conditions
customer	is	at	
high	intensity	
point
Right Now
26.3°
weather	is	hot
Picture	by	Shinya	Suzuki
4.	PRODUCTION
• If	the	ideas	are	proved	to	give	insights	
or	benefits,	bring	them	to	production
• But	don’t	invent	the	wheel	again
• Adapt	the	procedures	implemented	in	
the	prototyping	phase	directly	to	
production	
• The	IBM	offerings	are	not	just	
small	prototyping	
environments	
• They	have	production	ready	
variants	or	could	be	deployed	
on-premise
1.	IDEAS
2.	PROTOTYPE
3.	FAIL	FAST
4.	PRODUCTION
• IBM	can	support	you	to	bring	your	
innovations	to	life
• Prototype	ideas,	fail	fast,	bring	
successful	idea	into	production	– in	
the	cloud	on-premise	or	hybrid
• Innovations	born	in	the	cloud	and	
matured	where	it	fits	best	to	your	
business
Wilfried	Hoge
Architect	Big	Data	Analytics
hoge@de.ibm.com
@wilfriedhoge

More Related Content

PDF
Machine learning in real-time - the next frontier
PPTX
Big Data in the Cloud
PPTX
IoT and Big Data - Iot Asia 2014
PDF
Hyper-Converged Infrastructure: Big Data and IoT opportunities and challenges...
PPTX
Relationship between cloud computing and big data
PDF
Short introduction to Big Data Analytics, the Internet of Things, and their s...
PDF
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
PPTX
Momentum in Big Data, IoT and Machine Intelligence
Machine learning in real-time - the next frontier
Big Data in the Cloud
IoT and Big Data - Iot Asia 2014
Hyper-Converged Infrastructure: Big Data and IoT opportunities and challenges...
Relationship between cloud computing and big data
Short introduction to Big Data Analytics, the Internet of Things, and their s...
Dataiku, Pitch at Data-Driven NYC, New York City, September 17th 2013
Momentum in Big Data, IoT and Machine Intelligence

What's hot (20)

PDF
Enterprise Ready: A Look at Neo4j in Production
PDF
PASS Summit Data Storytelling with R Power BI and AzureML
PPTX
Aginity Big Data Research Lab V3
PPTX
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
PDF
Pentaho Big Data
PDF
Big Data LDN 2017: The New Dominant Companies Are Running on Data
PDF
Drowning in Data but Thirsty for Insights
PPTX
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...
PDF
Overview of big data in cloud computing
PPTX
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
PDF
Social Data Week - London - Google Session
PPTX
Big Data Roundtable. Why, how, where, which, and when to start doing Big Data
PDF
Graphs in Life Sciences
PPTX
Platfora - Denver Data Science Meetup
PPTX
Applying Big Data
PDF
How to get your engineers to care about the AWS Bill
PDF
The Connected Data Imperative: An Introduction to Neo4j
PDF
Intro to Neo4j Webinar
PDF
Top 10 ways BigInsights BigIntegrate and BigQuality will improve your life
ZIP
What Makes You Horny? Big Data!
Enterprise Ready: A Look at Neo4j in Production
PASS Summit Data Storytelling with R Power BI and AzureML
Aginity Big Data Research Lab V3
BreizhJUG - Janvier 2014 - Big Data - Dataiku - Pages Jaunes
Pentaho Big Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Drowning in Data but Thirsty for Insights
Keynote Address at 2013 CloudCon: Future of Big Data by Richard McDougall (In...
Overview of big data in cloud computing
The Five Graphs of Government: How Federal Agencies can Utilize Graph Technology
Social Data Week - London - Google Session
Big Data Roundtable. Why, how, where, which, and when to start doing Big Data
Graphs in Life Sciences
Platfora - Denver Data Science Meetup
Applying Big Data
How to get your engineers to care about the AWS Bill
The Connected Data Imperative: An Introduction to Neo4j
Intro to Neo4j Webinar
Top 10 ways BigInsights BigIntegrate and BigQuality will improve your life
What Makes You Horny? Big Data!
Ad

Viewers also liked (10)

PDF
InfoSphere Streams Technical Overview - Use Cases Big Data - Jerome CHAILLOUX
PDF
InfoSphere BigInsights
ODP
Geospatial Data in R
PPTX
R programming language in spatial analysis
PPT
R Spatial Analysis using SP
PPTX
Using R to Visualize Spatial Data: R as GIS - Guy Lansley
PDF
Is it harder to find a taxi when it is raining?
PDF
Spatial Analysis with R - the Good, the Bad, and the Pretty
PPTX
Data Ingestion, Extraction & Parsing on Hadoop
PPTX
Hadoop data ingestion
InfoSphere Streams Technical Overview - Use Cases Big Data - Jerome CHAILLOUX
InfoSphere BigInsights
Geospatial Data in R
R programming language in spatial analysis
R Spatial Analysis using SP
Using R to Visualize Spatial Data: R as GIS - Guy Lansley
Is it harder to find a taxi when it is raining?
Spatial Analysis with R - the Good, the Bad, and the Pretty
Data Ingestion, Extraction & Parsing on Hadoop
Hadoop data ingestion
Ad

Similar to Cloud Data Services - from prototyping to scalable analytics on cloud (20)

PDF
innovations born in the cloud - cloud data services from IBM to prototype you...
PDF
Deagital smart data proposal en
PDF
Building your data driven business with Reactive Marketing Technology
PPTX
From Business Intelligence to Big Data - hack/reduce Dec 2014
PPTX
Scaling Up Presentation
PDF
"Industrializing Machine Learning – How to Integrate ML in Existing Businesse...
PDF
Introduction to Big Data
PPTX
Bde presentation dv
PDF
Lecture 1-big data engineering (Introduction).pdf
PDF
Ibm db2update2019 icp4 data
PPTX
Introduction Data Warehouse With BigQuery
PPTX
How a big company employs cutting edge tech
PPTX
Databricks on AWS.pptx
PDF
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
PDF
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
PPTX
Taking Data Science to Enterprise level
PDF
Cloud Con 2015 - Integration & Web APIs
PDF
A journey to faster, repeatable data commercialization
PDF
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
PDF
How to build your own Delve: combining machine learning, big data and SharePoint
innovations born in the cloud - cloud data services from IBM to prototype you...
Deagital smart data proposal en
Building your data driven business with Reactive Marketing Technology
From Business Intelligence to Big Data - hack/reduce Dec 2014
Scaling Up Presentation
"Industrializing Machine Learning – How to Integrate ML in Existing Businesse...
Introduction to Big Data
Bde presentation dv
Lecture 1-big data engineering (Introduction).pdf
Ibm db2update2019 icp4 data
Introduction Data Warehouse With BigQuery
How a big company employs cutting edge tech
Databricks on AWS.pptx
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
zData BI & Advanced Analytics Platform + 8 Week Pilot Programs
Taking Data Science to Enterprise level
Cloud Con 2015 - Integration & Web APIs
A journey to faster, repeatable data commercialization
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
How to build your own Delve: combining machine learning, big data and SharePoint

More from Wilfried Hoge (8)

PDF
2015.05.07 watson rp15
PDF
Twitter analytics in Bluemix
PDF
InfoSphere BigInsights - Analytics power for Hadoop - field experience
PDF
Big SQL 3.0 - Fast and easy SQL on Hadoop
PDF
2014.07.11 biginsights data2014
PDF
2013.12.12 big data heise webcast
PDF
2012.04.26 big insights streams im forum2
PDF
IBM - Big Value from Big Data
2015.05.07 watson rp15
Twitter analytics in Bluemix
InfoSphere BigInsights - Analytics power for Hadoop - field experience
Big SQL 3.0 - Fast and easy SQL on Hadoop
2014.07.11 biginsights data2014
2013.12.12 big data heise webcast
2012.04.26 big insights streams im forum2
IBM - Big Value from Big Data

Recently uploaded (20)

PDF
annual-report-2024-2025 original latest.
PPTX
importance of Data-Visualization-in-Data-Science. for mba studnts
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
DOCX
Factor Analysis Word Document Presentation
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PDF
Transcultural that can help you someday.
PPT
Predictive modeling basics in data cleaning process
PPTX
A Complete Guide to Streamlining Business Processes
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PDF
Microsoft Core Cloud Services powerpoint
annual-report-2024-2025 original latest.
importance of Data-Visualization-in-Data-Science. for mba studnts
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
ISS -ESG Data flows What is ESG and HowHow
[EN] Industrial Machine Downtime Prediction
Topic 5 Presentation 5 Lesson 5 Corporate Fin
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
Factor Analysis Word Document Presentation
Database Infoormation System (DBIS).pptx
Qualitative Qantitative and Mixed Methods.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
STERILIZATION AND DISINFECTION-1.ppthhhbx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
Transcultural that can help you someday.
Predictive modeling basics in data cleaning process
A Complete Guide to Streamlining Business Processes
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
Microsoft Core Cloud Services powerpoint

Cloud Data Services - from prototyping to scalable analytics on cloud