SlideShare a Scribd company logo
Big Data && Fast Data
Vitaliy Rudnytskiy, SAP
August 2014, Budapest
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 2
Disclaimer
This presentation outlines our general product direction and should not be relied on in making a
purchase decision. This presentation is not subject to your license agreement or any other agreement
with SAP. SAP has no obligation to pursue any course of business outlined in this presentation or to
develop or release any functionality mentioned in this presentation. This presentation and SAP's
strategy and possible future developments are subject to change and may be changed by SAP at any
time for any reason without notice. This document is provided without a warranty of any kind, either
express or implied, including but not limited to, the implied warranties of merchantability, fitness for a
particular purpose, or non-infringement. SAP assumes no responsibility for errors or omissions in this
document, except if such damages were caused by SAP intentionally or grossly negligent.
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 3
Let’s start with…
… me :) Vitaliy Rudnytskiy
@Sygyzmundovych
SAP’s Developer Center team
- developers.sap.com
- Data Management and Analytics
Live in Wrocław, Poland
What is …
Big Data?
4
What is Big Data?
5
3xV: Volume, Variety, Velocity
4xV: + Veracity
5xV: + Value
6xV: + Vitaliy 
„One Terabyte Club” sounds like terayears ago!
7
Big Data Definitions accordingly to CIOs
8
Instant Access to Data
9
SAP Road to In-Memory Technology
11
Source: “SAP HANA Essentials”, Jeffrey Word
Principle #1
Keep all required data (aka “hot data”) in computers’ main memory
Rarely accessed data (aka “cold data”) can be moved to cheaper storage
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 13
Data Access Latencies and Bandwidth
Source: Intel
Principle #2
Compress data to minimize the footprint
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 17
Columnar and Row Based Data Storage
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 18
Data Compression in Column Store
1. Dictionary Compression
2. Compression of the Value ID
Sequence
• Prefix encoding
• Run Length encoding
• Cluster encoding
• Sparse encoding
• Indirect encoding
3. Further Compression
for Main Dictionary
Principle #3
Cache-sensitive data layout and cache-aware algorithms
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 21
„RAM Locality is King”
Source: Microsoft Research: research.microsoft.com/en.../Flash_is_Good.ppt
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 22
Bandwidth between CPU and Main Memory
(a) Intel Shared Front Side Bus (UMA Architecture)
(b) Intel Quick Path Interconnect (NUMA Architecture)
Source: “In-Memory Data Management: An Inflection Point for Enterprise Applications” by Hasso Plattner, Alexander Zeier
Principle #4
Software performance growth is in the parallel processing,
not in increasing clock speed
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 25
Examples of Parallelization in a Column Store
Principle #5
Move data-intensive operations to the data layer
Integrate data processing engines (relational, text, graph, streaming…)
Include built-in libraries (geospecial, predictive, 3rd parties)
Have you seen…
1
Petabyte? 31
The SAP HANA One Petabyte Test
33
Source: http://www.saphana.com/community/blogs/blog/2012/11/12/the-sap-hana-one-petabyte-test
Guinness World Record: 12.1 PB DW
34
Source: http://www.guinnessworldrecords.com/world-records/5000/largest-data-warehouse
Here comes Hadoop
2.8 ZB in 2012
85% from New Data Types
15x Machine Data by 2020
40 ZB by 2020
New Sources (Sentiment,
Clickstream, Geo, Sensor)
Ref:Hortonworks
37
SAP HANA && Hadoop: Comparison
38Source: „How to Use Hadoop with Your SAP® Software Landscape”, http://www.saphana.com/docs/DOC-3777
Big Data && Fast Data
39
Query FederationTwo-Speed Analytics
Big Data is Strategic to SAP and Partners
40
http://www.cloudera.com/content/cloudera/en/
solutions/partner/SAP.html http://hortonworks.com/partner/sap/
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 41
Planned Innovations Future DirectionToday
SAP HANA and Hadoop
Product road map overview – key themes and capabilities
This is the current state of planning and may be changed by SAP at any time.
Integrate:
 Usability
– Data Virtualization (Smart Data Access ) to Hive
via ODBC connectivity
– Richer SQL access from SAP HANA studio to
Hive Tables
 Co-processing
– Compute SQL operation between Hive and HANA
 Performance
– SDA optimizations: function completion
– Remote caching of HIVE jobs
 Enterprise readiness
– Hadoop Distributions Offering from our Partners
a. Co-innovation & Reselling with Intel and
Hortonworks
b. Open for other distro’s using HIVE odbc drivers
Optimize:
 Usability
– Execute custom map reduce methods from SAP
HANA and consume the results from HANA queries
via SQL User Defined Function (UDF).
– SDA support for Data Provisioning for the SAP HANA
Service/Adapter Framework
 Performance
– SDA optimization for connectivity, semi joins &
relocation when large amounts of data is queried
from SAP HANA and the number of Hadoop nodes
are increased
– Concurrent execution of HIVE jobs from SAP HANA
 Enterprise readiness
– * Support for HANA Authorization/ Authentication
and encryption capabilities with partner Hadoop
Disributions
– * Integrate and validate Cloudera CDH-5.1 and
Hortonworks HDP-2.0 Hadoop distributions with
latest SAP HANA platform
– Support Apache Spark SQL via partners distro
See Appendix for abbreviations
Synthesize:
 Usability
– Trigger and consume output of PIG jobs from HANA
– Support HBase as a store for HANA federated queries
 Co-processing
– Advanced SDA for Hadoop windowing, table
partitioned functions
– Hadoop as extended data store for HANA (Big Unified
Table)
– Tiered storage: hot/cold data controlled by HANA
 Performance
– Distributed in-memory caching of hot HDFS data
– Parallel connections from HANA to Hadoop for loading
and queries
– Extend HANA compute engine to Hadoop
 Enterprise readiness
– Integration of Partner Hadoop Distro features into SAP
HANA Cockpit
– Send Hadoop logs to SAP Solution Manager
– HANA snapshot to HDFS with restore
– Common authentication and authorization
Want to learn and code in…
SAP
HANA? 42
43
Free online education: open.sap.com
45
Community and developer access:
http://developers.sap.com/hana
• Free developer editions
hosted in the cloud
(AWS, Azure, CloudShare…)
• How-to guides
• Community support
• Blogs
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 46
Examples of some cool ideas from the community
„HADOOP HDFS Explorer built with HANA XS and SAPUI5” by Aron MacDonald
http://scn.sap.com/community/developer-center/hana/blog/2014/07/03/hadoop-hdfs-explorer-built-with-hana-xs-and-sapui5
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 47
Examples of some cool ideas from the community
„Experiences with SAP HANA Geo-Spatial Features” by Trinoy Hazarika
http://scn.sap.com/community/developer-center/hana/blog/2014/02/25/experiences-with-sap-hana-geo-spatial-features-part-1
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 48
Examples of some cool ideas from the community
„Predicting My Next Twitter Follower with SAP HANA PAL” by Lucas Sparvieri
*PAL – Predictive Analysis Library
http://scn.sap.com/community/developer-center/hana/blog/2013/09/02/predicting-my-next-twitter-follower-with-sap-hana-pal
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 49
Examples of some cool ideas from the community
„Detecting World Cup GOAL using Twitter and SAP HANA” by Stevanic Artana
http://scn.sap.com/community/developer-center/hana/blog/2014/07/03/goal-detection-using-twitter-and-sap-hana
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 50
Examples of some cool ideas from the community
„A Simple Door Monitoring System with HANA XS and Raspberry Pi” by Ferry Gunawan
http://scn.sap.com/community/developer-center/hana/blog/2014/07/09/build-a-door-sensor-with-raspberry-pi-and-hana
1000+ Startups working with us
51More info and how to join: http://startups.saphana.com
Semantic Vision from Czech Rep.
52
More: http://www.saphana.com/community/learn/startups/news-views/blog/2013/10/04/beyond-the-big-hype-
using-semantic-search-to-predict-the-way-the-universe-will-behave
Startups in SAP Startup Forum Program
53
2/3rds in
these 4
categories
Area of Startup Focus % of Startups
Visualization / BI / Market Insight 25%
Social Media / Collaboration / Gaming 17%
Predictive Analytics / Complex Analytics 13%
Sensor Network Data / Internet of Things 10%
Geospatial / Geo-Location / 3D Analysis 8%
Primarily a Mobile Solution 7%
Big Data Infrastructure / Appliance + SAP HANA 6%
Enterprise Process Acceleration 6%
Extension of specific SAP Solutions and SAP Expertise 4%
Energy Management / Sustainability 3%
54
Community and developer access:
developers.sap.com
Contact information:
Vitaliy Rudnytskiy
vitaliy.rudnytskiy@sap.com
@Sygyzmundovych
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 55
© 2013 SAP AG or an SAP affiliate company. All rights reserved.
No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP AG.
The information contained herein may be changed without prior notice.
Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors.
National product specifications may vary.
These materials are provided by SAP AG and its affiliated companies ("SAP Group") for informational purposes only, without representation or warranty of any kind, and
SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth
in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty.
SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and
other countries.
Please see http://www.sap.com/corporate-en/legal/copyright/index.epx#trademark for additional trademark information and notices.

More Related Content

PPTX
Finance month closing with HANA
PDF
SAP HANA and Apache Hadoop for Big Data Management (SF Scalable Systems Meetup)
PDF
Big data/Hadoop/HANA Basics
PDF
Big Data, Big Thinking: Untapped Opportunities
PDF
SAP HANA Use Cases in 27 Industries
PDF
CIO Guide to Using SAP HANA Platform For Big Data
PDF
How Old Is Your Data? Don't Settle For Bad Data!
PPTX
Leveraging SAP, Hadoop, and Big Data to Redefine Business
Finance month closing with HANA
SAP HANA and Apache Hadoop for Big Data Management (SF Scalable Systems Meetup)
Big data/Hadoop/HANA Basics
Big Data, Big Thinking: Untapped Opportunities
SAP HANA Use Cases in 27 Industries
CIO Guide to Using SAP HANA Platform For Big Data
How Old Is Your Data? Don't Settle For Bad Data!
Leveraging SAP, Hadoop, and Big Data to Redefine Business

What's hot (20)

PDF
SAP Inside Track Belgium 2018 - SAP Leonardo Machine Learning Demystified
PPTX
SAP EIM Overview
PPTX
SAP Predictive Analytics
PDF
Business intelligence in the era of big data
PDF
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
PPTX
SAP HANA in Healthcare: Real-Time Big Data Analysis
PDF
How PepsiCo's Big Data Strategy is Disrupting CPG Retail Analytics
PDF
SAP HANA Vora SITMTY 20160707
PDF
Enterprise data science - What it takes to build?
PDF
Revolutionizing Executive Insight - The SAP Digital Boardroom
PDF
SAP HANA Interactive Use Case Map
PPTX
Sap hana l1 -reinventing real-time businesses through innovation, value & si...
PPTX
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
PPTX
SAP Data Services
PDF
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
PDF
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
PDF
Predictive Analytics with SAP HANA
PDF
#askSAP Analytics Innovations Community Call: Delivering Big Data Inisghts wi...
PDF
20100430 introduction to business objects data services
PPTX
SAP Helps Reduce Silos Between Business and Spatial Data
SAP Inside Track Belgium 2018 - SAP Leonardo Machine Learning Demystified
SAP EIM Overview
SAP Predictive Analytics
Business intelligence in the era of big data
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
SAP HANA in Healthcare: Real-Time Big Data Analysis
How PepsiCo's Big Data Strategy is Disrupting CPG Retail Analytics
SAP HANA Vora SITMTY 20160707
Enterprise data science - What it takes to build?
Revolutionizing Executive Insight - The SAP Digital Boardroom
SAP HANA Interactive Use Case Map
Sap hana l1 -reinventing real-time businesses through innovation, value & si...
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
SAP Data Services
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
Predictive Analytics with SAP HANA
#askSAP Analytics Innovations Community Call: Delivering Big Data Inisghts wi...
20100430 introduction to business objects data services
SAP Helps Reduce Silos Between Business and Spatial Data
Ad

Similar to SAP HANA - Big Data and Fast Data (20)

PPTX
Big data tim
PPTX
In-Memory Database Platform for Big Data
PPTX
The SAP Startup Focus Program – Tackling Big Data With the Power of Small by ...
PDF
IMCSummit 2015 - Day 1 IT Business Track - In-memory computing with SAP HANA:...
PDF
Developing and Deploying Applications on the SAP HANA Platform
PPTX
Harnessing Big Data in Real-Time
PDF
Autodesk Technical Webinar: SAP HANA in-memory database
PPTX
SDA - POC
PDF
Interactive SAP Big Data Overview
PDF
D14,C21 ビックデータ・イノベーションを起こすSAPのリアルタイム・データ・プラットフォームのご紹介 by Ryo Saso
PDF
Capturing big value in big data
PDF
Future of Enterprise PaaS
PDF
Next Generation Data Platforms - Deon Thomas
PPTX
Big data4businessusers
PPTX
Future of Enterprise PaaS (Cloud Foundry Summit 2014)
PDF
Big data analysis concepts and references
PPTX
Dr. Bjarne Berg for Knowledge Stream
PDF
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
PDF
Comment rendre votre architecture BI plus flexible avec HANA?
PDF
Dba to data scientist -Satyendra
Big data tim
In-Memory Database Platform for Big Data
The SAP Startup Focus Program – Tackling Big Data With the Power of Small by ...
IMCSummit 2015 - Day 1 IT Business Track - In-memory computing with SAP HANA:...
Developing and Deploying Applications on the SAP HANA Platform
Harnessing Big Data in Real-Time
Autodesk Technical Webinar: SAP HANA in-memory database
SDA - POC
Interactive SAP Big Data Overview
D14,C21 ビックデータ・イノベーションを起こすSAPのリアルタイム・データ・プラットフォームのご紹介 by Ryo Saso
Capturing big value in big data
Future of Enterprise PaaS
Next Generation Data Platforms - Deon Thomas
Big data4businessusers
Future of Enterprise PaaS (Cloud Foundry Summit 2014)
Big data analysis concepts and references
Dr. Bjarne Berg for Knowledge Stream
"Integration of Hadoop in Business landscape", Michal Alexa, IT and Innovatio...
Comment rendre votre architecture BI plus flexible avec HANA?
Dba to data scientist -Satyendra
Ad

More from Vitaliy Rudnytskiy (20)

PDF
SIT Wrocław 2019 - Intro
PDF
Wroclaw SAP Meetup 2019/02
PDF
Wrocław SAP Meetup - 2018/02
PDF
Gentle Introduction into Geospatial (using SQL in SAP HANA)
PDF
IoT at Scale
PDF
Welcome to SAP Community of Developers!
PDF
Wroclaw SAP Meetup 2017/10
PDF
SAP Vora CodeJam
PDF
SAP HANA and SAP Vora
PDF
Mobile of People and Internet of Things: State of the Union
PDF
Wroclaw SAP Meetup - 2017/01
PDF
Wroclaw SAP Meetup - 2016/10
PDF
Quantify your drive: IoT on a personal scale with SAP technologies
PDF
Overview of SAP HANA Cloud Platform
PDF
PDF
Welcome to SAP Community of Developers!
PDF
SAP Developer Center - March 2016 update
PDF
SAP Tech Innovation for Business - 2014.05
PDF
SAP CodeJam Mobile - Poland 2013
PDF
SAP Store (in Polish / po polsku)
SIT Wrocław 2019 - Intro
Wroclaw SAP Meetup 2019/02
Wrocław SAP Meetup - 2018/02
Gentle Introduction into Geospatial (using SQL in SAP HANA)
IoT at Scale
Welcome to SAP Community of Developers!
Wroclaw SAP Meetup 2017/10
SAP Vora CodeJam
SAP HANA and SAP Vora
Mobile of People and Internet of Things: State of the Union
Wroclaw SAP Meetup - 2017/01
Wroclaw SAP Meetup - 2016/10
Quantify your drive: IoT on a personal scale with SAP technologies
Overview of SAP HANA Cloud Platform
Welcome to SAP Community of Developers!
SAP Developer Center - March 2016 update
SAP Tech Innovation for Business - 2014.05
SAP CodeJam Mobile - Poland 2013
SAP Store (in Polish / po polsku)

Recently uploaded (20)

PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Encapsulation theory and applications.pdf
PDF
cuic standard and advanced reporting.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Approach and Philosophy of On baking technology
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Electronic commerce courselecture one. Pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Unlocking AI with Model Context Protocol (MCP)
Encapsulation theory and applications.pdf
cuic standard and advanced reporting.pdf
Network Security Unit 5.pdf for BCA BBA.
NewMind AI Weekly Chronicles - August'25 Week I
Approach and Philosophy of On baking technology
Spectral efficient network and resource selection model in 5G networks
Building Integrated photovoltaic BIPV_UPV.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Chapter 3 Spatial Domain Image Processing.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
20250228 LYD VKU AI Blended-Learning.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
“AI and Expert System Decision Support & Business Intelligence Systems”
Electronic commerce courselecture one. Pdf
MYSQL Presentation for SQL database connectivity
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Dropbox Q2 2025 Financial Results & Investor Presentation
Agricultural_Statistics_at_a_Glance_2022_0.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf

SAP HANA - Big Data and Fast Data

  • 1. Big Data && Fast Data Vitaliy Rudnytskiy, SAP August 2014, Budapest
  • 2. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 2 Disclaimer This presentation outlines our general product direction and should not be relied on in making a purchase decision. This presentation is not subject to your license agreement or any other agreement with SAP. SAP has no obligation to pursue any course of business outlined in this presentation or to develop or release any functionality mentioned in this presentation. This presentation and SAP's strategy and possible future developments are subject to change and may be changed by SAP at any time for any reason without notice. This document is provided without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. SAP assumes no responsibility for errors or omissions in this document, except if such damages were caused by SAP intentionally or grossly negligent.
  • 3. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 3 Let’s start with… … me :) Vitaliy Rudnytskiy @Sygyzmundovych SAP’s Developer Center team - developers.sap.com - Data Management and Analytics Live in Wrocław, Poland
  • 4. What is … Big Data? 4
  • 5. What is Big Data? 5 3xV: Volume, Variety, Velocity 4xV: + Veracity 5xV: + Value 6xV: + Vitaliy 
  • 6. „One Terabyte Club” sounds like terayears ago! 7
  • 7. Big Data Definitions accordingly to CIOs 8
  • 9. SAP Road to In-Memory Technology 11 Source: “SAP HANA Essentials”, Jeffrey Word
  • 10. Principle #1 Keep all required data (aka “hot data”) in computers’ main memory Rarely accessed data (aka “cold data”) can be moved to cheaper storage
  • 11. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 13 Data Access Latencies and Bandwidth Source: Intel
  • 12. Principle #2 Compress data to minimize the footprint
  • 13. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 17 Columnar and Row Based Data Storage
  • 14. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 18 Data Compression in Column Store 1. Dictionary Compression 2. Compression of the Value ID Sequence • Prefix encoding • Run Length encoding • Cluster encoding • Sparse encoding • Indirect encoding 3. Further Compression for Main Dictionary
  • 15. Principle #3 Cache-sensitive data layout and cache-aware algorithms
  • 16. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 21 „RAM Locality is King” Source: Microsoft Research: research.microsoft.com/en.../Flash_is_Good.ppt
  • 17. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 22 Bandwidth between CPU and Main Memory (a) Intel Shared Front Side Bus (UMA Architecture) (b) Intel Quick Path Interconnect (NUMA Architecture) Source: “In-Memory Data Management: An Inflection Point for Enterprise Applications” by Hasso Plattner, Alexander Zeier
  • 18. Principle #4 Software performance growth is in the parallel processing, not in increasing clock speed
  • 19. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 25 Examples of Parallelization in a Column Store
  • 20. Principle #5 Move data-intensive operations to the data layer Integrate data processing engines (relational, text, graph, streaming…) Include built-in libraries (geospecial, predictive, 3rd parties)
  • 22. The SAP HANA One Petabyte Test 33 Source: http://www.saphana.com/community/blogs/blog/2012/11/12/the-sap-hana-one-petabyte-test
  • 23. Guinness World Record: 12.1 PB DW 34 Source: http://www.guinnessworldrecords.com/world-records/5000/largest-data-warehouse
  • 24. Here comes Hadoop 2.8 ZB in 2012 85% from New Data Types 15x Machine Data by 2020 40 ZB by 2020 New Sources (Sentiment, Clickstream, Geo, Sensor) Ref:Hortonworks 37
  • 25. SAP HANA && Hadoop: Comparison 38Source: „How to Use Hadoop with Your SAP® Software Landscape”, http://www.saphana.com/docs/DOC-3777
  • 26. Big Data && Fast Data 39 Query FederationTwo-Speed Analytics
  • 27. Big Data is Strategic to SAP and Partners 40 http://www.cloudera.com/content/cloudera/en/ solutions/partner/SAP.html http://hortonworks.com/partner/sap/
  • 28. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 41 Planned Innovations Future DirectionToday SAP HANA and Hadoop Product road map overview – key themes and capabilities This is the current state of planning and may be changed by SAP at any time. Integrate:  Usability – Data Virtualization (Smart Data Access ) to Hive via ODBC connectivity – Richer SQL access from SAP HANA studio to Hive Tables  Co-processing – Compute SQL operation between Hive and HANA  Performance – SDA optimizations: function completion – Remote caching of HIVE jobs  Enterprise readiness – Hadoop Distributions Offering from our Partners a. Co-innovation & Reselling with Intel and Hortonworks b. Open for other distro’s using HIVE odbc drivers Optimize:  Usability – Execute custom map reduce methods from SAP HANA and consume the results from HANA queries via SQL User Defined Function (UDF). – SDA support for Data Provisioning for the SAP HANA Service/Adapter Framework  Performance – SDA optimization for connectivity, semi joins & relocation when large amounts of data is queried from SAP HANA and the number of Hadoop nodes are increased – Concurrent execution of HIVE jobs from SAP HANA  Enterprise readiness – * Support for HANA Authorization/ Authentication and encryption capabilities with partner Hadoop Disributions – * Integrate and validate Cloudera CDH-5.1 and Hortonworks HDP-2.0 Hadoop distributions with latest SAP HANA platform – Support Apache Spark SQL via partners distro See Appendix for abbreviations Synthesize:  Usability – Trigger and consume output of PIG jobs from HANA – Support HBase as a store for HANA federated queries  Co-processing – Advanced SDA for Hadoop windowing, table partitioned functions – Hadoop as extended data store for HANA (Big Unified Table) – Tiered storage: hot/cold data controlled by HANA  Performance – Distributed in-memory caching of hot HDFS data – Parallel connections from HANA to Hadoop for loading and queries – Extend HANA compute engine to Hadoop  Enterprise readiness – Integration of Partner Hadoop Distro features into SAP HANA Cockpit – Send Hadoop logs to SAP Solution Manager – HANA snapshot to HDFS with restore – Common authentication and authorization
  • 29. Want to learn and code in… SAP HANA? 42
  • 31. 45 Community and developer access: http://developers.sap.com/hana • Free developer editions hosted in the cloud (AWS, Azure, CloudShare…) • How-to guides • Community support • Blogs
  • 32. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 46 Examples of some cool ideas from the community „HADOOP HDFS Explorer built with HANA XS and SAPUI5” by Aron MacDonald http://scn.sap.com/community/developer-center/hana/blog/2014/07/03/hadoop-hdfs-explorer-built-with-hana-xs-and-sapui5
  • 33. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 47 Examples of some cool ideas from the community „Experiences with SAP HANA Geo-Spatial Features” by Trinoy Hazarika http://scn.sap.com/community/developer-center/hana/blog/2014/02/25/experiences-with-sap-hana-geo-spatial-features-part-1
  • 34. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 48 Examples of some cool ideas from the community „Predicting My Next Twitter Follower with SAP HANA PAL” by Lucas Sparvieri *PAL – Predictive Analysis Library http://scn.sap.com/community/developer-center/hana/blog/2013/09/02/predicting-my-next-twitter-follower-with-sap-hana-pal
  • 35. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 49 Examples of some cool ideas from the community „Detecting World Cup GOAL using Twitter and SAP HANA” by Stevanic Artana http://scn.sap.com/community/developer-center/hana/blog/2014/07/03/goal-detection-using-twitter-and-sap-hana
  • 36. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 50 Examples of some cool ideas from the community „A Simple Door Monitoring System with HANA XS and Raspberry Pi” by Ferry Gunawan http://scn.sap.com/community/developer-center/hana/blog/2014/07/09/build-a-door-sensor-with-raspberry-pi-and-hana
  • 37. 1000+ Startups working with us 51More info and how to join: http://startups.saphana.com
  • 38. Semantic Vision from Czech Rep. 52 More: http://www.saphana.com/community/learn/startups/news-views/blog/2013/10/04/beyond-the-big-hype- using-semantic-search-to-predict-the-way-the-universe-will-behave
  • 39. Startups in SAP Startup Forum Program 53 2/3rds in these 4 categories Area of Startup Focus % of Startups Visualization / BI / Market Insight 25% Social Media / Collaboration / Gaming 17% Predictive Analytics / Complex Analytics 13% Sensor Network Data / Internet of Things 10% Geospatial / Geo-Location / 3D Analysis 8% Primarily a Mobile Solution 7% Big Data Infrastructure / Appliance + SAP HANA 6% Enterprise Process Acceleration 6% Extension of specific SAP Solutions and SAP Expertise 4% Energy Management / Sustainability 3%
  • 40. 54 Community and developer access: developers.sap.com Contact information: Vitaliy Rudnytskiy vitaliy.rudnytskiy@sap.com @Sygyzmundovych
  • 41. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 55 © 2013 SAP AG or an SAP affiliate company. All rights reserved. No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP AG. The information contained herein may be changed without prior notice. Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors. National product specifications may vary. These materials are provided by SAP AG and its affiliated companies ("SAP Group") for informational purposes only, without representation or warranty of any kind, and SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and other countries. Please see http://www.sap.com/corporate-en/legal/copyright/index.epx#trademark for additional trademark information and notices.