SlideShare a Scribd company logo
Big Data – Are You Ready?
Thomas Kyte
http://asktom.oracle.com
Big Data Buzz
“Keeping Afloat
in a Sea of 'Big
Data”
ITBusinessEdge – 9/6/11
“Why big data
is a big deal”
InfoWorld – 9/1/11
“The challenge–
and opportunity–
of big data”
McKinsey Quarterly—5/11
“Getting a Handle
on Big Data with
Hadoop”
Businessweek-9/7/11
“Ten reasons why
Big Data will
change the travel
industry”
Tnooz -8/15/11
“The promise of
Big Data”
Intelligent Utility-8/28/11
Big Data Use Cases
Today’s Challenge New Data What’s Possible
Healthcare
Expensive office visits
Remote patient monitoring
Preventive care, reduced
hospitalization
Manufacturing
In-person support
Product sensors Automated diagnosis, support
Location-Based Services
Based on home zip code
Real time location data
Geo-advertising, traffic, local
search
Public Sector
Standardized services
Citizen surveys
Tailored services,
cost reductions
Retail
One size fits all marketing
Social media
Sentiment analysis
segmentation
What Makes it Big Data?
VOLUME VELOCITY VARIETY VALUE
SOCIAL
BLOG
SMART
METER
101100101001
001001101010
101011100101
010100100101
Make
Better
Decisions
Using
Big Data
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZE
Acquire all
available data
Big Data in Action
ANALYZE
DECIDE
ORGANIZE
ACQUIRE
Acquiring Big Data Challenge
Application will need
tochange
frequently
Need toprocess
high volume, low-
density information
Must scale out to
meet aggressive
roll out plan
Oracle NoSQL Database
Key value pair database
Dynamic data model
Highly scalable, available
Transparent load balancing
Built using BerkeleyDB
Nodes
East
Nodes
West
Nodes
Central
Nodes
NoSQL Driver
Application
NoSQL Driver
Application
…
Nodes
…
Read
Delete
Read
Update
Oracle NoSQL Database
Key value pair database
Dynamic data model
Highly scalable, available
Transparent load balancing
Built using BerkeleyDB
Oracle NoSQL Database
Key value pair database
Dynamic data model
Highly scalable, available
Transparent load balancing
Built using BerkeleyDB
Oracle NoSQL: Practically ACID
…
The serious part of Oracle NoSQL is a practical
approximation of ACID compliance, the standard that
SQL databases like to offer. ACID means "Atomic,
Consistent, Isolated, Durable transactions," and there's a
robust debate about just what this translates to in
excruciating detail. Most NoSQL systems promise a
different acronym, BASE, which stands for "Basically
Available, Soft State, and Eventually Consistent." In
other words, you'll probably get the right answer except
when you don't.
Oracle NoSQL Database
Key value pair database
Dynamic data model
Highly scalable, available
Transparent load balancing
Built using BerkeleyDB
In all, Oracle NoSQL was a pleasure to try
because it offered so many serious features
developed by a company with a deep history of
serious data management. There are dozens of
small ways in which the tool is more thorough
and sophisticated than the simpler NoSQL
projects. You get a number of different options
for increasing the durability in the face of a node
crash or trading that durability for speed. The
documentation is solid and written by working
engineers with deep experience in storing data
for enterprise customers.
Big Data in Action
ANALYZE
DECIDE
ORGANIZE
ACQUIRE
Oracle NoSQL Database
Organize and
distill big data
using massive
parallelism
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZE
Organizing Big Data Challenge
Also want to
perform analysis
on big data
Have existing
Oracle data
warehouse
Can’t negatively
impact data
warehouse SLAs
Analysis Sandbox
Provides analysis workspace
Controlled access to
resources and data
Doesn’t impact production
system
Sandboxing with Oracle Enterprise Manager
Simple to set up
Efficient server utilization
Secure and scalable
Accountable via charge back
Ideal for Oracle Exadata
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZE
Oracle NoSQL Database
Oracle Enterprise Manager
Organizing and Distilling Big Data Challenge
Want toavoid
writing lots of
Hadoop code
Must transform big
data into something
easily analyzed
Need toload data
quickly into Oracle
Data Warehouse
Hadoop Architecture
Management/Monitoring
Hadoop Distributed File System (HDFS)
MapReduce
Distributed file system with
redundant storage
Map/Reduce programming
paradigm
Highly scalable data
processing
Cost-effective model for high
volume, low density data
A Map/Reduce Pipeline
SHUFFLE
/SORT
SHUFFLE
/SORT
MAP
MAP
MAP
MAP
SHUFFLE
/SORT
REDUCE
REDUCE
SHUFFLE
/SORT
SHUFFLE
/SORT
REDUCE
REDUCE
REDUCE
INPUT
2
INPUT
1
OUTPUT
2
OUTPUT
1
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
REDUCE
MAP
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
REDUCE
Oracle Data Integrator
Reduces Hadoop
complexities
through graphical
tooling
Oracle Loader for Hadoop
SHUFFLE
/SORT
SHUFFLE
/SORT
MAP
MAP
MAP
MAP
SHUFFLE
/SORT
REDUCE
REDUCE
SHUFFLE
/SORT
SHUFFLE
/SORT
REDUCE
REDUCE
REDUCE
INPUT
2
INPUT
1
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
REDUCE
MAP
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
MAP
MAP
MAP
MAP
MAP
REDUCE
REDUCE
REDUCE
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZE
Oracle NoSQL Database
Oracle Enterprise Manager
Oracle Data Integrator
Oracle Loader for Hadoop
Analyze all your
data, at once
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZEANALYZE
Analyzing Big Data Challenge
Want toperform
statistical analysis
using R
Require access to
all data
Doing analysis on a
laptop is slow and
not secure
R Statistical Programming Language
Open source language and
environment
Used for statistical
computing and graphics
Strength in easily producing
publication-quality graphs
Highly extensible
Small data models
only are stored and
run on user’s laptop
Why R Wasn’t Ready for the Enterprise
Oracle R Enterprise Approach
Models run in-database
Processes large data sets
Uses the power of Oracle
Database 11g and Exadata
Same code, much faster
Big Data in Action
ANALYZE
DECIDE ACQUIRE
ORGANIZEANALYZE
Oracle NoSQL Database
Oracle Enterprise Manager
Oracle Data Integrator
Oracle Loader for Hadoop
Oracle R Enterprise
Decide based
on real-time
big data
Big Data in Action
ANALYZE
ACQUIRE
ORGANIZE
DECIDE
Making Decisions Based on Big Data Challenge
Want toadd new
insights into BI
dashboard
Big data has been
transformed into
actionable insight
How do we quickly
integrate R analytics
into dashboard?
Dashboard Analytics
•Oracle Business Intelligence Enterprise Edition
‒Advanced dashboard visualization
‒Runs BI and EPM applications
•Integrating R Analytics
‒Embed R script’s web interface in BI dashboard
‒Graphics will stream to BI dashboard
Oracle Integrated Solution Stack for Big Data
ACQUIRE
Oracle NoSQL
Database
HDFS
Enterprise
Applications
ORGANIZE
Hadoop
(MapReduce)
Oracle Loader
for Hadoop
Oracle Data
Integrator
DECIDE
Analytic
Applications
ANALYZE
In-Database
Analytics
Data
Warehouse
Oracle Exalytics Hardware
Engineered for extreme analytics
•40 Intel processor cores
•1 Terabyte main memory
•40 Gb InfiniBand connection to Oracle Exadata
Oracle Exalytics Software
•Oracle TimesTen In-Memory Database
‒Adaptive in-memory caching of analytics
‒In-memory columnar compression
‒Tightly integrated with Oracle Exadata
‒Enables speed-of-thought visualization
•Oracle Business Intelligence Foundation Suite
•18 Sun X4270 M2 Servers
–48 GB memory per node = 864 GB memory
–12 Intel cores per node = 216 cores
–24 TB storage per node = 432 TB storage
•40 Gb p/sec InfiniBand
•10 Gb p/sec Ethernet
Oracle Big Data Appliance Hardware
•Oracle Linux
•Java Hotspot VM
•Apache Hadoop Distribution
•R Distribution
•Oracle NoSQL Database
•Oracle Data Integrator for Hadoop
•Oracle Loader for Hadoop
Oracle Big Data Appliance Software
Big Data and Oracle - 2013

More Related Content

PPTX
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
PPTX
An Operational Data Layer is Critical for Transformative Banking Applications
PDF
Transforming GE Healthcare with Data Platform Strategy
PPTX
Moving to the Cloud: Modernizing Data Architecture in Healthcare
PPT
Webinar: 2 Billion Data Points Each Day
PPTX
Webinar: Transforming Customer Experience Through an Always-On Data Platform
PPTX
Introduction to Data Engineering
PPTX
How to get Real-Time Value from your IoT Data - Datastax
Webinar - Delivering Enhanced Message Processing at Scale With an Always-on D...
An Operational Data Layer is Critical for Transformative Banking Applications
Transforming GE Healthcare with Data Platform Strategy
Moving to the Cloud: Modernizing Data Architecture in Healthcare
Webinar: 2 Billion Data Points Each Day
Webinar: Transforming Customer Experience Through an Always-On Data Platform
Introduction to Data Engineering
How to get Real-Time Value from your IoT Data - Datastax

What's hot (20)

PDF
Horses for Courses: Database Roundtable
PPTX
Augmented Analytics and Automation in the Age of the Data Scientist
PPTX
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
PDF
Webinar - Bringing Game Changing Insights with Graph Databases
PPTX
Rethink Analytics with an Enterprise Data Hub
PDF
Making Big Data Easy for Everyone
PDF
Modernizing to a Cloud Data Architecture
PDF
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
PDF
You're the New CDO, Now What?
PPTX
Live Seminar Cloudera & Big Data Ecosystem
PPTX
Webinar - Data Management for the "Right-Now" Economy - The 5 Key Ingredients
PDF
Analytics in a Day Virtual Workshop
 
PPTX
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Panel - Interactive Applic...
PDF
Intro to Data Science on Hadoop
PPTX
How To Tell if Your Business Needs NoSQL
PPTX
Beyond Batch: Is ETL still relevant in the API economy?
PPTX
Getting Big Value from Big Data
PPTX
Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
PDF
Benefits of the Azure Cloud
PPTX
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
Horses for Courses: Database Roundtable
Augmented Analytics and Automation in the Age of the Data Scientist
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
Webinar - Bringing Game Changing Insights with Graph Databases
Rethink Analytics with an Enterprise Data Hub
Making Big Data Easy for Everyone
Modernizing to a Cloud Data Architecture
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
You're the New CDO, Now What?
Live Seminar Cloudera & Big Data Ecosystem
Webinar - Data Management for the "Right-Now" Economy - The 5 Key Ingredients
Analytics in a Day Virtual Workshop
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Panel - Interactive Applic...
Intro to Data Science on Hadoop
How To Tell if Your Business Needs NoSQL
Beyond Batch: Is ETL still relevant in the API economy?
Getting Big Value from Big Data
Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
Benefits of the Azure Cloud
Webinar: Comparing DataStax Enterprise with Open Source Apache Cassandra
Ad

Similar to Big Data and Oracle - 2013 (20)

PDF
Presentation big dataappliance-overview_oow_v3
PPTX
Big Data & Oracle Technologies
PDF
Big Data
PDF
User 2013-oracle-big-data-analytics-1971985
PDF
Meetup Oracle Database BCN: 2.1 Data Management Trends
PPTX
Big data oracle_introduccion
PPTX
Oracle Big Data Appliance and Big Data SQL for advanced analytics
PDF
MySQL London Tech Tour March 2015 - Big Data
PPTX
Tangenz big data
PPTX
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
PPTX
Oracle Big data at work
PPTX
Expand a Data warehouse with Hadoop and Big Data
PPTX
Oracle big data appliance and solutions
PPTX
Big Data Roundtable. Why, how, where, which, and when to start doing Big Data
PDF
Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big data
PDF
A6 big data_in_the_cloud
PPTX
1 PSUT Big Data Class, introduction
PDF
K2 oracle big data at work transform your business with analytics
PPTX
oracleadvancedanalyticsv2otn-2859525.pptx
PDF
Big Data: Myths and Realities
Presentation big dataappliance-overview_oow_v3
Big Data & Oracle Technologies
Big Data
User 2013-oracle-big-data-analytics-1971985
Meetup Oracle Database BCN: 2.1 Data Management Trends
Big data oracle_introduccion
Oracle Big Data Appliance and Big Data SQL for advanced analytics
MySQL London Tech Tour March 2015 - Big Data
Tangenz big data
Meetup Oracle Database MAD: 2.1 Data Management Trends: SQL, NoSQL y Big Data
Oracle Big data at work
Expand a Data warehouse with Hadoop and Big Data
Oracle big data appliance and solutions
Big Data Roundtable. Why, how, where, which, and when to start doing Big Data
Conociendo y entendiendo a tu cliente mediante monitoreo, analíticos y big data
A6 big data_in_the_cloud
1 PSUT Big Data Class, introduction
K2 oracle big data at work transform your business with analytics
oracleadvancedanalyticsv2otn-2859525.pptx
Big Data: Myths and Realities
Ad

More from Connor McDonald (20)

PDF
Flashback ITOUG
PDF
Sangam 19 - PLSQL still the coolest
PDF
Sangam 19 - Analytic SQL
PDF
UKOUG - 25 years of hints and tips
PDF
Sangam 19 - Successful Applications on Autonomous
PDF
Sangam 2019 - The Latest Features
PDF
UKOUG 2019 - SQL features
PDF
APEX tour 2019 - successful development with autonomous
PDF
APAC Groundbreakers 2019 - Perth/Melbourne
PDF
OOW19 - Flashback, not just for DBAs
PDF
OOW19 - Read consistency
PDF
OOW19 - Slower and less secure applications
PDF
OOW19 - Killing database sessions
PDF
OOW19 - Ten Amazing SQL features
PDF
Latin America Tour 2019 - 18c and 19c featues
PDF
Latin America tour 2019 - Flashback
PDF
Latin America Tour 2019 - 10 great sql features
PDF
Latin America Tour 2019 - pattern matching
PDF
Latin America Tour 2019 - slow data and sql processing
PDF
ANSI vs Oracle language
Flashback ITOUG
Sangam 19 - PLSQL still the coolest
Sangam 19 - Analytic SQL
UKOUG - 25 years of hints and tips
Sangam 19 - Successful Applications on Autonomous
Sangam 2019 - The Latest Features
UKOUG 2019 - SQL features
APEX tour 2019 - successful development with autonomous
APAC Groundbreakers 2019 - Perth/Melbourne
OOW19 - Flashback, not just for DBAs
OOW19 - Read consistency
OOW19 - Slower and less secure applications
OOW19 - Killing database sessions
OOW19 - Ten Amazing SQL features
Latin America Tour 2019 - 18c and 19c featues
Latin America tour 2019 - Flashback
Latin America Tour 2019 - 10 great sql features
Latin America Tour 2019 - pattern matching
Latin America Tour 2019 - slow data and sql processing
ANSI vs Oracle language

Recently uploaded (20)

PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Spectroscopy.pptx food analysis technology
PDF
Encapsulation theory and applications.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Cloud computing and distributed systems.
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Approach and Philosophy of On baking technology
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Per capita expenditure prediction using model stacking based on satellite ima...
Spectral efficient network and resource selection model in 5G networks
Machine learning based COVID-19 study performance prediction
Understanding_Digital_Forensics_Presentation.pptx
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Empathic Computing: Creating Shared Understanding
Spectroscopy.pptx food analysis technology
Encapsulation theory and applications.pdf
Unlocking AI with Model Context Protocol (MCP)
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Cloud computing and distributed systems.
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Approach and Philosophy of On baking technology
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton

Big Data and Oracle - 2013

  • 1. Big Data – Are You Ready? Thomas Kyte http://asktom.oracle.com
  • 2. Big Data Buzz “Keeping Afloat in a Sea of 'Big Data” ITBusinessEdge – 9/6/11 “Why big data is a big deal” InfoWorld – 9/1/11 “The challenge– and opportunity– of big data” McKinsey Quarterly—5/11 “Getting a Handle on Big Data with Hadoop” Businessweek-9/7/11 “Ten reasons why Big Data will change the travel industry” Tnooz -8/15/11 “The promise of Big Data” Intelligent Utility-8/28/11
  • 3. Big Data Use Cases Today’s Challenge New Data What’s Possible Healthcare Expensive office visits Remote patient monitoring Preventive care, reduced hospitalization Manufacturing In-person support Product sensors Automated diagnosis, support Location-Based Services Based on home zip code Real time location data Geo-advertising, traffic, local search Public Sector Standardized services Citizen surveys Tailored services, cost reductions Retail One size fits all marketing Social media Sentiment analysis segmentation
  • 4. What Makes it Big Data? VOLUME VELOCITY VARIETY VALUE SOCIAL BLOG SMART METER 101100101001 001001101010 101011100101 010100100101
  • 5. Make Better Decisions Using Big Data Big Data in Action ANALYZE DECIDE ACQUIRE ORGANIZE
  • 6. Acquire all available data Big Data in Action ANALYZE DECIDE ORGANIZE ACQUIRE
  • 7. Acquiring Big Data Challenge Application will need tochange frequently Need toprocess high volume, low- density information Must scale out to meet aggressive roll out plan
  • 8. Oracle NoSQL Database Key value pair database Dynamic data model Highly scalable, available Transparent load balancing Built using BerkeleyDB Nodes East Nodes West Nodes Central Nodes NoSQL Driver Application NoSQL Driver Application … Nodes … Read Delete Read Update
  • 9. Oracle NoSQL Database Key value pair database Dynamic data model Highly scalable, available Transparent load balancing Built using BerkeleyDB
  • 10. Oracle NoSQL Database Key value pair database Dynamic data model Highly scalable, available Transparent load balancing Built using BerkeleyDB Oracle NoSQL: Practically ACID … The serious part of Oracle NoSQL is a practical approximation of ACID compliance, the standard that SQL databases like to offer. ACID means "Atomic, Consistent, Isolated, Durable transactions," and there's a robust debate about just what this translates to in excruciating detail. Most NoSQL systems promise a different acronym, BASE, which stands for "Basically Available, Soft State, and Eventually Consistent." In other words, you'll probably get the right answer except when you don't.
  • 11. Oracle NoSQL Database Key value pair database Dynamic data model Highly scalable, available Transparent load balancing Built using BerkeleyDB In all, Oracle NoSQL was a pleasure to try because it offered so many serious features developed by a company with a deep history of serious data management. There are dozens of small ways in which the tool is more thorough and sophisticated than the simpler NoSQL projects. You get a number of different options for increasing the durability in the face of a node crash or trading that durability for speed. The documentation is solid and written by working engineers with deep experience in storing data for enterprise customers.
  • 12. Big Data in Action ANALYZE DECIDE ORGANIZE ACQUIRE Oracle NoSQL Database
  • 13. Organize and distill big data using massive parallelism Big Data in Action ANALYZE DECIDE ACQUIRE ORGANIZE
  • 14. Organizing Big Data Challenge Also want to perform analysis on big data Have existing Oracle data warehouse Can’t negatively impact data warehouse SLAs
  • 15. Analysis Sandbox Provides analysis workspace Controlled access to resources and data Doesn’t impact production system
  • 16. Sandboxing with Oracle Enterprise Manager Simple to set up Efficient server utilization Secure and scalable Accountable via charge back Ideal for Oracle Exadata
  • 17. Big Data in Action ANALYZE DECIDE ACQUIRE ORGANIZE Oracle NoSQL Database Oracle Enterprise Manager
  • 18. Organizing and Distilling Big Data Challenge Want toavoid writing lots of Hadoop code Must transform big data into something easily analyzed Need toload data quickly into Oracle Data Warehouse
  • 19. Hadoop Architecture Management/Monitoring Hadoop Distributed File System (HDFS) MapReduce Distributed file system with redundant storage Map/Reduce programming paradigm Highly scalable data processing Cost-effective model for high volume, low density data
  • 21. Oracle Data Integrator Reduces Hadoop complexities through graphical tooling
  • 22. Oracle Loader for Hadoop SHUFFLE /SORT SHUFFLE /SORT MAP MAP MAP MAP SHUFFLE /SORT REDUCE REDUCE SHUFFLE /SORT SHUFFLE /SORT REDUCE REDUCE REDUCE INPUT 2 INPUT 1 MAP MAP MAP MAP MAP REDUCE REDUCE REDUCE MAP MAP MAP MAP MAP MAP REDUCE REDUCE MAP MAP MAP MAP MAP REDUCE REDUCE REDUCE
  • 23. Big Data in Action ANALYZE DECIDE ACQUIRE ORGANIZE Oracle NoSQL Database Oracle Enterprise Manager Oracle Data Integrator Oracle Loader for Hadoop
  • 24. Analyze all your data, at once Big Data in Action ANALYZE DECIDE ACQUIRE ORGANIZEANALYZE
  • 25. Analyzing Big Data Challenge Want toperform statistical analysis using R Require access to all data Doing analysis on a laptop is slow and not secure
  • 26. R Statistical Programming Language Open source language and environment Used for statistical computing and graphics Strength in easily producing publication-quality graphs Highly extensible
  • 27. Small data models only are stored and run on user’s laptop Why R Wasn’t Ready for the Enterprise
  • 28. Oracle R Enterprise Approach Models run in-database Processes large data sets Uses the power of Oracle Database 11g and Exadata Same code, much faster
  • 29. Big Data in Action ANALYZE DECIDE ACQUIRE ORGANIZEANALYZE Oracle NoSQL Database Oracle Enterprise Manager Oracle Data Integrator Oracle Loader for Hadoop Oracle R Enterprise
  • 30. Decide based on real-time big data Big Data in Action ANALYZE ACQUIRE ORGANIZE DECIDE
  • 31. Making Decisions Based on Big Data Challenge Want toadd new insights into BI dashboard Big data has been transformed into actionable insight How do we quickly integrate R analytics into dashboard?
  • 32. Dashboard Analytics •Oracle Business Intelligence Enterprise Edition ‒Advanced dashboard visualization ‒Runs BI and EPM applications •Integrating R Analytics ‒Embed R script’s web interface in BI dashboard ‒Graphics will stream to BI dashboard
  • 33. Oracle Integrated Solution Stack for Big Data ACQUIRE Oracle NoSQL Database HDFS Enterprise Applications ORGANIZE Hadoop (MapReduce) Oracle Loader for Hadoop Oracle Data Integrator DECIDE Analytic Applications ANALYZE In-Database Analytics Data Warehouse
  • 34. Oracle Exalytics Hardware Engineered for extreme analytics •40 Intel processor cores •1 Terabyte main memory •40 Gb InfiniBand connection to Oracle Exadata
  • 35. Oracle Exalytics Software •Oracle TimesTen In-Memory Database ‒Adaptive in-memory caching of analytics ‒In-memory columnar compression ‒Tightly integrated with Oracle Exadata ‒Enables speed-of-thought visualization •Oracle Business Intelligence Foundation Suite
  • 36. •18 Sun X4270 M2 Servers –48 GB memory per node = 864 GB memory –12 Intel cores per node = 216 cores –24 TB storage per node = 432 TB storage •40 Gb p/sec InfiniBand •10 Gb p/sec Ethernet Oracle Big Data Appliance Hardware
  • 37. •Oracle Linux •Java Hotspot VM •Apache Hadoop Distribution •R Distribution •Oracle NoSQL Database •Oracle Data Integrator for Hadoop •Oracle Loader for Hadoop Oracle Big Data Appliance Software