SlideShare a Scribd company logo
Oracle Big Data Discovery
Product Overview
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Richard Tomlinson
Director, Product Management
Oracle Big Data Discovery
Kip Bowes
VP Services
Cloudera
2
Speakers
3© Cloudera, Inc. All rights reserved.
Strong Partnership for Enterprise Big Data
+
4© Cloudera, Inc. All rights reserved.
Data Changes How We Work
Everything that can be
measured will be measured.
Employees and customers expect
more personal interactions, but
not at the cost of their privacy.
The most innovative companies
embrace experimentation
and agility.
Instrumentation Consumerization Experimentation
5© Cloudera, Inc. All rights reserved.
Cloudera Enterprise powered by Apache Hadoop
A new kind of data platform.
• One place for unlimited data
• Unified, multi-framework data access
Only with Cloudera:
• Enterprise Security
• Data Governance
• Complete Management
• Open source, open standards
Security and Administration
Unlimited Storage
Process Discove
r
Model Serve
Deployment
Flexibility
On-Premises
Appliances
Engineered Systems
Public Cloud
Private Cloud
Hybrid Cloud
6© Cloudera, Inc. All rights reserved.
Data Discovery is the #1 fastest growing
workload for enterprise analytics.
7© Cloudera, Inc. All rights reserved.
Data Discovery & Analytics (DD&A) :
The ability to find enterprise data and
quickly uncover new insights and
optimize existing analytics.
(AKA: Self-service BI, BI, Data Discovery, Advance Analytics, Machine Learning)
8© Cloudera, Inc. All rights reserved.
Discovery and Analytics is an Iterative Process
Report, Model,
or Rules
Ingest
Transformatio
n
80% of Time Preparing
Diverse Ingest
Search and lineage
Agile Transforms
20% of Time Analyzing
SQL
Statistical
Machine Learning
Implement
Point Solution
Custom App
Analysis
Technique
Access
Data
Generatio
n
Data Discovery
& Analytics
Flow
9© Cloudera, Inc. All rights reserved.
The Challenge for Data Discovery Projects
How do we make data
preparation 20% of the
effort so businesses can
focus 80% of their time on
executing from analytics?
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Requires a Fundamentally New Approach
10
quickly transform
and enrich it to make
it better
unlock big data for
anyone to discover
and share new value
A single intuitive and visual user interface, to...
find and explore big
data to understand its
potential
find explore transform discover share
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 11
Oracle Big Data Discovery. The Visual Face of Hadoop
find explore transform discover share
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Discovery. The Visual Face of Hadoop
12
find explore transform discover share
See the potential in big data
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Catalog
13
• Access a rich,
interactive catalog
of all data in
Hadoop
• Familiar search and
guided navigation
for ease of use
• See data set
summaries, user
annotation and
recommendations
• Provision personal
and enterprise data
to Hadoop via self-
service
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Explore
14
• Visualize all
attributes by type
• Sort attributes by
information
potential
• Assess attribute
statistics, data
quality and
outliers
• Use scratch pad to
uncover
correlations
between
attributes
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Discovery. The Visual Face of Hadoop
15
find explore transform discover share
Quickly make big data better
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 1616
• Intuitive, user
driven data
wrangling
• Extensive library of
powerful data
transformations
and enrichments
• Preview results,
undo, commit and
replay transforms
• Test on sample data
then apply to full
data set in Hadoop
Transform
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Discovery. The Visual Face of Hadoop
17
find explore transform discover share
Unlock big data for everyone
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 18
• Join and blend data
for deeper
perspectives
• Compose project
pages via drag and
drop
• Use powerful
search and guided
navigation to ask
questions
• See new patterns in
rich, interactive
data visualizations
Discover
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 19
• Share projects,
bookmarks and
snapshots with
others
• Build galleries and
tell big data stories
• Collaborate and
iterate as a team
• Publish blended
data to HDFS for
leverage in other
tools
Share
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Discovery. Technical Innovation on CDH
Oracle Confidential – Internal 20
Oracle Big Data Discovery Workloads
Hadoop Cluster
(BDA or Commodity
Hardware)
BDD node
data node
data node
data node
data node
name node
Data Processing, Workflow & Monitoring
• Profiling: catalog entry creation, data type &
language detection, schema configuration
• Sampling: dgraph (index) file creation
• Transforms: >100 functions
• Enrichments: location (geo), text (cleanup,
sentiment, entity, key-phrase, whitelist tagging)
Self-Service Provisioning & Data Transfer
• Personal Data: Upload CSV and XLS to HDFS
In-Memory Discovery Indexes
• DGraph: Search, Guided Navigation, Analytics
Studio
• Web UI: Find, Explore, Transform, Discover, Share
Hadoop 2.x
Filesystem
(HDFS)
Workload Mgmt
(YARN)
Metadata
(HCatalog)
Other Hadoop
Workloads
MapReduce
Spark
Hive
Pig
Oracle Big Data SQL
(BDA only)
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Discovery. A Game Changing Platform
Business Benefits
• Get value faster. Rapidly turn raw data
into actionable insights, leveraged across
the enterprise
• Democratize value from Big Data.
Increase the size, diversify the skills, and
improve the efficiency of Big Data teams
21
See the Potential in Big Data, Quickly Make it Better and Unlock Value for Everyone
Technical Benefits
• Destroy existing technical barriers. Run
natively on Hadoop cluster for maximum
scalability and performance
• Publish, secure and leverage. Integrate
with Hadoop open standards and
leverage the unified Oracle big data
ecosystem
Product Demo
www.oracle.com/bigdatadiscovery
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 23
Questions?
Please submit questions via the
“Q and A” box and we will
answer them live.

More Related Content

PDF
Big Data Discovery
PDF
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
PDF
6 enriching your data warehouse with big data and hadoop
PDF
Contexti / Oracle - Big Data : From Pilot to Production
PPTX
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
PDF
The Maturity Model: Taking the Growing Pains Out of Hadoop
PDF
Setting Up the Data Lake
PDF
You're the New CDO, Now What?
Big Data Discovery
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
6 enriching your data warehouse with big data and hadoop
Contexti / Oracle - Big Data : From Pilot to Production
Beyond a Big Data Pilot: Building a Production Data Infrastructure - Stampede...
The Maturity Model: Taking the Growing Pains Out of Hadoop
Setting Up the Data Lake
You're the New CDO, Now What?

What's hot (20)

PDF
The Data Lake - Balancing Data Governance and Innovation
PPTX
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
PDF
Making Big Data Easy for Everyone
PPTX
Big Data: Setting Up the Big Data Lake
PDF
The Emerging Data Lake IT Strategy
PDF
Data Discovery and BI - Is there Really a Difference?
PPTX
Data Governance, Compliance and Security in Hadoop with Cloudera
PPTX
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
PDF
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
PDF
Agile Big Data Analytics Development: An Architecture-Centric Approach
PPTX
Big Data's Impact on the Enterprise
PDF
Incorporating the Data Lake into Your Analytic Architecture
PDF
The Emerging Role of the Data Lake
PPTX
Govern This! Data Discovery and the application of data governance with new s...
PDF
Smart data for a predictive bank
PDF
IDC Retail Insights - What's Possible with a Modern Data Architecture?
PDF
Data lake benefits
PPTX
Data Mining - The Big Picture!
PDF
Splunk Business Analytics
PDF
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Data Lake - Balancing Data Governance and Innovation
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
Making Big Data Easy for Everyone
Big Data: Setting Up the Big Data Lake
The Emerging Data Lake IT Strategy
Data Discovery and BI - Is there Really a Difference?
Data Governance, Compliance and Security in Hadoop with Cloudera
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Agile Big Data Analytics Development: An Architecture-Centric Approach
Big Data's Impact on the Enterprise
Incorporating the Data Lake into Your Analytic Architecture
The Emerging Role of the Data Lake
Govern This! Data Discovery and the application of data governance with new s...
Smart data for a predictive bank
IDC Retail Insights - What's Possible with a Modern Data Architecture?
Data lake benefits
Data Mining - The Big Picture!
Splunk Business Analytics
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
Ad

Similar to Oracle big data discovery 994294 (20)

PDF
Big Data at Oracle - Strata 2015 San Jose
PDF
Big Data: Myths and Realities
PPTX
2013 05 Oracle big_dataapplianceoverview
PPTX
Oracle Big Data Appliance and Big Data SQL for advanced analytics
PPTX
Expand a Data warehouse with Hadoop and Big Data
PDF
Long-Term Outcomes: Customer-Centered Product Strategy For Machine Intelligen...
PDF
Big Data
PDF
Presentation big dataappliance-overview_oow_v3
PPTX
Big data oracle_introduccion
PPTX
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
PPTX
Bridging Oracle Database and Hadoop by Alex Gorbachev, Pythian from Oracle Op...
PDF
Aleksejs Nemirovskis - Manage your data using oracle BDA
PPTX
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
PDF
5 big data at work linking discovery and bi to improve business outcomes from...
PPTX
Big data4businessusers
PPTX
Insights into Real-world Data Management Challenges
PPTX
Deutsche Telekom on Big Data
PDF
A6 big data_in_the_cloud
PDF
User 2013-oracle-big-data-analytics-1971985
PPTX
Essential Tools For Your Big Data Arsenal
Big Data at Oracle - Strata 2015 San Jose
Big Data: Myths and Realities
2013 05 Oracle big_dataapplianceoverview
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Expand a Data warehouse with Hadoop and Big Data
Long-Term Outcomes: Customer-Centered Product Strategy For Machine Intelligen...
Big Data
Presentation big dataappliance-overview_oow_v3
Big data oracle_introduccion
Big Data Management System: Smart SQL Processing Across Hadoop and your Data ...
Bridging Oracle Database and Hadoop by Alex Gorbachev, Pythian from Oracle Op...
Aleksejs Nemirovskis - Manage your data using oracle BDA
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
5 big data at work linking discovery and bi to improve business outcomes from...
Big data4businessusers
Insights into Real-world Data Management Challenges
Deutsche Telekom on Big Data
A6 big data_in_the_cloud
User 2013-oracle-big-data-analytics-1971985
Essential Tools For Your Big Data Arsenal
Ad

More from Edgar Alejandro Villegas (20)

PDF
What's New in Predictive Analytics IBM SPSS - Apr 2016
PDF
Actian Ingres10.2 Datasheet
PDF
Actian Matrix Datasheet
PDF
Actian Matrix Whitepaper
PDF
Actian Vector Whitepaper
PDF
Actian DataFlow Whitepaper
PDF
The Four Pillars of Analytics Technology Whitepaper
PDF
SQL in Hadoop To Boldly Go Where no Data Warehouse Has Gone Before
PDF
Realtime analytics with_hadoop
PDF
SQL – The Natural Language for Analysis - Oracle - Whitepaper - 2431343
PDF
Hadoop and Your Enterprise Data Warehouse
PDF
Big Data SurVey - IOUG - 2013 - 594292
PDF
Best Practices for Oracle Exadata and the Oracle Optimizer
PDF
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...
PDF
Big Data and Enterprise Data - Oracle -1663869
PDF
Fast and Easy Analytics: - Tableau - Data Base Trends - Dbt06122013slides
PDF
BITGLASS - DATA BREACH DISCOVERY DATASHEET
PDF
Four Pillars of Business Analytics - e-book - Actuate
PDF
Sas hpa-va-bda-exadata-2389280
PDF
Splice machine-bloor-webinar-data-lakes
What's New in Predictive Analytics IBM SPSS - Apr 2016
Actian Ingres10.2 Datasheet
Actian Matrix Datasheet
Actian Matrix Whitepaper
Actian Vector Whitepaper
Actian DataFlow Whitepaper
The Four Pillars of Analytics Technology Whitepaper
SQL in Hadoop To Boldly Go Where no Data Warehouse Has Gone Before
Realtime analytics with_hadoop
SQL – The Natural Language for Analysis - Oracle - Whitepaper - 2431343
Hadoop and Your Enterprise Data Warehouse
Big Data SurVey - IOUG - 2013 - 594292
Best Practices for Oracle Exadata and the Oracle Optimizer
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...
Big Data and Enterprise Data - Oracle -1663869
Fast and Easy Analytics: - Tableau - Data Base Trends - Dbt06122013slides
BITGLASS - DATA BREACH DISCOVERY DATASHEET
Four Pillars of Business Analytics - e-book - Actuate
Sas hpa-va-bda-exadata-2389280
Splice machine-bloor-webinar-data-lakes

Recently uploaded (20)

PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Electronic commerce courselecture one. Pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Encapsulation theory and applications.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
1. Introduction to Computer Programming.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
MYSQL Presentation for SQL database connectivity
Electronic commerce courselecture one. Pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Empathic Computing: Creating Shared Understanding
Dropbox Q2 2025 Financial Results & Investor Presentation
Reach Out and Touch Someone: Haptics and Empathic Computing
Encapsulation theory and applications.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Agricultural_Statistics_at_a_Glance_2022_0.pdf
NewMind AI Weekly Chronicles - August'25-Week II
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
1. Introduction to Computer Programming.pptx
Encapsulation_ Review paper, used for researhc scholars
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Assigned Numbers - 2025 - Bluetooth® Document
The Rise and Fall of 3GPP – Time for a Sabbatical?

Oracle big data discovery 994294

  • 1. Oracle Big Data Discovery Product Overview
  • 2. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Richard Tomlinson Director, Product Management Oracle Big Data Discovery Kip Bowes VP Services Cloudera 2 Speakers
  • 3. 3© Cloudera, Inc. All rights reserved. Strong Partnership for Enterprise Big Data +
  • 4. 4© Cloudera, Inc. All rights reserved. Data Changes How We Work Everything that can be measured will be measured. Employees and customers expect more personal interactions, but not at the cost of their privacy. The most innovative companies embrace experimentation and agility. Instrumentation Consumerization Experimentation
  • 5. 5© Cloudera, Inc. All rights reserved. Cloudera Enterprise powered by Apache Hadoop A new kind of data platform. • One place for unlimited data • Unified, multi-framework data access Only with Cloudera: • Enterprise Security • Data Governance • Complete Management • Open source, open standards Security and Administration Unlimited Storage Process Discove r Model Serve Deployment Flexibility On-Premises Appliances Engineered Systems Public Cloud Private Cloud Hybrid Cloud
  • 6. 6© Cloudera, Inc. All rights reserved. Data Discovery is the #1 fastest growing workload for enterprise analytics.
  • 7. 7© Cloudera, Inc. All rights reserved. Data Discovery & Analytics (DD&A) : The ability to find enterprise data and quickly uncover new insights and optimize existing analytics. (AKA: Self-service BI, BI, Data Discovery, Advance Analytics, Machine Learning)
  • 8. 8© Cloudera, Inc. All rights reserved. Discovery and Analytics is an Iterative Process Report, Model, or Rules Ingest Transformatio n 80% of Time Preparing Diverse Ingest Search and lineage Agile Transforms 20% of Time Analyzing SQL Statistical Machine Learning Implement Point Solution Custom App Analysis Technique Access Data Generatio n Data Discovery & Analytics Flow
  • 9. 9© Cloudera, Inc. All rights reserved. The Challenge for Data Discovery Projects How do we make data preparation 20% of the effort so businesses can focus 80% of their time on executing from analytics?
  • 10. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Requires a Fundamentally New Approach 10 quickly transform and enrich it to make it better unlock big data for anyone to discover and share new value A single intuitive and visual user interface, to... find and explore big data to understand its potential find explore transform discover share
  • 11. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 11 Oracle Big Data Discovery. The Visual Face of Hadoop find explore transform discover share
  • 12. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Oracle Big Data Discovery. The Visual Face of Hadoop 12 find explore transform discover share See the potential in big data
  • 13. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Catalog 13 • Access a rich, interactive catalog of all data in Hadoop • Familiar search and guided navigation for ease of use • See data set summaries, user annotation and recommendations • Provision personal and enterprise data to Hadoop via self- service
  • 14. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Explore 14 • Visualize all attributes by type • Sort attributes by information potential • Assess attribute statistics, data quality and outliers • Use scratch pad to uncover correlations between attributes
  • 15. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Oracle Big Data Discovery. The Visual Face of Hadoop 15 find explore transform discover share Quickly make big data better
  • 16. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 1616 • Intuitive, user driven data wrangling • Extensive library of powerful data transformations and enrichments • Preview results, undo, commit and replay transforms • Test on sample data then apply to full data set in Hadoop Transform
  • 17. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Oracle Big Data Discovery. The Visual Face of Hadoop 17 find explore transform discover share Unlock big data for everyone
  • 18. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 18 • Join and blend data for deeper perspectives • Compose project pages via drag and drop • Use powerful search and guided navigation to ask questions • See new patterns in rich, interactive data visualizations Discover
  • 19. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 19 • Share projects, bookmarks and snapshots with others • Build galleries and tell big data stories • Collaborate and iterate as a team • Publish blended data to HDFS for leverage in other tools Share
  • 20. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Oracle Big Data Discovery. Technical Innovation on CDH Oracle Confidential – Internal 20 Oracle Big Data Discovery Workloads Hadoop Cluster (BDA or Commodity Hardware) BDD node data node data node data node data node name node Data Processing, Workflow & Monitoring • Profiling: catalog entry creation, data type & language detection, schema configuration • Sampling: dgraph (index) file creation • Transforms: >100 functions • Enrichments: location (geo), text (cleanup, sentiment, entity, key-phrase, whitelist tagging) Self-Service Provisioning & Data Transfer • Personal Data: Upload CSV and XLS to HDFS In-Memory Discovery Indexes • DGraph: Search, Guided Navigation, Analytics Studio • Web UI: Find, Explore, Transform, Discover, Share Hadoop 2.x Filesystem (HDFS) Workload Mgmt (YARN) Metadata (HCatalog) Other Hadoop Workloads MapReduce Spark Hive Pig Oracle Big Data SQL (BDA only)
  • 21. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Oracle Big Data Discovery. A Game Changing Platform Business Benefits • Get value faster. Rapidly turn raw data into actionable insights, leveraged across the enterprise • Democratize value from Big Data. Increase the size, diversify the skills, and improve the efficiency of Big Data teams 21 See the Potential in Big Data, Quickly Make it Better and Unlock Value for Everyone Technical Benefits • Destroy existing technical barriers. Run natively on Hadoop cluster for maximum scalability and performance • Publish, secure and leverage. Integrate with Hadoop open standards and leverage the unified Oracle big data ecosystem
  • 23. Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 23 Questions? Please submit questions via the “Q and A” box and we will answer them live.