SlideShare a Scribd company logo
Big Data Analytics – Realize the Investment from Your Big Data Clusters
Mark Davis| Senior Architect and Principal Engineer, Dell Inc.
Big Data and Society
How Is Big Data Affecting Our World?
200EB = 1018 B
1ZB = 1021 B
10EB
100TB
2000198519001750
Industrial
Revolution
#1
Industrial
Revolution
#2
Industrial
Revolution
#3
Industrial
Revolution
#4
R. J. Gordon: Is US economic growth over? Faltering innovation confronts
the six headwinds. CEPR Policy Insight No 63
Distributed
File System
MapReduce
Eventually
Consistent
Column Store
Analytics
Database
NoSQL
Structured
Semi-structured
Unstructured
Text Analytics
Machine
Learning
The Big Data “Zoo”
Big Data Use Cases
How Is Big Data Being Consumed Today?
SourcesKAS
GOAL: Improve force effectiveness
SOURCES: Situation reports and acquired multi-
source intelligence
ANALYSIS: Extract named entities and
relationships, classify and label, normalize
geospatial and temporal metadata; visually
understand relationships and trends
ACTION: Identify mission objectives and create
priorities
Defense Intelligence
Visualization
metadata
relationships
data
Visual
Understanding
entities
* Current system doesn’t scale
* Oracle with text plug-in
* Overwhelmed by intelligence needs
* Need analytic capability with search
US Army
SourcesKAS
GOAL: Be more competitive
SOURCES: Patents, PR announcements, legal
documents, whitepapers, crawled websites
ANALYSIS: Extract named entities and
relationships, classify and label; visually
understand relationships and trends
ACTION: Change R&D priorities and improve
marketing approaches
Competitive Intelligence
Viz/Search
metadata
relationships
data
Understanding
entities
* Understand IP among competitors
* Assist legal team with litigation
* Custom search experience
* Custom extractors:
Electronic parts
Memory types
Flash memory
Customer: Technology Company
SourcesKAS
GOAL: Discover new drugs, detect side-effects,
speed R&D
SOURCES: Published research reports, patents,
adverse effects databases, genomics and
proteomics databases
ANALYSIS: Extract named entities and
relationships, classify and label; visually
discover trends and relationships
ACTION: Change R&D priorities
Drug Discovery
Viz/Seach
relationships
data
Understanding
entities
pathways
sequences
* Lousy search
* Internal regulators can’t find by accession number
* Custom extractors:
Accession number
Ontology of active ingredients
Drug names
FDA
SourcesKAS
GOAL: Scalable analysis of customer relationship
engagements
SOURCES: Call center and web help contact
narratives
ANALYSIS: Ingest massive data sets; visually
discover trends, novelty, and relationships
ACTION: Predict new product issues
CRM Analytics
Viz/Search
relationships
data
Understanding
My iPhone is
very hot…
SourcesKAS
GOAL: Scalable analysis of network
failures
SOURCES: Uploaded syslog data and
configuration for routers and switches
ANALYSIS: Ingest massive data sets;
visually discover trends and relationships
ACTION: Solve network problems
Network Analytics
Viz/Search
relationships
data
Understanding
* Unable to manage customer network signals
* RDBMS
* Tiger team dumps database and runs Perl scripts for analysis
Router/Switch Vendor
SourcesKAS
GOAL: Reduce fraud
SOURCES: Analysis customer data
ANALYSIS: Extract patterns of web and service
usage, classify, label with normalized
geospatial and temporal metadata; visually
understand relationships and trends.
ACTION: Indentify fraudulent transactions and
patterns
Financial Services: Fraud
Viz/Search
metadata relationships
data
Understanding
SourcesKAS
GOAL: Identify what people want to buy
SOURCES: Crawl Twitter, blogs, and websites
ANALYSIS: Extract sentiments about products,
classify, label with normalized geospatial and
temporal metadata; visually understand
relationships and trends.
ACTION: Target sales and enhance offerings
Buy Signals
Viz/Search
metadata relationships
data
Understanding
sentiments
SourcesKAS
GOAL: Find case-supporting and actionable
information
SOURCES: Email repositories, Office
documents, patents, memos
ANALYSIS: Extract named entities and
relationships, classify and label; visually
discover trends and relationships
ACTION: Develop legal theories and prepare for
arguments
Legal Informatics
Viz/Search
metadata
relationships
data
Understanding
entities
Dell’s Kitenga Analytics Suite
 Aggregate
 Count
 Extract
 Transform
 Chart
 Graph
 Model
 Visualize
 Search
 Predict
Transform Big Data into Actionable Intelligence
Search
Facetted Search,
Visualization
Analytics
Extract, Crawl, Index,
NLP, Transform,
Machine Learning
Analytical
Producer
Analytical
Consumer
Visualization
Visualize, Model,
Interact
C* Summit 2013: Big Data Analytics – Realize the Investment from Your Big Data Clusters by Mark Davis
C* Summit 2013: Big Data Analytics – Realize the Investment from Your Big Data Clusters by Mark Davis
C* Summit 2013: Big Data Analytics – Realize the Investment from Your Big Data Clusters by Mark Davis
C* Summit 2013: Big Data Analytics – Realize the Investment from Your Big Data Clusters by Mark Davis
C* Summit 2013: Big Data Analytics – Realize the Investment from Your Big Data Clusters by Mark Davis
C* Summit 2013: Big Data Analytics – Realize the Investment from Your Big Data Clusters by Mark Davis
Cassandra in the Zoo
How Dell Is Integrating Cassandra
Cassandra Integration
Toad
IC
Cassandra
RDBMS
Salesforce
KAS
Cassandra
Crawls
Feeds
THANK YOU

More Related Content

PDF
DataScava
PPT
Your big data audience insight big data show 24 apr 2013
PDF
Using Graphs to Enable National-Scale Analytics
PPTX
5 Big Data Use Cases for 2013
PDF
Big Data analytics
PDF
BigData and Beyond
PDF
6 great competitive intelligence data sources
PPTX
Big data
DataScava
Your big data audience insight big data show 24 apr 2013
Using Graphs to Enable National-Scale Analytics
5 Big Data Use Cases for 2013
Big Data analytics
BigData and Beyond
6 great competitive intelligence data sources
Big data

What's hot (15)

PDF
Getting down to business on Big Data analytics
PPTX
Webinar | Using Big Data and Predictive Analytics to Empower Distribution and...
PDF
Thwart Fraud Using Graph-Enhanced Machine Learning and AI
PPTX
BIG DATA & DATA ANALYTICS
PDF
Big agendas for big data analytics projects
PDF
2015 Trends in Data Intelligence
PDF
Sqrrl Datasheet: Cyber Hunting
PPTX
Semantic Data Lake
PPTX
big data Presentation
PDF
Business case for Big Data Analytics
PPTX
What is big data
PPTX
PhD Projects in Big Data Analytics Research Guidance
PPTX
Big Data: 8 facts and 8 fictions
PPTX
Digital Velocity 2014: "The Holy Grail of Digital Data Analytics"
PDF
Denodo Platform 7.0: What's New?
Getting down to business on Big Data analytics
Webinar | Using Big Data and Predictive Analytics to Empower Distribution and...
Thwart Fraud Using Graph-Enhanced Machine Learning and AI
BIG DATA & DATA ANALYTICS
Big agendas for big data analytics projects
2015 Trends in Data Intelligence
Sqrrl Datasheet: Cyber Hunting
Semantic Data Lake
big data Presentation
Business case for Big Data Analytics
What is big data
PhD Projects in Big Data Analytics Research Guidance
Big Data: 8 facts and 8 fictions
Digital Velocity 2014: "The Holy Grail of Digital Data Analytics"
Denodo Platform 7.0: What's New?
Ad

Similar to C* Summit 2013: Big Data Analytics – Realize the Investment from Your Big Data Clusters by Mark Davis (20)

PDF
Why Big Data is Really about Small Data
PDF
Level Seven - Expedient Big Data presentation
PPTX
Big Data & Business Analytics: Understanding the Marketspace
PPTX
000 introduction to big data analytics 2021
PPTX
Big data insights part i
PDF
Big Data Analytics
PDF
Random notes on big data
PPTX
20150118 s snet analytics vca
PPT
"Big Data Dreams"
PPTX
Finance and Accounting BPM
PPTX
Data sciences and marketing analytics
PDF
Comprehensive Notes on Big Data Concepts and Applications Based on University...
PDF
Big data in marketing at harvard business club nick1 june 15 2013
PPTX
Big data unit 2
PPTX
Big Data Analytics
PDF
Big Data Analytics PowerPoint Presentation Slides
PPT
01-introduction.ppt the paper that you can unless you want to join me because...
PDF
Random notes on big data
PDF
Revolution in Business Analytics-Zika Virus Example
PDF
SIMPosium presentation_Bardess Qlik
Why Big Data is Really about Small Data
Level Seven - Expedient Big Data presentation
Big Data & Business Analytics: Understanding the Marketspace
000 introduction to big data analytics 2021
Big data insights part i
Big Data Analytics
Random notes on big data
20150118 s snet analytics vca
"Big Data Dreams"
Finance and Accounting BPM
Data sciences and marketing analytics
Comprehensive Notes on Big Data Concepts and Applications Based on University...
Big data in marketing at harvard business club nick1 june 15 2013
Big data unit 2
Big Data Analytics
Big Data Analytics PowerPoint Presentation Slides
01-introduction.ppt the paper that you can unless you want to join me because...
Random notes on big data
Revolution in Business Analytics-Zika Virus Example
SIMPosium presentation_Bardess Qlik
Ad

More from DataStax Academy (20)

PDF
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
PPTX
Introduction to DataStax Enterprise Graph Database
PPTX
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
PPTX
Cassandra on Docker @ Walmart Labs
PDF
Cassandra 3.0 Data Modeling
PPTX
Cassandra Adoption on Cisco UCS & Open stack
PDF
Data Modeling for Apache Cassandra
PDF
Coursera Cassandra Driver
PDF
Production Ready Cassandra
PDF
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 1
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 2
PDF
Standing Up Your First Cluster
PDF
Real Time Analytics with Dse
PDF
Introduction to Data Modeling with Apache Cassandra
PDF
Cassandra Core Concepts
PPTX
Enabling Search in your Cassandra Application with DataStax Enterprise
PPTX
Bad Habits Die Hard
PDF
Advanced Data Modeling with Apache Cassandra
PDF
Advanced Cassandra
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Cassandra on Docker @ Walmart Labs
Cassandra 3.0 Data Modeling
Cassandra Adoption on Cisco UCS & Open stack
Data Modeling for Apache Cassandra
Coursera Cassandra Driver
Production Ready Cassandra
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 2
Standing Up Your First Cluster
Real Time Analytics with Dse
Introduction to Data Modeling with Apache Cassandra
Cassandra Core Concepts
Enabling Search in your Cassandra Application with DataStax Enterprise
Bad Habits Die Hard
Advanced Data Modeling with Apache Cassandra
Advanced Cassandra

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Empathic Computing: Creating Shared Understanding
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPT
Teaching material agriculture food technology
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
Cloud computing and distributed systems.
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Electronic commerce courselecture one. Pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Modernizing your data center with Dell and AMD
Mobile App Security Testing_ A Comprehensive Guide.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Empathic Computing: Creating Shared Understanding
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Machine learning based COVID-19 study performance prediction
Chapter 3 Spatial Domain Image Processing.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Advanced methodologies resolving dimensionality complications for autism neur...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Teaching material agriculture food technology
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Cloud computing and distributed systems.
Network Security Unit 5.pdf for BCA BBA.
Review of recent advances in non-invasive hemoglobin estimation
Encapsulation_ Review paper, used for researhc scholars
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Electronic commerce courselecture one. Pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Modernizing your data center with Dell and AMD

C* Summit 2013: Big Data Analytics – Realize the Investment from Your Big Data Clusters by Mark Davis

  • 1. Big Data Analytics – Realize the Investment from Your Big Data Clusters Mark Davis| Senior Architect and Principal Engineer, Dell Inc.
  • 2. Big Data and Society How Is Big Data Affecting Our World?
  • 3. 200EB = 1018 B 1ZB = 1021 B 10EB 100TB 2000198519001750 Industrial Revolution #1 Industrial Revolution #2 Industrial Revolution #3 Industrial Revolution #4 R. J. Gordon: Is US economic growth over? Faltering innovation confronts the six headwinds. CEPR Policy Insight No 63
  • 5. Big Data Use Cases How Is Big Data Being Consumed Today?
  • 6. SourcesKAS GOAL: Improve force effectiveness SOURCES: Situation reports and acquired multi- source intelligence ANALYSIS: Extract named entities and relationships, classify and label, normalize geospatial and temporal metadata; visually understand relationships and trends ACTION: Identify mission objectives and create priorities Defense Intelligence Visualization metadata relationships data Visual Understanding entities
  • 7. * Current system doesn’t scale * Oracle with text plug-in * Overwhelmed by intelligence needs * Need analytic capability with search US Army
  • 8. SourcesKAS GOAL: Be more competitive SOURCES: Patents, PR announcements, legal documents, whitepapers, crawled websites ANALYSIS: Extract named entities and relationships, classify and label; visually understand relationships and trends ACTION: Change R&D priorities and improve marketing approaches Competitive Intelligence Viz/Search metadata relationships data Understanding entities
  • 9. * Understand IP among competitors * Assist legal team with litigation * Custom search experience * Custom extractors: Electronic parts Memory types Flash memory Customer: Technology Company
  • 10. SourcesKAS GOAL: Discover new drugs, detect side-effects, speed R&D SOURCES: Published research reports, patents, adverse effects databases, genomics and proteomics databases ANALYSIS: Extract named entities and relationships, classify and label; visually discover trends and relationships ACTION: Change R&D priorities Drug Discovery Viz/Seach relationships data Understanding entities pathways sequences
  • 11. * Lousy search * Internal regulators can’t find by accession number * Custom extractors: Accession number Ontology of active ingredients Drug names FDA
  • 12. SourcesKAS GOAL: Scalable analysis of customer relationship engagements SOURCES: Call center and web help contact narratives ANALYSIS: Ingest massive data sets; visually discover trends, novelty, and relationships ACTION: Predict new product issues CRM Analytics Viz/Search relationships data Understanding My iPhone is very hot…
  • 13. SourcesKAS GOAL: Scalable analysis of network failures SOURCES: Uploaded syslog data and configuration for routers and switches ANALYSIS: Ingest massive data sets; visually discover trends and relationships ACTION: Solve network problems Network Analytics Viz/Search relationships data Understanding
  • 14. * Unable to manage customer network signals * RDBMS * Tiger team dumps database and runs Perl scripts for analysis Router/Switch Vendor
  • 15. SourcesKAS GOAL: Reduce fraud SOURCES: Analysis customer data ANALYSIS: Extract patterns of web and service usage, classify, label with normalized geospatial and temporal metadata; visually understand relationships and trends. ACTION: Indentify fraudulent transactions and patterns Financial Services: Fraud Viz/Search metadata relationships data Understanding
  • 16. SourcesKAS GOAL: Identify what people want to buy SOURCES: Crawl Twitter, blogs, and websites ANALYSIS: Extract sentiments about products, classify, label with normalized geospatial and temporal metadata; visually understand relationships and trends. ACTION: Target sales and enhance offerings Buy Signals Viz/Search metadata relationships data Understanding sentiments
  • 17. SourcesKAS GOAL: Find case-supporting and actionable information SOURCES: Email repositories, Office documents, patents, memos ANALYSIS: Extract named entities and relationships, classify and label; visually discover trends and relationships ACTION: Develop legal theories and prepare for arguments Legal Informatics Viz/Search metadata relationships data Understanding entities
  • 19.  Aggregate  Count  Extract  Transform  Chart  Graph  Model  Visualize  Search  Predict Transform Big Data into Actionable Intelligence
  • 20. Search Facetted Search, Visualization Analytics Extract, Crawl, Index, NLP, Transform, Machine Learning Analytical Producer Analytical Consumer Visualization Visualize, Model, Interact
  • 27. Cassandra in the Zoo How Dell Is Integrating Cassandra