SlideShare a Scribd company logo
November 2013

BIG ANALYTICS
THE GOOD & THE VALUE
About Think Big Analytics

¨ 

Formed in 2010 to help clients launch and scale-out Big Data solutions

¨ 

Services include Big Data strategy, training, engineering and data science

¨ 

¨ 

Management Background: Quantcast, Cambridge Technology, Oracle, Sun
Microsystems, Accenture
Blue chip clients, including:
Ø 

Internet Transactions Security Global #1

Ø 

Retail 2 of Global Top 5

Ø 

Banking 4 of Global Top 1; Financial Services 2 of Global Top 5

Ø 

Asset Management Global #1

Ø 

Disk Manufacturing Global #1

Ø 

Social Networking Global #1

CONFIDENTIAL

|

2

2
Think Big Integrated Value

Integrated Value
Advisory
¨ 
¨ 

¨ 

¨ 

Understand true
business needs
Evaluate suitability of
new technologies

¨ 

Provide perspective on
market ideas
¨ 

Ensure engineering
and analytics support
business goals
Help establish realistic
and attainable
objectives
Drive client-specific
innovation

Implement
¨ 

¨ 

¨ 

Understand technology
preferences and
limitations
Assess talent skills and
development needs
Develop deep knowledge
of the data and tools

CONFIDENTIAL

|

3
Big Analytics

Ÿ 
Ÿ 
Ÿ 
Ÿ 

New data
Yielding new opportunities
Enabled by new approaches
With supporting organization

CONFIDENTIAL

|

4
New Data

CONFIDENTIAL

|

5
Nontraditional Formats

Ÿ  Unstructured data != text
- Call logs
- Raw video
- Satellite photos

CONFIDENTIAL

|

6
Exhaust

Ÿ  Byproduct data
Ÿ  Driving interest in the Internet
of Things

Ÿ  Our machines tell a story about
us

CONFIDENTIAL

|

7
Data about Data

Ÿ  Data usage patterns
Ÿ  Driving next generation
organizations
- Data access patterns as KPI
- Systems access as employee
engagement

CONFIDENTIAL

|

8
New Opportunity

CONFIDENTIAL

|

9
Fingerprinting

Ÿ  Unintentional patterns define us
- ATM rhythm
- Botnet synchronization

Ÿ  More connected world exposes
more fingerprints
- Mobile installs and settings +
NFC
- Sensory data at shopping mall
displays

CONFIDENTIAL

|

10
Dark Data Insights

Ÿ 
Ÿ 
Ÿ 
Ÿ 
Ÿ 

It’s back from the dead!
Audit data
Fund manager predictions
Employee logs
Architectural records

CONFIDENTIAL

|

11
New Approaches

CONFIDENTIAL

|

12
Unstructured Analysis

Ÿ  Non-traditional structures
- Path models
- High dimensionality

Ÿ  Text
- POS
- Classification

Ÿ  Images
- Object recognition
- Time differentials

CONFIDENTIAL

|

13
Deep Learning

Ÿ  MapReduce built for
- Bootstrapped models
- Partitioning data by complex
logic

Ÿ  Backpropagation is hard
Ÿ  Feature learning isn’t (always)

CONFIDENTIAL

|

14
Challenges Incorporating Data
Science

CONFIDENTIAL

|

15
Organizational Integration

Ÿ  Traditionally under engineering
Ÿ  Integrated with data creators,
not data consumers

Ÿ  Disconnected from business
priorities

CONFIDENTIAL

|

16
Success Loops

Ÿ  We take BI for granted
- Analysts find novel patterns
- Business sees new trends
- Statistics is balanced by
domain knowledge
- Integration of actors aware of
feasibility, cost, and impact

Ÿ  Where does your data scientist
sit?

CONFIDENTIAL

|

17
Successful Incorporation of Data
Science

CONFIDENTIAL

|

18
Partnership

Ÿ  Business is a partner, not a
customer

Ÿ  New insights, capabilities, and
products are not born in a
vacuum

CONFIDENTIAL

|

19
Cross Functional Teams

Ÿ  Data science is a process, not
a job role

- Engineering
- Research
- Statistics
- Business
- Salesmanship

Ÿ  Successful Big Analytics blends
skills, perspectives, and pushes
boundaries

CONFIDENTIAL

|

20
Measurement

Ÿ  Requires KPI/KRI
Ÿ  Performance metrics
- Direct actions
- Create purpose

CONFIDENTIAL

|

21
Client Success

CONFIDENTIAL

|

22
Example Client Phase 1

Ÿ  First phase: Big Analytics
execution

Ÿ  New methods of Botnet
detection

Ÿ  Led to patent

CONFIDENTIAL

|

23
Example Client Phase 2

Ÿ  Further analysis
- Improvement of Botnet models

Ÿ  Expansion of cross functional
Big Analytic team
- Tool selection
- Training
- Early win identification
- Self-selected group

CONFIDENTIAL

|

24
Example Client Phase 3

Ÿ  Cross-Functional Analytic
Organization

Ÿ 
Ÿ 
Ÿ 
Ÿ 

Governance
Ownership and accountability
Process
Roadmap

CONFIDENTIAL

|

25
Questions?
www.thinkbiganalytics.com
www.linkedin.com/in/danmallinger
@danmallinger

CONFIDENTIAL

|

26

More Related Content

PPTX
Big data-science-oanyc
PDF
"Data Pipelines for Small, Messy and Tedious Data", Vladislav Supalov, CAO & ...
PDF
Data Science Salon: Building a Data Science Culture
PPTX
Data Science Salon: Applying Machine Learning to Modernize Business Processes
PDF
Data Science Salon: Culture, Data Engineering and Hamburger Stands: Thoughts ...
PPTX
Big data webinar may23 nrit by sunil
PPTX
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
PDF
Neo4j Innovation Lab – Bringing the Best of Data Science and Design Thinking ...
Big data-science-oanyc
"Data Pipelines for Small, Messy and Tedious Data", Vladislav Supalov, CAO & ...
Data Science Salon: Building a Data Science Culture
Data Science Salon: Applying Machine Learning to Modernize Business Processes
Data Science Salon: Culture, Data Engineering and Hamburger Stands: Thoughts ...
Big data webinar may23 nrit by sunil
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
Neo4j Innovation Lab – Bringing the Best of Data Science and Design Thinking ...

What's hot (20)

PDF
GraphTour 2020 - Customer Journey with Neo4j Services
PDF
"What we learned from 5 years of building a data science software that actual...
PPTX
Big data perspective solution & technology
PDF
Data & Analytics at Scale
PDF
PASS Summit Data Storytelling with R Power BI and AzureML
PDF
SpeedTrack Tech Overview 2015
PDF
Building a data platform tnt
PDF
Building up a Data Science Team from Scratch
PDF
The 3 Key Barriers Keeping Companies from Deploying Data Products
PDF
Double Your Hadoop Performance with Hortonworks SmartSense
PDF
How a global manufacturing company built a data science capability from scratch
PDF
Vishal resume
PDF
Knowi Overview: NoSQL Analytics and Business Intelligence
PPTX
Dataiku data science studio
PDF
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
PDF
The Role(s) of Data Science in Modern Organizations
PPTX
Handoop training in bangalore
PPTX
Week1day2 (1)
PDF
Strategy toolbox for startsups
GraphTour 2020 - Customer Journey with Neo4j Services
"What we learned from 5 years of building a data science software that actual...
Big data perspective solution & technology
Data & Analytics at Scale
PASS Summit Data Storytelling with R Power BI and AzureML
SpeedTrack Tech Overview 2015
Building a data platform tnt
Building up a Data Science Team from Scratch
The 3 Key Barriers Keeping Companies from Deploying Data Products
Double Your Hadoop Performance with Hortonworks SmartSense
How a global manufacturing company built a data science capability from scratch
Vishal resume
Knowi Overview: NoSQL Analytics and Business Intelligence
Dataiku data science studio
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
The Role(s) of Data Science in Modern Organizations
Handoop training in bangalore
Week1day2 (1)
Strategy toolbox for startsups
Ad

Viewers also liked (11)

PPTX
BIG Data Science: A Path Forward
PPTX
Real Time Data Processing Using Spark Streaming
PDF
R+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster Answers
PDF
Apachecon Europe 2012: Operating HBase - Things you need to know
PDF
R + 15 minutes = Hadoop cluster
PPTX
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYC
PPTX
January 2015 HUG: Apache Flink: Fast and reliable large-scale data processing
PPTX
Are You Ready for Big Data Big Analytics?
PDF
HBase and Impala Notes - Munich HUG - 20131017
PDF
High Performance Predictive Analytics in R and Hadoop
PDF
Predictive Analytics using R
BIG Data Science: A Path Forward
Real Time Data Processing Using Spark Streaming
R+Hadoop - Ask Bigger (and New) Questions and Get Better, Faster Answers
Apachecon Europe 2012: Operating HBase - Things you need to know
R + 15 minutes = Hadoop cluster
Dan Mallinger, Data Science Practice Manager, Think Big Analytics at MLconf NYC
January 2015 HUG: Apache Flink: Fast and reliable large-scale data processing
Are You Ready for Big Data Big Analytics?
HBase and Impala Notes - Munich HUG - 20131017
High Performance Predictive Analytics in R and Hadoop
Predictive Analytics using R
Ad

Similar to Big Analytics: Building Lasting Value (20)

PDF
Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL
PDF
IDC Retail Insights - What's Possible with a Modern Data Architecture?
PDF
Data strategy demistifying data
PDF
Competitive Advantage from the Data Lake
PDF
Acctiva: expertise in Business Intelligence, Data Warehousing, Data Governance
PDF
The Softer Skills Analysts need to make an impact
PDF
Building a Winning Roadmap for Analytics
PPTX
Big Data Strategies
PDF
ThoughtWorks: Monetising Open Banking
PDF
Mastering Data Science: A Key to Unlocking Business Potential
PDF
Agile BI success factors
PPTX
Scaling Your Enterprise With Data Science
PPTX
From 'I think' to 'I know'
PDF
Team undiscovered opportunuity analysis report presentation- venture lab 2012
PDF
Big data analytics and innovation
PDF
From Customer Insights to Action
PDF
ASAS 2014 - Klasien Postma
PDF
Developing a Modernization Strategy: Evaluating the Options by Chris Koppe
PDF
Data strategy in a Big Data world
PDF
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdf
 
Dan Mallinger – Data Science Practice Manager, Think Big Analytics at MLconf ATL
IDC Retail Insights - What's Possible with a Modern Data Architecture?
Data strategy demistifying data
Competitive Advantage from the Data Lake
Acctiva: expertise in Business Intelligence, Data Warehousing, Data Governance
The Softer Skills Analysts need to make an impact
Building a Winning Roadmap for Analytics
Big Data Strategies
ThoughtWorks: Monetising Open Banking
Mastering Data Science: A Key to Unlocking Business Potential
Agile BI success factors
Scaling Your Enterprise With Data Science
From 'I think' to 'I know'
Team undiscovered opportunuity analysis report presentation- venture lab 2012
Big data analytics and innovation
From Customer Insights to Action
ASAS 2014 - Klasien Postma
Developing a Modernization Strategy: Evaluating the Options by Chris Koppe
Data strategy in a Big Data world
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdf
 

Recently uploaded (20)

PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Encapsulation theory and applications.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Cloud computing and distributed systems.
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
KodekX | Application Modernization Development
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Empathic Computing: Creating Shared Understanding
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
MIND Revenue Release Quarter 2 2025 Press Release
NewMind AI Weekly Chronicles - August'25 Week I
Encapsulation theory and applications.pdf
Encapsulation_ Review paper, used for researhc scholars
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
sap open course for s4hana steps from ECC to s4
Understanding_Digital_Forensics_Presentation.pptx
Cloud computing and distributed systems.
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Network Security Unit 5.pdf for BCA BBA.
KodekX | Application Modernization Development
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
The AUB Centre for AI in Media Proposal.docx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Empathic Computing: Creating Shared Understanding
Advanced methodologies resolving dimensionality complications for autism neur...
Digital-Transformation-Roadmap-for-Companies.pptx
Machine learning based COVID-19 study performance prediction
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Review of recent advances in non-invasive hemoglobin estimation
MIND Revenue Release Quarter 2 2025 Press Release

Big Analytics: Building Lasting Value