SlideShare a Scribd company logo
Operating in a Multi-execution Engine Hadoop Environment by Erik Halseth of Datameer
© 2014 Datameer, Inc. All rights reserved.
Datameer’s Vision!

Make big data analytics 
simple for everyone
© 2014 Datameer, Inc. All rights reserved.
What Datameer Offers!
Wizard-led Data Integration!
• No ETL!
• 59 Connectors + plug-in API!
• Smart Sampling!
Point-and-click Analytics!
• Interactive spreadsheet UI!
• 270 pre-built analytic functions!
• Macros & function plug-in API!
Drag-and-Drop Visualization!
• Blank canvas for design !
• HTML5, consumable on any device!
• Visualization plug-in API!
© 2014 Datameer, Inc. All rights reserved.
Smart Analytics!
Column Dependencies
Decision Tree
Recommendation Engine
Clustering
© 2014 Datameer, Inc. All rights reserved.
Where does Datameer sit?!
© 2014 Datameer, Inc. All rights reserved.
Classic Business Analytics Data Flow !
© 2014 Datameer, Inc. All rights reserved.
New Business Analytics Data Flow!
© 2014 Datameer, Inc. All rights reserved.
Datameer On Premise Installation!
© 2014 Datameer, Inc. All rights reserved.
Datameer Implementation - Cloud!
© 2014 Datameer, Inc. All rights reserved.
!   Stefan Groschupf

CEO, Co-Founder!
Problem
© 2014 Datameer, Inc. All rights reserved.
Typical Data Analytics Funnel
Raw Data (TB-PB)
Insights (KB)
! More sophisticated
! Less change
! High value
! Power users
! Planned / scheduled
! More ad hoc
! More change
! High & low value
! Casual users
! Interactive sessions
5 - 15 steps,
iterative algorithms
Explore
Summarize
Prepare
Learn
Aggregate
Present
Slice
© 2014 Datameer, Inc. All rights reserved.
Raw Data (TB-PB)
Insights (KB)
Map Reduce
•  Inefficient for small data!
•  High latency!
Current Approaches: Either - Or
Raw Data (<TB)
Insights (KB)
In-Memory
•  Only small data!
•  Very expensive!
•  Not Hadoop!
Not
New
© 2014 Datameer, Inc. All rights reserved.
Small Data, Big Machine
VS
© 2014 Datameer, Inc. All rights reserved.
600h Spent on Jobs < 100MB!
© 2014 Datameer, Inc. All rights reserved.
!   Stefan Groschupf

CEO, Co-Founder!
Our Solution
© 2014 Datameer, Inc. All rights reserved.
Smart Execution
Raw Data (TB-PB)
Insights (KB)
New
Optimized
MapReduce
In-Memory
Single
Node
© 2014 Datameer, Inc. All rights reserved.
Architecture
Hadoop
MapReduce
Dataflow Graph Engine
YARN
Smart Execution Engine
In-Memory
Tez
Others
Data Integration
 Visualization 
Spreadsheet
Other
(SQL) 
Single Node
© 2014 Datameer, Inc. All rights reserved.
Workflow
Data Sets
System Resources
Optimized!
MapReduce!
Single Node!
In-Memory!
Future!
Technology!
Analytics
© 2014 Datameer, Inc. All rights reserved.
DAG Processing
vs.!
© 2014 Datameer, Inc. All rights reserved.
Transparent for End Users
@Datameer!

More Related Content

PDF
How to do Data Science Without the Scientist
PDF
Webinar - Introducing Datameer 4.0: Visual, End-to-End
PDF
Managing Productivity of a Service Team: Customer Best Practices by Nucleus N...
PPT
Unleash Business Technology 2.0
PPT
Dataplex Company Presentation
PDF
Corporate Profile
PPTX
All analytics assets, one launchpad
PDF
Critical data center move case study
How to do Data Science Without the Scientist
Webinar - Introducing Datameer 4.0: Visual, End-to-End
Managing Productivity of a Service Team: Customer Best Practices by Nucleus N...
Unleash Business Technology 2.0
Dataplex Company Presentation
Corporate Profile
All analytics assets, one launchpad
Critical data center move case study

What's hot (16)

PPTX
ESPC14 - T23 - SharePoint Online vs On-Premises vs Hosted - Making the Right ...
PDF
CloudHealth Boston Presentation
PDF
leave behind flyer-1
PPTX
Insight Facts & Figures
PPTX
ISConvergence
PDF
savvyTalent brochure
PPTX
Invertedi Services
PPT
Laerdal Medical experience with Aurea products - Aurea & Helmes Nordic Semina...
PDF
Tips To Create Stronger Business On Cloud
PDF
Learn NetSuite: Top NetSuite Training Resources For Self-Teaching
PDF
Office 365 FactSheet-2
PDF
Softchoice overview
PDF
Moogilu StartupKit
PDF
Full-Service NetSuite Team: Implementation, Integration, Training & Support
PPTX
The Newgistics Digital Transformation Journey
PDF
Freeing Minds - Reduce waste, improve efficiency
ESPC14 - T23 - SharePoint Online vs On-Premises vs Hosted - Making the Right ...
CloudHealth Boston Presentation
leave behind flyer-1
Insight Facts & Figures
ISConvergence
savvyTalent brochure
Invertedi Services
Laerdal Medical experience with Aurea products - Aurea & Helmes Nordic Semina...
Tips To Create Stronger Business On Cloud
Learn NetSuite: Top NetSuite Training Resources For Self-Teaching
Office 365 FactSheet-2
Softchoice overview
Moogilu StartupKit
Full-Service NetSuite Team: Implementation, Integration, Training & Support
The Newgistics Digital Transformation Journey
Freeing Minds - Reduce waste, improve efficiency
Ad

Similar to Operating in a Multi-execution Engine Hadoop Environment by Erik Halseth of Datameer (20)

PDF
Webinar - Big Data: Power to the User
PDF
Making Hadoop based analytics simple for everyone to use
PDF
Instant Visualizations in Every Step of Analysis
PDF
James Mesney_"Datameer's Big Data Analytics Platform"_April 9th_Data Enthusia...
PDF
braincavesoft-com-big-data-analytics.pdf
PDF
Making Big Data Easy for Everyone
PDF
Big Data Analytics Services - BrainCave Soft.pdf
PDF
Big Data Analytics Services - BrainCave Soft.pdf
PDF
Customer Case Studies of Self-Service Big Data Analytics
PDF
Big data-analytics-ebook
PDF
Online Fraud Detection Using Big Data Analytics Webinar
PDF
Datameer Analytics Solution
PDF
Big data-analytics-ebook
PDF
Top 3 Considerations for Machine Learning on Big Data
PPTX
SoftServe BI/BigData Workshop in Utah
PDF
braincavesoft-com-data-analytics (1).pdf
PDF
braincavesoft-com-data-analytics.pdf
PDF
Iod session 3423 analytics patterns of expertise, the fast path to amazing ...
PDF
Complement Your Existing Data Warehouse with Big Data & Hadoop
PDF
How Can Analytics Improve Business?
Webinar - Big Data: Power to the User
Making Hadoop based analytics simple for everyone to use
Instant Visualizations in Every Step of Analysis
James Mesney_"Datameer's Big Data Analytics Platform"_April 9th_Data Enthusia...
braincavesoft-com-big-data-analytics.pdf
Making Big Data Easy for Everyone
Big Data Analytics Services - BrainCave Soft.pdf
Big Data Analytics Services - BrainCave Soft.pdf
Customer Case Studies of Self-Service Big Data Analytics
Big data-analytics-ebook
Online Fraud Detection Using Big Data Analytics Webinar
Datameer Analytics Solution
Big data-analytics-ebook
Top 3 Considerations for Machine Learning on Big Data
SoftServe BI/BigData Workshop in Utah
braincavesoft-com-data-analytics (1).pdf
braincavesoft-com-data-analytics.pdf
Iod session 3423 analytics patterns of expertise, the fast path to amazing ...
Complement Your Existing Data Warehouse with Big Data & Hadoop
How Can Analytics Improve Business?
Ad

More from Data Con LA (20)

PPTX
Data Con LA 2022 Keynotes
PPTX
Data Con LA 2022 Keynotes
PDF
Data Con LA 2022 Keynote
PPTX
Data Con LA 2022 - Startup Showcase
PPTX
Data Con LA 2022 Keynote
PDF
Data Con LA 2022 - Using Google trends data to build product recommendations
PPTX
Data Con LA 2022 - AI Ethics
PDF
Data Con LA 2022 - Improving disaster response with machine learning
PDF
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
PDF
Data Con LA 2022 - Real world consumer segmentation
PPTX
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
PPTX
Data Con LA 2022 - Moving Data at Scale to AWS
PDF
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
PDF
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
PDF
Data Con LA 2022 - Intro to Data Science
PDF
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
PPTX
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
PPTX
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
PPTX
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
PPTX
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
Data Con LA 2022 Keynote
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 Keynote
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022 - Data Streaming with Kafka

Recently uploaded (20)

PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Spectroscopy.pptx food analysis technology
PDF
Approach and Philosophy of On baking technology
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Encapsulation theory and applications.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Unlocking AI with Model Context Protocol (MCP)
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
sap open course for s4hana steps from ECC to s4
Diabetes mellitus diagnosis method based random forest with bat algorithm
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Network Security Unit 5.pdf for BCA BBA.
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Spectroscopy.pptx food analysis technology
Approach and Philosophy of On baking technology
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Spectral efficient network and resource selection model in 5G networks
Encapsulation theory and applications.pdf
Empathic Computing: Creating Shared Understanding
Unlocking AI with Model Context Protocol (MCP)

Operating in a Multi-execution Engine Hadoop Environment by Erik Halseth of Datameer

  • 2. © 2014 Datameer, Inc. All rights reserved. Datameer’s Vision! Make big data analytics simple for everyone
  • 3. © 2014 Datameer, Inc. All rights reserved. What Datameer Offers! Wizard-led Data Integration! • No ETL! • 59 Connectors + plug-in API! • Smart Sampling! Point-and-click Analytics! • Interactive spreadsheet UI! • 270 pre-built analytic functions! • Macros & function plug-in API! Drag-and-Drop Visualization! • Blank canvas for design ! • HTML5, consumable on any device! • Visualization plug-in API!
  • 4. © 2014 Datameer, Inc. All rights reserved. Smart Analytics! Column Dependencies Decision Tree Recommendation Engine Clustering
  • 5. © 2014 Datameer, Inc. All rights reserved. Where does Datameer sit?!
  • 6. © 2014 Datameer, Inc. All rights reserved. Classic Business Analytics Data Flow !
  • 7. © 2014 Datameer, Inc. All rights reserved. New Business Analytics Data Flow!
  • 8. © 2014 Datameer, Inc. All rights reserved. Datameer On Premise Installation!
  • 9. © 2014 Datameer, Inc. All rights reserved. Datameer Implementation - Cloud!
  • 10. © 2014 Datameer, Inc. All rights reserved. !   Stefan Groschupf
 CEO, Co-Founder! Problem
  • 11. © 2014 Datameer, Inc. All rights reserved. Typical Data Analytics Funnel Raw Data (TB-PB) Insights (KB) ! More sophisticated ! Less change ! High value ! Power users ! Planned / scheduled ! More ad hoc ! More change ! High & low value ! Casual users ! Interactive sessions 5 - 15 steps, iterative algorithms Explore Summarize Prepare Learn Aggregate Present Slice
  • 12. © 2014 Datameer, Inc. All rights reserved. Raw Data (TB-PB) Insights (KB) Map Reduce •  Inefficient for small data! •  High latency! Current Approaches: Either - Or Raw Data (<TB) Insights (KB) In-Memory •  Only small data! •  Very expensive! •  Not Hadoop! Not New
  • 13. © 2014 Datameer, Inc. All rights reserved. Small Data, Big Machine VS
  • 14. © 2014 Datameer, Inc. All rights reserved. 600h Spent on Jobs < 100MB!
  • 15. © 2014 Datameer, Inc. All rights reserved. !   Stefan Groschupf
 CEO, Co-Founder! Our Solution
  • 16. © 2014 Datameer, Inc. All rights reserved. Smart Execution Raw Data (TB-PB) Insights (KB) New Optimized MapReduce In-Memory Single Node
  • 17. © 2014 Datameer, Inc. All rights reserved. Architecture Hadoop MapReduce Dataflow Graph Engine YARN Smart Execution Engine In-Memory Tez Others Data Integration Visualization Spreadsheet Other (SQL) Single Node
  • 18. © 2014 Datameer, Inc. All rights reserved. Workflow Data Sets System Resources Optimized! MapReduce! Single Node! In-Memory! Future! Technology! Analytics
  • 19. © 2014 Datameer, Inc. All rights reserved. DAG Processing vs.!
  • 20. © 2014 Datameer, Inc. All rights reserved. Transparent for End Users