SlideShare a Scribd company logo
Exploratory Data Analysis
Pranav Agarwal (SDE-3 @ Flipkart)
Bala Nathan (Architect @ Flipkart)
The Fifth Elephant
Bangalore, 2015
Most enterprises have more than one
Analytics Systems
Different kind of analysis
● Incubating since October 2014
● CUbe Abstraction across multiple
datastores
○ Facts
○ Dimension
○ Partitions
Apache Lens
● Incubating since December 2014
● Web based notebook
● Collaborative data analytics
● Visualization tool
● Integrates with Apache Spark, Apache
Lens
Apache Zeppelin
Demo
Key Take-aways
● Simplified drill down and roll ups
● Easy and consistent mechanism to
discover data across systems
● In place visualization to Big Data
● Web notebook interface helps you
organize, revisit and collaborate findings
Q & A
More Questions? feel free to send them @
user@lens.incubator.apache.org
praagarw@gmail.com
balaknathan@gmail.com

More Related Content

PPTX
DevOps: Monitorando aplicação com App Insights
PDF
ERPNext Open Day - April 2013
PPTX
Compare Table Unleashed
PPTX
Datastromen in de grip
PDF
Sentiment Analysis
PPTX
Data Engineering at Udemy
PDF
J sai subrahmanyam_Resume
PDF
Suhas_Manjunath_Resume
DevOps: Monitorando aplicação com App Insights
ERPNext Open Day - April 2013
Compare Table Unleashed
Datastromen in de grip
Sentiment Analysis
Data Engineering at Udemy
J sai subrahmanyam_Resume
Suhas_Manjunath_Resume

Viewers also liked (10)

PPTX
Beyond the Dashboard - Exploratory Analytics
PDF
Apache Zeppelin, Helium and Beyond
PPTX
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
PPTX
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
PPTX
Data Science with Spark & Zeppelin
PPTX
Intro to Spark with Zeppelin
PPTX
Intro to Big Data Analytics using Apache Spark and Apache Zeppelin
PDF
Data science lifecycle with Apache Zeppelin
PPTX
The Evolution of Apache Kylin
PDF
Big Data visualization with Apache Spark and Zeppelin
Beyond the Dashboard - Exploratory Analytics
Apache Zeppelin, Helium and Beyond
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
4.Building a Data Product using apache Zeppelin - Apache Kylin Meetup @Shanghai
Data Science with Spark & Zeppelin
Intro to Spark with Zeppelin
Intro to Big Data Analytics using Apache Spark and Apache Zeppelin
Data science lifecycle with Apache Zeppelin
The Evolution of Apache Kylin
Big Data visualization with Apache Spark and Zeppelin
Ad

Similar to Exploratory data analysis using apache lens and apache zeppelin (20)

PPTX
Lens at apachecon
PPTX
Bigdatacooltools
PDF
Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...
PDF
Apache Lens : Unified OLAP on Realtime and Batch Data
PDF
Apache Lens: Unified OLAP on Realtime and Historic Data
PPTX
INTERNET OF THINGS On data acquisition m2m systems
PDF
WSO2Con USA 2017: Driving Insights for Your Digital Business With Analytics
PPTX
Growth hacking in the age of Data
PPTX
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
PPTX
Foresight conversation
ODP
Big Data Analytics - Introduction
PDF
Salesforce Analytics Cloud - Explained
PPTX
The key to unlocking the Value in the IoT? Managing the Data!
PDF
Big Data Modeling and Analytic Patterns – Beyond Schema on Read
PPTX
Apache Lens at Hadoop meetup
PPTX
Overview of Big Data Characteristics and Technologies.pptx
PPTX
Data analytics - Let's break it down
PDF
NRB - BE MAINFRAME DAY 2017 - Data spark and the data federation
 
PDF
NRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data Federation
 
PDF
Levelling up your data infrastructure
Lens at apachecon
Bigdatacooltools
Software Analytics with Jupyter, Pandas, jQAssistant, and Neo4j [Neo4j Online...
Apache Lens : Unified OLAP on Realtime and Batch Data
Apache Lens: Unified OLAP on Realtime and Historic Data
INTERNET OF THINGS On data acquisition m2m systems
WSO2Con USA 2017: Driving Insights for Your Digital Business With Analytics
Growth hacking in the age of Data
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
Foresight conversation
Big Data Analytics - Introduction
Salesforce Analytics Cloud - Explained
The key to unlocking the Value in the IoT? Managing the Data!
Big Data Modeling and Analytic Patterns – Beyond Schema on Read
Apache Lens at Hadoop meetup
Overview of Big Data Characteristics and Technologies.pptx
Data analytics - Let's break it down
NRB - BE MAINFRAME DAY 2017 - Data spark and the data federation
 
NRB - LUXEMBOURG MAINFRAME DAY 2017 - Data Spark and the Data Federation
 
Levelling up your data infrastructure
Ad

Recently uploaded (20)

PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Electronic commerce courselecture one. Pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
KodekX | Application Modernization Development
PPT
Teaching material agriculture food technology
PPTX
Big Data Technologies - Introduction.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Electronic commerce courselecture one. Pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
MIND Revenue Release Quarter 2 2025 Press Release
Empathic Computing: Creating Shared Understanding
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
KodekX | Application Modernization Development
Teaching material agriculture food technology
Big Data Technologies - Introduction.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Understanding_Digital_Forensics_Presentation.pptx
NewMind AI Weekly Chronicles - August'25 Week I
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Programs and apps: productivity, graphics, security and other tools
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?

Exploratory data analysis using apache lens and apache zeppelin

  • 1. Exploratory Data Analysis Pranav Agarwal (SDE-3 @ Flipkart) Bala Nathan (Architect @ Flipkart) The Fifth Elephant Bangalore, 2015
  • 2. Most enterprises have more than one Analytics Systems
  • 3. Different kind of analysis
  • 4. ● Incubating since October 2014 ● CUbe Abstraction across multiple datastores ○ Facts ○ Dimension ○ Partitions Apache Lens
  • 5. ● Incubating since December 2014 ● Web based notebook ● Collaborative data analytics ● Visualization tool ● Integrates with Apache Spark, Apache Lens Apache Zeppelin
  • 7. Key Take-aways ● Simplified drill down and roll ups ● Easy and consistent mechanism to discover data across systems ● In place visualization to Big Data ● Web notebook interface helps you organize, revisit and collaborate findings
  • 8. Q & A More Questions? feel free to send them @ user@lens.incubator.apache.org praagarw@gmail.com balaknathan@gmail.com