SlideShare a Scribd company logo
Data Exploration & BI
Cristian Guajardo Garcia
CristiánGuajardo-García
www.5561.cl
MBA Politecnico di Milano
& IIM Lucknow (India) B-
Schools.
Has worked in Chile &
Italy with companies such
as Universidad Andrés
Bello, ProChile, IBM,
Luxottica.
Agenda
1. Why data matters | BI
2. Data gathering: Scraping with Import.io
3. Data cleaning: Spreadsheets
4. Data visualization: Tableau
5. Insights for Marketing
6. Business Case: Luxottica
7. Q&A
1. Why data matters | BI
2. Data gathering: Scraping with Import.io
3. Data cleaning: Spreadsheets
4. Data visualization: Tableau
5. Insights for Marketing
6. Business Case: Luxottica
7. Q&A
Why Data matters | BI
Today we have several sources generating tons of data per
second. Businesses need to anticipate the consumer in order
to remain competitive.
SQL = Sequence or structured
NOSQL = Unstructured data.
The goal of BI is to make the right information available to
the right people in the right time.
Big Data is nothing else but gathering tons of structured
and unstructured data, filtering it, cleaning it,
visualizing it and last but not least, getting insights from
it.
Data Exploration & BI
How and where do I start?
By Exploring the Data I don’t actually know where this will
take me. I just collect data from several sources and start
to explore it
The 2 most common scenarios to
start working with dat
By Answering a Question I know I have one question to ask.
This will lead my research and let me get rid of data that
is useless for this stage.
The Steps
1. Web Scraping (gather data)
2. Clean
3. Visualize and Analyze
1. Why data matters | BI
2. Data gathering: Scraping with Import.io
3. Data cleaning: Spreadsheets
4. Data visualization: Tableau
5. Insights for Marketing
6. Business Case: Luxottica
7. Q&A
Data Gathering: Scraping with
Import.io
What is Scraping
Scraping is a technique used to extract data from one place
to another one, which is usually, a table.
Tabula = Extracts data from PDF
OCR = Extracts data from images
Import.io = Extracts data from
the web
Extractor Crawlers Connectors
Scraping is a very basic -yet useful- artificial
intelligence technique.
What is it?
● Machine reading the web
● Real time crawling through API
● Map Data of website
● Point & Click UI
● Turn data to structured data
● Tailor made crawlers
● Cloud scaling
● Wide integration options
From a minimum input get a
maximized output.
How does it work
Answer Question: What is the average € of
a Nike sneaker on eBay Italy?
1. Open Import.io
2. Create a new Connector
3. Go to ebay.it
4. Click “I’m there” button
5. Click the red button which will
record our click trail (now Import.
io will start recording your clicks)
6. Click stop button
7. Now you tell import.io what matters
to you and what is it (image, text,
link etc).
Pieces of information
Now you have the data you needed
You will create a bot that
basically gets pieces of
information that will be stored
in a table.
Once you have trained the bot to
crawl the whole results, you can
clean columns that you might not
use.
Now is time to manipulate the
data and get info like average
price, most common products and
so on.
1. Why data matters | BI
2. Data gathering: Scraping with Import.io
3. Data cleaning: Spreadsheets
4. Data visualization: Tableau
5. Insights for Marketing
6. Business Case: Luxottica
7. Q&A
Data Cleaning: Spreadsheets
Store and clean
Once you have gathered the data, you might want to hide or
erase columns. Fill the n/a spaces or do some pivot table
maneuver. Whatever the case, Spreadsheet is a great way to
go.
Pivot Table: summarize big info
HLookup and Vlookup: target
specific info store in columns
and rows.
1. Why data matters | BI
2. Data gathering: Scraping with Import.io
3. Data cleaning: Spreadsheets
4. Data visualization: Tableau
5. Insights for Marketing
6. Business Case: Luxottica
7. Q&A
Data Visualization: Tableau
What is it for?
Tableau is the ultimate desktop and cloud solution for
visualizing data coming from several sources.
Remember: privacy is an illusion
It works perfectly merging info from several sources:
Survey data, Social media, SEM and Analytics visualized in
one dashboard.
Perfect for reporting and meeting the needs of several
clients.
Why is it useful?
1. Tailor made dashboards
2. Several layers (and sources) of
information
3. Set clear goals and KPI’s
4. Easy to export
5. Works for several industries
and roles
Visualize and analyze data
Example of
Tableau
1. Why data matters | BI
2. Data gathering: Scraping with Import.io
3. Data cleaning: Excel
4. Data visualization: Tableau
5. Insights for Marketing
6. Business Case: Luxottica
7. Q&A
How we can harness the power of the web
When we start working with data we stop “believing” and
start thinking. All the data available can help us to create
consumer profiles, specific interests, potential issues with
our product or even new ways to connect with them.
1. Forecast (where the puck is going)
2. The Rise of the Robots (automation)
3. Cross selling and tailor made dashboards per client
4. Insights like you’ve never seen before
Business Case
Applied to Marketing
The scenario
Untapped customer intelligence Luxottica
needed to analyze historical data pertaining
to more than 100 million customers to
increase marketing effectiveness.
© Copyright IBM Corporation 2013
The Impact
Centralized analytics The company deployed
advanced analytics technology from IBM to
create a 360-degree view of customers.
Actionable insights By identifying the
highest-value customers and creating
individualized marketing campaigns,
Luxottica anticipates a 10 percent boost in
marketing effectiveness.
1. Anticipates a 10 %
improvement in marketing
effectiveness
2. Identifies the highest-value
customers out of nearly 100
million
3. Targets individual
customers based on unique
preferences and histories
Recommendations
Adds on for Google Sheets
● Merge Sheets
● Data Everywhere
● Mapping Sheets
● Find Fuzzy Matches
● DukeDeploy
● BlockSpring
● Text Analysis
● Translate My Sheet
● AppSheet
● BigML
Q&A

More Related Content

PPTX
Big Data, Data Visualization, Machine Learning & Artificial Intelligence by...
PDF
The Present - the History of Business Intelligence
PDF
Seven Trends in Government Business Intelligence
PPTX
Big Data & Business Analytics: Understanding the Marketspace
PPTX
An introduction to Business intelligence
PPTX
IBM CDO Fall Summit 2016 Keynote: Driving innovation in the cognitive era
PDF
Artificial intelligence
PDF
BigData & Supply Chain: A "Small" Introduction
Big Data, Data Visualization, Machine Learning & Artificial Intelligence by...
The Present - the History of Business Intelligence
Seven Trends in Government Business Intelligence
Big Data & Business Analytics: Understanding the Marketspace
An introduction to Business intelligence
IBM CDO Fall Summit 2016 Keynote: Driving innovation in the cognitive era
Artificial intelligence
BigData & Supply Chain: A "Small" Introduction

What's hot (20)

PDF
Paths to more personal and collaborative knowledge graphs
PPTX
Strata Data Conference 2019 : Scaling Visualization for Big Data in the Cloud
PPTX
Am I a Business Intelligence Hound?
PDF
Keynote GraphTour Europe 2019, Emil Eifrem, CEO & Co-Founder Neo4j
PPT
Big data and your career final
PPTX
Making Sense of your data - eLearning Network April 2014
PPTX
Importance of Big data for your Business
PPTX
BDAS-2017 | Lesson learned from the application of data science at BBVA
PDF
Top 20 artificial intelligence companies to watch out in 2022
PDF
The Past - the History of Business Intelligence
PPTX
Tseesuren - Data is the Key for Innovation
PDF
GraphTour - Opening Keynote
PDF
Graphs in Retail: Know Your Customers and Make Your Recommendations Engine Learn
PDF
Smart Data Webinar: Transforming Industries with Artificial Intelligence (AI)...
PDF
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
PPTX
Big Data, Big True
PPTX
Munkhzorig - Digital Transformation
PDF
Building a Data Platform Strata SF 2019
PPTX
Big Data Analytics
PDF
Dcaf transformation & kg adoption 2022 -alan morrison
Paths to more personal and collaborative knowledge graphs
Strata Data Conference 2019 : Scaling Visualization for Big Data in the Cloud
Am I a Business Intelligence Hound?
Keynote GraphTour Europe 2019, Emil Eifrem, CEO & Co-Founder Neo4j
Big data and your career final
Making Sense of your data - eLearning Network April 2014
Importance of Big data for your Business
BDAS-2017 | Lesson learned from the application of data science at BBVA
Top 20 artificial intelligence companies to watch out in 2022
The Past - the History of Business Intelligence
Tseesuren - Data is the Key for Innovation
GraphTour - Opening Keynote
Graphs in Retail: Know Your Customers and Make Your Recommendations Engine Learn
Smart Data Webinar: Transforming Industries with Artificial Intelligence (AI)...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
Big Data, Big True
Munkhzorig - Digital Transformation
Building a Data Platform Strata SF 2019
Big Data Analytics
Dcaf transformation & kg adoption 2022 -alan morrison
Ad

Viewers also liked (20)

PPTX
The Rensselaer IDEA: Data Exploration
PPTX
Self Service Buisness Intelligence - Tech Talk
PPTX
General Presentation The Selfservice company
PDF
SAS Visual Analytics
PDF
SAS Visual Analytics
PDF
Data exploration validation and sanitization
PPT
Data visualisation - Big data
PDF
Sas visual-analytics-startup-guide
PPTX
Data Visualization
PPTX
Big Data and BI Best Practices
PPTX
How different between Big Data, Business Intelligence and Analytics ?
PDF
Brief introduction to data visualization
PPTX
Benefits of data visualization
PDF
Model building in credit card and loan approval
PPTX
Decision tree
PDF
Big Data Visualization
PPTX
Sas visual analytics training presentation
PDF
The Importance of Data Visualization
PPTX
Credit Risk Model Building Steps
PPTX
The 8 Hats of Data Visualisation
The Rensselaer IDEA: Data Exploration
Self Service Buisness Intelligence - Tech Talk
General Presentation The Selfservice company
SAS Visual Analytics
SAS Visual Analytics
Data exploration validation and sanitization
Data visualisation - Big data
Sas visual-analytics-startup-guide
Data Visualization
Big Data and BI Best Practices
How different between Big Data, Business Intelligence and Analytics ?
Brief introduction to data visualization
Benefits of data visualization
Model building in credit card and loan approval
Decision tree
Big Data Visualization
Sas visual analytics training presentation
The Importance of Data Visualization
Credit Risk Model Building Steps
The 8 Hats of Data Visualisation
Ad

Similar to Data Exploration & BI (20)

PPT
Business Intelligence for kids (example project)
PPTX
Business intelligence
PDF
The 3 Key Barriers Keeping Companies from Deploying Data Products
PPTX
Why 4Segment
PPTX
Why 4Segments
PDF
Enabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
PPTX
Why Everything You Know About bigdata Is A Lie
PDF
BIGDATA-DIGITAL TRANSFORMATION AND STRATEGY
PPTX
[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi
PDF
Business Intelligence
PDF
Building your data driven business with Reactive Marketing Technology
PDF
Take Action on Big Data With Actian's Action Apps
PDF
Faster Ways To Data Insights
PPTX
Data mining concepts
PPTX
TOP Business Intelligence Predictions for 2015
PPTX
Shivraj MCA Evaluation 2010981527.pptx
PDF
My latest white paper
PPTX
Google Analytics Training - full 2017
PDF
Accelerate Self-Service Analytics with Data Virtualization and Visualization
PPTX
Every angle jacques adriaansen
Business Intelligence for kids (example project)
Business intelligence
The 3 Key Barriers Keeping Companies from Deploying Data Products
Why 4Segment
Why 4Segments
Enabling a Bimodal IT Framework for Advanced Analytics with Data Virtualization
Why Everything You Know About bigdata Is A Lie
BIGDATA-DIGITAL TRANSFORMATION AND STRATEGY
[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi
Business Intelligence
Building your data driven business with Reactive Marketing Technology
Take Action on Big Data With Actian's Action Apps
Faster Ways To Data Insights
Data mining concepts
TOP Business Intelligence Predictions for 2015
Shivraj MCA Evaluation 2010981527.pptx
My latest white paper
Google Analytics Training - full 2017
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Every angle jacques adriaansen

More from Cristian Guajardo-Garcia (20)

PDF
Tag Fundamentals de Google
PDF
Data Scientist Toolbox
PDF
PDF
Tendencias Cosmética Natural a Base Vegetal en Italia
PDF
Using BigSheets for Spreadsheet-like Analytics by IBM
PDF
Hadoop Fundamentals by IBM
PDF
Big Data Fundamentals
PDF
Understanding Europe: Why It Matters and What It Can Offer You
PDF
Best OCU Luxottica MBA 2014
PDF
Coursera Competitive Strategy 2014
PPTX
Making Sense of Data
PDF
Ogilvy & Mather's Workshop, India.
PDF
Coursera Global Sports Business 2013
PDF
Diploma Curso Creatividad UNAM (México)
PDF
Design Thinking Stanford University Diploma
PDF
Coffee Industry Analysis
PDF
Business model
PDF
PDF
Construcción de marca
PDF
Ideas en la era del Capital Intelectual
Tag Fundamentals de Google
Data Scientist Toolbox
Tendencias Cosmética Natural a Base Vegetal en Italia
Using BigSheets for Spreadsheet-like Analytics by IBM
Hadoop Fundamentals by IBM
Big Data Fundamentals
Understanding Europe: Why It Matters and What It Can Offer You
Best OCU Luxottica MBA 2014
Coursera Competitive Strategy 2014
Making Sense of Data
Ogilvy & Mather's Workshop, India.
Coursera Global Sports Business 2013
Diploma Curso Creatividad UNAM (México)
Design Thinking Stanford University Diploma
Coffee Industry Analysis
Business model
Construcción de marca
Ideas en la era del Capital Intelectual

Recently uploaded (20)

PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Database Infoormation System (DBIS).pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Introduction to machine learning and Linear Models
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
1_Introduction to advance data techniques.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Database Infoormation System (DBIS).pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Miokarditis (Inflamasi pada Otot Jantung)
Qualitative Qantitative and Mixed Methods.pptx
Introduction to machine learning and Linear Models
IB Computer Science - Internal Assessment.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Introduction to Knowledge Engineering Part 1
Data_Analytics_and_PowerBI_Presentation.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
1_Introduction to advance data techniques.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Quality review (1)_presentation of this 21
Introduction-to-Cloud-ComputingFinal.pptx

Data Exploration & BI

  • 1. Data Exploration & BI Cristian Guajardo Garcia
  • 2. CristiánGuajardo-García www.5561.cl MBA Politecnico di Milano & IIM Lucknow (India) B- Schools. Has worked in Chile & Italy with companies such as Universidad Andrés Bello, ProChile, IBM, Luxottica.
  • 3. Agenda 1. Why data matters | BI 2. Data gathering: Scraping with Import.io 3. Data cleaning: Spreadsheets 4. Data visualization: Tableau 5. Insights for Marketing 6. Business Case: Luxottica 7. Q&A
  • 4. 1. Why data matters | BI 2. Data gathering: Scraping with Import.io 3. Data cleaning: Spreadsheets 4. Data visualization: Tableau 5. Insights for Marketing 6. Business Case: Luxottica 7. Q&A
  • 5. Why Data matters | BI Today we have several sources generating tons of data per second. Businesses need to anticipate the consumer in order to remain competitive. SQL = Sequence or structured NOSQL = Unstructured data. The goal of BI is to make the right information available to the right people in the right time. Big Data is nothing else but gathering tons of structured and unstructured data, filtering it, cleaning it, visualizing it and last but not least, getting insights from it.
  • 7. How and where do I start? By Exploring the Data I don’t actually know where this will take me. I just collect data from several sources and start to explore it The 2 most common scenarios to start working with dat By Answering a Question I know I have one question to ask. This will lead my research and let me get rid of data that is useless for this stage.
  • 8. The Steps 1. Web Scraping (gather data) 2. Clean 3. Visualize and Analyze
  • 9. 1. Why data matters | BI 2. Data gathering: Scraping with Import.io 3. Data cleaning: Spreadsheets 4. Data visualization: Tableau 5. Insights for Marketing 6. Business Case: Luxottica 7. Q&A
  • 10. Data Gathering: Scraping with Import.io
  • 11. What is Scraping Scraping is a technique used to extract data from one place to another one, which is usually, a table. Tabula = Extracts data from PDF OCR = Extracts data from images Import.io = Extracts data from the web Extractor Crawlers Connectors Scraping is a very basic -yet useful- artificial intelligence technique.
  • 12. What is it? ● Machine reading the web ● Real time crawling through API ● Map Data of website ● Point & Click UI ● Turn data to structured data ● Tailor made crawlers ● Cloud scaling ● Wide integration options From a minimum input get a maximized output.
  • 13. How does it work Answer Question: What is the average € of a Nike sneaker on eBay Italy? 1. Open Import.io 2. Create a new Connector 3. Go to ebay.it 4. Click “I’m there” button 5. Click the red button which will record our click trail (now Import. io will start recording your clicks) 6. Click stop button 7. Now you tell import.io what matters to you and what is it (image, text, link etc). Pieces of information
  • 14. Now you have the data you needed You will create a bot that basically gets pieces of information that will be stored in a table. Once you have trained the bot to crawl the whole results, you can clean columns that you might not use. Now is time to manipulate the data and get info like average price, most common products and so on.
  • 15. 1. Why data matters | BI 2. Data gathering: Scraping with Import.io 3. Data cleaning: Spreadsheets 4. Data visualization: Tableau 5. Insights for Marketing 6. Business Case: Luxottica 7. Q&A
  • 17. Store and clean Once you have gathered the data, you might want to hide or erase columns. Fill the n/a spaces or do some pivot table maneuver. Whatever the case, Spreadsheet is a great way to go. Pivot Table: summarize big info HLookup and Vlookup: target specific info store in columns and rows.
  • 18. 1. Why data matters | BI 2. Data gathering: Scraping with Import.io 3. Data cleaning: Spreadsheets 4. Data visualization: Tableau 5. Insights for Marketing 6. Business Case: Luxottica 7. Q&A
  • 20. What is it for? Tableau is the ultimate desktop and cloud solution for visualizing data coming from several sources. Remember: privacy is an illusion It works perfectly merging info from several sources: Survey data, Social media, SEM and Analytics visualized in one dashboard. Perfect for reporting and meeting the needs of several clients.
  • 21. Why is it useful? 1. Tailor made dashboards 2. Several layers (and sources) of information 3. Set clear goals and KPI’s 4. Easy to export 5. Works for several industries and roles
  • 22. Visualize and analyze data Example of Tableau
  • 23. 1. Why data matters | BI 2. Data gathering: Scraping with Import.io 3. Data cleaning: Excel 4. Data visualization: Tableau 5. Insights for Marketing 6. Business Case: Luxottica 7. Q&A
  • 24. How we can harness the power of the web When we start working with data we stop “believing” and start thinking. All the data available can help us to create consumer profiles, specific interests, potential issues with our product or even new ways to connect with them. 1. Forecast (where the puck is going) 2. The Rise of the Robots (automation) 3. Cross selling and tailor made dashboards per client 4. Insights like you’ve never seen before
  • 26. The scenario Untapped customer intelligence Luxottica needed to analyze historical data pertaining to more than 100 million customers to increase marketing effectiveness. © Copyright IBM Corporation 2013 The Impact Centralized analytics The company deployed advanced analytics technology from IBM to create a 360-degree view of customers. Actionable insights By identifying the highest-value customers and creating individualized marketing campaigns, Luxottica anticipates a 10 percent boost in marketing effectiveness. 1. Anticipates a 10 % improvement in marketing effectiveness 2. Identifies the highest-value customers out of nearly 100 million 3. Targets individual customers based on unique preferences and histories
  • 27. Recommendations Adds on for Google Sheets ● Merge Sheets ● Data Everywhere ● Mapping Sheets ● Find Fuzzy Matches ● DukeDeploy ● BlockSpring ● Text Analysis ● Translate My Sheet ● AppSheet ● BigML
  • 28. Q&A