SlideShare a Scribd company logo
VisualizationFor analysis and communication
Chen “Gwen” Shapira
Reveal Structure
in Data
Visualization
Visualization
Verify Your Findings
• Prior knowledge
• Statistical tools
• Graphs are only the starting point
Not all visuals are
created equal
Visualization
Visualization
Numerical quantities focus on
expected values – graphical
summaries on unexpected values
– John Tukey
How long does it take
to run full export on
ITGDB10?
5 Hours and 45
minutes. On average.
Visualization
Most of the time it
take 3 to 6.5 hours.
But it can take as long
as 20 hours!
Visualization
5 hours on average,
when the storage works.
Visualization
I got rid of the outliers.
Am I normal now?
Visualization
Visualization
What about the rest of
the servers?
Visualization
Visualization
Does our maintenance
have impact on response
times?
Visualization
Visualization
Visualization
Visualization
0
2
4
6
8
10
12
14 RowLabels
24-SEP-0922.00.00
25-SEP-0902.00.00
25-SEP-0906.00.00
25-SEP-0910.00.00
25-SEP-0914.00.00
25-SEP-0918.00.00
25-SEP-0922.00.00
26-SEP-0902.00.00
26-SEP-0906.00.00
26-SEP-0910.00.00
26-SEP-0914.00.00
26-SEP-0918.00.00
26-SEP-0922.00.00
27-SEP-0902.00.00
27-SEP-0906.00.00
27-SEP-0910.00.00
27-SEP-0914.00.00
27-SEP-0918.00.00
27-SEP-0922.00.00
28-SEP-0902.00.00
28-SEP-0906.00.00
28-SEP-0910.00.00
28-SEP-0914.00.00
28-SEP-0918.00.00
28-SEP-0922.00.00
29-SEP-0902.00.00
29-SEP-0906.00.00
29-SEP-0910.00.00
29-SEP-0914.00.00
29-SEP-0918.00.00
29-SEP-0922.00.00
30-SEP-0902.00.00
30-SEP-0906.00.00
30-SEP-0910.00.00
30-SEP-0914.00.00
30-SEP-0918.00.00
30-SEP-0922.00.00
01-OCT-0902.00.00
01-OCT-0906.00.00
01-OCT-0910.00.00
01-OCT-0914.00.00
01-OCT-0918.00.00
Series5 Series7 Series9
Visualization
Visualization
Visualization
Communicating
Information
Visualization
0
10
20
30
40
50
60
70
80
90
100
AxisTitle
Axis Title
oracle
0
20
40
60
80
100
120
oracle
India
Pakistan
Singapore
Kenya
Sri Lanka
Nigeria
Hong Kong
South Korea
Japan
El Salvador
Jordan
China
United Arab Emirates
Taiwan
United States
Guatemala
Costa Rica
Ecuador
Russian Federation
South Africa
0 20 40 60 80 100 120
Russian Federation
Costa Rica
Ecuador
United Arab Emirates
Taiwan
United States
Guatemala
China
Jordan
Japan
El Salvador
South Korea
Hong Kong
Nigeria
Kenya
Sri Lanka
Singapore
Pakistan
India
Oracle Google Searches - By Region, Normalized
0 20 40 60 80 100 120
Russian Federation
Costa Rica
Ecuador
United Arab Emirates
Taiwan
United States
Guatemala
China
Jordan
Japan
El Salvador
South Korea
Hong Kong
Nigeria
Kenya
Sri Lanka
Singapore
Pakistan
India
Oracle Google Searches - By Region, Normalized
Visualization

More Related Content

PPTX
Improving Mental Focus
PPT
Spatiotemporal Analysis of Rambling Activities: Approach to Inferring Visitor...
PPTX
2014-03-25 De-Mystifying the IT Assessment
PPTX
Compiling Analysis Results
PDF
Integrated Data Warehouse with Hadoop and Oracle Database
PDF
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
PPTX
When will it be done? (Lean Agile Forecasting)
PDF
Agile Data Science
Improving Mental Focus
Spatiotemporal Analysis of Rambling Activities: Approach to Inferring Visitor...
2014-03-25 De-Mystifying the IT Assessment
Compiling Analysis Results
Integrated Data Warehouse with Hadoop and Oracle Database
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
When will it be done? (Lean Agile Forecasting)
Agile Data Science

Similar to Visualization (20)

PPT
Operation Analysis - Process Mapping
PPTX
"Making Scrum "More" Effective : What can we learn from Other Body of Knowled...
PPTX
Cycle times and the Evolution From Story Points
PPTX
Work Unit Analysis Tool
PPTX
The Business Case for DevOps - Justifying the Journey
PPTX
Lean Back Offices Project
PDF
Metrics in Security Operations
PDF
Kanban - A Crash Course
PPTX
Azure stream analytics by Nico Jacobs
PPTX
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
PPTX
Problem solving and design
PPTX
Real World Performance - OLTP
PPTX
1_1_First Class_F2023.pptx statistics course
PPTX
The agile forecast joe tristano southern fried agile 2018_ final
PDF
Value streammapping cascadiait2014-mceniry
PDF
Agile Dev West 2018_Measuring Flow: Metrics that Matter
PDF
Everybody Lies
PDF
Kanban seminar
PPTX
Agile Estimation @ Lean Agile Manchester: Make Estimates Small!
PPTX
#Measurecamp : 18 Simple Ways to F*** up Your AB Testing
Operation Analysis - Process Mapping
"Making Scrum "More" Effective : What can we learn from Other Body of Knowled...
Cycle times and the Evolution From Story Points
Work Unit Analysis Tool
The Business Case for DevOps - Justifying the Journey
Lean Back Offices Project
Metrics in Security Operations
Kanban - A Crash Course
Azure stream analytics by Nico Jacobs
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
Problem solving and design
Real World Performance - OLTP
1_1_First Class_F2023.pptx statistics course
The agile forecast joe tristano southern fried agile 2018_ final
Value streammapping cascadiait2014-mceniry
Agile Dev West 2018_Measuring Flow: Metrics that Matter
Everybody Lies
Kanban seminar
Agile Estimation @ Lean Agile Manchester: Make Estimates Small!
#Measurecamp : 18 Simple Ways to F*** up Your AB Testing
Ad

More from Gwen (Chen) Shapira (20)

PPTX
Velocity 2019 - Kafka Operations Deep Dive
PPTX
Lies Enterprise Architects Tell - Data Day Texas 2018 Keynote
PPTX
Gluecon - Kafka and the service mesh
PPTX
Multi-Cluster and Failover for Apache Kafka - Kafka Summit SF 17
PPTX
Papers we love realtime at facebook
PPTX
Kafka reliability velocity 17
PPTX
Multi-Datacenter Kafka - Strata San Jose 2017
PPTX
Streaming Data Integration - For Women in Big Data Meetup
PPTX
Kafka at scale facebook israel
PPTX
Kafka connect-london-meetup-2016
PPTX
Fraud Detection for Israel BigThings Meetup
PPT
Kafka Reliability - When it absolutely, positively has to be there
PPTX
Nyc kafka meetup 2015 - when bad things happen to good kafka clusters
PPTX
Fraud Detection Architecture
PPTX
Have your cake and eat it too
PPTX
Kafka for DBAs
PPTX
Data Architectures for Robust Decision Making
PPTX
Kafka and Hadoop at LinkedIn Meetup
PPTX
Kafka & Hadoop - for NYC Kafka Meetup
PPTX
Twitter with hadoop for oow
Velocity 2019 - Kafka Operations Deep Dive
Lies Enterprise Architects Tell - Data Day Texas 2018 Keynote
Gluecon - Kafka and the service mesh
Multi-Cluster and Failover for Apache Kafka - Kafka Summit SF 17
Papers we love realtime at facebook
Kafka reliability velocity 17
Multi-Datacenter Kafka - Strata San Jose 2017
Streaming Data Integration - For Women in Big Data Meetup
Kafka at scale facebook israel
Kafka connect-london-meetup-2016
Fraud Detection for Israel BigThings Meetup
Kafka Reliability - When it absolutely, positively has to be there
Nyc kafka meetup 2015 - when bad things happen to good kafka clusters
Fraud Detection Architecture
Have your cake and eat it too
Kafka for DBAs
Data Architectures for Robust Decision Making
Kafka and Hadoop at LinkedIn Meetup
Kafka & Hadoop - for NYC Kafka Meetup
Twitter with hadoop for oow
Ad

Recently uploaded (20)

PDF
Computing-Curriculum for Schools in Ghana
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
1_English_Language_Set_2.pdf probationary
PPTX
History, Philosophy and sociology of education (1).pptx
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PPTX
Unit 4 Skeletal System.ppt.pptxopresentatiom
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Indian roads congress 037 - 2012 Flexible pavement
PDF
Classroom Observation Tools for Teachers
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
advance database management system book.pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
Computing-Curriculum for Schools in Ghana
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
Practical Manual AGRO-233 Principles and Practices of Natural Farming
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
1_English_Language_Set_2.pdf probationary
History, Philosophy and sociology of education (1).pptx
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
Unit 4 Skeletal System.ppt.pptxopresentatiom
Final Presentation General Medicine 03-08-2024.pptx
Indian roads congress 037 - 2012 Flexible pavement
Classroom Observation Tools for Teachers
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
advance database management system book.pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation

Visualization

Editor's Notes

  • #2: Visualization - visual display of graphical information. I am going to show how to be more effective in analyzing and communication information using graphical methods. Visualization is sometimes discarded as a cop-out. Newbies and managers use graphs because they are not manly enough. Real DBAs use numbers and command line! In the excellent book “Lies, Damn Lies and Statistics” there is entire chapter dedicated to graphs and the author says something like: People use graphs because they are afraid of numbers, maybe a trauma from school. This is a bit like saying that people use cars because they are too lazy to walk. Sometimes its true. But it ignores the fact that cars are really more efficient. In the same way, graphs are really a more efficient way to display information. In fact, for reasons I’ll show soon, graphs are even more useful experts than they are for beginners. What I’ll take about: Why using graphics is so efficient New graphical methods Simple design principals
  • #3: Structure = Trends, repetitions and outliers, etc. High bandwidth information channel. Apply pattern matching skills and prior knowledge to analysis of data.
  • #4: We can easily find information in very ambiguous data. Its an evolutionary thing.
  • #6: First line of attack.
  • #8: Quantifiable visual differences – comparative length of parallel lines. 2D location.
  • #9: Differences between color shades and sizes of shapes are difficult to compare and quantify
  • #12: Average describes normal distributions quite well. Give height as an example for why average is a good descriptor for normal distribution.
  • #13: Extremely Skewed distribution! Its not even close to normal. Average does not really describe how slow export can get.
  • #14: That looks like a good description. But wait!
  • #15: Sometimes export doesn’t run at all. I can explain the outliers (both low and high) - those 5 days one Netapp head was down and we didn’t run exports, and when we did performance was awful. Since I can explain the outliers – I know I can remove them.
  • #19: histogram. Looks kind of normal, but hard to tell.
  • #20: qqnorm. Yep, looks normal with some noise. You don’t see a consistent skew.
  • #22: Multiple Boxplots
  • #23: Scatter plot
  • #33: Less is more. Be clear and to the point. Do not distort or mislead. Think of your data as a fashion model – you look at her and photograph her from all positions and angles, but only the best photos appear in the magazine – often hiding as much as they reveal!