SlideShare a Scribd company logo
王成军
@计算传播实验研究中心
数据叙事与可视化应用训练营
数据叙事概览:从数据可视化讲起
1
1. 概念 Concepts
2. 功能 Functions
3. 过程 Process
4. 理论 Theory
2
内容简介
概念 CONCEPTS
定义可视化
 “使用图像、表格、动画进行传播”(Wikipedia)
 Images: illustrations; photographs, especially modified photos
 Diagrams: structural diagrams, blueprints, plots & charts
 Animations: based on simulation or other specifications
 包括但不限于统计图 (Statistical Graphics)
 可视化 (Often Abbreviated “Vis” cf. IEEE InfoVis)
 科学可视化: transformation, representation of data for exploration
 数据可视化: schematic form
 e.g., relational database form ( tuples of attribute values)
 “Data vis” often synonymous with “statistical vis”
 信息可视化: spectrum from “raw data” to “info”, “knowledge”
 Premise: info more structured, organized, abstract than data
 Emphasis on computational tools
 Working with (especially analyzing) large data sets
可视化周期表
5
http://www.visual-literacy.org/periodic_table/periodic_table.html
数据可视化
 数据可视化 DataViz is an umbrella term, usually
covering both information and scientific visualization.
 To convert data into a visual representation (like
charts, graphs, maps, sometimes even just tables).
 静态、交互与动态 Static vs. interactive vs. dynamic
Source: Angela Zoss, http://guides.library.duke.edu/datavis/
6
禹迹图
Earliest Grid Map:
Song Dynasty, 960 – 1279 CE)
流地图
Minard, 1865 – French Wine Exports
功能 Functions
 优图 Graphical Excellence
 Complex Ideas
 Communicated with
 Clarity
 Precision
 Efficiency
E. R. Tufte 2001 The Visual Display of Quantitative Information. Yale University http://bit.ly/16Se1
优秀的可视化
清晰传播
Principles Questions in mind
Apprehension Does the graph maximize apprehension of the relations
among variables?
Clarity Are the most important elements or relations visually most
prominent?
Consistency Are the elements, symbol shapes and colors consistent with
their use in previous graphs?
Efficiency Are the elements of the graph economically used? Is the
graph easy to interpret?
Necessity Is the graph a more useful way to represent the data than
alternatives (table, text)? Are all the graph elements
necessary to convey the relations?
Truthfulness Are the graph elements accurately positioned and scaled?
D. A. Burn (1993), "Designing Effective Statistical Graphs". In C. R. Rao, ed., Handbook of Statistics, vol. 9, Chapter 22.
 好的可视化应该做什么?
 Show the data
 Induce to viewer to think about the data
 Avoid distorting what the data have to
say
 Present many numbers in a small space
 Make large data sets coherent
 Encourage the eye to compare different
pieces of data
 Reveal the data at several levels of
detail, from overview to fine structure
 Serve a clear purpose:
 Description, exploration, tabulation, or decoration
 Be closely integrated with the statistical
and verbal descriptions of a data set.
12
(Tufte 2001/1983)
优图原则
1854年伦敦宽街黑死病爆发
可视化解读
http://www.selkirkgis.com/blog/tag/program-collaboration/
1812年拿破仑军队的溃退
15An artistic depiction of Napoleon's retreat from Moscow, by Adolph Northen
记者如何报道战争?
16
地理空间、军队规模和温度
17
可视化解读
18
Charles Joseph Minard's famous graph showing the decreasing size of the Grande
Armée as it marches to Moscow (brown line, from left to right) and back (black line,
from right to left) with the size of the army equal to the width of the line.
Temperature is plotted on the lower graph for the return journey
(multiplyRéaumur temperatures by 1¼ to get Celsius, e.g. −30 °R = −37.5 °C).
如何更好地呈现统计结果?
19
可视化之美
20
数据新闻&数字叙事
 数据新闻
The Data Journalism Handbook
 为何记者要运用数据?
 可视化作为数据新闻的重要工具
 用可视化来讲故事
 从可视化到叙事
Question + Visual Data + Context = Story (Shapiro,
2010, p.16)
21
做数据新闻的商业原因
财新流量数据
数字说频道 2013年10月 - 2014年5月
发布互动数据新闻作品30件
信息图超过300件
累计获得流量870多万
单篇最高访问量接近100万。
青岛爆炸案当日,网站整体访问量达1000万,创新高
周永康,财新+网易400多万访问量
相关微博被转发5万次,评论4万条
微博阅读量2000万
http://djchina.org/2014/04/06/favorite_viz_2013/
数据新闻实践
24
25
http://www.informationisbeautiful.net/2010/peak-break-up-times-on-facebook/
过程 PROCESS
数据可视化的七个步骤
 获取、清洗、过滤、挖掘、表征、调整、互动
27
(Fry, 2008)
获取、清理、过滤数据
28Anscombe, F.J. (1973).Graphs in Statistical Analysis. The American Statistician,
Vol. 27, No. 1., pp. 17-21.
表征数据关系
29Anscombe, F.J. (1973).Graphs in Statistical Analysis. The American
Statistician, Vol. 27, No. 1., pp. 17-21.
可视化目标
30
See relationships among data points
寻找关系
Scatterplot
Matrix Chart
Network Diagram
Compare a set of values
分组比较
Bar Chart
Block Histogram
Bubble Chart
Track rises and falls over time
时序涨落
Line Graph
Stack Graph
Stack Graph for Categories
See the parts of a whole
了解比例
Pie Chart
Treemap
Treemap for Comparisons
Analyze a text
文本分析
Word Tree
Tag Cloud
Phrase Net
See the world
地理位置
Map
http://www.manyeyes.com/software/analytics/manyeyes/page/Visualization_Options.html
从数据到可视化
1. 数据类型:What data types are present in the
data source?
2. 数据关系:How are the variables likely to
relate?
3. 可视化类型:What visualization type seems to
be the best fit for the goal?
31
可视化基础
1. 数据类型 Types of data
1) Nominal
2) Ordinal
3) Scale
2. 数据结构 Forms of structure
1) Census
2) Financial
3) Social network
4) Web data
可视化基础
1. 位置 Position
2. 形状 Shape
3. 大小 Size
4. 亮度 Brightness
5. 颜色 Color
6. 排列方向 Orientation
7. 纹理 Texture
8. 运动 Motion
33
基础图形
 单变量 Single variable visualization
 直方图 Histograms
 饼状图 Pie charts
 时间序列 Time series
 双连续变量 Two continuous variables
 散点图 Scatterplots
 双变量(一个类别变量)Two Variables - one categorical
 箱形图 Boxplots
 柱状图 Bar charts
 地图 Maps
 网络 Social networks
 动态交互图 Interactive and dynamic graphs
34
理论 THEORY
作为视觉传播的可视化
 观察者
 文本
 互动
 框架
36
框架理论与视觉传播
 媒介眼中的社会
 培养理论
 美国媒介中的世界
 框架理论
 可视化图形中的世界
37
http://www.ted.com/talks/alisa_miller_shares_the_news_about_the_news#t-17151
数据驱动
 数据可视化主要是数据驱动的 Dataviz differs from the
general graphic design in that it is of the data, by the
data, and for the data.
 数治 By the data: guided primarily by data results
rather than esthetical considerations
 数享 For the data: to tell accurate, informative, and
understandable quantitative stories
 数有 Of the data: an integrated phase of the
discovery rather than a post-analysis phase to
decorate the findings
38
图像诚实Graphic integrity
 标注和基准一致 Consistency in Labeling, Baselines
 时间一致 Consistency in Time (Independent Axis)
 警惕数据不全 Dangers of Partial Annual Data
 数据的标准化 Need for Data Normalization
 不要忽略整体 Context – “Compared to What?”
 不要将连续变量当做定序变量 Pravda School of Ordinal Graphics
Tufte’s Six Principles
 1. Make Representation of Numbers Proportional to Quantities
 Ratio of size to numerical value should be close to 1
 As physically measured on surface of graphic
 2. Use Clear, Detailed, Thorough Labeling
 Don’t introduce or propagate graphical distortion, ambiguity
 Write out explanations of the data on the graphic itself
 Label important events in the data
 3. Show Data Variation, Not Design Variation
 4. Use Standardized (e.g., Inflation-Adjusted) Units, Not Nominal
 5. Depict N Data Dimensions with N Variable Dimensions
 Don’t use more than N information-carrying dimensions for N-D data
 When graphing data in N-D, use N-D ratio (see #1 above)
 6. Quote Data in Full Context ( Don’t Quote Out of Context)
See also How to Lie With Statistics (Huff, 1984): http://bit.ly/3wAgS0
撒谎因子Lie Factor
 撒谎因子是图像中的效果与数据中的效果的比值,
即图形在表达数据变化时的失真程度
42
43
http://news.qq.com/newspedia/baogao.htm
坐标轴起点
44
Source: http://data.heapanalytics.com/how-to-lie-with-data-visualization/
http://static.guim.co.uk/sys-images/Guardian/Pix/pictures/2013/8/1/1375343461201/misleading.jpg
累计增长曲线
45
Source: http://data.heapanalytics.com/how-to-lie-with-data-visualization/
苹果的销售量
46
Source: http://qz.com/122921/the-chart-tim-cook-doesnt-want-you-to-see/
47
Source: http://qz.com/122921/the-chart-tim-cook-doesnt-want-you-to-see/
尽量不用饼状图
48
http://flowingdata.com/2009/11/26/fox-news-makes-the-best-pie-chart-ever/
图片垃圾Chartjunk
 Edward Tufte (1942-) 统计学家
 1)2)Data-ink Ratio 数据笔墨比例,即有多少
笔墨是用在数据上了
 3)Data Density 数据密度,一定大小的空间内
表示了多少数据
49
优图
 Gives to Viewer
 Greatest number of ideas – data
 In shortest time – “ink ratio” really rate per
time (cognitive effort)
 With least ink – filled space, pixels,
primitives, rendered objects
 In smallest space – total size of graphic,
page, viewport, window
数据笔墨比例
51
“Duck” here refers
to self-promoting
decorative graphics
数据叙事既是科学也是艺术
Finding the right way view your data is as much an art as a science.
可视化如何抓住读者?
Borkin MA, Vo AA, Bylinskii Z, Isola P, Sunkavalli S, Oliva A, Pfister H. What Makes a Visualization Memorable?.
IEEE Transactions on Visualization and Computer Graphics (Proceedings of InfoVis 2013). 2013.
直觉 vs. 抽象?
 图像垃圾有用吗?
 It's easy to spot a "bad" data visualization—one packed with too much
text, excessive ornamentation, gaudy colors, and clip art.
 Design guru Edward Tufte derided such decorations as redundant at
best, useless at worst, labeling them "chart junk."
 Yet a debate still rages among visualization experts: Can these reviled
extra elements serve a purpose?
 形象的结果 Intuitive results (e.g., attributes like color and the inclusion of a
human recognizable object enhance memorability)
 抽象的结果 Less intuitive results (e.g., common graphs are less
memorable than unique visualization types).
54
加入创意
55
数据新闻所需技能
– 传统报道能力 traditional reporting
– 数学及统计 math and statistics
– 数据分析编程 programming for data analysis
– 网站编程 web programming
– 平面设计 graphic design
– 互动设计 interaction design
– 写作Writing
Readings
1. Tufte E.T. (2001). The Visual Display of
Quantitative Information. 2nd Edition.
Cheshire, Conn. : Graphics Press.
2. Cairo, A. (2013). The Functional Art:
An Introduction to Information Graphics and
Visualization. Berkely CA : New Riders.
3. Fry, B. (2008). Visualizing Data.
Sebastopol, CA : O'Reilly Media, Inc.
47
THANK YOU

More Related Content

PDF
A Tour through the Data Vizualization Zoo - Communications of the ACM
PDF
Class 3 visual representation of data
PDF
Strategic Management of Intellectual Property: R&D Investment Appraisal Using...
PPTX
Data Visualization Tips and Concepts
PDF
Information visualisation: 
Data ink design principles
PDF
Statgraphs
PPTX
Information Design for Technical Communicators: Scratching the Surface
PPT
Pecha Kucha - Edward Tufte Information Design books review
A Tour through the Data Vizualization Zoo - Communications of the ACM
Class 3 visual representation of data
Strategic Management of Intellectual Property: R&D Investment Appraisal Using...
Data Visualization Tips and Concepts
Information visualisation: 
Data ink design principles
Statgraphs
Information Design for Technical Communicators: Scratching the Surface
Pecha Kucha - Edward Tufte Information Design books review

Viewers also liked (7)

PDF
The introduction of data visualization
PPT
Data driven storytelling tips from an iron viz champion ryan sleeper
PDF
TWP Meetup
PPTX
Effective Data Visualization
PDF
Data Visualization Techniques
PDF
Data Visualization 101: How to Design Charts and Graphs
PDF
Fundamental Ways We Use Data Visualizations
The introduction of data visualization
Data driven storytelling tips from an iron viz champion ryan sleeper
TWP Meetup
Effective Data Visualization
Data Visualization Techniques
Data Visualization 101: How to Design Charts and Graphs
Fundamental Ways We Use Data Visualizations
Ad

Similar to 数据可视化 概念案例方法 王成军 20140104 (20)

PDF
Dataviz 101: Data is beautiful, please don't ruin it by Anne-Marie Tousch, Se...
PPTX
Introduction to Data Visualisation - Andrew Errity
PPTX
Making sense of data visually: A modern look at datavisualization
PPTX
Data is beautiful​, please don't ruin it
PDF
Data Visualization in Data Science
PPTX
Data Visulalization
PPT
Information visualisation
PDF
Workshop lyons ray
PPTX
Data Visualization
PDF
Pratiques historiques et actuelles
PPTX
Data Visualization Power Point Presentations
PPT
chi03-tutorial.ppt
PPTX
Discover new ways to visualize market research data
PDF
5 part 2 - Methodology _ Learning About Your Data.pdf
PDF
Introduction to Data Visualization
PDF
Art and Science of Dashboard Design
PPTX
Principles of data visualisation 2020
PPTX
SYBIS - Data Visualisation
PDF
Visual communication of qualitative and quantitative data (v. 2021 ITA)
PDF
Visual communication of quantitative data
Dataviz 101: Data is beautiful, please don't ruin it by Anne-Marie Tousch, Se...
Introduction to Data Visualisation - Andrew Errity
Making sense of data visually: A modern look at datavisualization
Data is beautiful​, please don't ruin it
Data Visualization in Data Science
Data Visulalization
Information visualisation
Workshop lyons ray
Data Visualization
Pratiques historiques et actuelles
Data Visualization Power Point Presentations
chi03-tutorial.ppt
Discover new ways to visualize market research data
5 part 2 - Methodology _ Learning About Your Data.pdf
Introduction to Data Visualization
Art and Science of Dashboard Design
Principles of data visualisation 2020
SYBIS - Data Visualisation
Visual communication of qualitative and quantitative data (v. 2021 ITA)
Visual communication of quantitative data
Ad

More from Chengjun Wang (15)

PDF
计算传播学导论
PPT
Randomly sampling YouTube users
PDF
An introduction to computational communication
PPTX
Pajek chapter2 Attributes and Relations
PPTX
Calculate Thresholds of Diffusion with Pajek
PPT
Chapter 2. Multivariate Analysis of Stationary Time Series
PPTX
人类行为与最大熵原理
PPTX
Impact of human value, consumer perceived value
PPTX
Introduction to News diffusion On News Sharing Website
PPTX
The Emergence of Spiral of Silence from Individual behaviors: Agent-based Mod...
PPTX
Suppressor and distort variables
PPTX
Pajek chapter1
PPTX
Stata Learning From Treiman
PPT
A M O S L E A R N I N G
PPT
Amos Learning
计算传播学导论
Randomly sampling YouTube users
An introduction to computational communication
Pajek chapter2 Attributes and Relations
Calculate Thresholds of Diffusion with Pajek
Chapter 2. Multivariate Analysis of Stationary Time Series
人类行为与最大熵原理
Impact of human value, consumer perceived value
Introduction to News diffusion On News Sharing Website
The Emergence of Spiral of Silence from Individual behaviors: Agent-based Mod...
Suppressor and distort variables
Pajek chapter1
Stata Learning From Treiman
A M O S L E A R N I N G
Amos Learning

Recently uploaded (20)

PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Leprosy and NLEP programme community medicine
PPTX
SAP 2 completion done . PRESENTATION.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPT
Predictive modeling basics in data cleaning process
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Global Data and Analytics Market Outlook Report
PDF
How to run a consulting project- client discovery
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PDF
Business Analytics and business intelligence.pdf
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
DOCX
Factor Analysis Word Document Presentation
Qualitative Qantitative and Mixed Methods.pptx
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
Database Infoormation System (DBIS).pptx
Leprosy and NLEP programme community medicine
SAP 2 completion done . PRESENTATION.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Predictive modeling basics in data cleaning process
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Global Data and Analytics Market Outlook Report
How to run a consulting project- client discovery
STERILIZATION AND DISINFECTION-1.ppthhhbx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Business Analytics and business intelligence.pdf
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Pilar Kemerdekaan dan Identi Bangsa.pptx
Factor Analysis Word Document Presentation

数据可视化 概念案例方法 王成军 20140104