SlideShare a Scribd company logo
A Linguistic Approach to
Categorical Color Assignment for
Data Visualization
Contents:
 INTRODUCTION
 COLOR NAME AND COGNITION
 COLOR AND LANGUAGE
 MAPPING NAME TO COLORS
 PALETTE GENERATION
 DISCUSSION AND FUTURE WORK
 CONCLUSION
 REFERENCE
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Introduction
 Here for the data visualization the coloring is done by a well-designed default categorical
palette, with colors that are optimized for legibility and mapped to basic color names.
 There is no semantic relationship between the colors used in the visualization and those
commonly associated with these data.
 The goal of this research is to aid in the semantic mapping of coloring to data, both by
presenting a specific technique and by discussing the challenges and trade-offs
discovered in this work.
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Color names and cognition
 Focuses on basic color names, studying how these names are represented in different
languages and cultures.
 For English (and many similar European languages), there are 11 basic color names: red, green,
blue, yellow, orange, purple, pink, brown, black, white and gray. Furthermore, there is strong
agreement on the color stimuli, in this case, defined by a set of Munsell color chips that are
exemplars of these names.
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
 There is a fundamental link between language and color cognition is demonstrated by the
Stroop effect. When the color of the letters is in conflict with the color described by the word
(for example, the word ‘blue’ is written in red letters), subjects find it harder to name the color of
the letters.
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Color and Language
 This suggests that linguistic data sources that consider the semantics of color names might
provide for better reference, selection and retrieval of colors for various tasks, including for
categorical palettes in data visualizations
 use n-gram analysis to determine the relationship between terms and basic colors, they use the
links provided in ConceptNet.
 They do not use images to determine colors, but depend instead on finding color centroids
semantically. In addition, their focus is on returning one ideal color, not a set of color options.
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Mapping Names to Colors
 A common way to map names to colors is to fit statistical models to human judgments of color-
name associations.
 Typical color naming data .However, is created by asking people to describe a displayed color in
language.
 The data for these models is created by showing people color patches and asked to name them,
sometimes freely, and sometimes with a constrained vocabulary.
 Categorical data typically does not directly describe colors, but are rather concepts with a strong
color association. Therefore, we need a way to map an arbitrary word or phrase to a color.
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Search Parameters
 For any word or phrase, the Google n-gram analysis provides a colorability score and a list of
basic colors that are strongly associated with that phrase.
 This information is used to determine if a given term is colorable, and if so, the list of basic
colors is used as a query parameter to obtain images with those dominant colors.
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Query Expansion and Image Retrieval
 The basic color names can be used as input queries to an image search engine, the query words
may be different than the ones used in the metadata describing the semantics of the imagery.
 We use query expansion to augment related terms to each of the queries for improving search
precision and recall.
 Once the query is executed, the result set of images returned are accompanied by a normalized
confidence measure from the Google search engine
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Canonical colors retrieved from Google Images using dominant filtering for ‘taxi’, ‘lizard’
and ‘saffron.’
PALETTE GENERATION
 Effective categorical color palettes encode visually distinct colors to different categorical values
for distinction, in addition to the color being as semantically meaningful to the corresponding
value.
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Color Palette
Color Palettes A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Color Chart
Color Hex Codes A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Color Hex Codes
Color Hex Values A LinguisticApproach to Categorical Color
Assignment for DataVisualization
Color Hex Codes
Future Work
For a set of data categories, our techniques can provide two useful pieces of information –
 Whether the terms have strong associations to color.
 Second, if there are such color associations, what are the corresponding semantic colors. In
addition, we can construct palettes that contain distinctly different colors, either as defined in
CIELAB space or by mapping to a pre-defined palette. In some cases, the results are adequate as
returned. In others, there needs to be additional refinement.
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
data visualization tools
 Tableau
 D3.js
 Fusion Charts
 Chart.js
 Google Charts
 Data wrapper
 Power BI
A LinguisticApproach to Categorical Color
Assignment for DataVisualization
CONCLUSION
Two ways to leverage natural language techniques to map category names to
semantically appropriate colors.
 The first uses n-gram analysis with respect to basic color names.
 The second semantic context for the data category, to find identity colors from
visual symbols such as logos.
 Both return scores that can be used to determine if a data term has a semantic
coloring
A LinguisticApproach to Categorical Color
Assignment for DataVisualization

More Related Content

PPTX
Language Models for Information Retrieval
PDF
Data Science and Analytics Brown Bag
PPTX
Adversarial and reinforcement learning-based approaches to information retrieval
PDF
Survey of Generative Clustering Models 2008
PPTX
Neural Models for Document Ranking
PDF
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
PDF
Language Models for Information Retrieval
PDF
Data Tactics Analytics Brown Bag (Aug 22, 2013)
Language Models for Information Retrieval
Data Science and Analytics Brown Bag
Adversarial and reinforcement learning-based approaches to information retrieval
Survey of Generative Clustering Models 2008
Neural Models for Document Ranking
Topic Modeling for Information Retrieval and Word Sense Disambiguation tasks
Language Models for Information Retrieval
Data Tactics Analytics Brown Bag (Aug 22, 2013)

Viewers also liked (20)

PPTX
Westby DMT - Week 01 color
PDF
Introduction to Design Thinking
PDF
Data Viz CE 2014 Storytelling
PDF
Color for Data Visualization
PDF
Staying on target with IBM Design Thinking and Lean UX
PPT
Chapter 4
PDF
Data Viz CE 2014 Toolbox
PDF
Data Viz CE 2014 Color
PPTX
Introduction to Design Thinking and finding True Love
PDF
Design thinking - Piktochart presentation for Barcamp Penang 2013
PDF
UX and Design Thinking for Startups
PDF
Design Thinking, Agile, DevOps - fuel the innovation delivery
PDF
Design thinking in an agile world
PDF
Agile design thinking and you... ux australia2011
PPT
Color Theory Book
PPTX
The 8 Hats of Data Visualisation
PDF
Design thinking & scrum v 3.0
PDF
Design thinking and agile development
PDF
Design Thinking & Agile Innovation Workshop
PDF
Fundamental Ways We Use Data Visualizations
Westby DMT - Week 01 color
Introduction to Design Thinking
Data Viz CE 2014 Storytelling
Color for Data Visualization
Staying on target with IBM Design Thinking and Lean UX
Chapter 4
Data Viz CE 2014 Toolbox
Data Viz CE 2014 Color
Introduction to Design Thinking and finding True Love
Design thinking - Piktochart presentation for Barcamp Penang 2013
UX and Design Thinking for Startups
Design Thinking, Agile, DevOps - fuel the innovation delivery
Design thinking in an agile world
Agile design thinking and you... ux australia2011
Color Theory Book
The 8 Hats of Data Visualisation
Design thinking & scrum v 3.0
Design thinking and agile development
Design Thinking & Agile Innovation Workshop
Fundamental Ways We Use Data Visualizations
Ad

Similar to A linguistic approach to categorical color assignment (20)

PDF
05 Scalar Visualization
PDF
Colourful Language
PDF
Choosing Effective Colours for Data Visualization
PDF
design principles for visualization
PDF
Colours and Accessibility (a11y) - WordCamp Europe 2014 Sofia
PPTX
Data Visualization 2
PDF
Subzero Report (1).pdf
PPTX
Data-Driven Color Palettes for Categorical Maps
PPTX
Data Visualization by David Kretch
PDF
12. Map Visualization .pdf
PDF
IDIO2020_B3_DataViz_EdoraFNguyenJ_acc.pdf
PPTX
mapping Interpretation and map design.pptx
PPTX
Using Color to Convey Data in Charts
PDF
A Framework for Harmonic Color Measures
PDF
Information Visualisation: perception and principles
PDF
Powerpoint
PDF
Painting a clearer picture
PPTX
Data/Visualization - Digital Center Cohort - 13_0222
PPTX
image processing
PDF
New Directions In Colour Studies Carole P Biggam Carole A Hough
05 Scalar Visualization
Colourful Language
Choosing Effective Colours for Data Visualization
design principles for visualization
Colours and Accessibility (a11y) - WordCamp Europe 2014 Sofia
Data Visualization 2
Subzero Report (1).pdf
Data-Driven Color Palettes for Categorical Maps
Data Visualization by David Kretch
12. Map Visualization .pdf
IDIO2020_B3_DataViz_EdoraFNguyenJ_acc.pdf
mapping Interpretation and map design.pptx
Using Color to Convey Data in Charts
A Framework for Harmonic Color Measures
Information Visualisation: perception and principles
Powerpoint
Painting a clearer picture
Data/Visualization - Digital Center Cohort - 13_0222
image processing
New Directions In Colour Studies Carole P Biggam Carole A Hough
Ad

Recently uploaded (20)

PDF
Well-logging-methods_new................
PPTX
Lecture Notes Electrical Wiring System Components
PDF
PPT on Performance Review to get promotions
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
Welding lecture in detail for understanding
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
DOCX
573137875-Attendance-Management-System-original
PPTX
Construction Project Organization Group 2.pptx
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
additive manufacturing of ss316l using mig welding
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Sustainable Sites - Green Building Construction
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Well-logging-methods_new................
Lecture Notes Electrical Wiring System Components
PPT on Performance Review to get promotions
CYBER-CRIMES AND SECURITY A guide to understanding
Welding lecture in detail for understanding
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
573137875-Attendance-Management-System-original
Construction Project Organization Group 2.pptx
UNIT 4 Total Quality Management .pptx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
additive manufacturing of ss316l using mig welding
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Sustainable Sites - Green Building Construction
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Mitigating Risks through Effective Management for Enhancing Organizational Pe...

A linguistic approach to categorical color assignment

  • 1. A Linguistic Approach to Categorical Color Assignment for Data Visualization
  • 2. Contents:  INTRODUCTION  COLOR NAME AND COGNITION  COLOR AND LANGUAGE  MAPPING NAME TO COLORS  PALETTE GENERATION  DISCUSSION AND FUTURE WORK  CONCLUSION  REFERENCE A LinguisticApproach to Categorical Color Assignment for DataVisualization
  • 3. Introduction  Here for the data visualization the coloring is done by a well-designed default categorical palette, with colors that are optimized for legibility and mapped to basic color names.  There is no semantic relationship between the colors used in the visualization and those commonly associated with these data.  The goal of this research is to aid in the semantic mapping of coloring to data, both by presenting a specific technique and by discussing the challenges and trade-offs discovered in this work. A LinguisticApproach to Categorical Color Assignment for DataVisualization
  • 4. Color names and cognition  Focuses on basic color names, studying how these names are represented in different languages and cultures.  For English (and many similar European languages), there are 11 basic color names: red, green, blue, yellow, orange, purple, pink, brown, black, white and gray. Furthermore, there is strong agreement on the color stimuli, in this case, defined by a set of Munsell color chips that are exemplars of these names. A LinguisticApproach to Categorical Color Assignment for DataVisualization
  • 5.  There is a fundamental link between language and color cognition is demonstrated by the Stroop effect. When the color of the letters is in conflict with the color described by the word (for example, the word ‘blue’ is written in red letters), subjects find it harder to name the color of the letters. A LinguisticApproach to Categorical Color Assignment for DataVisualization
  • 6. Color and Language  This suggests that linguistic data sources that consider the semantics of color names might provide for better reference, selection and retrieval of colors for various tasks, including for categorical palettes in data visualizations  use n-gram analysis to determine the relationship between terms and basic colors, they use the links provided in ConceptNet.  They do not use images to determine colors, but depend instead on finding color centroids semantically. In addition, their focus is on returning one ideal color, not a set of color options. A LinguisticApproach to Categorical Color Assignment for DataVisualization
  • 7. Mapping Names to Colors  A common way to map names to colors is to fit statistical models to human judgments of color- name associations.  Typical color naming data .However, is created by asking people to describe a displayed color in language.  The data for these models is created by showing people color patches and asked to name them, sometimes freely, and sometimes with a constrained vocabulary.  Categorical data typically does not directly describe colors, but are rather concepts with a strong color association. Therefore, we need a way to map an arbitrary word or phrase to a color. A LinguisticApproach to Categorical Color Assignment for DataVisualization
  • 8. Search Parameters  For any word or phrase, the Google n-gram analysis provides a colorability score and a list of basic colors that are strongly associated with that phrase.  This information is used to determine if a given term is colorable, and if so, the list of basic colors is used as a query parameter to obtain images with those dominant colors. A LinguisticApproach to Categorical Color Assignment for DataVisualization
  • 9. Query Expansion and Image Retrieval  The basic color names can be used as input queries to an image search engine, the query words may be different than the ones used in the metadata describing the semantics of the imagery.  We use query expansion to augment related terms to each of the queries for improving search precision and recall.  Once the query is executed, the result set of images returned are accompanied by a normalized confidence measure from the Google search engine A LinguisticApproach to Categorical Color Assignment for DataVisualization
  • 10. A LinguisticApproach to Categorical Color Assignment for DataVisualization Canonical colors retrieved from Google Images using dominant filtering for ‘taxi’, ‘lizard’ and ‘saffron.’
  • 11. PALETTE GENERATION  Effective categorical color palettes encode visually distinct colors to different categorical values for distinction, in addition to the color being as semantically meaningful to the corresponding value. A LinguisticApproach to Categorical Color Assignment for DataVisualization Color Palette
  • 12. Color Palettes A LinguisticApproach to Categorical Color Assignment for DataVisualization Color Chart
  • 13. Color Hex Codes A LinguisticApproach to Categorical Color Assignment for DataVisualization Color Hex Codes
  • 14. Color Hex Values A LinguisticApproach to Categorical Color Assignment for DataVisualization Color Hex Codes
  • 15. Future Work For a set of data categories, our techniques can provide two useful pieces of information –  Whether the terms have strong associations to color.  Second, if there are such color associations, what are the corresponding semantic colors. In addition, we can construct palettes that contain distinctly different colors, either as defined in CIELAB space or by mapping to a pre-defined palette. In some cases, the results are adequate as returned. In others, there needs to be additional refinement. A LinguisticApproach to Categorical Color Assignment for DataVisualization
  • 16. data visualization tools  Tableau  D3.js  Fusion Charts  Chart.js  Google Charts  Data wrapper  Power BI A LinguisticApproach to Categorical Color Assignment for DataVisualization
  • 17. CONCLUSION Two ways to leverage natural language techniques to map category names to semantically appropriate colors.  The first uses n-gram analysis with respect to basic color names.  The second semantic context for the data category, to find identity colors from visual symbols such as logos.  Both return scores that can be used to determine if a data term has a semantic coloring A LinguisticApproach to Categorical Color Assignment for DataVisualization