SlideShare a Scribd company logo
Taxonomy-Based Glyph Design– with a Case Study on
     Visualizing Workflows of Biological Experiments
      Eamonn Maguire, Philippe Rocca-Serra, Susanna-Assunta Sansone, Jim Davies and Min Chen
                                                                   University of Oxford, UK
The Road Map…




Aid quicker exploration and comparison of experimental workflows for
biologists performing experiments and curators who validate them.
Glyph: A glyph is a small visual object composed of a number
            Getting there                             of visual channels which can be used independently as well as
                                                      constructively to depict attributes of a data record.




e.g. Chernoff Faces                                  e.g. Glyph based on rectangles using color and
Image Sources: http://kspark.kaist.ac.kr/Human       orientation.
%20Engineering.files/Chernoff/Chernoff%20Faces.htm
                                                     Healey, C et al. Perceptually-Based Brush Strokes for
                                                     Nonphotorealistic Visualization 2004
Some of the ~4000 concepts
                                                        labeling
But, we have a problem…                                 nucleic acid extraction
                                                        hybridization
                                                        feature extraction
                                                        bioassay data transformation
                                                        Growth
We have 21,000 studies with > 500,000 individual        cultured cells
                                                        saccharomyces cerevisiae scr101
experiments giving > 60 processes (actions on           pool
materials) and >4000 inputs/outputs to those            image acquisition
                                                        behavioral stimulus
processes.                                              purify
                                                        pcr amplification
                                                        normalization
Creating 1000s of glyphs for each individual concept    lowess group normalization
is simply not scalable.                                 extraction
                                                        Scanning
                                                        feature extraction and analysis
We need a systematic process for glyph creation based   immunoprecipitation
                                                        compound based treatment
on the properties of these concepts.                    transformation protocol
                                                        linear amplification
                                                        fresh frozen tissue
                                                        saccharomyces cerevisiae bqs252.
                                                        exponential growth in ypd.
Solution outline




Create a taxonomy                       Order Visual Channels…   Map taxonomy to visual channels.
A structured hierarchical arrangement   Color > shape > size >   We can create a glyph for items
of concepts.                            orientation > texture.   based on the position in taxonomy.
One or more concepts represented by
leaf nodes                              And provide design       Higher levels in the taxonomy will
                                        guidelines.              command better visual channels.
Creating the taxonomy
Creating the taxonomy…input format




In each scheme, there are sub-classifications (4 in S1).
If a concept can be classified with this classification, it is assigned a 1, otherwise 0.
Creating the taxonomy…general workings




                                 The algorithm runs recursively,
                                 selecting each best scheme S
                                 and attempting to sub-classify
                                 each classification C

                            But how do we select the best
                            scheme?
Metric 1: Coverage
100% coverage yields value of 1
The more concepts a scheme can classify, the better.
Metric 2: Potential Use
Higher occurrence yields value closer to 1
Metric 3: Sub tree balance
Low standard deviation in number of concepts in each classification yields value closer to 1
A balanced tree is desirable and prevents a tree from having excessive height (greater height = need for more visual channels).
Metric 4: Number of Subclasses
Low number of classes yields value close to 1
Schemes with a high number of subclasses are penalized since a high number of subclasses would mean a high number of
levels to map to with the selected visual channels.




                                                                                    Only consider subclasses that are used.
Application to our case study

  We have 8 schemes shown here, focusing mainly on processes.
Application to our case study

We have 21,000 biological studies with > 500,000 individual
experiments giving > 60 processes (actions on materials) and
>4000 inputs/outputs to those processes.


1.   Concepts were extracted
2.   Categories were created by a domain expert.
3.   Taxonomy algorithm applied.
4.   Taxonomy on the right created >>

Next we attempt to order visual channels and
   create design guidelines.
Guidelines for design
Ordering Visual Channels
Bertin’s Visual Channels
Associative                   Selective                      Ordered             Quantitative
facilitate grouping of all    facilitate selection of one    facilitate visual   permits extraction of ratios
elements of a variable        category of data and ignore    ranking of data:    without the need to inspect a
despite differing values:     others:                                            legend:

texture, color, orientation   texture, color, orientation,   texture, color &    planar & size.
and shape.                    shape, planar, size &          size.
                              brightness.
Pop-out effect
(Williams 67, Duncan 89, Luck 94, Bertin 83, Green 98, Wolfe 89, Treisman 77, Palmer 77, Parkhurst 02)
Visual Hierarchy

In particular we look at:
1.top-down (global);
2.salient feature detection of edges, points and colors.

Since they are most relevant to overview level processing of a scene.

[Palmer 77, Navon 77, Shor 71, Love 99, Kinchla 79]
Metaphor is important!




Material Combination   Material Amplification   Material Separation   Material Collection
From Taxonomy to Visual Channels
Visual Mapping
                 Select design options based on the guidelines and the
                 level of the classification in the taxonomy and map the
                 scheme to selected Visual Channels and structure


                                       C1        C3        C2




                                     In Vitro   In Vivo   In Silico
Visual Mapping
Crush test.
Schemes high up in the taxonomy should be distinguishable at low resolution...overview level.




                                                         We should be able to distinguish high-
                                                         level classes in the taxonomy even at
                                                         low resolutions.
Implementation & Dissemination




                                 Towards interoperable bioscience data
                                 Sansone et al, 2012
                                 Nature Genetics
Contributions
 1. Systematic Approach For Glyph Design
     • Ordering of concepts
     • Ordering of visual channels according to
         psychological literature
     • Mapping between them

 2. Application
     • Biological Metadata
     • Biological Workflows
Questions?
Funders




Thanks to the organizers and everyone here for listening!

More Related Content

PDF
Clusterix at VDS 2016
PDF
ShawnQuinnCSS581FinalProjectReport
PPTX
Report: "MolGAN: An implicit generative model for small molecular graphs"
PDF
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
PDF
Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace
PPT
Random Neural Network (Erol) by Engr. Edgar Carrillo II
PPTX
One shot learning
PPTX
Developing Document Image Retrieval System
Clusterix at VDS 2016
ShawnQuinnCSS581FinalProjectReport
Report: "MolGAN: An implicit generative model for small molecular graphs"
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace
Random Neural Network (Erol) by Engr. Edgar Carrillo II
One shot learning
Developing Document Image Retrieval System

What's hot (20)

PDF
Transformer based approaches for visual representation learning
PDF
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
PDF
3D 딥러닝 동향
PPTX
MultiModal Retrieval Image
PDF
Btv thesis defense_v1.02-final
PDF
Learning to learn unlearned feature for segmentation
PDF
Machine learning for_finance
PDF
Matching Network
PDF
Identification of Relevant Sections in Web Pages Using a Machine Learning App...
PPTX
Automatic Image Annotation
PDF
Bhadale group of companies ai neural networks and algorithms catalogue
PDF
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
PPTX
Recommendation system using collaborative deep learning
PDF
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
PDF
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
PDF
IRJET - Object Detection using Deep Learning with OpenCV and Python
PDF
Domain Invariant Representation Learning with Domain Density Transformations
PDF
Learning with Relative Attributes
PDF
Introduction to ambient GAN
PDF
Evolutionary Design of Swarms (SSCI 2014)
Transformer based approaches for visual representation learning
CNN FEATURES ARE ALSO GREAT AT UNSUPERVISED CLASSIFICATION
3D 딥러닝 동향
MultiModal Retrieval Image
Btv thesis defense_v1.02-final
Learning to learn unlearned feature for segmentation
Machine learning for_finance
Matching Network
Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Automatic Image Annotation
Bhadale group of companies ai neural networks and algorithms catalogue
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
Recommendation system using collaborative deep learning
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
Integrated Hidden Markov Model and Kalman Filter for Online Object Tracking
IRJET - Object Detection using Deep Learning with OpenCV and Python
Domain Invariant Representation Learning with Domain Density Transformations
Learning with Relative Attributes
Introduction to ambient GAN
Evolutionary Design of Swarms (SSCI 2014)
Ad

Viewers also liked (7)

PDF
Visual Compression of Workflow Visualizations with Automated Detection of Mac...
PDF
PDF
HEPData Open Repositories 2016 Talk
PDF
Reproducible, Open Data Science in the Life Sciences
PDF
Principles of Data Visualization
PDF
Web valley talk - usability, visualization and mobile app development
PDF
Visualization of Publication Impact
Visual Compression of Workflow Visualizations with Automated Detection of Mac...
HEPData Open Repositories 2016 Talk
Reproducible, Open Data Science in the Life Sciences
Principles of Data Visualization
Web valley talk - usability, visualization and mobile app development
Visualization of Publication Impact
Ad

Similar to Taxonomy-Based Glyph Design (20)

PDF
Modeling XCS in class imbalances: Population sizing and parameter settings
PDF
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
PDF
Learning where to look: focus and attention in deep vision
PDF
Performance Evaluation of Classifiers used for Identification of Encryption A...
PPTX
AI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
PDF
Executable Biology Tutorial
PPTX
Deep learning from a novice perspective
PDF
Semantic Hybridized Image Features in Visual Diagnostic of Plant Health
PDF
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
PPT
Multiclassification system
KEY
Content-based Image Retrieval
PPTX
PPT.pptx
PPT
Renikko
PDF
Machine learning in the life sciences with knime
PPTX
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
PDF
[Paper] DetectoRS for Object Detection
PPTX
Object Discovery using CNN Features in Egocentric Videos
PDF
Novel Class Detection Using RBF SVM Kernel from Feature Evolving Data Streams
DOC
Chapter6.doc
PPTX
Elettronica: Multimedia Information Processing in Smart Environments by Aless...
Modeling XCS in class imbalances: Population sizing and parameter settings
CCIA'2008: Can Evolution Strategies Improve Learning Guidance in XCS? Design ...
Learning where to look: focus and attention in deep vision
Performance Evaluation of Classifiers used for Identification of Encryption A...
AI Class Topic 6: Easy Way to Learn Deep Learning AI Technologies
Executable Biology Tutorial
Deep learning from a novice perspective
Semantic Hybridized Image Features in Visual Diagnostic of Plant Health
HIS'2008: New Crossover Operator for Evolutionary Rule Discovery in XCS
Multiclassification system
Content-based Image Retrieval
PPT.pptx
Renikko
Machine learning in the life sciences with knime
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
[Paper] DetectoRS for Object Detection
Object Discovery using CNN Features in Egocentric Videos
Novel Class Detection Using RBF SVM Kernel from Feature Evolving Data Streams
Chapter6.doc
Elettronica: Multimedia Information Processing in Smart Environments by Aless...

Recently uploaded (20)

PPTX
Slide gioi thieu VietinBank Quy 2 - 2025
PPTX
svnfcksanfskjcsnvvjknsnvsdscnsncxasxa saccacxsax
PPTX
basic introduction to research chapter 1.pptx
PPTX
Sales & Distribution Management , LOGISTICS, Distribution, Sales Managers
PPT
Lecture 3344;;,,(,(((((((((((((((((((((((
PDF
Susan Semmelmann: Enriching the Lives of others through her Talents and Bless...
PDF
Satish NS: Fostering Innovation and Sustainability: Haier India’s Customer-Ce...
PPTX
TRAINNING, DEVELOPMENT AND APPRAISAL.pptx
PPTX
interschool scomp.pptxzdkjhdjvdjvdjdhjhieij
PDF
ANALYZING THE OPPORTUNITIES OF DIGITAL MARKETING IN BANGLADESH TO PROVIDE AN ...
PDF
1911 Gold Corporate Presentation Aug 2025.pdf
PDF
Blood Collected straight from the donor into a blood bag and mixed with an an...
DOCX
80 DE ÔN VÀO 10 NĂM 2023vhkkkjjhhhhjjjj
DOCX
Handbook of Entrepreneurship- Chapter 5: Identifying business opportunity.docx
PDF
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
PDF
Daniels 2024 Inclusive, Sustainable Development
PDF
Booking.com The Global AI Sentiment Report 2025
PDF
Keppel_Proposed Divestment of M1 Limited
PPTX
Slide gioi thieu VietinBank Quy 2 - 2025
PDF
PMB 401-Identification-of-Potential-Biotechnological-Products.pdf
Slide gioi thieu VietinBank Quy 2 - 2025
svnfcksanfskjcsnvvjknsnvsdscnsncxasxa saccacxsax
basic introduction to research chapter 1.pptx
Sales & Distribution Management , LOGISTICS, Distribution, Sales Managers
Lecture 3344;;,,(,(((((((((((((((((((((((
Susan Semmelmann: Enriching the Lives of others through her Talents and Bless...
Satish NS: Fostering Innovation and Sustainability: Haier India’s Customer-Ce...
TRAINNING, DEVELOPMENT AND APPRAISAL.pptx
interschool scomp.pptxzdkjhdjvdjvdjdhjhieij
ANALYZING THE OPPORTUNITIES OF DIGITAL MARKETING IN BANGLADESH TO PROVIDE AN ...
1911 Gold Corporate Presentation Aug 2025.pdf
Blood Collected straight from the donor into a blood bag and mixed with an an...
80 DE ÔN VÀO 10 NĂM 2023vhkkkjjhhhhjjjj
Handbook of Entrepreneurship- Chapter 5: Identifying business opportunity.docx
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
Daniels 2024 Inclusive, Sustainable Development
Booking.com The Global AI Sentiment Report 2025
Keppel_Proposed Divestment of M1 Limited
Slide gioi thieu VietinBank Quy 2 - 2025
PMB 401-Identification-of-Potential-Biotechnological-Products.pdf

Taxonomy-Based Glyph Design

  • 1. Taxonomy-Based Glyph Design– with a Case Study on Visualizing Workflows of Biological Experiments Eamonn Maguire, Philippe Rocca-Serra, Susanna-Assunta Sansone, Jim Davies and Min Chen University of Oxford, UK
  • 2. The Road Map… Aid quicker exploration and comparison of experimental workflows for biologists performing experiments and curators who validate them.
  • 3. Glyph: A glyph is a small visual object composed of a number Getting there of visual channels which can be used independently as well as constructively to depict attributes of a data record. e.g. Chernoff Faces e.g. Glyph based on rectangles using color and Image Sources: http://kspark.kaist.ac.kr/Human orientation. %20Engineering.files/Chernoff/Chernoff%20Faces.htm Healey, C et al. Perceptually-Based Brush Strokes for Nonphotorealistic Visualization 2004
  • 4. Some of the ~4000 concepts labeling But, we have a problem… nucleic acid extraction hybridization feature extraction bioassay data transformation Growth We have 21,000 studies with > 500,000 individual cultured cells saccharomyces cerevisiae scr101 experiments giving > 60 processes (actions on pool materials) and >4000 inputs/outputs to those image acquisition behavioral stimulus processes. purify pcr amplification normalization Creating 1000s of glyphs for each individual concept lowess group normalization is simply not scalable. extraction Scanning feature extraction and analysis We need a systematic process for glyph creation based immunoprecipitation compound based treatment on the properties of these concepts. transformation protocol linear amplification fresh frozen tissue saccharomyces cerevisiae bqs252. exponential growth in ypd.
  • 5. Solution outline Create a taxonomy Order Visual Channels… Map taxonomy to visual channels. A structured hierarchical arrangement Color > shape > size > We can create a glyph for items of concepts. orientation > texture. based on the position in taxonomy. One or more concepts represented by leaf nodes And provide design Higher levels in the taxonomy will guidelines. command better visual channels.
  • 7. Creating the taxonomy…input format In each scheme, there are sub-classifications (4 in S1). If a concept can be classified with this classification, it is assigned a 1, otherwise 0.
  • 8. Creating the taxonomy…general workings The algorithm runs recursively, selecting each best scheme S and attempting to sub-classify each classification C But how do we select the best scheme?
  • 9. Metric 1: Coverage 100% coverage yields value of 1 The more concepts a scheme can classify, the better.
  • 10. Metric 2: Potential Use Higher occurrence yields value closer to 1
  • 11. Metric 3: Sub tree balance Low standard deviation in number of concepts in each classification yields value closer to 1 A balanced tree is desirable and prevents a tree from having excessive height (greater height = need for more visual channels).
  • 12. Metric 4: Number of Subclasses Low number of classes yields value close to 1 Schemes with a high number of subclasses are penalized since a high number of subclasses would mean a high number of levels to map to with the selected visual channels. Only consider subclasses that are used.
  • 13. Application to our case study We have 8 schemes shown here, focusing mainly on processes.
  • 14. Application to our case study We have 21,000 biological studies with > 500,000 individual experiments giving > 60 processes (actions on materials) and >4000 inputs/outputs to those processes. 1. Concepts were extracted 2. Categories were created by a domain expert. 3. Taxonomy algorithm applied. 4. Taxonomy on the right created >> Next we attempt to order visual channels and create design guidelines.
  • 18. Associative Selective Ordered Quantitative facilitate grouping of all facilitate selection of one facilitate visual permits extraction of ratios elements of a variable category of data and ignore ranking of data: without the need to inspect a despite differing values: others: legend: texture, color, orientation texture, color, orientation, texture, color & planar & size. and shape. shape, planar, size & size. brightness.
  • 19. Pop-out effect (Williams 67, Duncan 89, Luck 94, Bertin 83, Green 98, Wolfe 89, Treisman 77, Palmer 77, Parkhurst 02)
  • 20. Visual Hierarchy In particular we look at: 1.top-down (global); 2.salient feature detection of edges, points and colors. Since they are most relevant to overview level processing of a scene. [Palmer 77, Navon 77, Shor 71, Love 99, Kinchla 79]
  • 21. Metaphor is important! Material Combination Material Amplification Material Separation Material Collection
  • 22. From Taxonomy to Visual Channels
  • 23. Visual Mapping Select design options based on the guidelines and the level of the classification in the taxonomy and map the scheme to selected Visual Channels and structure C1 C3 C2 In Vitro In Vivo In Silico
  • 25. Crush test. Schemes high up in the taxonomy should be distinguishable at low resolution...overview level. We should be able to distinguish high- level classes in the taxonomy even at low resolutions.
  • 26. Implementation & Dissemination Towards interoperable bioscience data Sansone et al, 2012 Nature Genetics
  • 27. Contributions 1. Systematic Approach For Glyph Design • Ordering of concepts • Ordering of visual channels according to psychological literature • Mapping between them 2. Application • Biological Metadata • Biological Workflows
  • 28. Questions? Funders Thanks to the organizers and everyone here for listening!

Editor's Notes

  • #3: In biological experiment workflows, we are showing the biological materials and protocols enacted on materials which result in some data files…e.g. DNA sequence data, protein expression data, etc. The current representation is the representation on the left. It does not facilitate pattern discovery/recognition and requires zooming in to get any information about the nodes. In other words, it is absent of a valid overview level visualization. Our solution is to use glyphs to replace these text-labeled boxes with glyphs. The presence of iconic memory was introduced by Sperling in 1960. It is shown to facilitate rapid comparison between glyphs in the same display, whereas the effect is less so for text.
  • #4: Sperling, 1960 - The presence of iconic memory may facilitate rapid comparison between glyphs in the same display, whereas it is less so for texts.
  • #5: In our problem domain, we have the following numbers at our disposal. >4,000 qualitative terms
  • #6: Taxonomy and visual channels. Through formulating a glyphs representation based on the taxonomy, we implicitly construct rules for how a glyph is constructed.
  • #8: Animate.
  • #10: The first metric for selecting a scheme is the coverage.
  • #11: Same coverage but Scheme 1 is better for potential use.
  • #12: Make fair comparison with 3 each side.
  • #13: When mapped to more abstract visual channels, e.g. color, too many mappings are hard to learn. Unless the color is metaphoric.
  • #14: Move information about concepts in to earlier slide.
  • #15: Move information about concepts in to earlier slide.
  • #16: Move information about concepts in to earlier slide.
  • #17: I will simply show how we built this table but relatively briefly. If you want more information, you can read the paper and/or speak to me throughout the course of the meeting.
  • #20: Pop-out effect We looked at many sources of literature on pop-out effectThe power of Visual Channels differ in their ability to contribute to pop out effect. Integral/Separable Visual Channels Some visual channels do not interact well with others, for instance, motion & flicker or width and height are common examples of what are termed integral dimensions .
  • #21: There are a few theories on how we process information in the visual hierarchy. Local, Global, middle-out and salient feature detection. Some disagreement over exact mechanism used. In our work we focus mainly on the top-down and salient feature detection theories since the glyphs will often be small in relation to the overall visualization.
  • #22: In Particular domain-specific metaphor... Learning, recognition + memorizing… “ Natural mappings ” [Siirtola 02] between data and their visual counterparts can make it easier for users to infer meaning from the glyph with less effort required to learn and remember them.
  • #23: We have the taxonomic order and the visual channel order, now we can map between them.
  • #24: Explain the first level mapping in more detail.
  • #25: Animate creation of the tree.
  • #26: Because of the ordering, the top level in the taxonomy will be distinguishable even at low levels.
  • #27: Franks and Bransford’s study on transformation of prototypes suggested that humans can learn to recognize glyphs by rules consciously as well as unconsciously.
  • #28: We’ve provided a rule based encoding Franks and Bransford’s study on transformation of prototypes suggested that humans can learn to recognize glyphs by rules consciously as well as unconsciously.