SlideShare a Scribd company logo
Approximate and User Steerable tSNE
for Progressive Visual Analytics
Nicola Pezzotti, Boudewijn P.F. Lelieveldt, Laurens van der Maaten,
Thomas Höllt, Elmar Eisemann, Anna Vilanova
2
Non-Linear Dimensionality-Reduction
3
Non-linear dimensionality-reduction
algorithm
• Preserves small neighborhoods
• Reveals global structures
Visualizing data using t-SNE - Van der Maaten & Hinton - 2008
t-Distributed Stochastic Neighbor Embedding
4
tSNE
Similarities
Computation
Similarities
Computation
Gradient descent
minimization
Similarities
tSNE as a Black Box
5
PVA - tSNE
Similarities
Computation
Similarities
Computation
Gradient descent
minimization
Similarities
Progressive Visual Analytics: User-Driven Visual Exploration of In-Progress Analytics - Stolper et al. - 2014
Opening the Black Box: Strategies for Increased User Involvement in Existing Algorithm Implementations - Muhlbacher et al. - 2014
Progressive Analytics: A Computation Paradigm for Exploratory Data Analysis - Fekete & Primet - 2016
Visualization
Compute partial results
tSNE
Progressive Visual Analytics (PVA)
Approximated Computations in PVA
7
Approximated - tSNE
Similarities
Computation
Similarities
Visualization
Compute partial results
Approximated
Similarities
PVA - tSNE
Approximated tSNE
8
Approximated
K-Nearest-Neighborhood [1]
Precision: 50%
[1] Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration - Muja et al. - 2009
K-Nearest-Neighborhood
Approximated similarities computation
9
Approximated - tSNE
Similarities
Computation
Similarities
Visualization
Compute partial results
Approximated
Similarities
Approx.
Refinement
Exact
Refinement
Approximated tSNE
10
tSNE
Time: 3191.8 s
A-tSNE Precision: 35%
Time: 30.1 s
Speed up: 100x
Precision 35% ?
11
Approximated similarities computation
12
• Density-based visualization
• Supports brushing & linking
• Approximation is visualized and
removed if requested
• 3 Strategies
• Local minima avoidance
Steerability & Approximation visualization
A-tSNE Precision: 5%
Preprocessing: 12 s
Case Study I : Gene Expression in the Mouse Brain
14
Case Study I : Gene expression
Sagittal
Axial 3D Volume
Coronal
61164 data points (Voxels) 4345 dimensions (Gene expression)
15
Case Study I : Gene expression
A-tSNE 50 seconds – tSNE 3 hours and 50 minutes
Speed up: 250x
Case Study II : High-dimensional data streams
17
Case Study II : High-dimensional data streams
Chest - Ankle - Wrist
52 Dimensions every 100 ms
Image courtesy of www.activ8all.com
18[1] Hierarchical Stochastic Neighbor Embedding - Pezzotti et al. - 2016
Conclusions
• Approximation in Progressive
Visual Analytics
• Approximated-tSNE
• Data manipulation
• Refinement
• Scalability issues of the gradient
descent
• Hierarchical SNE [1]
19
Thank you for your attention!
A-tSNE
Precision: 35%
tSNE A-tSNE
Precision: 5%
Similarities
computation time: 12 sSimilarities
computation time: 29 s
Precomp. 3195 s
Speed 4x
29 s 12 s
20
21
22
23
24
25
26

More Related Content

PDF
Principal component analysis, Code and Time Complexity
PPTX
Pca(principal components analysis)
PPTX
Visualizing Social Networks using a Treemap overlaid with a Graph
PPT
Bachelor Thesis
PPTX
Spherule Diagrams with Graph for Social Network Visualization
PPTX
Social Network Visualisation
PPTX
Spherule Diagrams: A Matrix-based Set Visualization Compared with Euler Diagrams
Principal component analysis, Code and Time Complexity
Pca(principal components analysis)
Visualizing Social Networks using a Treemap overlaid with a Graph
Bachelor Thesis
Spherule Diagrams with Graph for Social Network Visualization
Social Network Visualisation
Spherule Diagrams: A Matrix-based Set Visualization Compared with Euler Diagrams

What's hot (17)

PDF
Rsqrd AI - ML Interpretability: Beyond Feature Importance
PPTX
How Does Math Matter in Data Science
PPTX
WPIPosterPresentation24x36
PPTX
[Seminar] 200508 joohee kim
PDF
00 - 30 Dec - Introduction
PPTX
Linear regression on 1 terabytes of data? Some crazy observations and actions
PDF
poster
PPTX
An influence propagation view of page rank
PDF
ELEC2017 3.3 s. blöchl - measuring lean competencies – an approach for quan...
PPT
PPT slides
PDF
mmm16-jiangbj_pot
PDF
Spring 2015 Review - Isenberg Undergraduate Consulting Group, UMass Amherst
PDF
Facial Emotion Detection Project
PDF
Meta Learning Shared Hierarchies
PDF
Splunk Certificate
PPTX
Combining Two Datasets into a Single Map Animation
PPTX
Cupum 2013 Marco te Brömmelstroet
Rsqrd AI - ML Interpretability: Beyond Feature Importance
How Does Math Matter in Data Science
WPIPosterPresentation24x36
[Seminar] 200508 joohee kim
00 - 30 Dec - Introduction
Linear regression on 1 terabytes of data? Some crazy observations and actions
poster
An influence propagation view of page rank
ELEC2017 3.3 s. blöchl - measuring lean competencies – an approach for quan...
PPT slides
mmm16-jiangbj_pot
Spring 2015 Review - Isenberg Undergraduate Consulting Group, UMass Amherst
Facial Emotion Detection Project
Meta Learning Shared Hierarchies
Splunk Certificate
Combining Two Datasets into a Single Map Animation
Cupum 2013 Marco te Brömmelstroet
Ad

Similar to Approximated and User Steerable tSNE for Progressive Visual Analytics (20)

PDF
Do's and Don'ts of using t-SNE.pdf
PPTX
Dimensionality reduction and visualization techniques for high-dimensional ge...
PDF
Visualizing Data Using t-SNE
PPTX
Visualization using tSNE
PDF
Throttling Malware Families in 2D
PDF
High Dimensional Data Visualization using t-SNE
PDF
Nonlinear dimension reduction
PPTX
Hierarchical Stochastic Neighbor Embedding
PDF
"Understanding and Implementing Face Landmark Detection and Tracking," a Pres...
PPTX
[NS][Lab_Seminar_240722]Face Clustering via Graph Convolutional Networks with...
PDF
Web image annotation by diffusion maps manifold learning algorithm
PPT
Health-e-Child CaseReasoner
PDF
Face recognition and deep learning โดย ดร. สรรพฤทธิ์ มฤคทัต NECTEC
PPT
Image Processing
PDF
Tsvi Lev. Practical Explainability for AI - with examples
PDF
PSN for Precision Medicine
PDF
Data Visualization at codetalks 2016
PDF
Topological Data Analysis
PDF
Survey on Supervised Method for Face Image Retrieval Based on Euclidean Dist...
PDF
Personal Matching Recommendation system in TinderBox
Do's and Don'ts of using t-SNE.pdf
Dimensionality reduction and visualization techniques for high-dimensional ge...
Visualizing Data Using t-SNE
Visualization using tSNE
Throttling Malware Families in 2D
High Dimensional Data Visualization using t-SNE
Nonlinear dimension reduction
Hierarchical Stochastic Neighbor Embedding
"Understanding and Implementing Face Landmark Detection and Tracking," a Pres...
[NS][Lab_Seminar_240722]Face Clustering via Graph Convolutional Networks with...
Web image annotation by diffusion maps manifold learning algorithm
Health-e-Child CaseReasoner
Face recognition and deep learning โดย ดร. สรรพฤทธิ์ มฤคทัต NECTEC
Image Processing
Tsvi Lev. Practical Explainability for AI - with examples
PSN for Precision Medicine
Data Visualization at codetalks 2016
Topological Data Analysis
Survey on Supervised Method for Face Image Retrieval Based on Euclidean Dist...
Personal Matching Recommendation system in TinderBox
Ad

Recently uploaded (20)

PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PDF
Introduction to the R Programming Language
PDF
Transcultural that can help you someday.
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Database Infoormation System (DBIS).pptx
PPT
Quality review (1)_presentation of this 21
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
modul_python (1).pptx for professional and student
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Business Analytics and business intelligence.pdf
Acceptance and paychological effects of mandatory extra coach I classes.pptx
IB Computer Science - Internal Assessment.pptx
SAP 2 completion done . PRESENTATION.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Introduction to the R Programming Language
Transcultural that can help you someday.
Clinical guidelines as a resource for EBP(1).pdf
.pdf is not working space design for the following data for the following dat...
Database Infoormation System (DBIS).pptx
Quality review (1)_presentation of this 21
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
modul_python (1).pptx for professional and student
Optimise Shopper Experiences with a Strong Data Estate.pdf
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Mega Projects Data Mega Projects Data
Introduction-to-Cloud-ComputingFinal.pptx
Miokarditis (Inflamasi pada Otot Jantung)
Business Analytics and business intelligence.pdf

Approximated and User Steerable tSNE for Progressive Visual Analytics