SlideShare a Scribd company logo
Generalized Low Rank Models
Anqi Fu
Machine Learning Scientist, H2O.ai
anqi@h2o.ai
October 29, 2015
Based on work by Stephen Boyd, Madeleine Udell, Corinne Horn and Reza Zadeh
Anqi Fu (H2O.ai) Generalized Low Rank Models
October 29, 2015[1ex] Based on work b
/ 8
What is a Low Rank Model?
Given: Data table A with m rows and n columns
Find: Compressed representation as numeric tables X and Y , where
# cols in X = # rows in Y = small user-specified k max(m, n)
m




 A


n
≈ m




X


k
Y k
n
Row of Y = archetypal feature created from cols of A
Row of X = row of A in reduced feature space
Can approximately reconstruct A from product XY
Anqi Fu (H2O.ai) Generalized Low Rank Models
October 29, 2015[1ex] Based on work b
/ 8
Why use Low Rank Models?
Reduce storage space, e.g. 10 GB compressed to 100 MB
Increase prediction speed, e.g. 10x speed-up with no accuracy loss
Identify and visualize important features
Impute missing data
Anqi Fu (H2O.ai) Generalized Low Rank Models
October 29, 2015[1ex] Based on work b
/ 8
Example 1: Visualizing Walking Stances
time forehead (x) forehead (y) · · · right toe (y) right toe (z)
t1 1.4 2.7 · · · -0.5 -0.1
t2 2.7 3.5 · · · 1.3 0.9
t3 3.3 -.9 · · · 4.2 1.8
...
...
...
...
...
...
A contains 151 rows (observations over time) by 124 columns
(location of body parts)
Build a low rank model X, Y with rank k = 10
Rows of Y are principal stances person takes while walking
Rows of X decompose each bodily position into combination of
principal stances
Anqi Fu (H2O.ai) Generalized Low Rank Models
October 29, 2015[1ex] Based on work b
/ 8
Example 2: Compressing Zip Codes
repeat violator (y) ZCTA state · · · violations compensation
N/A 70525 LA · · · 9 8100
R 75189 TX · · · 6 935
RW 95621 CA · · · 4 1155
...
...
...
...
...
...
Train: U.S. Labor Department Wage & Hour Division (WHD)
compliance actions contains 208,806 rows (cases) and 252 cols
(violation info)
Response: Was firm a repeat and/or willful violator?
Predictors: ZCTA, state, number of violations, civil penalties, etc
Anqi Fu (H2O.ai) Generalized Low Rank Models
October 29, 2015[1ex] Based on work b
/ 8
Example 2: Compressing Zip Codes
ZCTA associate degree bachelor’s degree · · · welsh west indian
01001 1584 1953 · · · 34 57
01002 510 3098 · · · 332 181
01003 27 49 · · · 40 134
...
...
...
...
...
...
American Community Survey (ACS) data contains 32,989 rows by
150 columns (population info)
Each row represents a unique zip code tabulation area (ZCTA)
Build a low rank model X, Y with rank k = 10 and regularization to
sparsify features
Rows of Y are demographic archetypes
Rows of X map ZCTAs into combination of demographic archetypes
Anqi Fu (H2O.ai) Generalized Low Rank Models
October 29, 2015[1ex] Based on work b
/ 8
Example 2: Compressing Zip Codes
Train =



y ZCTA ···
N/A 70525 · · ·
...
... · · ·
R 01002 · · ·


 X =





ZCTA archetypes
01001 x1
01002 x2
...
...
70525 x24128





Replace ZCTA col of training data with low rank model (X) of ACS
Predict if firm will be a repeat violator using modified training data
repeat violator (y) archetypes state · · · violations compensation
N/A x24128 LA · · · 9 8100
...
...
...
...
...
...
R x2 MA · · · 2 225
Anqi Fu (H2O.ai) Generalized Low Rank Models
October 29, 2015[1ex] Based on work b
/ 8
References
M. Udell, et al (2014), Generalized Low Rank Models
Example 1: Visualizing Walking Stance
Walking Gait Data
Walking Gait Data with Missing Values
Example 2: Compressing Zip Codes
Wage and Hour Division Data
American Community Survey Data
Anqi Fu (H2O.ai) Generalized Low Rank Models
October 29, 2015[1ex] Based on work b
/ 8

More Related Content

PDF
H2O World - Generalized Low Rank Models - Madeleine Udell
PDF
QMC: Undergraduate Workshop, Introduction to Monte Carlo Methods with 'R' Sof...
PPTX
Spline interpolation numerical methods presentation
PPTX
Curve sketching 1
PDF
A25.8 modelingquads
PPTX
2.9 graphs of factorable rational functions t
PPTX
Integration
PDF
AP Calculus Slides December 14, 2007
H2O World - Generalized Low Rank Models - Madeleine Udell
QMC: Undergraduate Workshop, Introduction to Monte Carlo Methods with 'R' Sof...
Spline interpolation numerical methods presentation
Curve sketching 1
A25.8 modelingquads
2.9 graphs of factorable rational functions t
Integration
AP Calculus Slides December 14, 2007

What's hot (19)

PDF
Pre-Cal 40S Slides February 29, 2008
PDF
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
PPT
Spline Interpolation
PDF
AP Calculus Slides September 18, 2007
PDF
Curve fitting and Optimization
PPT
Contrastive Divergence Learning
PPTX
Logistic Modeling with Applications to Marketing and Credit Risk in the Autom...
PPT
Functions for Grade 10
PPTX
rational expression
PPT
PDF
Lesson 25: The Definite Integral
PDF
AP Calculus Jauary 13, 2009
PPT
Tutorials--Factoring Quadratics
PPT
PDF
AP Calculus Slides December 10, 2007
PDF
Presentation of my master thesis - Image Processing
PDF
Day 1 examples u1f13
PDF
Day 1 examples u1f13
PPTX
Rational function representation
Pre-Cal 40S Slides February 29, 2008
CLIM Fall 2017 Course: Statistics for Climate Research, Spatial Data: Models ...
Spline Interpolation
AP Calculus Slides September 18, 2007
Curve fitting and Optimization
Contrastive Divergence Learning
Logistic Modeling with Applications to Marketing and Credit Risk in the Autom...
Functions for Grade 10
rational expression
Lesson 25: The Definite Integral
AP Calculus Jauary 13, 2009
Tutorials--Factoring Quadratics
AP Calculus Slides December 10, 2007
Presentation of my master thesis - Image Processing
Day 1 examples u1f13
Day 1 examples u1f13
Rational function representation
Ad

More from Sri Ambati (20)

PDF
H2O Label Genie Starter Track - Support Presentation
PDF
H2O.ai Agents : From Theory to Practice - Support Presentation
PDF
H2O Generative AI Starter Track - Support Presentation Slides.pdf
PDF
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
PDF
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
PDF
Intro to Enterprise h2oGPTe Presentation Slides
PDF
Enterprise h2o GPTe Learning Path Slide Deck
PDF
H2O Wave Course Starter - Presentation Slides
PDF
Large Language Models (LLMs) - Level 3 Slides
PDF
Data Science and Machine Learning Platforms (2024) Slides
PDF
Data Prep for H2O Driverless AI - Slides
PDF
H2O Cloud AI Developer Services - Slides (2024)
PDF
LLM Learning Path Level 2 - Presentation Slides
PDF
LLM Learning Path Level 1 - Presentation Slides
PDF
Hydrogen Torch - Starter Course - Presentation Slides
PDF
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
PDF
H2O Driverless AI Starter Course - Slides and Assignments
PPTX
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
PDF
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
PPTX
Generative AI Masterclass - Model Risk Management.pptx
H2O Label Genie Starter Track - Support Presentation
H2O.ai Agents : From Theory to Practice - Support Presentation
H2O Generative AI Starter Track - Support Presentation Slides.pdf
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
Intro to Enterprise h2oGPTe Presentation Slides
Enterprise h2o GPTe Learning Path Slide Deck
H2O Wave Course Starter - Presentation Slides
Large Language Models (LLMs) - Level 3 Slides
Data Science and Machine Learning Platforms (2024) Slides
Data Prep for H2O Driverless AI - Slides
H2O Cloud AI Developer Services - Slides (2024)
LLM Learning Path Level 2 - Presentation Slides
LLM Learning Path Level 1 - Presentation Slides
Hydrogen Torch - Starter Course - Presentation Slides
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
H2O Driverless AI Starter Course - Slides and Assignments
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Generative AI Masterclass - Model Risk Management.pptx
Ad

Recently uploaded (20)

PDF
Digital Strategies for Manufacturing Companies
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PPTX
Introduction to Artificial Intelligence
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PPTX
Odoo POS Development Services by CandidRoot Solutions
PPTX
history of c programming in notes for students .pptx
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
How Creative Agencies Leverage Project Management Software.pdf
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PPTX
CHAPTER 2 - PM Management and IT Context
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
top salesforce developer skills in 2025.pdf
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
Digital Strategies for Manufacturing Companies
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Introduction to Artificial Intelligence
VVF-Customer-Presentation2025-Ver1.9.pptx
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Odoo POS Development Services by CandidRoot Solutions
history of c programming in notes for students .pptx
Operating system designcfffgfgggggggvggggggggg
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Design an Analysis of Algorithms II-SECS-1021-03
Softaken Excel to vCard Converter Software.pdf
Navsoft: AI-Powered Business Solutions & Custom Software Development
How Creative Agencies Leverage Project Management Software.pdf
Wondershare Filmora 15 Crack With Activation Key [2025
Upgrade and Innovation Strategies for SAP ERP Customers
CHAPTER 2 - PM Management and IT Context
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
top salesforce developer skills in 2025.pdf
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
2025 Textile ERP Trends: SAP, Odoo & Oracle

Generalized Low Rank Models

  • 1. Generalized Low Rank Models Anqi Fu Machine Learning Scientist, H2O.ai anqi@h2o.ai October 29, 2015 Based on work by Stephen Boyd, Madeleine Udell, Corinne Horn and Reza Zadeh Anqi Fu (H2O.ai) Generalized Low Rank Models October 29, 2015[1ex] Based on work b / 8
  • 2. What is a Low Rank Model? Given: Data table A with m rows and n columns Find: Compressed representation as numeric tables X and Y , where # cols in X = # rows in Y = small user-specified k max(m, n) m      A   n ≈ m     X   k Y k n Row of Y = archetypal feature created from cols of A Row of X = row of A in reduced feature space Can approximately reconstruct A from product XY Anqi Fu (H2O.ai) Generalized Low Rank Models October 29, 2015[1ex] Based on work b / 8
  • 3. Why use Low Rank Models? Reduce storage space, e.g. 10 GB compressed to 100 MB Increase prediction speed, e.g. 10x speed-up with no accuracy loss Identify and visualize important features Impute missing data Anqi Fu (H2O.ai) Generalized Low Rank Models October 29, 2015[1ex] Based on work b / 8
  • 4. Example 1: Visualizing Walking Stances time forehead (x) forehead (y) · · · right toe (y) right toe (z) t1 1.4 2.7 · · · -0.5 -0.1 t2 2.7 3.5 · · · 1.3 0.9 t3 3.3 -.9 · · · 4.2 1.8 ... ... ... ... ... ... A contains 151 rows (observations over time) by 124 columns (location of body parts) Build a low rank model X, Y with rank k = 10 Rows of Y are principal stances person takes while walking Rows of X decompose each bodily position into combination of principal stances Anqi Fu (H2O.ai) Generalized Low Rank Models October 29, 2015[1ex] Based on work b / 8
  • 5. Example 2: Compressing Zip Codes repeat violator (y) ZCTA state · · · violations compensation N/A 70525 LA · · · 9 8100 R 75189 TX · · · 6 935 RW 95621 CA · · · 4 1155 ... ... ... ... ... ... Train: U.S. Labor Department Wage & Hour Division (WHD) compliance actions contains 208,806 rows (cases) and 252 cols (violation info) Response: Was firm a repeat and/or willful violator? Predictors: ZCTA, state, number of violations, civil penalties, etc Anqi Fu (H2O.ai) Generalized Low Rank Models October 29, 2015[1ex] Based on work b / 8
  • 6. Example 2: Compressing Zip Codes ZCTA associate degree bachelor’s degree · · · welsh west indian 01001 1584 1953 · · · 34 57 01002 510 3098 · · · 332 181 01003 27 49 · · · 40 134 ... ... ... ... ... ... American Community Survey (ACS) data contains 32,989 rows by 150 columns (population info) Each row represents a unique zip code tabulation area (ZCTA) Build a low rank model X, Y with rank k = 10 and regularization to sparsify features Rows of Y are demographic archetypes Rows of X map ZCTAs into combination of demographic archetypes Anqi Fu (H2O.ai) Generalized Low Rank Models October 29, 2015[1ex] Based on work b / 8
  • 7. Example 2: Compressing Zip Codes Train =    y ZCTA ··· N/A 70525 · · · ... ... · · · R 01002 · · ·    X =      ZCTA archetypes 01001 x1 01002 x2 ... ... 70525 x24128      Replace ZCTA col of training data with low rank model (X) of ACS Predict if firm will be a repeat violator using modified training data repeat violator (y) archetypes state · · · violations compensation N/A x24128 LA · · · 9 8100 ... ... ... ... ... ... R x2 MA · · · 2 225 Anqi Fu (H2O.ai) Generalized Low Rank Models October 29, 2015[1ex] Based on work b / 8
  • 8. References M. Udell, et al (2014), Generalized Low Rank Models Example 1: Visualizing Walking Stance Walking Gait Data Walking Gait Data with Missing Values Example 2: Compressing Zip Codes Wage and Hour Division Data American Community Survey Data Anqi Fu (H2O.ai) Generalized Low Rank Models October 29, 2015[1ex] Based on work b / 8