Using Deep Neural Networks
for Fashion Applications
Ahmad Qamar
ahmad@threadgenius.co
motivation
challenges
related work
data collection
Thread Genius
applications
future work
demos
threadgenius.co
set of terms used interchangeably
● features, fingerprints, representations, latent
factors, vectors
● attributes, labels, concepts
● product , SKU, in-store
● street, wild-type, UGC
threadgenius.co
threadgenius.co
motivation
threadgenius.co
2000 today
→ web content increasingly image-heavy
→ more participation and engagement
+ 1.8B+ photos uploaded daily (2014)
- only 15% contain relevant metadata
→ millennials are increasingly brand agnostic
- logo detection fails in extracting signal
threadgenius.co
threadgenius.co
millennials prefer branded
content that is ...
● social is saturated with content
● attention is limited commodity
authentic and
information dense
threadgenius.co
challenges
+ attribute extraction works well
- limited and generic taxonomy
of attributes
- word attributes don’t fully
capture image
threadgenius.co
visual understanding is becoming ubiquitous
- visual search suffers from
poor results
- lack of focus on specific
domain
threadgenius.co
vs.
query out-of-the-box model fashion domain model
threadgenius.co
threadgenius.co
threadgenius.co
collaborative filtering not fit for fashion
“people who buy a also buy b”
- no information about content
- half-life of fashion products is ~1.5 months
- higher quantity (vs. movies), lower volume (vs. music)
threadgenius.co
image models require lots of data to prevent
overfitting
- limited public datasets for fashion
- taxonomy requires domain consideration
- collecting training data is painstaking
- ambiguity and variability present in fashion products
and photos
threadgenius.co
threadgenius.co
related work
threadgenius.co
Convnets 101: convnets learn features and the
classifier simultaneously
images from Sander Dieleman
image
[Kiapour et al] Where to Buy It: Matching Street
Clothing Photos in Online Shops
+ learn a NN similarity function
between query and candidate to
outputs match-score; works well
- querying is expensive since score
must be computed for all candidates
- learn separate functions for each
category (tops, footwear, …)
threadgenius.co
[Liu et al] DeepFashion: Powering Robust Clothes Recognition and
Retrieval with Rich Annotations
+ provide 800K fashion images for academic community
+ perform landmark detection for localization
- questionable data quality: taxonomy contains irrelevant
words (eg. brooklyn, kurt), images are mislabeled, bounding boxes
too tight
threadgenius.co
images for
“brooklyn”
[Bell et al] Learning visual similarity for product design with
convolutional neural networks
threadgenius.co
+ learn image embeddings that place similar products close
- requires cropped images for querying
[Bell et al] Learning visual similarity for product design with
convolutional neural networks
threadgenius.co
train on two tasks
● metric learning
● category prediction+
-
[Ren et al] Faster R-CNN: Towards Real-Time Object
Detection with Region Proposal Networks
threadgenius.co
object detection model
[Ren et al] Faster R-CNN: Towards Real-Time Object
Detection with Region Proposal Networks
threadgenius.co
train a mini-network that
classifies objectness and
regresses bounding boxes
threadgenius.co
data collection
fashion products exhibit different types of structure
● category: top, footwear, ...
● product: sweater, pumps, ...
● detail: shawl collar, square toe, ...
● color: yellow, anthracite, ...
● pattern: paisley, colorblock, ...
● aesthetic: preppy, scandalous, …
- degrees of freedom make similarity search subjective
threadgenius.co
compiled a taxonomy of ~1000 fashion attributes
threadgenius.co
for each attribute, images are sourced from
● retail websites
● social media networks (Pinterest, The Hunt)
● and fashion resale networks (Poshmark, Vinted)
threadgenius.co
Long sleeve silk chiffon shirt-style
dress featuring graphic pattern in
navy, burgundy, and green. Vented
crewneck collar. Gathering at
front and back yokes. Detachable
self-tie fastening at waist.
Two-button barrel cuffs.
Detachable viscose chiffon slip
lining in black. Tonal stitching.
Body: 100% silk. Lining: 100%
viscose. Imported.
+ image-attribute pairs help model learn fashion feature
detectors
- image-attribute pairs are not enough: attribute classifiers
simply compute histograms over visual features
threadgenius.co
≈
+ image-image pairs allow for unsupervised
learning of similarity
+ captures invariances
threadgenius.co
+ -
images require cleaning
threadgenius.coimages from [Kiapour et al]
3M+ images annotated
threadgenius.co
threadgenius.co
Thread Genius
images/UGC
unified space of images,
products, and metadata
products
OPENING
CEREMONY
$495
ZARA
$125metadata
varsity jacket,
color black,
standup collar threadgenius.co
threadgenius.co
object detector
RPN model trained on
bounding box labels
feature extractor
alternate training on
attribute classification
and metric learning
indexing
Tech Stack
Research pipeline
Deployment pipeline
Training data
...
Data
annotation
Lasagne
Annoy
Model training +
experimentation
+ validation
Compute +
Storage + Server
TG GPU
Server
Product Inventory
Lookbooks
...
fashion blogs ...
Workflow
manager
TG API
threadgenius.co
threadgenius.co
applications
retail
alternative products to: sold
out inventory, pricey items
shop.threadgenius.co
visual marketing
making Instagram
shoppable
+ 2-3x lift in conversion
threadgenius.co
audience generation
build custom audiences
for specific products
threadgenius.co
audience generation
+ 3x increase in CTR
threadgenius.co
threadgenius.co
future work
threadgenius.co
experimental model [beta]
word2vec on collections of images and text
● words represented by embedding / lookup table
● images represented by convnet
Long sleeve silk chiffon shirt-style
dress featuring graphic pattern in
navy, burgundy, and green. Vented
crewneck collar. Gathering at
front and back yokes. Detachable
self-tie fastening at waist.
Two-button barrel cuffs.
Detachable viscose chiffon slip
lining in black. Tonal stitching.
Body: 100% silk. Lining: 100%
viscose. Imported.
document
threadgenius.co
other
● combined model that directly extracts features of
component apparel items (convnet+RNN)
● refine training of experimental image-text model
threadgenius.co
demos
threadgenius.co
experimental model: semantic
arithmetic with images and text
threadgenius.co
robo-Bill Cunningham
Questions?
we’re hiring for backend
and ML roles
we’re hiring for
backend and ML roles
ahmad@threadgenius.co

More Related Content

PPT
Photoshop
PDF
Техника продаж страховых продуктов в торговых сетях
PPTX
Patternmaking Principles-1.pptx
PDF
How to Become a Thought Leader in Your Niche
PDF
Intelligent Thumbnail Selection
PDF
What convnets look at when they look at nudity
PDF
Artificial Intelligence in Fashion, Beauty and related Creative industries
Photoshop
Техника продаж страховых продуктов в торговых сетях
Patternmaking Principles-1.pptx
How to Become a Thought Leader in Your Niche
Intelligent Thumbnail Selection
What convnets look at when they look at nudity
Artificial Intelligence in Fashion, Beauty and related Creative industries

Viewers also liked (20)

PPTX
Programming the Quantum Future
PDF
Machine Learning for Adversarial Agent Microworlds
PPT
Final powerpoint sgp
PDF
PPTX
Artificial Intelligence Programming in Art by Mohamed Farag
PDF
Roq.ad - NOAH16 Berlin
PDF
Fashwell - NOAH16 Berlin
PDF
Causative Adversarial Learning
PPTX
Predicting Thyroid Disorder with Deep Neural Networks
PDF
Output Units and Cost Function in FNN
PDF
Mobile Apps for the Fashion Industry
PPTX
Building Tooling And Culture Together
PDF
NYAI #5 - Fun With Neural Nets by Jason Yosinski
PDF
Why Twitter Is All The Rage: A Data Miner's Perspective (PyTN 2014)
PDF
NYAI #7 - Using Data Science to Operationalize Machine Learning by Matthew Ru...
PPT
NYAI #7 - Top-down vs. Bottom-up Computational Creativity by Dr. Cole D. Ingr...
PDF
NYAI #8 - HOLIDAY PARTY + NYC AI OVERVIEW with NYC's Chief Digital Officer Sr...
PDF
NYAI #9: Concepts and Questions As Programs by Brenden Lake
PDF
P03 neural networks cvpr2012 deep learning methods for vision
PPTX
NYAI - Understanding Music Through Machine Learning by Brian McFee
Programming the Quantum Future
Machine Learning for Adversarial Agent Microworlds
Final powerpoint sgp
Artificial Intelligence Programming in Art by Mohamed Farag
Roq.ad - NOAH16 Berlin
Fashwell - NOAH16 Berlin
Causative Adversarial Learning
Predicting Thyroid Disorder with Deep Neural Networks
Output Units and Cost Function in FNN
Mobile Apps for the Fashion Industry
Building Tooling And Culture Together
NYAI #5 - Fun With Neural Nets by Jason Yosinski
Why Twitter Is All The Rage: A Data Miner's Perspective (PyTN 2014)
NYAI #7 - Using Data Science to Operationalize Machine Learning by Matthew Ru...
NYAI #7 - Top-down vs. Bottom-up Computational Creativity by Dr. Cole D. Ingr...
NYAI #8 - HOLIDAY PARTY + NYC AI OVERVIEW with NYC's Chief Digital Officer Sr...
NYAI #9: Concepts and Questions As Programs by Brenden Lake
P03 neural networks cvpr2012 deep learning methods for vision
NYAI - Understanding Music Through Machine Learning by Brian McFee
Ad

Similar to Using deep neural networks for fashion applications (20)

PPTX
Computer Vision meets Fashion (第12回ステアラボ人工知能セミナー)
PDF
Using Deep Learning to Find Similar Dresses
PDF
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algori...
PPTX
Deep learning in fashion industry
PDF
Data Summer Conf 2018, “From the math to the business value: machine learning...
PDF
Deep Learning Meetup 7 - Building a Deep Learning-powered Search Engine
PDF
Presented By Preethi & Sanjeev vikram.pdf
PDF
Automated Background Removal Using PyTorch
PDF
Leveraging Social Media with Computer Vision
PPTX
Presented By Preethi & Sanjeev vikram-1.pptx
PDF
A Folksonomy of styles, aka: other stylists also said and Subjective Influenc...
PDF
IRJET- Automatic Suggestion of Outfits using Image Processing
PPTX
ARTIFICIAL INTELLIGENCE IN APPAREL INDUSTRY.pptx
PDF
Accurate fashion and accessories detection for mobile application based on d...
PDF
IRJET- Automatic Detection of Characteristics of Clothing using Image Process...
PDF
Fashion AI
PDF
From Metrics to Models: Data Science at Metail
PPTX
AI in the Fashion Industry
PDF
Fashion AI Literature
PDF
M sc thesis proposal v4
Computer Vision meets Fashion (第12回ステアラボ人工知能セミナー)
Using Deep Learning to Find Similar Dresses
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algori...
Deep learning in fashion industry
Data Summer Conf 2018, “From the math to the business value: machine learning...
Deep Learning Meetup 7 - Building a Deep Learning-powered Search Engine
Presented By Preethi & Sanjeev vikram.pdf
Automated Background Removal Using PyTorch
Leveraging Social Media with Computer Vision
Presented By Preethi & Sanjeev vikram-1.pptx
A Folksonomy of styles, aka: other stylists also said and Subjective Influenc...
IRJET- Automatic Suggestion of Outfits using Image Processing
ARTIFICIAL INTELLIGENCE IN APPAREL INDUSTRY.pptx
Accurate fashion and accessories detection for mobile application based on d...
IRJET- Automatic Detection of Characteristics of Clothing using Image Process...
Fashion AI
From Metrics to Models: Data Science at Metail
AI in the Fashion Industry
Fashion AI Literature
M sc thesis proposal v4
Ad

Recently uploaded (20)

PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PPTX
2018-HIPAA-Renewal-Training for executives
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPTX
The various Industrial Revolutions .pptx
PDF
A proposed approach for plagiarism detection in Myanmar Unicode text
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Five Habits of High-Impact Board Members
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
Architecture types and enterprise applications.pdf
PPTX
Modernising the Digital Integration Hub
PPT
What is a Computer? Input Devices /output devices
PPT
Geologic Time for studying geology for geologist
PDF
Consumable AI The What, Why & How for Small Teams.pdf
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
Credit Without Borders: AI and Financial Inclusion in Bangladesh
2018-HIPAA-Renewal-Training for executives
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
Developing a website for English-speaking practice to English as a foreign la...
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
The various Industrial Revolutions .pptx
A proposed approach for plagiarism detection in Myanmar Unicode text
Benefits of Physical activity for teenagers.pptx
Hindi spoken digit analysis for native and non-native speakers
Custom Battery Pack Design Considerations for Performance and Safety
A comparative study of natural language inference in Swahili using monolingua...
Five Habits of High-Impact Board Members
A review of recent deep learning applications in wood surface defect identifi...
Getting started with AI Agents and Multi-Agent Systems
Architecture types and enterprise applications.pdf
Modernising the Digital Integration Hub
What is a Computer? Input Devices /output devices
Geologic Time for studying geology for geologist
Consumable AI The What, Why & How for Small Teams.pdf

Using deep neural networks for fashion applications