VARIATIONAL DISCRIMINATOR BOTTLENECK:
IMPROVING IMITATION LEARNING, INVERSE RL, AND GANS BY
CONSTRAINING INFORMATION FLOW
Yawei Luo
Notoriously D & G
D can always find out the nonessential information from G(z) to make a judgement “fake”.
-> Uninformative gradients -> Unstable training!
How to force D to focus on essential information of G(z)?
Preliminaries
• Mutual Information
• Object function in information theoretic view
Preliminaries
• Information Bottleneck
Preliminaries
Preliminaries
q: decoder
E: encoder
Back to GANs
Back to GANs
Vanilla GAN:
GAN with VIB:
Training
I(X, Z) > Ic -> beta ++
I(X, Z) < Ic -> beta --
Experiments - IMITATION LEARNING
Experiments - IMITATION LEARNING
Experiments - INVERSE REINFORCEMENT LEARNING
Experiments – image generation

More Related Content

PDF
Deep learning for person re-identification
PDF
Cross-domain complementary learning with synthetic data for multi-person part...
PDF
Step zhedong
PPTX
Visual saliency
PDF
Image Synthesis From Reconfigurable Layout and Style
PPTX
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
PPTX
Weijian image retrieval
PPTX
Scops self supervised co-part segmentation
Deep learning for person re-identification
Cross-domain complementary learning with synthetic data for multi-person part...
Step zhedong
Visual saliency
Image Synthesis From Reconfigurable Layout and Style
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Weijian image retrieval
Scops self supervised co-part segmentation

More from 哲东 郑 (20)

PPTX
Video object detection
PDF
Center nets
PPTX
C2 ae open set recognition
PPTX
Sota semantic segmentation
PPTX
Deep randomized embedding
PPTX
Semantic Image Synthesis with Spatially-Adaptive Normalization
PPTX
Instance level facial attributes transfer with geometry-aware flow
PPTX
Learning to adapt structured output space for semantic
PPTX
Unsupervised Learning of Object Landmarks through Conditional Image Generation
PPTX
Graph based global reasoning networks
PPTX
Style gan
PDF
Vi2vi
PPTX
GNorm and Rethinking pre training-ruijie
PPTX
Smoothed manifold
PPTX
Controllable image to-video translation
PPTX
Comparator networks
PPTX
Swwae ruijie
PDF
UNIT -Deng
PDF
Pancreas Segmentation
PPTX
Dense pose
Video object detection
Center nets
C2 ae open set recognition
Sota semantic segmentation
Deep randomized embedding
Semantic Image Synthesis with Spatially-Adaptive Normalization
Instance level facial attributes transfer with geometry-aware flow
Learning to adapt structured output space for semantic
Unsupervised Learning of Object Landmarks through Conditional Image Generation
Graph based global reasoning networks
Style gan
Vi2vi
GNorm and Rethinking pre training-ruijie
Smoothed manifold
Controllable image to-video translation
Comparator networks
Swwae ruijie
UNIT -Deng
Pancreas Segmentation
Dense pose
Ad

Recently uploaded (20)

PPTX
TEXTILE technology diploma scope and career opportunities
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
Developing a website for English-speaking practice to English as a foreign la...
DOCX
search engine optimization ppt fir known well about this
PDF
Flame analysis and combustion estimation using large language and vision assi...
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
Architecture types and enterprise applications.pdf
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PDF
STKI Israel Market Study 2025 version august
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PDF
The influence of sentiment analysis in enhancing early warning system model f...
PDF
CloudStack 4.21: First Look Webinar slides
PPTX
Microsoft Excel 365/2024 Beginner's training
PDF
Comparative analysis of machine learning models for fake news detection in so...
PPT
What is a Computer? Input Devices /output devices
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
TEXTILE technology diploma scope and career opportunities
Benefits of Physical activity for teenagers.pptx
OpenACC and Open Hackathons Monthly Highlights July 2025
Taming the Chaos: How to Turn Unstructured Data into Decisions
A review of recent deep learning applications in wood surface defect identifi...
Developing a website for English-speaking practice to English as a foreign la...
search engine optimization ppt fir known well about this
Flame analysis and combustion estimation using large language and vision assi...
Custom Battery Pack Design Considerations for Performance and Safety
Architecture types and enterprise applications.pdf
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Final SEM Unit 1 for mit wpu at pune .pptx
STKI Israel Market Study 2025 version august
Credit Without Borders: AI and Financial Inclusion in Bangladesh
The influence of sentiment analysis in enhancing early warning system model f...
CloudStack 4.21: First Look Webinar slides
Microsoft Excel 365/2024 Beginner's training
Comparative analysis of machine learning models for fake news detection in so...
What is a Computer? Input Devices /output devices
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
Ad

Variational Discriminator Bottleneck