The Frontier of Deep Learning in
2020 and Beyond
Recent advances, Trends and Opportunities
Bhav Ashok
#ISSLearningFest
Overview
1. Brief history of Deep Learning
2. Trends
1. Move to free data
2. Learn everything
3. Do more with less
3. Future of Deep Learning
1. GPT-3 (Generative-Pretraining-3)
2. NeRF (Neural Radiance Fields)
#ISSLearningFest
Brief history of Deep Learning
How it all began
#ISSLearningFest
History of deep learning
#ISSLearningFest
2012 2016 2020< 2012
Classical
ML
History of deep learning
#ISSLearningFest
2012 2016 2020< 2012
Large-scale
Deep Learning
ResNet-101
(2015)
VGG-16
(2014)
AlexNet
(2012)
Classical
ML
2012-2016
• GPU training of deep CNNs
(Convolutional Neural Network)
− AlexNet wins ImageNet 2012
− First superhuman performance on ImageNet
• Deeper CNNs improve performance
• More innovation in CNNs
− VGGNet in 2014
− ResNet wins ImageNet 2015
• Birth of large-scale deep learning
#ISSLearningFest
History of deep learning
#ISSLearningFest
2012 2016 2020< 2012
(2019)
Transformers
(2018)
NAS/AutoML
(2017)
AlphaGO
(2016)
Large-scale
Deep Learning
ResNet-101
(2015)
VGG-16
(2014)
AlexNet
(2012) Realistic
GANs
(2019)
Classical
ML
2016-2020
• Trend 1: Move to free data
• Trend 2: Learn everything (AutoML)
• Trend 3: Do more with less data
• Deep Learning scales on other tasks
− Language models: Transformers
− Speech synthesis: Tacotron
− Deep RL: Alpha GO
− Generative models: Hyper-realistic GANs (Generative Adversarial
Networks)
#ISSLearningFest
History of deep learning
#ISSLearningFest
2012 2016 2020< 2012
GPT-3
(2020)
(2019)
Transformers
(2018)
NAS/AutoML
(2017)
AlphaGO
(2016)
Self-driving
AR
Large-scale
Deep Learning
ResNet-101
(2015)
VGG-16
(2014)
AlexNet
(2012) Realistic
GANs
(2019)
Classical
ML
2020+
• Previous success in deep learning mostly in controlled environments
− Web
− Games (video and board games)
− Academic benchmarks
• Real world applications start to mature
− Self-driving, GPT-3, AR
• Promising research in 3D vision
• Few shot learning and domain adaptation.
#ISSLearningFest
Trend 1: Free data
The push for more annotated data at a cheaper cost
#ISSLearningFest
Motivation
• Accuracy scales with data
− Unreasonable effectiveness of data revisited
(2017)
• Problem: Labeling data is expensive
− Requires human annotators
− Around $0.50 - $10 per image
• Problem: User privacy
− Majority of training data is generated by consumers
− GDPR (2016-2018) #ISSLearningFest
Technological advances
• Techniques in improving data labelling efficiency
• Knowledge distillation
• Machines label data for you
• 3D to 2D supervision
• Label in 3D, $$ in 2D
• Synthetic data
• Generate data and labels synthetically
#ISSLearningFest
Knowledge distillation
• Use a pre-trained “Teacher” model to label unlabeled data
• Train a “Student” model using newly annotated dataset
• Machines annotate data instead of humans
− Promise:
− Free labels on real images
− Problems:
− Might be noisy and reinforce errors present in Teacher model
− Recent advances in self-distillation, Multi-Teacher distillation,
Human in the loop (active learning)
#ISSLearningFest
Teacher
Student
Fig: “Teacher” distills knowledge into “Student”
Image
Label
3D to 2D supervision
• Reconstruct scene, annotate in 3D, profit in 2D
− Cheaper supervision on real 2D images
• Problems
⚫ Depends on quality of 3D reconstruction of scene.
⚫ 3D annotation is expensive
⚫ though total cost may still be cheaper than 2D annotation.
#ISSLearningFest
Rendered annotations
w/ real image.
Source:
http://www.scan-net.org/
Synthetic data (Sim2Real)
• Create environments and get annotation for free.
• Free annotations and unlimited variation for cheap
• Problems
⚫ Difficulty in generalizing to real world domain
⚫ Some human input required to generate useful simulations
#ISSLearningFest
Rendered
Image
Rendered
Depth map
Rendered
Segmentation
annotation
Map of scene
Source: https://github.com/facebookresearch/House3D
Source: https://venturebeat.com/2020/07/17/why-unity-claims-synthetic-data-sets-can-improve-computer-vision-
models/
Trend 2: Learn everything
From data to architectures and optimizers (AutoML).
#ISSLearningFest
Learn everything
• Classical paradigm of Deep Learning
• Optimizing all parts of the stack
− Architectures
− Optimizers
− Data augmentation
− Learning schedules and more
• Also known as AutoML or “Learning to learn”
#ISSLearningFest
Network
Loss
function
Data
Data
augmentation
Optimizer
Classical paradigm of Deep Learning
Architectures
• People previously believed that human intuition was essential in
architecture design
• Early architectures followed intuitions from pattern recognition
• Reinforcement Learning, Supervised Learning and Evolutionary
Algorithms are commonly used in AutoML
• Neural Architecture Search (ICLR 2018)
• Used over 800 GPUs
#ISSLearningFest
Neural Architecture Search
• Idea: Use reinforcement learning to train a neural network to create a
high performing neural network
• Reward function: Accuracy of generated architecture
• Result: RL agent learns to generate architectures
that produce high accuracy on the task.
#ISSLearningFest
Example of generated architecture
Trend 3: Do more with less
Semi-supervised learning, self-attention, self-play and more
#ISSLearningFest
Do more with less
⚫ Self-training
⚫ Learn more from unlabeled data.
⚫ Self-play
⚫ AIs compete amongst themselves to improve.
⚫ Self-attention
⚫ Learn better associations within input data
#ISSLearningFest
Self-training
• Self-training with noisy student
− Idea: perform self-distillation on unlabeled data but add noise
during training to increase robustness of model.
− Current state of the Art on ImageNet
#ISSLearningFest
Model
Step 1: Label example using current model
“Cat”
Step 2: Train with noisy example
+ noise
Model
Train with label
from previous step
Self-play
• AIs compete to maximize Reinforcement Learning reward function
• Vital to breakthroughs in
− AlphaGO
− OpenAI Five
#ISSLearningFest AlphaGo vs. AlphaGo Match 41OpenAI Five Self-Play
Self-attention
• Transformers introduced in paper “Attention is all you need”
− Breakthrough in language models
− State of the Art on multiple language tasks
• Example
• “The animal didn't cross the street because it was too tired”
• What does “it” refer to? Animal or Street?
• Self-attention allows model to look at entire sentence and
form associations by training on language understanding tasks.
#ISSLearningFest
Source: jalammar.github.io/illustrated-transformer
Future of Deep Learning
Expanding to the real world
#ISSLearningFest
Real world deep learning
• Real world is
• Complex
• Physical space is in 3D
• Applications need to be
• Robust and adapt to changing environments
• Understand 3D if deployed in physical world
• Research
• Domain adaptation, few-shot learning
• 3D scene understanding
#ISSLearningFest
Recent breakthroughs
• GPT-3 (Generative Pre-Training 3 - 2020)
• Adapts to environment using few-shot learning.
• Generalizes surprisingly well to many useful applications.
• NeRF (Neural Radiance Fields - 2020)
• Able to reconstruct and model 3D environments completely within a
single neural network.
• Qualitative results much better than classical reconstruction methods
on real world data.
#ISSLearningFest
Generative Pre-Training-3 (GPT-3)
• Based on transformer model
• Uses pretraining and adapts to environment using few-shot learning.
• Very very large scale
• 175 Billion parameters (>10x more than any previous model)
• Trained using over a trillion words
• Cost US$ 12 million to train
#ISSLearningFest
GPT-3 - Writing code
#ISSLearningFest
GPT-3 – Designing UI
#ISSLearningFest
Source:
https://twitter.com/jsngr/status/1284511080715362304
GPT-3 – Learning
#ISSLearningFestSource: https://learnfromanyone.com/
GPT-3 – Other applications
#ISSLearningFestSource: https://learnfromanyone.com/
• Writing stories
• Psychotherapy
• Food recipes
• Medical diagnosis
• Compose music
• And more
Neural Radiance Fields (NeRF)
• Training data: Sparse set of images + viewpoints
• Note: viewpoints can be recovered using traditional pose estimation
techniques so really, this needs only a set of images.
• Result: 3D reconstruction and view dependent rendering
• Learns a function that maps rays passing through images to rgb + density
#ISSLearningFest
Source: https://www.matthewtancik.com/nerf
Viewing
position +
angle
Image
pixels +
density
NeRF
• Photorealistic rendering that accounts for lighting/materials/viewpoint.
#ISSLearningFest
NeRF – photorealistic rendering
#ISSLearningFest
Example of classical reconstruction + rendering
Source: youtube.com/watch?v=OsZvBEkJ6Vg
NeRF rendering
Source: https://www.matthewtancik.com/nerf
NeRF - results
#ISSLearningFest
Rendering with specularities Result with fixed viewing position but varying
viewing angle
Source: https://www.matthewtancik.com/nerf
NeRF - results
#ISSLearningFest
Photorealistic rendering Models fine physical structures
Source: https://www.matthewtancik.com/nerf
NeRF
• Photorealistic rendering that accounts for lighting/materials/viewpoint
• Works in the real world
#ISSLearningFest
NeRF – Real world results
#ISSLearningFest
Brandenburg GateTrevi Fountain
Source: https://nerf-w.github.io/
NeRF
• Photorealistic rendering that accounts for lighting/materials/viewpoint
• Works in the real world
• Learns a neural representation of the 3D scene
• Useful for varying arbitrary quantities. (e.g. lighting)
• Useful for multi-task learning.
#ISSLearningFest
NeRF - Relighting
#ISSLearningFestSource: https://nerf-w.github.io/
Thank You!
bhav@nuronlabs.com
#ISSLearningFest
The Frontier of Deep Learning in 2020 and Beyond

More Related Content

PDF
The (very) basics of AI for the Radiology resident
PDF
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
PPTX
The Convergence of HPC and Deep Learning
PDF
Data Science Training | Data Science Tutorial | Data Science Certification | ...
PDF
Machine Learning and its Applications
PDF
Ml intro
PPTX
Machine Learning 2 deep Learning: An Intro
PDF
Evolution and Trends in Edge AI Systems and Architectures for the Internet of...
The (very) basics of AI for the Radiology resident
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
The Convergence of HPC and Deep Learning
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Machine Learning and its Applications
Ml intro
Machine Learning 2 deep Learning: An Intro
Evolution and Trends in Edge AI Systems and Architectures for the Internet of...

What's hot (20)

PDF
Full-Stack Data Science: How to be a One-person Data Team
PPTX
Leveraging Open Source Automated Data Science Tools
PDF
AI and Deep Learning
PPTX
Application of machine learning in industrial applications
PDF
Introduction To Machine Learning | Edureka
PPTX
Machine Learning with GraphLab Create
PDF
10 Lessons Learned from Building Machine Learning Systems
PDF
H2O World - Intro to Data Science with Erin Ledell
PPTX
Vertical is the New Horizontal - MinneAnalytics 2016 Sri Ambati Keynote on AI
PPTX
AI in the Enterprise at Scale
PDF
AI for Software Engineering
PDF
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
PPTX
9/9/16 Top 5 Deep Learning
PPTX
IBM Watson
PDF
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
PPTX
9/23 Top 5 Deep Learning
PDF
Neural network
PDF
“From Inference to Action: AI Beyond Pattern Recognition,” a Keynote Presenta...
PDF
Top 10 Big Data Technologies | Edureka
PPTX
Carmelo Iaria, AI Academy - How The AI Academy is accelerating NLP projects w...
Full-Stack Data Science: How to be a One-person Data Team
Leveraging Open Source Automated Data Science Tools
AI and Deep Learning
Application of machine learning in industrial applications
Introduction To Machine Learning | Edureka
Machine Learning with GraphLab Create
10 Lessons Learned from Building Machine Learning Systems
H2O World - Intro to Data Science with Erin Ledell
Vertical is the New Horizontal - MinneAnalytics 2016 Sri Ambati Keynote on AI
AI in the Enterprise at Scale
AI for Software Engineering
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
9/9/16 Top 5 Deep Learning
IBM Watson
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
9/23 Top 5 Deep Learning
Neural network
“From Inference to Action: AI Beyond Pattern Recognition,” a Keynote Presenta...
Top 10 Big Data Technologies | Edureka
Carmelo Iaria, AI Academy - How The AI Academy is accelerating NLP projects w...
Ad

Similar to The Frontier of Deep Learning in 2020 and Beyond (20)

PDF
Introduction to the Artificial Intelligence and Computer Vision revolution
PDF
SOCIAL DISTANCING MONITORING IN COVID-19 USING DEEP LEARNING
PDF
Empirical AI Research
PDF
Introduction to Deep Learning: Concepts, Architectures, and Applications
PPTX
Deep Learning Jump Start
PDF
Deep learning applications - for fun and profit
PDF
Deep learning and reasoning: Recent advances
PDF
Learning visual representation without human label
PPTX
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PDF
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
PDF
Directions in machine learning Ceadar webinar
PDF
Brief History of Visual Representation Learning
PDF
Deep learning 1.0 and Beyond, Part 2
PDF
Deep Learning Class #0 - You Can Do It
PDF
DL Classe 0 - You can do it
PDF
Transformers in 2021
PPTX
Deep Learning Explained: The future of Artificial Intelligence and Smart Netw...
PPTX
An Incomplete Introduction to Artificial Intelligence
PDF
Deep Learning for IoT : is there a shallow end of the pool?
PPTX
GDSC Introduction to Deep Learning Workshop
Introduction to the Artificial Intelligence and Computer Vision revolution
SOCIAL DISTANCING MONITORING IN COVID-19 USING DEEP LEARNING
Empirical AI Research
Introduction to Deep Learning: Concepts, Architectures, and Applications
Deep Learning Jump Start
Deep learning applications - for fun and profit
Deep learning and reasoning: Recent advances
Learning visual representation without human label
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Directions in machine learning Ceadar webinar
Brief History of Visual Representation Learning
Deep learning 1.0 and Beyond, Part 2
Deep Learning Class #0 - You Can Do It
DL Classe 0 - You can do it
Transformers in 2021
Deep Learning Explained: The future of Artificial Intelligence and Smart Netw...
An Incomplete Introduction to Artificial Intelligence
Deep Learning for IoT : is there a shallow end of the pool?
GDSC Introduction to Deep Learning Workshop
Ad

More from NUS-ISS (20)

PDF
Designing Impactful Services and User Experience - Lim Wee Khee
PDF
Upskilling the Evolving Workforce with Digital Fluency for Tomorrow's Challen...
PDF
The Importance of Cybersecurity for Digital Transformation
PDF
Architecting CX Measurement Frameworks and Ensuring CX Metrics are fit for Pu...
PDF
Understanding GenAI/LLM and What is Google Offering - Felix Goh
PDF
Digital Product-Centric Enterprise and Enterprise Architecture - Tan Eng Tsze
PDF
Emerging & Future Technology - How to Prepare for the Next 10 Years of Radica...
PDF
Beyond the Hype: What Generative AI Means for the Future of Work - Damien Cum...
PDF
Supply Chain Security for Containerised Workloads - Lee Chuk Munn
PDF
Future of Learning - Yap Aye Wee.pdf
PDF
Future of Learning - Khoong Chan Meng
PPTX
Site Reliability Engineer (SRE), We Keep The Lights On 24/7
PDF
Product Management in The Trenches for a Cloud Service
PDF
Overview of Data and Analytics Essentials and Foundations
PDF
Predictive Analytics
PDF
Feature Engineering for IoT
PDF
Master of Technology in Software Engineering
PDF
Master of Technology in Enterprise Business Analytics
PDF
Diagnosing Complex Problems Using System Archetypes
PPTX
Satisfying the ‘-ilities’ of an Enterprise Cloud Service
Designing Impactful Services and User Experience - Lim Wee Khee
Upskilling the Evolving Workforce with Digital Fluency for Tomorrow's Challen...
The Importance of Cybersecurity for Digital Transformation
Architecting CX Measurement Frameworks and Ensuring CX Metrics are fit for Pu...
Understanding GenAI/LLM and What is Google Offering - Felix Goh
Digital Product-Centric Enterprise and Enterprise Architecture - Tan Eng Tsze
Emerging & Future Technology - How to Prepare for the Next 10 Years of Radica...
Beyond the Hype: What Generative AI Means for the Future of Work - Damien Cum...
Supply Chain Security for Containerised Workloads - Lee Chuk Munn
Future of Learning - Yap Aye Wee.pdf
Future of Learning - Khoong Chan Meng
Site Reliability Engineer (SRE), We Keep The Lights On 24/7
Product Management in The Trenches for a Cloud Service
Overview of Data and Analytics Essentials and Foundations
Predictive Analytics
Feature Engineering for IoT
Master of Technology in Software Engineering
Master of Technology in Enterprise Business Analytics
Diagnosing Complex Problems Using System Archetypes
Satisfying the ‘-ilities’ of an Enterprise Cloud Service

Recently uploaded (20)

PPTX
The various Industrial Revolutions .pptx
PDF
Unlock new opportunities with location data.pdf
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
STKI Israel Market Study 2025 version august
PDF
Five Habits of High-Impact Board Members
PDF
Getting Started with Data Integration: FME Form 101
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
August Patch Tuesday
PDF
Architecture types and enterprise applications.pdf
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
DP Operators-handbook-extract for the Mautical Institute
PPTX
Web Crawler for Trend Tracking Gen Z Insights.pptx
PDF
sustainability-14-14877-v2.pddhzftheheeeee
The various Industrial Revolutions .pptx
Unlock new opportunities with location data.pdf
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Hindi spoken digit analysis for native and non-native speakers
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
A review of recent deep learning applications in wood surface defect identifi...
STKI Israel Market Study 2025 version august
Five Habits of High-Impact Board Members
Getting Started with Data Integration: FME Form 101
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
August Patch Tuesday
Architecture types and enterprise applications.pdf
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
Zenith AI: Advanced Artificial Intelligence
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
DP Operators-handbook-extract for the Mautical Institute
Web Crawler for Trend Tracking Gen Z Insights.pptx
sustainability-14-14877-v2.pddhzftheheeeee

The Frontier of Deep Learning in 2020 and Beyond

  • 1. The Frontier of Deep Learning in 2020 and Beyond Recent advances, Trends and Opportunities Bhav Ashok #ISSLearningFest
  • 2. Overview 1. Brief history of Deep Learning 2. Trends 1. Move to free data 2. Learn everything 3. Do more with less 3. Future of Deep Learning 1. GPT-3 (Generative-Pretraining-3) 2. NeRF (Neural Radiance Fields) #ISSLearningFest
  • 3. Brief history of Deep Learning How it all began #ISSLearningFest
  • 4. History of deep learning #ISSLearningFest 2012 2016 2020< 2012 Classical ML
  • 5. History of deep learning #ISSLearningFest 2012 2016 2020< 2012 Large-scale Deep Learning ResNet-101 (2015) VGG-16 (2014) AlexNet (2012) Classical ML
  • 6. 2012-2016 • GPU training of deep CNNs (Convolutional Neural Network) − AlexNet wins ImageNet 2012 − First superhuman performance on ImageNet • Deeper CNNs improve performance • More innovation in CNNs − VGGNet in 2014 − ResNet wins ImageNet 2015 • Birth of large-scale deep learning #ISSLearningFest
  • 7. History of deep learning #ISSLearningFest 2012 2016 2020< 2012 (2019) Transformers (2018) NAS/AutoML (2017) AlphaGO (2016) Large-scale Deep Learning ResNet-101 (2015) VGG-16 (2014) AlexNet (2012) Realistic GANs (2019) Classical ML
  • 8. 2016-2020 • Trend 1: Move to free data • Trend 2: Learn everything (AutoML) • Trend 3: Do more with less data • Deep Learning scales on other tasks − Language models: Transformers − Speech synthesis: Tacotron − Deep RL: Alpha GO − Generative models: Hyper-realistic GANs (Generative Adversarial Networks) #ISSLearningFest
  • 9. History of deep learning #ISSLearningFest 2012 2016 2020< 2012 GPT-3 (2020) (2019) Transformers (2018) NAS/AutoML (2017) AlphaGO (2016) Self-driving AR Large-scale Deep Learning ResNet-101 (2015) VGG-16 (2014) AlexNet (2012) Realistic GANs (2019) Classical ML
  • 10. 2020+ • Previous success in deep learning mostly in controlled environments − Web − Games (video and board games) − Academic benchmarks • Real world applications start to mature − Self-driving, GPT-3, AR • Promising research in 3D vision • Few shot learning and domain adaptation. #ISSLearningFest
  • 11. Trend 1: Free data The push for more annotated data at a cheaper cost #ISSLearningFest
  • 12. Motivation • Accuracy scales with data − Unreasonable effectiveness of data revisited (2017) • Problem: Labeling data is expensive − Requires human annotators − Around $0.50 - $10 per image • Problem: User privacy − Majority of training data is generated by consumers − GDPR (2016-2018) #ISSLearningFest
  • 13. Technological advances • Techniques in improving data labelling efficiency • Knowledge distillation • Machines label data for you • 3D to 2D supervision • Label in 3D, $$ in 2D • Synthetic data • Generate data and labels synthetically #ISSLearningFest
  • 14. Knowledge distillation • Use a pre-trained “Teacher” model to label unlabeled data • Train a “Student” model using newly annotated dataset • Machines annotate data instead of humans − Promise: − Free labels on real images − Problems: − Might be noisy and reinforce errors present in Teacher model − Recent advances in self-distillation, Multi-Teacher distillation, Human in the loop (active learning) #ISSLearningFest Teacher Student Fig: “Teacher” distills knowledge into “Student” Image Label
  • 15. 3D to 2D supervision • Reconstruct scene, annotate in 3D, profit in 2D − Cheaper supervision on real 2D images • Problems ⚫ Depends on quality of 3D reconstruction of scene. ⚫ 3D annotation is expensive ⚫ though total cost may still be cheaper than 2D annotation. #ISSLearningFest
  • 16. Rendered annotations w/ real image. Source: http://www.scan-net.org/
  • 17. Synthetic data (Sim2Real) • Create environments and get annotation for free. • Free annotations and unlimited variation for cheap • Problems ⚫ Difficulty in generalizing to real world domain ⚫ Some human input required to generate useful simulations #ISSLearningFest
  • 18. Rendered Image Rendered Depth map Rendered Segmentation annotation Map of scene Source: https://github.com/facebookresearch/House3D
  • 20. Trend 2: Learn everything From data to architectures and optimizers (AutoML). #ISSLearningFest
  • 21. Learn everything • Classical paradigm of Deep Learning • Optimizing all parts of the stack − Architectures − Optimizers − Data augmentation − Learning schedules and more • Also known as AutoML or “Learning to learn” #ISSLearningFest Network Loss function Data Data augmentation Optimizer Classical paradigm of Deep Learning
  • 22. Architectures • People previously believed that human intuition was essential in architecture design • Early architectures followed intuitions from pattern recognition • Reinforcement Learning, Supervised Learning and Evolutionary Algorithms are commonly used in AutoML • Neural Architecture Search (ICLR 2018) • Used over 800 GPUs #ISSLearningFest
  • 23. Neural Architecture Search • Idea: Use reinforcement learning to train a neural network to create a high performing neural network • Reward function: Accuracy of generated architecture • Result: RL agent learns to generate architectures that produce high accuracy on the task. #ISSLearningFest Example of generated architecture
  • 24. Trend 3: Do more with less Semi-supervised learning, self-attention, self-play and more #ISSLearningFest
  • 25. Do more with less ⚫ Self-training ⚫ Learn more from unlabeled data. ⚫ Self-play ⚫ AIs compete amongst themselves to improve. ⚫ Self-attention ⚫ Learn better associations within input data #ISSLearningFest
  • 26. Self-training • Self-training with noisy student − Idea: perform self-distillation on unlabeled data but add noise during training to increase robustness of model. − Current state of the Art on ImageNet #ISSLearningFest Model Step 1: Label example using current model “Cat” Step 2: Train with noisy example + noise Model Train with label from previous step
  • 27. Self-play • AIs compete to maximize Reinforcement Learning reward function • Vital to breakthroughs in − AlphaGO − OpenAI Five #ISSLearningFest AlphaGo vs. AlphaGo Match 41OpenAI Five Self-Play
  • 28. Self-attention • Transformers introduced in paper “Attention is all you need” − Breakthrough in language models − State of the Art on multiple language tasks • Example • “The animal didn't cross the street because it was too tired” • What does “it” refer to? Animal or Street? • Self-attention allows model to look at entire sentence and form associations by training on language understanding tasks. #ISSLearningFest Source: jalammar.github.io/illustrated-transformer
  • 29. Future of Deep Learning Expanding to the real world #ISSLearningFest
  • 30. Real world deep learning • Real world is • Complex • Physical space is in 3D • Applications need to be • Robust and adapt to changing environments • Understand 3D if deployed in physical world • Research • Domain adaptation, few-shot learning • 3D scene understanding #ISSLearningFest
  • 31. Recent breakthroughs • GPT-3 (Generative Pre-Training 3 - 2020) • Adapts to environment using few-shot learning. • Generalizes surprisingly well to many useful applications. • NeRF (Neural Radiance Fields - 2020) • Able to reconstruct and model 3D environments completely within a single neural network. • Qualitative results much better than classical reconstruction methods on real world data. #ISSLearningFest
  • 32. Generative Pre-Training-3 (GPT-3) • Based on transformer model • Uses pretraining and adapts to environment using few-shot learning. • Very very large scale • 175 Billion parameters (>10x more than any previous model) • Trained using over a trillion words • Cost US$ 12 million to train #ISSLearningFest
  • 33. GPT-3 - Writing code #ISSLearningFest
  • 34. GPT-3 – Designing UI #ISSLearningFest Source: https://twitter.com/jsngr/status/1284511080715362304
  • 35. GPT-3 – Learning #ISSLearningFestSource: https://learnfromanyone.com/
  • 36. GPT-3 – Other applications #ISSLearningFestSource: https://learnfromanyone.com/ • Writing stories • Psychotherapy • Food recipes • Medical diagnosis • Compose music • And more
  • 37. Neural Radiance Fields (NeRF) • Training data: Sparse set of images + viewpoints • Note: viewpoints can be recovered using traditional pose estimation techniques so really, this needs only a set of images. • Result: 3D reconstruction and view dependent rendering • Learns a function that maps rays passing through images to rgb + density #ISSLearningFest Source: https://www.matthewtancik.com/nerf Viewing position + angle Image pixels + density
  • 38. NeRF • Photorealistic rendering that accounts for lighting/materials/viewpoint. #ISSLearningFest
  • 39. NeRF – photorealistic rendering #ISSLearningFest Example of classical reconstruction + rendering Source: youtube.com/watch?v=OsZvBEkJ6Vg NeRF rendering Source: https://www.matthewtancik.com/nerf
  • 40. NeRF - results #ISSLearningFest Rendering with specularities Result with fixed viewing position but varying viewing angle Source: https://www.matthewtancik.com/nerf
  • 41. NeRF - results #ISSLearningFest Photorealistic rendering Models fine physical structures Source: https://www.matthewtancik.com/nerf
  • 42. NeRF • Photorealistic rendering that accounts for lighting/materials/viewpoint • Works in the real world #ISSLearningFest
  • 43. NeRF – Real world results #ISSLearningFest Brandenburg GateTrevi Fountain Source: https://nerf-w.github.io/
  • 44. NeRF • Photorealistic rendering that accounts for lighting/materials/viewpoint • Works in the real world • Learns a neural representation of the 3D scene • Useful for varying arbitrary quantities. (e.g. lighting) • Useful for multi-task learning. #ISSLearningFest
  • 45. NeRF - Relighting #ISSLearningFestSource: https://nerf-w.github.io/