SlideShare a Scribd company logo
Soumith Chintala
Facebook AI Research
A dynamic view of the deep learning world
Overview
• What is Torch?
• The Community
• Today’s AI
• What Next?
What is ?
• Interactive Scientific computing framework
What is ?
• Interactive Scientific computing framework
What is ?
• Similar to Matlab / Python+Numpy
What is ?
• Easy integration into and from C
• Example: using CuDNN functions
What is ?
• Strong GPU support
Neural Networks
• nn: neural networks made easy
• building blocks of differentiable modules
autograd by
• Write imperative programs
• Backprop defined for every operation in the language
Distributed Learning
• Multi-GPU and Multi-Node
• distlearn by
• MPI-like model
• scales to a large amount of nodes
Core Philosophy
• Interactive computing
• No compilation time
• Imperative programming
• Easy to understand, think and debug
• Minimal abstraction
• Thinking linearly
• Maximal Flexibility
• No constraints on interfaces or classes
Community
Community
Community
Community
Community
Community
Community
Community
Today’s AI
• Reasonably solved to be used in production
Today’s AI
• Classification and Detection
• in images, videos, volumetric data, sparse datasets
DenseCap: Johnson et. al. https://github.com/jcjohnson/densecap
Today’s AI
• Segmentation
DeepMask: Pinhero et. al.
Today’s AI
• Text Classification (sentiment analysis etc.)
• Text Embeddings
• Graph embeddings
• Machine Translation
• Ads ranking
Today’s AI
static datasets + static model structure
Today’s AI
static datasets + static model structure
model does not change over training time
Today’s AI
static datasets + static model structure
model does not change over training time
offline learning
Active and Future AI
Research
• Agents training in different and new environments
Active and Future AI
Research
• RL Agents in different and new environments
• Example: OpenAI Universe
Active and Future AI
Research
• RL Agents in different and new environments
• Example: OpenAI Universe
• Online Learning
• Example: self-driving cars
Active and Future AI
Research
• RL Agents in different and new environments
• Example: OpenAI Universe
• Online Learning
• Example: self-driving cars
• Dynamic Neural Networks
• self-adding new memory or layers
• changing evaluation path based on inputs
Active and Future AI
Research
• RL Agents in different and new environments
• Example: OpenAI Universe
• Online Learning
• Example: self-driving cars
• Dynamic Neural Networks
• self-adding new memory or layers
• changing evaluation path based on inputs
• Structured Prediction
• Viterbi style decoders
A Next-Gen Framework
• Interop with environments
• Like Universe, VizDoom, etc.
• Easy data loaders
• Rich scientific ecosystem (such as SciPy)
• Neural Networks are not the only methods to be used
• Dynamic Neural Networks
• with no specific or static structure
• Minimal Abstractions
• better debugging and low-level programming in the hands of researcher
• Graph Compilers
• To speed up structured prediction and fuse operations
• Numba, XLA, Theano
Let’s share!
Torch
Caffe TensorFlow
Theano MXNet, etc.
Eigen
TH
CuDNN
MKL
Numba
XLA
NNVM
MPI
NCCL

More Related Content

PDF
Hai Tao at AI Frontiers: Deep Learning For Embedded Vision System
PPTX
Deploying ML models in the enterprise
PDF
Cognitive IoT using DeepLearning on data parallel frameworks like Spark & Flink
PPTX
Inferene trends in industry
PDF
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
PPTX
Emerging trends in Artificial intelligence - A deeper review
PDF
Dato Keynote
PDF
Artificial intelligence at the edge
Hai Tao at AI Frontiers: Deep Learning For Embedded Vision System
Deploying ML models in the enterprise
Cognitive IoT using DeepLearning on data parallel frameworks like Spark & Flink
Inferene trends in industry
Retrieving Visually-Similar Products for Shopping Recommendations using Spark...
Emerging trends in Artificial intelligence - A deeper review
Dato Keynote
Artificial intelligence at the edge

What's hot (20)

PPTX
Data Science, Machine Learning, and H2O
PDF
AIIA - Charting the Path to Intelligent Operations with Machine Learning - At...
PPTX
Squeezing Deep Learning Into Mobile Phones
PDF
Applying Deep Learning Vision Technology to low-cost/power Embedded Systems
PPTX
AI at Google (30 min)
PPTX
AI | Now + Next
PPTX
Interpretable Machine Learning
PDF
Knowledge Discovery
PDF
Knowledge Discovery in Production
PDF
"Dataflow: Where Power Budgets Are Won and Lost," a Presentation from Movidius
PPTX
Keras: A versatile modeling layer for deep learning
PPTX
Brief Introduction to Deep Learning for Object Recognition Using MATLAB
PDF
NVIDIA GTC 2018 Presentation
PDF
Nitin sharma - Deep Learning Applications to Online Payment Fraud Detection
PPTX
Advanced AI for People in a Hurry
PPTX
InfoEducatie - What is Solution Architecture?
PDF
Recent developments in Deep Learning
PPTX
Danny Bickson - Python based predictive analytics with GraphLab Create
PDF
Exploring the Cloud
PDF
The Data Science Process - Do we need it and how to apply?
Data Science, Machine Learning, and H2O
AIIA - Charting the Path to Intelligent Operations with Machine Learning - At...
Squeezing Deep Learning Into Mobile Phones
Applying Deep Learning Vision Technology to low-cost/power Embedded Systems
AI at Google (30 min)
AI | Now + Next
Interpretable Machine Learning
Knowledge Discovery
Knowledge Discovery in Production
"Dataflow: Where Power Budgets Are Won and Lost," a Presentation from Movidius
Keras: A versatile modeling layer for deep learning
Brief Introduction to Deep Learning for Object Recognition Using MATLAB
NVIDIA GTC 2018 Presentation
Nitin sharma - Deep Learning Applications to Online Payment Fraud Detection
Advanced AI for People in a Hurry
InfoEducatie - What is Solution Architecture?
Recent developments in Deep Learning
Danny Bickson - Python based predictive analytics with GraphLab Create
Exploring the Cloud
The Data Science Process - Do we need it and how to apply?
Ad

Viewers also liked (20)

PDF
Charles Fan at AI Frontiers: The New Era of AI Plus
PDF
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
PDF
Junli Gu at AI Frontiers: Autonomous Driving Revolution
PDF
Adam Coates at AI Frontiers: AI for 100 Million People with Deep Learning
PDF
Kevin Shaw at AI Frontiers: AI on the Edge: Bringing Intelligence to Small De...
PDF
Scaling Deep Learning with MXNet
PDF
Intelligent Chatbot on WeChat
PDF
Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...
PDF
Rajat Monga at AI Frontiers: Deep Learning with TensorFlow
PDF
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
PDF
Jisheng Wang at AI Frontiers: Deep Learning in Security
PDF
Jeff Dean at AI Frontiers: Trends and Developments in Deep Learning Research
PDF
Yangqing Jia at AI Frontiers: Towards Better DL Frameworks
PDF
James Manyika at AI Frontiers: A Future That Works: Automation, Employment, a...
PDF
Nikko Ström at AI Frontiers: Deep Learning in Alexa
PDF
Lukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLP
PDF
Andres Rodriguez at AI Frontiers: Catalyzing Deep Learning's Impact in the En...
PDF
Liu Ren at AI Frontiers: Sensor-aware Augmented Reality
PDF
Alex Smola at AI Frontiers: Scalable Deep Learning Using MXNet
PDF
Soumith Chintala, Artificial Intelligence Research Engineer, Facebook at MLco...
Charles Fan at AI Frontiers: The New Era of AI Plus
Naghi Prasad at AI Frontiers: Building AI systems to automate enterprise proc...
Junli Gu at AI Frontiers: Autonomous Driving Revolution
Adam Coates at AI Frontiers: AI for 100 Million People with Deep Learning
Kevin Shaw at AI Frontiers: AI on the Edge: Bringing Intelligence to Small De...
Scaling Deep Learning with MXNet
Intelligent Chatbot on WeChat
Saket Saurabh at AI Frontiers: Data Operations or: How I Learned to Stop Data...
Rajat Monga at AI Frontiers: Deep Learning with TensorFlow
Li Deng at AI Frontiers: Three Generations of Spoken Dialogue Systems (Bots)
Jisheng Wang at AI Frontiers: Deep Learning in Security
Jeff Dean at AI Frontiers: Trends and Developments in Deep Learning Research
Yangqing Jia at AI Frontiers: Towards Better DL Frameworks
James Manyika at AI Frontiers: A Future That Works: Automation, Employment, a...
Nikko Ström at AI Frontiers: Deep Learning in Alexa
Lukasz Kaiser at AI Frontiers: How Deep Learning Quietly Revolutionized NLP
Andres Rodriguez at AI Frontiers: Catalyzing Deep Learning's Impact in the En...
Liu Ren at AI Frontiers: Sensor-aware Augmented Reality
Alex Smola at AI Frontiers: Scalable Deep Learning Using MXNet
Soumith Chintala, Artificial Intelligence Research Engineer, Facebook at MLco...
Ad

Similar to Soumith Chintala at AI Frontiers: A Dynamic View of the Deep Learning World (20)

PPTX
Introduction to deep learning
PDF
Choosing the best JavaScript framework/library/toolkit
PDF
How to write a web framework
PDF
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
PDF
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
PDF
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
PDF
NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...
PDF
Using Algorithmia to leverage AI and Machine Learning APIs
PPTX
Introduction to deep learning
PDF
Deep Learning with CNTK
PDF
DL4J at Workday Meetup
PDF
Prototyping like it is 2022
PDF
Data Science, Machine Learning and Neural Networks
KEY
Drupal at the EBI
PDF
SciPy Latin America 2019
PPTX
Promises of Deep Learning
PPTX
No BS Guide to Deep Learning in the Enterprise
PPTX
Building High Available and Scalable Machine Learning Applications
PPT
The Economies of Scaling Software
PPTX
Parallel architecture &programming
Introduction to deep learning
Choosing the best JavaScript framework/library/toolkit
How to write a web framework
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...
Using Algorithmia to leverage AI and Machine Learning APIs
Introduction to deep learning
Deep Learning with CNTK
DL4J at Workday Meetup
Prototyping like it is 2022
Data Science, Machine Learning and Neural Networks
Drupal at the EBI
SciPy Latin America 2019
Promises of Deep Learning
No BS Guide to Deep Learning in the Enterprise
Building High Available and Scalable Machine Learning Applications
The Economies of Scaling Software
Parallel architecture &programming

More from AI Frontiers (20)

PPTX
Divya Jain at AI Frontiers : Video Summarization
PPTX
Training at AI Frontiers 2018 - LaiOffer Data Session: How Spark Speedup AI
PDF
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-Lecture 1: Heuristi...
PDF
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
PDF
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-lecture 2: Incremen...
PDF
Training at AI Frontiers 2018 - Udacity: Enhancing NLP with Deep Neural Networks
PDF
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-Lecture 3: Any-Angl...
PDF
Training at AI Frontiers 2018 - Lukasz Kaiser: Sequence to Sequence Learning ...
PDF
Percy Liang at AI Frontiers : Pushing the Limits of Machine Learning
PDF
Ilya Sutskever at AI Frontiers : Progress towards the OpenAI mission
PDF
Mark Moore at AI Frontiers : Uber Elevate
PPTX
Mario Munich at AI Frontiers : Consumer robotics: embedding affordable AI in ...
PPTX
Arnaud Thiercelin at AI Frontiers : AI in the Sky
PPTX
Anima Anandkumar at AI Frontiers : Modern ML : Deep, distributed, Multi-dimen...
PPTX
Wei Xu at AI Frontiers : Language Learning in an Interactive and Embodied Set...
PPTX
Sumit Gupta at AI Frontiers : AI for Enterprise
PPTX
Yuandong Tian at AI Frontiers : Planning in Reinforcement Learning
PPTX
Alex Ermolaev at AI Frontiers : Major Applications of AI in Healthcare
PPTX
Long Lin at AI Frontiers : AI in Gaming
PDF
Melissa Goldman at AI Frontiers : AI & Finance
Divya Jain at AI Frontiers : Video Summarization
Training at AI Frontiers 2018 - LaiOffer Data Session: How Spark Speedup AI
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-Lecture 1: Heuristi...
Training at AI Frontiers 2018 - Ni Lao: Weakly Supervised Natural Language Un...
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-lecture 2: Incremen...
Training at AI Frontiers 2018 - Udacity: Enhancing NLP with Deep Neural Networks
Training at AI Frontiers 2018 - LaiOffer Self-Driving-Car-Lecture 3: Any-Angl...
Training at AI Frontiers 2018 - Lukasz Kaiser: Sequence to Sequence Learning ...
Percy Liang at AI Frontiers : Pushing the Limits of Machine Learning
Ilya Sutskever at AI Frontiers : Progress towards the OpenAI mission
Mark Moore at AI Frontiers : Uber Elevate
Mario Munich at AI Frontiers : Consumer robotics: embedding affordable AI in ...
Arnaud Thiercelin at AI Frontiers : AI in the Sky
Anima Anandkumar at AI Frontiers : Modern ML : Deep, distributed, Multi-dimen...
Wei Xu at AI Frontiers : Language Learning in an Interactive and Embodied Set...
Sumit Gupta at AI Frontiers : AI for Enterprise
Yuandong Tian at AI Frontiers : Planning in Reinforcement Learning
Alex Ermolaev at AI Frontiers : Major Applications of AI in Healthcare
Long Lin at AI Frontiers : AI in Gaming
Melissa Goldman at AI Frontiers : AI & Finance

Recently uploaded (20)

PDF
Empathic Computing: Creating Shared Understanding
PDF
Modernizing your data center with Dell and AMD
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
KodekX | Application Modernization Development
PPTX
A Presentation on Artificial Intelligence
PPT
Teaching material agriculture food technology
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Cloud computing and distributed systems.
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
Empathic Computing: Creating Shared Understanding
Modernizing your data center with Dell and AMD
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Unlocking AI with Model Context Protocol (MCP)
Chapter 3 Spatial Domain Image Processing.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
KodekX | Application Modernization Development
A Presentation on Artificial Intelligence
Teaching material agriculture food technology
Advanced methodologies resolving dimensionality complications for autism neur...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Mobile App Security Testing_ A Comprehensive Guide.pdf
NewMind AI Monthly Chronicles - July 2025
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Cloud computing and distributed systems.
Reach Out and Touch Someone: Haptics and Empathic Computing

Soumith Chintala at AI Frontiers: A Dynamic View of the Deep Learning World

  • 1. Soumith Chintala Facebook AI Research A dynamic view of the deep learning world
  • 2. Overview • What is Torch? • The Community • Today’s AI • What Next?
  • 3. What is ? • Interactive Scientific computing framework
  • 4. What is ? • Interactive Scientific computing framework
  • 5. What is ? • Similar to Matlab / Python+Numpy
  • 6. What is ? • Easy integration into and from C • Example: using CuDNN functions
  • 7. What is ? • Strong GPU support
  • 8. Neural Networks • nn: neural networks made easy • building blocks of differentiable modules
  • 9. autograd by • Write imperative programs • Backprop defined for every operation in the language
  • 10. Distributed Learning • Multi-GPU and Multi-Node • distlearn by • MPI-like model • scales to a large amount of nodes
  • 11. Core Philosophy • Interactive computing • No compilation time • Imperative programming • Easy to understand, think and debug • Minimal abstraction • Thinking linearly • Maximal Flexibility • No constraints on interfaces or classes
  • 20. Today’s AI • Reasonably solved to be used in production
  • 21. Today’s AI • Classification and Detection • in images, videos, volumetric data, sparse datasets DenseCap: Johnson et. al. https://github.com/jcjohnson/densecap
  • 23. Today’s AI • Text Classification (sentiment analysis etc.) • Text Embeddings • Graph embeddings • Machine Translation • Ads ranking
  • 24. Today’s AI static datasets + static model structure
  • 25. Today’s AI static datasets + static model structure model does not change over training time
  • 26. Today’s AI static datasets + static model structure model does not change over training time offline learning
  • 27. Active and Future AI Research • Agents training in different and new environments
  • 28. Active and Future AI Research • RL Agents in different and new environments • Example: OpenAI Universe
  • 29. Active and Future AI Research • RL Agents in different and new environments • Example: OpenAI Universe • Online Learning • Example: self-driving cars
  • 30. Active and Future AI Research • RL Agents in different and new environments • Example: OpenAI Universe • Online Learning • Example: self-driving cars • Dynamic Neural Networks • self-adding new memory or layers • changing evaluation path based on inputs
  • 31. Active and Future AI Research • RL Agents in different and new environments • Example: OpenAI Universe • Online Learning • Example: self-driving cars • Dynamic Neural Networks • self-adding new memory or layers • changing evaluation path based on inputs • Structured Prediction • Viterbi style decoders
  • 32. A Next-Gen Framework • Interop with environments • Like Universe, VizDoom, etc. • Easy data loaders • Rich scientific ecosystem (such as SciPy) • Neural Networks are not the only methods to be used • Dynamic Neural Networks • with no specific or static structure • Minimal Abstractions • better debugging and low-level programming in the hands of researcher • Graph Compilers • To speed up structured prediction and fuse operations • Numba, XLA, Theano
  • 33. Let’s share! Torch Caffe TensorFlow Theano MXNet, etc. Eigen TH CuDNN MKL Numba XLA NNVM MPI NCCL