SlideShare a Scribd company logo
LEAPS IN VISUAL COMPUTING
JEN-HSUN HUANG, CO-FOUNDER & CEO | GTC 2015
FOUR ANNOUNCEMENTS
A New GPU
and
Deep Learning
A Very Fast Box
and
Deep Learning
Roadmap Reveal
and
Deep Learning
Self-Driving Cars
and
Deep Learning
AMAZING YEAR IN VISUAL COMPUTING
© 2015 Industrial Light & Magic. All Rights Reserved.
10X GROWTH IN GPU COMPUTING
2008
150,000
CUDA Downloads
4,000
Academic Papers
60
Universities Teaching
77
Supercomputing Teraflops
6,000
Tesla GPUs
27
CUDA Apps
2008
150,000
CUDA Downloads
4,000
Academic Papers
60
Universities Teaching
77
Supercomputing Teraflops
6,000
Tesla GPUs
27
CUDA Apps
2015
3 Million
CUDA Downloads
10X GROWTH IN GPU COMPUTING
2015
3 Million
CUDA Downloads
10X GROWTH IN GPU COMPUTING
319
CUDA Apps
2008
150,000
CUDA Downloads
4,000
Academic Papers
60
Universities Teaching
77
Supercomputing Teraflops
6,000
Tesla GPUs
27
CUDA Apps
2015
3 Million
CUDA Downloads
800
Universities Teaching
10X GROWTH IN GPU COMPUTING
319
CUDA Apps
2008
150,000
CUDA Downloads
4,000
Academic Papers
60
Universities Teaching
77
Supercomputing Teraflops
6,000
Tesla GPUs
27
CUDA Apps
2015
3 Million
CUDA Downloads
800
Universities Teaching
10X GROWTH IN GPU COMPUTING
319
CUDA Apps
2008
150,000
CUDA Downloads
4,000
Academic Papers
60
Universities Teaching
77
Supercomputing Teraflops
6,000
Tesla GPUs
27
CUDA Apps
60,000
Academic Papers
2015
3 Million
CUDA Downloads
800
Universities Teaching
10X GROWTH IN GPU COMPUTING
319
CUDA Apps
2008
150,000
CUDA Downloads
4,000
Academic Papers
60
Universities Teaching
77
Supercomputing Teraflops
6,000
Tesla GPUs
27
CUDA Apps
60,000
Academic Papers
450,000
Tesla GPUs
2015
3 Million
CUDA Downloads
60,000
Academic Papers
800
Universities Teaching
54,000
Supercomputing Teraflops
10X GROWTH IN GPU COMPUTING
450,000
Tesla GPUs
319
CUDA Apps
2008
150,000
CUDA Downloads
4,000
Academic Papers
60
Universities Teaching
77
Supercomputing Teraflops
6,000
Tesla GPUs
27
CUDA Apps
Opening Keynote at GTC 2015: Leaps in Visual Computing
8 Billion Transistors
3,072 CUDA Cores
7 TFLOPS SP / 0.2 TFLOPS DP
12GB Memory
TITAN X
THE WORLD’S FASTEST GPU
Opening Keynote at GTC 2015: Leaps in Visual Computing
0
1
2
3
4
5
6
7
TITAN X FOR DEEP LEARNING
Training AlexNet
Days
16-core Xeon CPU TITAN TITAN Black
cuDNN
TITAN X
cuDNN
~
43
…
8 Billion Transistors
3,072 CUDA Cores
7 TFLOPS SP / 0.2 TFLOPS DP
12GB Memory
TITAN X
THE WORLD’S FASTEST GPU
$999
FOUR ANNOUNCEMENTS
A New GPU
and
Deep Learning
A Very Fast Box
and
Deep Learning
Roadmap Reveal
and
Deep Learning
Self-Driving Cars
and
Deep Learning
A SHORT HISTORY OF DEEP LEARNING
Convolutional Neural Networks for
Handwritten Digital Recognition
LECUN, BOTTOU, BENGIO, HAFFNER, 1998
ImageNet Classification with NVIDIA GPUs
KRIZHEVSKY, HINTON, ET AL., 2012
1995 2000 2005 2010 2015
Accuracy %
2010 201420122011 2013
74%
84%
DNN
CV
72%
“Delving Deep into Rectifiers: Surpassing
Human-Level Performance on ImageNet Classification”
— Microsoft: 4.94%, Feb. 6, 2015
“Deep Image: Scaling up Image Recognition”
— Baidu: 5.98%, Jan. 13, 2015
“Batch Normalization: Accelerating Deep Network Training
by Reducing Internal Covariant Shift”
— Google: 4.82%, Feb. 11, 2015
IMAGENET CHALLENGE
Accuracy %
2010 201420122011 2013
74%
84%
DNN
CV
72%
THE BIG BANG
DEEP LEARNING
VISUALIZED
GPU-ACCELERATED DEEP LEARNING
START-UPS
Detecting Mitosis in Breast Cancer Cells
— IDSIA
Predicting the Toxicity of New Drugs
— Johannes Kepler University
Understanding Gene Mutation to Prevent Disease
— University of Toronto
DEEP LEARNING REVOLUTIONIZING MEDICAL RESEARCH
“Automated Image Captioning with
ConvNets and Recurrent Nets”
—Andrej Karpathy, Fei-Fei Li
DIGITS
DEEP GPU TRAINING SYSTEM
FOR DATA SCIENTISTS
Design DNNs
Visualize activations
Manage multiple trainings
USER
INTERFACE
Visualize
Layers
Configure
DNN
Process
Data
GPUGPU HW CloudGPU ClusterMulti-GPU
Theano
Torch
Monitor
Progress
Caffe
cuDNN, cuBLAS
CUDA
Monitor Progress
DIGITS
Configure DNNProcess Data Visualize Layers
Test Image
DIGITS DEVBOX
World’s fastest GPU
Max GPU out of a plug
Multi-GPU training & inference
“I’ve never seen AlexNet
run this fast…TitanX is
a monster, Crazy Fast”
DIGITS DEVBOX — EARLY RESULTS
“DIGITS makes it way easier
to design the best network
for the job”
0x
1x
2x
3x
4x
1 2 4
Multi-GPU scaling on Torch
AlexNet VGG
— Simon Osindero
A.I. Architech
— Soumith Chintala
Research Engineer
DIGITS DEVBOX
Available May 2015
$15,000
FOUR ANNOUNCEMENTS
A New GPU
and
Deep Learning
A Very Fast Box
and
Deep Learning
Roadmap Reveal
and
Deep Learning
Self-Driving Cars
and
Deep Learning
SGEMM/W
2012 20142008 2010 2016
48
36
12
0
24
60
2018
72
Tesla Fermi
Kepler
Maxwell
Pascal
Mixed Precision
3D Memory
NVLink
Volta
GPU ROADMAP
Pascal 2x SGEMM/W
FrameBufferCapacity(GB)
2012 20142008 2010 2016
40
30
10
0
20
50
2018
60
Tesla Fermi
Kepler
Maxwell
Pascal
Mixed Precision
3D Memory
NVLink
Volta
GPU ROADMAP
Pascal 2.7x Memory Capacity
HGEMM/W
2012 20142008 2010 2016
96
72
24
0
48
120
2018
144
Tesla Fermi Kepler
Maxwell
Pascal
Mixed Precision
3D Memory
NVLink
Volta
GPU ROADMAP
Pascal 4x Mixed Precision
STREAMGB/s
2012 20142008 2010 2016
600
450
150
0
300
750
2018
900
Tesla
Fermi
Kepler
Maxwell
Pascal
Mixed Precision
3D Memory
NVLink
Volta
GPU ROADMAP
Pascal 3x Bandwidth
PASCAL 10X MAXWELL
CONVOLUTION FULLY CONNECTED FULLY CONNECTED CONVOLUTION
(compute) (bandwidth) (bandwidth) (compute)
WEIGHT UPDATE
(interconnect)
4x (FP16) 6x 6x 4x 10x
Mixed Precision 3D Memory NVLINK
forward backward
Mixed Precision3D Memory
5x 2x
* Very rough estimates
FOUR ANNOUNCEMENTS
A New GPU
and
Deep Learning
A Very Fast Box
and
Deep Learning
Roadmap Reveal
and
Deep Learning
Self-Driving Cars
and
Deep Learning
TODAY’S ADAS
PLAN ACT
CPU
WARN
FPGA
CV ASIC
SENSE
BRAKE
NEXT-GENERATION ADAS
PLAN ACT
CPU
WARN
FPGA
CV ASIC
SENSE
BRAKE
STEER
ACCELERATE
NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER
PLAN ACT
CPU
WARN
FPGA
CV ASIC
DNN
SENSE
BRAKE
STEER
IMAGENET CHALLENGE
Accuracy %
2010 201420122011 2013
74%
84%
DNN
CV
72%
ACCELERATE
NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER
PLAN ACT
CPU
WARN
FPGA
CV ASIC
DNN
SENSE
BRAKE
STEER
IMAGENET CHALLENGE
Accuracy %
2010 201420122011 2013
74%
84%
DNN
CV
72%
ACCELERATE
NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER
PLAN ACT
CPU
WARN
FPGA
CV ASIC
DNN
SENSE
BRAKE
STEER
IMAGENET CHALLENGE
Accuracy %
2010 201420122011 2013
74%
84%
DNN
CV
72%
ACCELERATE
NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER
PLAN ACT
CPU
WARN
FPGA
CV ASIC
DNN
SENSE
BRAKE
STEER
IMAGENET CHALLENGE
Accuracy %
2010 201420122011 2013
74%
84%
DNN
CV
72%
ACCELERATE
NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER
PLAN ACT
CPU
WARN
FPGA
CV ASIC
DNN
SENSE
BRAKE
STEER
IMAGENET CHALLENGE
Accuracy %
2010 201420122011 2013
74%
84%
DNN
CV
72%
ACCELERATE
NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER
PLAN ACT
CPU
WARN
FPGA
CV ASIC
DNN
SENSE
BRAKE
STEER
IMAGENET CHALLENGE
Accuracy %
2010 201420122011 2013
74%
84%
DNN
CV
72%
ACCELERATE
DNN-based self-driving robot
Training data by human driver
No hand-coded CV algorithms
PROJECT LEADS
Urs Muller: Chief Architect,
Autonomous Driving, NVIDIA
Yann LeCun: Director,
AI Research, Facebook
PROJECT DAVE — DARPA AUTONOMOUS VEHICLE
IMAGENET CHALLENGE
Accuracy %
2010 201420122011 2013
74%
84%
DNN
CV
72%
DAVE IN ACTION
TRAINING DATA
225K Images
TEST DRIVE
No Training
TEST DRIVE
Partially Trained (52K images)
TEST DRIVE
Fully Trained (225K images)
3,000x
Faster
DAVE
AlexNet on
DRIVE PX
3.1 Million
12
38 Million
630 Million
184
116 Billion
Number of Connections
Frames / Second
Connections / Second
NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER
PLAN ACT
CPU
WARN
FPGA
CV ASIC
DNN
SENSE
BRAKE
STEER
IMAGENET CHALLENGE
Accuracy %
2010 201420122011 2013
74%
84%
DNN
CV
72%
ACCELERATE
NVIDIA DRIVE™ PX
SELF-DRIVING CAR COMPUTER
Available May 2015
$10,000
ELON MUSK
LEAPS IN VISUAL COMPUTING
TITAN X
The World’s Fastest GPU
DIGITS DevBox
GPU Deep Learning Platform
Pascal — 10x Maxwell
For Deep Learning
NVIDIA DRIVE PX
Deep Learning Platform for Self-Driving Cars
Opening Keynote at GTC 2015: Leaps in Visual Computing

More Related Content

PDF
GPU Technology Conference 2014 Keynote
PDF
GTC 2013 Jen-Hsun Huang Keynote
PDF
GPU Accelerated Deep Learning for CUDNN V2
PDF
GTC China 2016
PDF
Accelerated Computing: The Path Forward
PDF
GTC 2012 Jen-Hsun Huang Keynote
PDF
NVIDIA CES 2016 Press Conference
PDF
NVIDIA press conference at CES 2014
GPU Technology Conference 2014 Keynote
GTC 2013 Jen-Hsun Huang Keynote
GPU Accelerated Deep Learning for CUDNN V2
GTC China 2016
Accelerated Computing: The Path Forward
GTC 2012 Jen-Hsun Huang Keynote
NVIDIA CES 2016 Press Conference
NVIDIA press conference at CES 2014

What's hot (20)

PPTX
Nvidia Deep Learning Solutions - Alex Sabatier
PDF
NVIDIA Overview 2015
PPTX
Shattering AI Performance Records
PPTX
AI For Enterprise
PDF
GTC 2016 Opening Keynote
PDF
2016 06 nvidia-isc_supercomputing_car_v02
PDF
GTC 2018: A New AI Era Dawns
PDF
Enabling Artificial Intelligence - Alison B. Lowndes
PDF
NVIDIA Is Revolutionizing Computing - June 2017
PPTX
HPC Top 5 Stories: May 18th, 2018
PDF
NVIDIA Deep Learning Institute 2017 基調講演
PDF
NVIDIA Corporation Brochure: Who We Are
PDF
NVIDIA Keynote #GTC21
PPTX
The AI Era Ignited by GPU Deep Learning
PPTX
Simplifying AI Infrastructure: Lessons in Scaling on DGX Systems
PDF
SC11 Jen-Hsun Huang Keynote
PDF
Innovation Roundtable
PPTX
HPC Top 5 Stories: Nov. 11, 2016
PDF
Fuelling the AI Revolution with Gaming
PDF
Talk on commercialising space data
Nvidia Deep Learning Solutions - Alex Sabatier
NVIDIA Overview 2015
Shattering AI Performance Records
AI For Enterprise
GTC 2016 Opening Keynote
2016 06 nvidia-isc_supercomputing_car_v02
GTC 2018: A New AI Era Dawns
Enabling Artificial Intelligence - Alison B. Lowndes
NVIDIA Is Revolutionizing Computing - June 2017
HPC Top 5 Stories: May 18th, 2018
NVIDIA Deep Learning Institute 2017 基調講演
NVIDIA Corporation Brochure: Who We Are
NVIDIA Keynote #GTC21
The AI Era Ignited by GPU Deep Learning
Simplifying AI Infrastructure: Lessons in Scaling on DGX Systems
SC11 Jen-Hsun Huang Keynote
Innovation Roundtable
HPC Top 5 Stories: Nov. 11, 2016
Fuelling the AI Revolution with Gaming
Talk on commercialising space data
Ad

Viewers also liked (20)

PDF
Over-The-Air Care @ Connected Car Expo.
PPTX
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
PPT
Lyft presentation
PPTX
HPC Top 5 Stories: Dec. 12, 2016
PDF
Investor Day 2013 Jen-Hsun Huang Presentation
PDF
Project inspire Impact Infographic
PDF
NVIDIA Countersues Samsung
PDF
NVIDIA SHIELD Launch
PDF
Video onesheeter-jun2015
PPTX
10/28 Top 5 Deep Learning Stories
PDF
SHIELD Tablet vs. Samsung Galaxy 4 Benchmarks
PPTX
NVIDIA SHIELD Launch Event at GDC 2015
PDF
GTC 2013: NVIDIA Fiscal Performance, Investments, and Opportunities
PDF
Benefits of Deploying VMware Horizon and vSphere with NVIDIA GRID vGPU
PPTX
10/13 Top 5 Deep Learning Stories
PDF
The Deep Learning Revolution
PDF
Samsung's Lawsuit Against NVIDIA in 2014
PDF
Compare Streaming Media Players With NVIDIA SHIELD
PDF
NVIDIA Corporate Responsibility Report
PPTX
10/21 Top 5 Deep Learning Stories
Over-The-Air Care @ Connected Car Expo.
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
Lyft presentation
HPC Top 5 Stories: Dec. 12, 2016
Investor Day 2013 Jen-Hsun Huang Presentation
Project inspire Impact Infographic
NVIDIA Countersues Samsung
NVIDIA SHIELD Launch
Video onesheeter-jun2015
10/28 Top 5 Deep Learning Stories
SHIELD Tablet vs. Samsung Galaxy 4 Benchmarks
NVIDIA SHIELD Launch Event at GDC 2015
GTC 2013: NVIDIA Fiscal Performance, Investments, and Opportunities
Benefits of Deploying VMware Horizon and vSphere with NVIDIA GRID vGPU
10/13 Top 5 Deep Learning Stories
The Deep Learning Revolution
Samsung's Lawsuit Against NVIDIA in 2014
Compare Streaming Media Players With NVIDIA SHIELD
NVIDIA Corporate Responsibility Report
10/21 Top 5 Deep Learning Stories
Ad

Similar to Opening Keynote at GTC 2015: Leaps in Visual Computing (20)

PDF
GTC 2017: Powering the AI Revolution
PDF
Introduction to Deep Learning (NVIDIA)
PPTX
abelbrownnvidiarakuten2016-170208065814 (1).pptx
PDF
GTC Europe 2017 Keynote
PDF
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
PDF
GTC Taiwan 2017 主題演說
PDF
Deep Learning Update May 2016
PDF
Aplicações Potenciais de Deep Learning à Indústria do Petróleo
PDF
Fuelling the AI Revolution with Gaming
PDF
IBM Cloud Paris Meetup 20180517 - Deep Learning Challenges
PDF
Fueling the AI Revolution with Gaming
PDF
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...
PDF
NVIDIA @ Infinite Conference, London
PDF
NVIDIA深度學習教育機構 (DLI): Deep Learning Institute
PDF
BAT40 NVIDIA Stampfli Künstliche Intelligenz, Roboter und autonome Fahrzeuge ...
PPTX
Dov Nimratz, Roman Chobik "Embedded artificial intelligence"
PDF
Nvidia at SEMICon, Munich
PPTX
Deep Learning Workflows: Training and Inference
PDF
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
PDF
Nvidia why every industry should be thinking about AI today
GTC 2017: Powering the AI Revolution
Introduction to Deep Learning (NVIDIA)
abelbrownnvidiarakuten2016-170208065814 (1).pptx
GTC Europe 2017 Keynote
"Enabling Ubiquitous Visual Intelligence Through Deep Learning," a Keynote Pr...
GTC Taiwan 2017 主題演說
Deep Learning Update May 2016
Aplicações Potenciais de Deep Learning à Indústria do Petróleo
Fuelling the AI Revolution with Gaming
IBM Cloud Paris Meetup 20180517 - Deep Learning Challenges
Fueling the AI Revolution with Gaming
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...
NVIDIA @ Infinite Conference, London
NVIDIA深度學習教育機構 (DLI): Deep Learning Institute
BAT40 NVIDIA Stampfli Künstliche Intelligenz, Roboter und autonome Fahrzeuge ...
Dov Nimratz, Roman Chobik "Embedded artificial intelligence"
Nvidia at SEMICon, Munich
Deep Learning Workflows: Training and Inference
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
Nvidia why every industry should be thinking about AI today

More from NVIDIA (20)

PDF
NVIDIA Story 2023.pdf
PDF
NVIDIA GTC2022 Spring Highlights
PDF
NVIDIA Brochure 2021 Company Overview
PDF
NVIDIA GTC 2020 October Summary
PPTX
The Best of AI and HPC in Healthcare and Life Sciences
PDF
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
PPTX
NLP for Biomedical Applications
PPTX
Top 5 Deep Learning and AI Stories - August 30, 2019
PPTX
Seven Ways to Boost Artificial Intelligence Research
PPTX
NVIDIA Developer Program Overview
PDF
NVIDIA at Computex 2019
PDF
Top 5 DGX Sessions From GTC 2019
PDF
DGX POD Top 4 Sessions From GTC 2019
PDF
Top 5 Data Science Sessions from GTC 2019
PPTX
This Week in Data Science - Top 5 News - April 26, 2019
PDF
GTC 2019 Keynote in Silicon Valley
PPTX
CUDA DLI Training Courses at GTC 2019
PPTX
DGX Sessions You Won't Want to Miss at GTC 2019
PPTX
Transforming Healthcare at GTC Silicon Valley
PPTX
OpenACC Monthly Highlights February 2019
NVIDIA Story 2023.pdf
NVIDIA GTC2022 Spring Highlights
NVIDIA Brochure 2021 Company Overview
NVIDIA GTC 2020 October Summary
The Best of AI and HPC in Healthcare and Life Sciences
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NLP for Biomedical Applications
Top 5 Deep Learning and AI Stories - August 30, 2019
Seven Ways to Boost Artificial Intelligence Research
NVIDIA Developer Program Overview
NVIDIA at Computex 2019
Top 5 DGX Sessions From GTC 2019
DGX POD Top 4 Sessions From GTC 2019
Top 5 Data Science Sessions from GTC 2019
This Week in Data Science - Top 5 News - April 26, 2019
GTC 2019 Keynote in Silicon Valley
CUDA DLI Training Courses at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019
Transforming Healthcare at GTC Silicon Valley
OpenACC Monthly Highlights February 2019

Recently uploaded (20)

PPT
Teaching material agriculture food technology
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Big Data Technologies - Introduction.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Modernizing your data center with Dell and AMD
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Electronic commerce courselecture one. Pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
MYSQL Presentation for SQL database connectivity
Teaching material agriculture food technology
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Big Data Technologies - Introduction.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
NewMind AI Monthly Chronicles - July 2025
Review of recent advances in non-invasive hemoglobin estimation
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Unlocking AI with Model Context Protocol (MCP)
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Modernizing your data center with Dell and AMD
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Electronic commerce courselecture one. Pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
The AUB Centre for AI in Media Proposal.docx
The Rise and Fall of 3GPP – Time for a Sabbatical?
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
MYSQL Presentation for SQL database connectivity

Opening Keynote at GTC 2015: Leaps in Visual Computing

  • 1. LEAPS IN VISUAL COMPUTING JEN-HSUN HUANG, CO-FOUNDER & CEO | GTC 2015
  • 2. FOUR ANNOUNCEMENTS A New GPU and Deep Learning A Very Fast Box and Deep Learning Roadmap Reveal and Deep Learning Self-Driving Cars and Deep Learning
  • 3. AMAZING YEAR IN VISUAL COMPUTING © 2015 Industrial Light & Magic. All Rights Reserved.
  • 4. 10X GROWTH IN GPU COMPUTING 2008 150,000 CUDA Downloads 4,000 Academic Papers 60 Universities Teaching 77 Supercomputing Teraflops 6,000 Tesla GPUs 27 CUDA Apps
  • 5. 2008 150,000 CUDA Downloads 4,000 Academic Papers 60 Universities Teaching 77 Supercomputing Teraflops 6,000 Tesla GPUs 27 CUDA Apps 2015 3 Million CUDA Downloads 10X GROWTH IN GPU COMPUTING
  • 6. 2015 3 Million CUDA Downloads 10X GROWTH IN GPU COMPUTING 319 CUDA Apps 2008 150,000 CUDA Downloads 4,000 Academic Papers 60 Universities Teaching 77 Supercomputing Teraflops 6,000 Tesla GPUs 27 CUDA Apps
  • 7. 2015 3 Million CUDA Downloads 800 Universities Teaching 10X GROWTH IN GPU COMPUTING 319 CUDA Apps 2008 150,000 CUDA Downloads 4,000 Academic Papers 60 Universities Teaching 77 Supercomputing Teraflops 6,000 Tesla GPUs 27 CUDA Apps
  • 8. 2015 3 Million CUDA Downloads 800 Universities Teaching 10X GROWTH IN GPU COMPUTING 319 CUDA Apps 2008 150,000 CUDA Downloads 4,000 Academic Papers 60 Universities Teaching 77 Supercomputing Teraflops 6,000 Tesla GPUs 27 CUDA Apps 60,000 Academic Papers
  • 9. 2015 3 Million CUDA Downloads 800 Universities Teaching 10X GROWTH IN GPU COMPUTING 319 CUDA Apps 2008 150,000 CUDA Downloads 4,000 Academic Papers 60 Universities Teaching 77 Supercomputing Teraflops 6,000 Tesla GPUs 27 CUDA Apps 60,000 Academic Papers 450,000 Tesla GPUs
  • 10. 2015 3 Million CUDA Downloads 60,000 Academic Papers 800 Universities Teaching 54,000 Supercomputing Teraflops 10X GROWTH IN GPU COMPUTING 450,000 Tesla GPUs 319 CUDA Apps 2008 150,000 CUDA Downloads 4,000 Academic Papers 60 Universities Teaching 77 Supercomputing Teraflops 6,000 Tesla GPUs 27 CUDA Apps
  • 12. 8 Billion Transistors 3,072 CUDA Cores 7 TFLOPS SP / 0.2 TFLOPS DP 12GB Memory TITAN X THE WORLD’S FASTEST GPU
  • 14. 0 1 2 3 4 5 6 7 TITAN X FOR DEEP LEARNING Training AlexNet Days 16-core Xeon CPU TITAN TITAN Black cuDNN TITAN X cuDNN ~ 43 …
  • 15. 8 Billion Transistors 3,072 CUDA Cores 7 TFLOPS SP / 0.2 TFLOPS DP 12GB Memory TITAN X THE WORLD’S FASTEST GPU $999
  • 16. FOUR ANNOUNCEMENTS A New GPU and Deep Learning A Very Fast Box and Deep Learning Roadmap Reveal and Deep Learning Self-Driving Cars and Deep Learning
  • 17. A SHORT HISTORY OF DEEP LEARNING Convolutional Neural Networks for Handwritten Digital Recognition LECUN, BOTTOU, BENGIO, HAFFNER, 1998 ImageNet Classification with NVIDIA GPUs KRIZHEVSKY, HINTON, ET AL., 2012 1995 2000 2005 2010 2015 Accuracy % 2010 201420122011 2013 74% 84% DNN CV 72%
  • 18. “Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification” — Microsoft: 4.94%, Feb. 6, 2015 “Deep Image: Scaling up Image Recognition” — Baidu: 5.98%, Jan. 13, 2015 “Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariant Shift” — Google: 4.82%, Feb. 11, 2015 IMAGENET CHALLENGE Accuracy % 2010 201420122011 2013 74% 84% DNN CV 72%
  • 22. Detecting Mitosis in Breast Cancer Cells — IDSIA Predicting the Toxicity of New Drugs — Johannes Kepler University Understanding Gene Mutation to Prevent Disease — University of Toronto DEEP LEARNING REVOLUTIONIZING MEDICAL RESEARCH
  • 23. “Automated Image Captioning with ConvNets and Recurrent Nets” —Andrej Karpathy, Fei-Fei Li
  • 24. DIGITS DEEP GPU TRAINING SYSTEM FOR DATA SCIENTISTS Design DNNs Visualize activations Manage multiple trainings USER INTERFACE Visualize Layers Configure DNN Process Data GPUGPU HW CloudGPU ClusterMulti-GPU Theano Torch Monitor Progress Caffe cuDNN, cuBLAS CUDA
  • 25. Monitor Progress DIGITS Configure DNNProcess Data Visualize Layers Test Image
  • 26. DIGITS DEVBOX World’s fastest GPU Max GPU out of a plug Multi-GPU training & inference
  • 27. “I’ve never seen AlexNet run this fast…TitanX is a monster, Crazy Fast” DIGITS DEVBOX — EARLY RESULTS “DIGITS makes it way easier to design the best network for the job” 0x 1x 2x 3x 4x 1 2 4 Multi-GPU scaling on Torch AlexNet VGG — Simon Osindero A.I. Architech — Soumith Chintala Research Engineer
  • 29. FOUR ANNOUNCEMENTS A New GPU and Deep Learning A Very Fast Box and Deep Learning Roadmap Reveal and Deep Learning Self-Driving Cars and Deep Learning
  • 30. SGEMM/W 2012 20142008 2010 2016 48 36 12 0 24 60 2018 72 Tesla Fermi Kepler Maxwell Pascal Mixed Precision 3D Memory NVLink Volta GPU ROADMAP Pascal 2x SGEMM/W
  • 31. FrameBufferCapacity(GB) 2012 20142008 2010 2016 40 30 10 0 20 50 2018 60 Tesla Fermi Kepler Maxwell Pascal Mixed Precision 3D Memory NVLink Volta GPU ROADMAP Pascal 2.7x Memory Capacity
  • 32. HGEMM/W 2012 20142008 2010 2016 96 72 24 0 48 120 2018 144 Tesla Fermi Kepler Maxwell Pascal Mixed Precision 3D Memory NVLink Volta GPU ROADMAP Pascal 4x Mixed Precision
  • 33. STREAMGB/s 2012 20142008 2010 2016 600 450 150 0 300 750 2018 900 Tesla Fermi Kepler Maxwell Pascal Mixed Precision 3D Memory NVLink Volta GPU ROADMAP Pascal 3x Bandwidth
  • 34. PASCAL 10X MAXWELL CONVOLUTION FULLY CONNECTED FULLY CONNECTED CONVOLUTION (compute) (bandwidth) (bandwidth) (compute) WEIGHT UPDATE (interconnect) 4x (FP16) 6x 6x 4x 10x Mixed Precision 3D Memory NVLINK forward backward Mixed Precision3D Memory 5x 2x * Very rough estimates
  • 35. FOUR ANNOUNCEMENTS A New GPU and Deep Learning A Very Fast Box and Deep Learning Roadmap Reveal and Deep Learning Self-Driving Cars and Deep Learning
  • 37. NEXT-GENERATION ADAS PLAN ACT CPU WARN FPGA CV ASIC SENSE BRAKE STEER ACCELERATE
  • 38. NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER PLAN ACT CPU WARN FPGA CV ASIC DNN SENSE BRAKE STEER IMAGENET CHALLENGE Accuracy % 2010 201420122011 2013 74% 84% DNN CV 72% ACCELERATE
  • 39. NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER PLAN ACT CPU WARN FPGA CV ASIC DNN SENSE BRAKE STEER IMAGENET CHALLENGE Accuracy % 2010 201420122011 2013 74% 84% DNN CV 72% ACCELERATE
  • 40. NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER PLAN ACT CPU WARN FPGA CV ASIC DNN SENSE BRAKE STEER IMAGENET CHALLENGE Accuracy % 2010 201420122011 2013 74% 84% DNN CV 72% ACCELERATE
  • 41. NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER PLAN ACT CPU WARN FPGA CV ASIC DNN SENSE BRAKE STEER IMAGENET CHALLENGE Accuracy % 2010 201420122011 2013 74% 84% DNN CV 72% ACCELERATE
  • 42. NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER PLAN ACT CPU WARN FPGA CV ASIC DNN SENSE BRAKE STEER IMAGENET CHALLENGE Accuracy % 2010 201420122011 2013 74% 84% DNN CV 72% ACCELERATE
  • 43. NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER PLAN ACT CPU WARN FPGA CV ASIC DNN SENSE BRAKE STEER IMAGENET CHALLENGE Accuracy % 2010 201420122011 2013 74% 84% DNN CV 72% ACCELERATE
  • 44. DNN-based self-driving robot Training data by human driver No hand-coded CV algorithms PROJECT LEADS Urs Muller: Chief Architect, Autonomous Driving, NVIDIA Yann LeCun: Director, AI Research, Facebook PROJECT DAVE — DARPA AUTONOMOUS VEHICLE IMAGENET CHALLENGE Accuracy % 2010 201420122011 2013 74% 84% DNN CV 72%
  • 49. TEST DRIVE Fully Trained (225K images)
  • 50. 3,000x Faster DAVE AlexNet on DRIVE PX 3.1 Million 12 38 Million 630 Million 184 116 Billion Number of Connections Frames / Second Connections / Second
  • 51. NVIDIA DRIVE PX SELF-DRIVING CAR COMPUTER PLAN ACT CPU WARN FPGA CV ASIC DNN SENSE BRAKE STEER IMAGENET CHALLENGE Accuracy % 2010 201420122011 2013 74% 84% DNN CV 72% ACCELERATE
  • 52. NVIDIA DRIVE™ PX SELF-DRIVING CAR COMPUTER Available May 2015 $10,000
  • 54. LEAPS IN VISUAL COMPUTING TITAN X The World’s Fastest GPU DIGITS DevBox GPU Deep Learning Platform Pascal — 10x Maxwell For Deep Learning NVIDIA DRIVE PX Deep Learning Platform for Self-Driving Cars