2018 Intel AI Developer Conference Keynote
2018 Intel AI Developer Conference Keynote
WELCOME TO
Operant
conditioning
1938
Transistor1947
1964
first
neuroscience
department
Intel
Founded
1968
1952
Spiking
Neuron
The
Turing
Machine1936
First
computer
science
department1962
mid 2000s
First1
Billion
transistor
processor
mid 2000s
Deep
learning
prevalence
2018 Intel AI Developer Conference Keynote
2018 Intel AI Developer Conference Keynote
BEST TOOLS TO BUILD TOWARDS
CommunityTools Hardware
Tools
2018 Intel AI Developer Conference Keynote
Convolution
Matrixmultiplication
BatchNorm poolingnormalization
activation
HELPS REALIZE THE INCREDIBLE BENEFITS OF DIRECT OPTIMIZATION
Intel/mkl-dnn
OPEN SOURCE
OPTIMIZING TENSORFLOW
Other names and brands may be claimed as the property of others
OPEN SOURCE COMPILER ENABLING FLEXIBILITY
TO RUN MODELS ACROSS A VARIETY OF
FRAMEWORKS AND HARDWARE
NervanaSystems/ngraph
Other names and brands may be claimed as the property of others
ENABLING DEEP LEARNING TO TAKE ADVANTAGE
OF SCALABLE SPARK AND HADOOP CLUSTERS
Other names and brands may be claimed as the property of others
intel-analytics/BigDL
USING DATA SCIENCE TO
Other names and brands may be claimed as the property of others
USING DATA SCIENCE TO
EmergencyResponse Financialservices MachineVision Cities/transportation
AutonomousVehicles ResponsiveRetail Manufacturing Publicsector
VISUAL INFERENCING AND NEURAL NETWORK OPTIMIZATION
DEPLOY COMPUTER
VISION AND DEEP
LEARNING CAPABILITIES
TO THE EDGE HighPerformance,highEfficiencyfortheedge
Writeonce+scaletoDiverseAccelerators
BroadFrameworksupport
Other names and brands may be claimed as the property of others
VPU = Vision Processing Unit (Movidius)
Tools
Hardware
2018 Intel AI Developer Conference Keynote
Other names and brands may be claimed as the property of others
Source: https://research.fb.com/publications/applied-machine-learning-at-facebook-a-datacenter-infrastructure-perspective/
Other names and brands may be claimed as the property of others
Other names and brands may be claimed as the property of others
AScientificCollaborationbetween
IntelandNovartis
224x224x3
ImageNet
Other names and brands may be claimed as the property of others
Software and workloads used in performance tests may have been optimized for performance only on Intel® microprocessors.
Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other
information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit www.intel.com/benchmarks.
1024x1280x3
26xlarger
Other names and brands may be claimed as the property of others
Software and workloads used in performance tests may have been optimized for performance only on Intel® microprocessors.
Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other
information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit www.intel.com/benchmarks.
224x224x3
ImageNet
26xlargerMultipleobjects
Other names and brands may be claimed as the property of others
Software and workloads used in performance tests may have been optimized for performance only on Intel® microprocessors.
Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other
information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit www.intel.com/benchmarks.
1024x1280x3
Other names and brands may be claimed as the property of others
§ Configuration: CPU: Xeon 6148 @ 2.4GHz, Hyper-threading: Enabled. NIC: Intel® Omni-Path Host Fabric Interface, TensorFlow: v1.7.0, Horovod: 0.12.1, OpenMPI: 3.0.0. OS: CentOS 7.3, OpenMPU 23.0.0, Python 2.7.5
Time to Train to converge to 99% accuracy in model
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer
systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your
contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/performance.
MultiscaleConvolutionNeuralNetwork
Intel® MKL/MKL-DNN,
clDNN, DAAL
OptimizedLibraries Intel®Omni-PathArchitecture
ScalingofTimetoTrainIntel® Omni-Path Architecture, Horovod and TensorFlow®
Speedupcomparedtobaseline1.0
measuredintimetotrainin1nodes
1 Node 2 Nodes 4 Nodes 8 Nodes 1 Node 2 Nodes 4 Nodes 8 Nodes
TOTALMEMORYUSED192GB DDR4 PER INTEL® 2S XEON® 6148 PROCESSOR
128.6GB
257.2GB
514.4GB
64.3GB
ARTIFICIAL INTELLIGENCE AND
Other names and brands may be claimed as the property of others
ZIVA VFX
Authoring Tools
Shipped as a
Maya Plugin
DISTRIBUTED TO SERVER FARM FOR SHOT RENDERS
Intel
Paradiso IntelBLAS IntelLapack
ZIVA FEM PHYSICS SOLVER
IntelMKL
Bones/
Muscles Fascia FatandSkin
ZIVA CHARACTERS SIMULATED IN PASSES Charactertransfer/
automation
Volumetriccapture
augmentation
Real-TimeTraining
Embeddedplayersin
anysoftware
Distributedgraphs
NativeNodegraphui
withrobustapi
A.I & M.L
TECHNOLOGIES
DISTRIBUTED
FUNCTIONALITY
AND EMBEDDED PLAYERS
ZIVA INTEGRATION FRAMEWORK
CLOUD SERVICES
TOOLS
ENGINES
Other names and brands may be claimed as the property of others
FLEXIBLE REAL-TIME INFERENCING
2018 Intel AI Developer Conference Keynote
FLEXIBLE REAL-TIME INFERENCING
2018 Intel AI Developer Conference Keynote
•Deploy DNN and Computer Vision at the Edge
•Native FP16 and Fixed Point 8 bit support
•4 TOPS with 1 TOPS of DNN Compute at 1W
2018 Intel AI Developer Conference Keynote
2018 Intel AI Developer Conference Keynote
2018 Intel AI Developer Conference Keynote
2018 Intel AI Developer Conference Keynote
Chip X Lake Crest
Theoretical
Reality
TOPs
0
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer
systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your
contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/performance
Chip X GEMM based on DeepBench training data for A(5124, 9124), B(9124,2560) matrix size GEMM operations performing DeepSpeech using FP16+ mixed precision at 27.43 TOPs.
Source: Lake Crest, Based on Intel measurements on limited distribution SDV, General Matrix-Matrix Multiplication; A(1536, 2048), B(2048, 1536).
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer
systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your
contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/performance
Chip X GEMM based on DeepBench training data for A(5124, 9124), B(9124,2560) matrix size GEMM operations performing DeepSpeech using FP16+ mixed precision at 27.43 TOPs.
Source: Lake Crest: Based on Intel measurements on limited distribution SDV
1 General Matrix-Matrix Multiplication; A(1536, 2048), B(2048, 1536)
2 Two chip vs. single chip GEMM performance; A(6144, 2048), B(2048, 1536)
3 Full Chip MRB-CHIP MRB data movement using send/recv, Tensor size = (1, 32), average across 50K iterations
MULTI-CHIP
COMMUNICATION3
Power<210W
2.4Tb/s
OFF CHIP BANDWIDTH
<790ns LATENCY
96.4%
GEMM OPERATION
UTILIZATION1
A(1536, 2048)
B(2048, 1536)
MULTI-CHIP
SCALING2
96.2%A(6144, 2048)
B(2048, 1536)
Chip X Lake Crest
Theoretical
Reality
TOPs
0
Chip X Lake Crest Spring Crest (Estimate)
Theoretical
Reality
TOPs
0
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer
systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your
contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/performance
Chip X GEMM based on DeepBench training data for A(5124, 9124), B(9124,2560) matrix size GEMM operations performing DeepSpeech using FP16+ mixed precision at 27.43 TOPs.
Source: Lake Crest - Based on Intel measurements on limited distribution SDV
Source: Spring Crest - Intel measurements on simulated product
firstcommercialNNP
Intel®Nervana™
NNPL-1000in 2019
3-4x training performance
of first generation
Lake Crest product
PURPOSE BUILT DESIGN OPTIMIZED ACROSS
MEMORY BANDWIDTH, UTILIZATION, AND POWER
Hardware
Community
PARTNERSHIPS AND
Other names and brands may be claimed as the property of others
Other names and brands may be claimed as the property of others
Machine Learning at Amazon: a long heritage
Person alized
recommen d ation s
Inventin g entirely n ew
c u stomer exp erien c es
F u lfillment au tomation
/ inventor y man agement
Cargo Voic e d riven
interac tion s
Other names and brands may be claimed as the property of others
Fully managed
hosting with auto-
scaling
One-click
deployment
Pre-built notebooks
for common
problems
Built-in, high
performance
algorithms
One-click
training
Hyperparameter
optimization
BUILD TRAIN DEPLOY
A m a z o n S a g e M a k e r
A W S M a c h i n e L e a r n i n g S t a c k
FRAMEWORKS AND
INTERFACES
APPLICATION SERVICES
PLATFORM
SERVICES
KERAS
Frameworks Interfaces
P O L L YR E K O G N I T I O N L E XR E K O G N I T I O N
V I D E O
T R A N S C R I B E T R A N S L A T E C O M P R E H E N D
AMAZON
SAGEMAKER
INFRASTRUCTURE
EC2 GPUs EC2 CPUs IoT Edge
AWS
DEEPLENS
Other names and brands may be claimed as the property of others
Other names and brands may be claimed as the property of others
HeadofDataScience
IntelAI
RL
Coach
Neural
Network
Distiller
NLP
Architect
NervanaSystems/nlp-architect NervanaSystems/coach NervanaSystems/distiller
GOAL IS TO SELECT A SET OF ML PROBLEMS, EACH
DEFINED BY A DATASET AND QUALITY TARGET, THEN
MEASURE THE WALL CLOCK TIME TO TRAIN A MODEL
FOR EACH PROBLEM.
For more information, see: https://mlperf.org/
Other names and brands may be claimed as the property of others
AICHALLENGE.INTEL.COM
*Idea implementation at the Olympic Games subject to approval
2018 Intel AI Developer Conference Keynote
Today’s presentation contains forward-looking statements. All
statements made that are not historical facts are subject to a number of
risks and uncertainties, and actual results may differ materially. Please
refer to our most recent earnings release, Form 10-Q and 10-K filing
available on our website for more information on the risk factors that
could cause actual results to differ.
2018 Intel AI Developer Conference Keynote
2018 Intel AI Developer Conference Keynote

More Related Content

PDF
Increasing Throughput per Node for Content Delivery Networks
PDF
Machine programming
PDF
FPGAs and Machine Learning
PPTX
Altera’s Role In Accelerating the Internet of Things
DOC
Ethernet base divice control
PDF
GTC 2022 Keynote
PDF
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
PDF
The Power of One: Supermicro’s High-Performance Single-Processor Blade Systems
Increasing Throughput per Node for Content Delivery Networks
Machine programming
FPGAs and Machine Learning
Altera’s Role In Accelerating the Internet of Things
Ethernet base divice control
GTC 2022 Keynote
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
The Power of One: Supermicro’s High-Performance Single-Processor Blade Systems

What's hot (17)

PDF
Secure remote protocol for fpga reconfiguration
PPTX
Closed Loop Platform Automation - Tong Zhong & Emma Collins
PDF
29937 845g pb
PPTX
Intel Developer Program
PDF
On-device Motion Tracking for Immersive VR
PDF
Energy Efficient Computing using Dynamic Tuning
DOCX
Fpga implementation of multilayer feed forward neural network architecture us...
PPTX
Closed Loop Network Automation for Optimal Resource Allocation via Reinforcem...
PPTX
Service Assurance Constructs for Achieving Network Transformation - Sunku Ran...
PDF
Overview of HPC Interconnects
PDF
"The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta...
PDF
Intel xeon e5v3 y sdi
PDF
AI in Healh Care using IBM POWER systems
PDF
14 284-291
PPTX
Trends and challenges in IP based SOC design
PDF
Design and Implementation of Quintuple Processor Architecture Using FPGA
PDF
CPU Subsystem Total Power Consumption: Understanding the Factors and Selectin...
Secure remote protocol for fpga reconfiguration
Closed Loop Platform Automation - Tong Zhong & Emma Collins
29937 845g pb
Intel Developer Program
On-device Motion Tracking for Immersive VR
Energy Efficient Computing using Dynamic Tuning
Fpga implementation of multilayer feed forward neural network architecture us...
Closed Loop Network Automation for Optimal Resource Allocation via Reinforcem...
Service Assurance Constructs for Achieving Network Transformation - Sunku Ran...
Overview of HPC Interconnects
"The Xilinx AI Engine: High Performance with Future-proof Architecture Adapta...
Intel xeon e5v3 y sdi
AI in Healh Care using IBM POWER systems
14 284-291
Trends and challenges in IP based SOC design
Design and Implementation of Quintuple Processor Architecture Using FPGA
CPU Subsystem Total Power Consumption: Understanding the Factors and Selectin...
Ad

Similar to 2018 Intel AI Developer Conference Keynote (20)

PDF
AIDC India - AI on IA
PDF
Intel® AI: Reinforcement Learning Coach
PDF
Deep Learning Training at Scale: Spring Crest Deep Learning Accelerator
PDF
AIDC Summit LA- Hands-on Training
PDF
Accelerate Your AI Today
PDF
“Optimization Techniques with Intel’s OpenVINO to Enhance Performance on Your...
PPTX
Gary Brown (Movidius, Intel): Deep Learning in AR: the 3 Year Horizon
PDF
Pedal to the Metal: Accelerating Spark with Silicon Innovation
PDF
Accelerating AI from the Cloud to the Edge
PDF
Intel® Open Image Denoise in Unity*
PDF
Spring Hill (NNP-I 1000): Intel's Data Center Inference Chip
PDF
Software-defined Visualization, High-Fidelity Visualization: OpenSWR and OSPRay
PDF
Driving Industrial InnovationOn the Path to Exascale
PDF
Intel Mobile Launch Information
PDF
[IGC2018] AMD Don Woligroski - WHY Ryzen
PDF
Optimizing Apache Spark Throughput Using Intel Optane and Intel Memory Drive...
PDF
Intel Core X-seires processors
PDF
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
PDF
How Funcom Increased Play Time in Lego Minifigures by 40%
PDF
O uso de tecnologias Intel na implantação de sistemas de alto desempenho
AIDC India - AI on IA
Intel® AI: Reinforcement Learning Coach
Deep Learning Training at Scale: Spring Crest Deep Learning Accelerator
AIDC Summit LA- Hands-on Training
Accelerate Your AI Today
“Optimization Techniques with Intel’s OpenVINO to Enhance Performance on Your...
Gary Brown (Movidius, Intel): Deep Learning in AR: the 3 Year Horizon
Pedal to the Metal: Accelerating Spark with Silicon Innovation
Accelerating AI from the Cloud to the Edge
Intel® Open Image Denoise in Unity*
Spring Hill (NNP-I 1000): Intel's Data Center Inference Chip
Software-defined Visualization, High-Fidelity Visualization: OpenSWR and OSPRay
Driving Industrial InnovationOn the Path to Exascale
Intel Mobile Launch Information
[IGC2018] AMD Don Woligroski - WHY Ryzen
Optimizing Apache Spark Throughput Using Intel Optane and Intel Memory Drive...
Intel Core X-seires processors
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
How Funcom Increased Play Time in Lego Minifigures by 40%
O uso de tecnologias Intel na implantação de sistemas de alto desempenho
Ad

More from DESMOND YUEN (20)

PDF
2022-AI-Index-Report_Master.pdf
PDF
Small Is the New Big
PDF
Intel® Blockscale™ ASIC Product Brief
PDF
Cryptography Processing with 3rd Gen Intel Xeon Scalable Processors
PDF
Intel 2021 Product Security Report
PDF
How can regulation keep up as transformation races ahead? 2022 Global regulat...
PDF
NASA Spinoffs Help Fight Coronavirus, Clean Pollution, Grow Food, More
PDF
A Survey on Security and Privacy Issues in Edge Computing-Assisted Internet o...
PDF
PUTTING PEOPLE FIRST: ITS IS SMART COMMUNITIES AND CITIES
PDF
BUILDING AN OPEN RAN ECOSYSTEM FOR EUROPE
PDF
An Introduction to Semiconductors and Intel
PDF
Changing demographics and economic growth bloom
PDF
Intel’s Impacts on the US Economy
PDF
2021 private networks infographics
PDF
Transforming the Modern City with the Intel-based 5G Smart City Road Side Uni...
PDF
3rd Generation Intel® Xeon® Scalable Processor - Achieving 1 Tbps IPsec with ...
PDF
"Life and Learning After One-Hundred Years: Trust Is The Coin Of The Realm."
PDF
Telefónica views on the design, architecture, and technology of 4G/5G Open RA...
PDF
Early Benchmarking Results for Neuromorphic Computing
PDF
Intel 2020 Labs Day Keynote Slides
2022-AI-Index-Report_Master.pdf
Small Is the New Big
Intel® Blockscale™ ASIC Product Brief
Cryptography Processing with 3rd Gen Intel Xeon Scalable Processors
Intel 2021 Product Security Report
How can regulation keep up as transformation races ahead? 2022 Global regulat...
NASA Spinoffs Help Fight Coronavirus, Clean Pollution, Grow Food, More
A Survey on Security and Privacy Issues in Edge Computing-Assisted Internet o...
PUTTING PEOPLE FIRST: ITS IS SMART COMMUNITIES AND CITIES
BUILDING AN OPEN RAN ECOSYSTEM FOR EUROPE
An Introduction to Semiconductors and Intel
Changing demographics and economic growth bloom
Intel’s Impacts on the US Economy
2021 private networks infographics
Transforming the Modern City with the Intel-based 5G Smart City Road Side Uni...
3rd Generation Intel® Xeon® Scalable Processor - Achieving 1 Tbps IPsec with ...
"Life and Learning After One-Hundred Years: Trust Is The Coin Of The Realm."
Telefónica views on the design, architecture, and technology of 4G/5G Open RA...
Early Benchmarking Results for Neuromorphic Computing
Intel 2020 Labs Day Keynote Slides

Recently uploaded (20)

PPTX
Clauses_Part1.hshshpjzjxnznxnxnndndndndndndndnndptx
PDF
20A LG INR18650HJ2 3.6V 2900mAh Battery cells for Power Tools Vacuum Cleaner
PDF
CAB UNIT 1 with computer details details
PPT
System Unit Components and its Functions
PDF
Printing Presentation to show beginners.
PPTX
RTS MASTER DECK_Household Convergence Scorecards. Use this file copy.pptx
PDF
Melt Flow Index Tester from Perfect Group India
PPTX
vortex flow measurement in instrumentation
PPTX
ppt to the world finance to the world in growing
PPTX
SAI-CAO-24CSEN2021-PPT-UNIT-------1.pptx
PDF
Maxon CINEMA 4D 2025 Crack Free Download Latest Version
PPTX
dDifference Beetween Saving slides And Investment Slides.pptx
PPTX
AIR BAG SYStYEM mechanical enginweering.pptx
PDF
2- Physical Layer (06).pdfgshshshbsbshshshhs
PDF
PakistanCoinageAct-906.pdfdbnsshsjjsbsbb
PPTX
Pin configuration and project related to
PDF
Topic-1-Main-Features-of-Data-Processing.pdf
PPTX
最新版新加坡英华美学院毕业证(Informatics毕业证书)原版定制硕士文凭
PPTX
Subordinate_Clauses_BlueGradient_Optimized.pptx
PPTX
Chapter no 8 output devices dpart 2.pptx
Clauses_Part1.hshshpjzjxnznxnxnndndndndndndndnndptx
20A LG INR18650HJ2 3.6V 2900mAh Battery cells for Power Tools Vacuum Cleaner
CAB UNIT 1 with computer details details
System Unit Components and its Functions
Printing Presentation to show beginners.
RTS MASTER DECK_Household Convergence Scorecards. Use this file copy.pptx
Melt Flow Index Tester from Perfect Group India
vortex flow measurement in instrumentation
ppt to the world finance to the world in growing
SAI-CAO-24CSEN2021-PPT-UNIT-------1.pptx
Maxon CINEMA 4D 2025 Crack Free Download Latest Version
dDifference Beetween Saving slides And Investment Slides.pptx
AIR BAG SYStYEM mechanical enginweering.pptx
2- Physical Layer (06).pdfgshshshbsbshshshhs
PakistanCoinageAct-906.pdfdbnsshsjjsbsbb
Pin configuration and project related to
Topic-1-Main-Features-of-Data-Processing.pdf
最新版新加坡英华美学院毕业证(Informatics毕业证书)原版定制硕士文凭
Subordinate_Clauses_BlueGradient_Optimized.pptx
Chapter no 8 output devices dpart 2.pptx

2018 Intel AI Developer Conference Keynote

  • 7. BEST TOOLS TO BUILD TOWARDS
  • 11. Convolution Matrixmultiplication BatchNorm poolingnormalization activation HELPS REALIZE THE INCREDIBLE BENEFITS OF DIRECT OPTIMIZATION Intel/mkl-dnn OPEN SOURCE
  • 12. OPTIMIZING TENSORFLOW Other names and brands may be claimed as the property of others
  • 13. OPEN SOURCE COMPILER ENABLING FLEXIBILITY TO RUN MODELS ACROSS A VARIETY OF FRAMEWORKS AND HARDWARE NervanaSystems/ngraph
  • 14. Other names and brands may be claimed as the property of others
  • 15. ENABLING DEEP LEARNING TO TAKE ADVANTAGE OF SCALABLE SPARK AND HADOOP CLUSTERS Other names and brands may be claimed as the property of others intel-analytics/BigDL
  • 16. USING DATA SCIENCE TO Other names and brands may be claimed as the property of others
  • 18. EmergencyResponse Financialservices MachineVision Cities/transportation AutonomousVehicles ResponsiveRetail Manufacturing Publicsector
  • 19. VISUAL INFERENCING AND NEURAL NETWORK OPTIMIZATION DEPLOY COMPUTER VISION AND DEEP LEARNING CAPABILITIES TO THE EDGE HighPerformance,highEfficiencyfortheedge Writeonce+scaletoDiverseAccelerators BroadFrameworksupport Other names and brands may be claimed as the property of others VPU = Vision Processing Unit (Movidius)
  • 20. Tools
  • 23. Other names and brands may be claimed as the property of others Source: https://research.fb.com/publications/applied-machine-learning-at-facebook-a-datacenter-infrastructure-perspective/
  • 24. Other names and brands may be claimed as the property of others
  • 25. Other names and brands may be claimed as the property of others AScientificCollaborationbetween IntelandNovartis
  • 26. 224x224x3 ImageNet Other names and brands may be claimed as the property of others Software and workloads used in performance tests may have been optimized for performance only on Intel® microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit www.intel.com/benchmarks.
  • 27. 1024x1280x3 26xlarger Other names and brands may be claimed as the property of others Software and workloads used in performance tests may have been optimized for performance only on Intel® microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit www.intel.com/benchmarks. 224x224x3 ImageNet
  • 28. 26xlargerMultipleobjects Other names and brands may be claimed as the property of others Software and workloads used in performance tests may have been optimized for performance only on Intel® microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit www.intel.com/benchmarks. 1024x1280x3
  • 29. Other names and brands may be claimed as the property of others § Configuration: CPU: Xeon 6148 @ 2.4GHz, Hyper-threading: Enabled. NIC: Intel® Omni-Path Host Fabric Interface, TensorFlow: v1.7.0, Horovod: 0.12.1, OpenMPI: 3.0.0. OS: CentOS 7.3, OpenMPU 23.0.0, Python 2.7.5 Time to Train to converge to 99% accuracy in model Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/performance. MultiscaleConvolutionNeuralNetwork Intel® MKL/MKL-DNN, clDNN, DAAL OptimizedLibraries Intel®Omni-PathArchitecture ScalingofTimetoTrainIntel® Omni-Path Architecture, Horovod and TensorFlow® Speedupcomparedtobaseline1.0 measuredintimetotrainin1nodes 1 Node 2 Nodes 4 Nodes 8 Nodes 1 Node 2 Nodes 4 Nodes 8 Nodes TOTALMEMORYUSED192GB DDR4 PER INTEL® 2S XEON® 6148 PROCESSOR 128.6GB 257.2GB 514.4GB 64.3GB
  • 31. Other names and brands may be claimed as the property of others
  • 32. ZIVA VFX Authoring Tools Shipped as a Maya Plugin DISTRIBUTED TO SERVER FARM FOR SHOT RENDERS Intel Paradiso IntelBLAS IntelLapack ZIVA FEM PHYSICS SOLVER IntelMKL Bones/ Muscles Fascia FatandSkin ZIVA CHARACTERS SIMULATED IN PASSES Charactertransfer/ automation Volumetriccapture augmentation Real-TimeTraining Embeddedplayersin anysoftware Distributedgraphs NativeNodegraphui withrobustapi A.I & M.L TECHNOLOGIES DISTRIBUTED FUNCTIONALITY AND EMBEDDED PLAYERS ZIVA INTEGRATION FRAMEWORK CLOUD SERVICES TOOLS ENGINES Other names and brands may be claimed as the property of others
  • 37. •Deploy DNN and Computer Vision at the Edge •Native FP16 and Fixed Point 8 bit support •4 TOPS with 1 TOPS of DNN Compute at 1W
  • 42. Chip X Lake Crest Theoretical Reality TOPs 0 Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/performance Chip X GEMM based on DeepBench training data for A(5124, 9124), B(9124,2560) matrix size GEMM operations performing DeepSpeech using FP16+ mixed precision at 27.43 TOPs. Source: Lake Crest, Based on Intel measurements on limited distribution SDV, General Matrix-Matrix Multiplication; A(1536, 2048), B(2048, 1536).
  • 43. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/performance Chip X GEMM based on DeepBench training data for A(5124, 9124), B(9124,2560) matrix size GEMM operations performing DeepSpeech using FP16+ mixed precision at 27.43 TOPs. Source: Lake Crest: Based on Intel measurements on limited distribution SDV 1 General Matrix-Matrix Multiplication; A(1536, 2048), B(2048, 1536) 2 Two chip vs. single chip GEMM performance; A(6144, 2048), B(2048, 1536) 3 Full Chip MRB-CHIP MRB data movement using send/recv, Tensor size = (1, 32), average across 50K iterations MULTI-CHIP COMMUNICATION3 Power<210W 2.4Tb/s OFF CHIP BANDWIDTH <790ns LATENCY 96.4% GEMM OPERATION UTILIZATION1 A(1536, 2048) B(2048, 1536) MULTI-CHIP SCALING2 96.2%A(6144, 2048) B(2048, 1536) Chip X Lake Crest Theoretical Reality TOPs 0
  • 44. Chip X Lake Crest Spring Crest (Estimate) Theoretical Reality TOPs 0 Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit http://www.intel.com/performance Chip X GEMM based on DeepBench training data for A(5124, 9124), B(9124,2560) matrix size GEMM operations performing DeepSpeech using FP16+ mixed precision at 27.43 TOPs. Source: Lake Crest - Based on Intel measurements on limited distribution SDV Source: Spring Crest - Intel measurements on simulated product
  • 45. firstcommercialNNP Intel®Nervana™ NNPL-1000in 2019 3-4x training performance of first generation Lake Crest product PURPOSE BUILT DESIGN OPTIMIZED ACROSS MEMORY BANDWIDTH, UTILIZATION, AND POWER
  • 49. Other names and brands may be claimed as the property of others
  • 50. Other names and brands may be claimed as the property of others
  • 51. Machine Learning at Amazon: a long heritage Person alized recommen d ation s Inventin g entirely n ew c u stomer exp erien c es F u lfillment au tomation / inventor y man agement Cargo Voic e d riven interac tion s Other names and brands may be claimed as the property of others
  • 52. Fully managed hosting with auto- scaling One-click deployment Pre-built notebooks for common problems Built-in, high performance algorithms One-click training Hyperparameter optimization BUILD TRAIN DEPLOY A m a z o n S a g e M a k e r
  • 53. A W S M a c h i n e L e a r n i n g S t a c k FRAMEWORKS AND INTERFACES APPLICATION SERVICES PLATFORM SERVICES KERAS Frameworks Interfaces P O L L YR E K O G N I T I O N L E XR E K O G N I T I O N V I D E O T R A N S C R I B E T R A N S L A T E C O M P R E H E N D AMAZON SAGEMAKER INFRASTRUCTURE EC2 GPUs EC2 CPUs IoT Edge AWS DEEPLENS
  • 54. Other names and brands may be claimed as the property of others
  • 55. Other names and brands may be claimed as the property of others
  • 58. GOAL IS TO SELECT A SET OF ML PROBLEMS, EACH DEFINED BY A DATASET AND QUALITY TARGET, THEN MEASURE THE WALL CLOCK TIME TO TRAIN A MODEL FOR EACH PROBLEM. For more information, see: https://mlperf.org/ Other names and brands may be claimed as the property of others
  • 59. AICHALLENGE.INTEL.COM *Idea implementation at the Olympic Games subject to approval
  • 61. Today’s presentation contains forward-looking statements. All statements made that are not historical facts are subject to a number of risks and uncertainties, and actual results may differ materially. Please refer to our most recent earnings release, Form 10-Q and 10-K filing available on our website for more information on the risk factors that could cause actual results to differ.