SlideShare a Scribd company logo
DPDK’s Best Kept Secret:
Micro-benchmarks
M	Jay	
Muthurajan.Jayakumar@intel.com
DPDK	Summit	- San	Jose	2017
2
Optimization Notice: Intel’s compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel
microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality,
or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with
Intel microprocessors. Certain optimizations not specifc to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and
Reference Guides for more information regarding the specifc instruction sets covered by this notice. Cost reduction scenarios described are intended as examples of
how a given Intel- based product, in the specifed circumstances and confgurations, may affect future costs and provide cost savings. Circumstances will vary. Intel does
not guarantee any costs or cost reduction. Intel technologies’ features and benefts depend on system confguration and may require enabled hardware, software or servic
activation. Performance varies depending on system confguration. No computer system can be absolutely secure. Check with your system manufacturer or retailer or
learn more at https://networkbuilders.intel.com/network-technologies/intelselectfasttrackkit.
No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document.
Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-
infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade.
This document contains information on products, services and/or processes in development. All information provided here is subject to change without notice. Contact
your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps.
The products and services described may contain defects or errors known as errata which may cause deviations from published specifications. Current characterized
errata are available on request. Copies of documents which have an order number and are referenced in this document may be obtained by calling 1-800-548-4725 or
by visiting www.intel.com/design/literature.htm.
© 2017 Intel Corporation. Intel, the Intel logo, and Xeon are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other countries.
*Other names and brands may be claimed as the property of others.
Legal Information
Agenda
u Why	should	I	care	about	DPDK	Micro-benchmarks?
u What	do	they	benchmark?
u How	do	I	run	them?
Not all slots are made equal
Not all slots
are made
equal !
Ensure that you have plugged in your NIC card in most optimal slot
How many lcores, you think, are there in this 2 socket
server?
More
than 100
lcores?
96
lcores?
64
lcores?
6
CPU 0 CPU 1
Question: What can be Improved here?
7
I/O Plugged in CPU1’s Slot
How much memory do you
see in CPU1 node?
ZERO !
CPU 0 CPU 1
CPU 0 has only One
Channel memory
populated.
Improvements -n 4
In Which Socket lcore # 50 resides? Socket 0 or Socket
1?
u Question:	
u In	which	socket	you	think	lcore#	50	resides?	– socket	0?		Or	socket	1?
u Assume	NIC	is	Plugged	in	socket	0
u Will	the	performance	be	best	or	sub-optimal?
More
than 100
lcores
Socket
0?
Socket
1?
Why Should I Care About DPDK Micro-benchmarks?
u We	thought	lcore #	50	resides	in	socket	0.
u But		actually,	you	can	see	it	is	in	socket	1.
u So,	NIC	in	socket	0	is	actually	sub-optimal.
u How	to	quantitatively	ensure	that	system	is	set	for	optimal	performance?
QUIZ:
Cores Within A Socket – All In Same
Loop?
Demo
Cores Within A Socket – Not equal
proximity
Prior to application level
benchmarking..
u Without	tightening	these,	if	you	start	developing	your	application…
u And	on	top	of	that,	if	you	start	measuring	application	level	performance	
u Root	cause	analysis	is	made	unnecessarily	complex
u
u Instead…	what	if	..	
u What	if	you	can	do	basic	benchmarking	of	key	performant	elements	/	ops	
u You	will	build	strong	foundation	first	
u Will	help	you	develop	Applications	confidently	towards	overall	higher	performance
What Objects, What Operations to
benchmark?
u In	other	words,	what	are	the	key	high	performant	objects and	operations?
u Objects:
u Ring	
u Mem	pool
u Mbuf
u Operations:
u Mem	copy	
u Hash	Operations
u Flow	Classification
Test_hash_multiwriter_main( )
Hash – Multi-writer – Transactional
Memory
Tests: Ring, PMD, Table
Router, Memcpy, Hash
Tests: Crypto, Event, Flow Classify
Mempool
SPSC MPMC – Time Taken
Call To Action:
Where To Find Them & How It
Measures?
Optimization Notice
22
Optimization Notice
Intel’s compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that
are not unique to Intel microprocessors. These optimizations include SSE2®, SSE3, and SSSE3 instruction sets and
other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on
microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended
for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for
Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information
regarding the specific instruction sets covered by this notice.
Notice revision #20110804
Questions?
M	Jay
Muthurajan.Jayakumar@intel.com

More Related Content

PDF
LF_DPDK17_Making networking apps scream on Windows with DPDK
PDF
LF_DPDK17_Enabling hardware acceleration in DPDK data plane applications
PDF
LF_DPDK_Accelerate storage service via SPDK
PDF
LF_DPDK17_Reducing Barriers to Adoption - Making DPDK Easier to Integrate int...
PDF
LF_DPDK17_The Path to Data Plane Microservices
PDF
2 new hw_features_cat_cod_etc
PDF
QATCodec: past, present and future
PDF
4 dpdk roadmap(1)
LF_DPDK17_Making networking apps scream on Windows with DPDK
LF_DPDK17_Enabling hardware acceleration in DPDK data plane applications
LF_DPDK_Accelerate storage service via SPDK
LF_DPDK17_Reducing Barriers to Adoption - Making DPDK Easier to Integrate int...
LF_DPDK17_The Path to Data Plane Microservices
2 new hw_features_cat_cod_etc
QATCodec: past, present and future
4 dpdk roadmap(1)

What's hot (20)

PDF
8 intel network builders overview
PDF
3 additional dpdk_theory(1)
PDF
Intel IT Experts Tour Cyber Security - Matthew Rosenquist 2013
PDF
TDC2019 Intel Software Day - Inferencia de IA em edge devices
PDF
Accelerating Virtual Machine Access with the Storage Performance Development ...
PDF
Disrupting the Data Center: Unleashing the Digital Services Economy
PDF
Making Networking Apps Scream on Windows with DPDK
PPTX
ITT - Tech CNS Captstone Project
PDF
Achieve Unconstrained Collaboration in a Digital World
PDF
Overcoming Scaling Challenges in MongoDB Deployments with SSD
PDF
TDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura Intel
PPTX
The Greenway Bank Presentation
PPTX
ITT CNS Capstone Project
PPT
Network Developement Capstone Project 2009 Sping Quarter
PDF
Новые технологии Intel в центрах обработки данных
PPT
Capstone Presentation For Five Rivers Medical Centers
PDF
Intel® QuickAssist Technology (Intel® QAT) and OpenSSL-1.1.0: Performance
PPTX
N(ot)-o(nly)-(Ha)doop - the DAG showdown
PDF
Architecting for Hyper-Scale Datacenter Efficiency
PPTX
Gary Brown (Movidius, Intel): Deep Learning in AR: the 3 Year Horizon
8 intel network builders overview
3 additional dpdk_theory(1)
Intel IT Experts Tour Cyber Security - Matthew Rosenquist 2013
TDC2019 Intel Software Day - Inferencia de IA em edge devices
Accelerating Virtual Machine Access with the Storage Performance Development ...
Disrupting the Data Center: Unleashing the Digital Services Economy
Making Networking Apps Scream on Windows with DPDK
ITT - Tech CNS Captstone Project
Achieve Unconstrained Collaboration in a Digital World
Overcoming Scaling Challenges in MongoDB Deployments with SSD
TDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura Intel
The Greenway Bank Presentation
ITT CNS Capstone Project
Network Developement Capstone Project 2009 Sping Quarter
Новые технологии Intel в центрах обработки данных
Capstone Presentation For Five Rivers Medical Centers
Intel® QuickAssist Technology (Intel® QAT) and OpenSSL-1.1.0: Performance
N(ot)-o(nly)-(Ha)doop - the DAG showdown
Architecting for Hyper-Scale Datacenter Efficiency
Gary Brown (Movidius, Intel): Deep Learning in AR: the 3 Year Horizon
Ad

Viewers also liked (9)

PDF
LF_DPDK17_DPDK support for new hardware offloads
PDF
LF_DPDK17_Integrating and using DPDK with Open vSwitch
PDF
LF_DPDK17_ OpenVswitch hardware offload over DPDK
PDF
LF_DPDK17_Lagopus Router
PDF
LF_DPDK17_testpmd: swissknife for NFV
PDF
LF_DPDK17_Make DPDK's software traffic manager a deployable solution for vBNG
PDF
LF_DPDK17_DPDK Membership Library
PDF
LF_DPDK17_Accelerating NFV with VMware's Enhanced Network Stack (ENS) and Int...
PDF
LF_DPDK_Mellanox bifurcated driver model
LF_DPDK17_DPDK support for new hardware offloads
LF_DPDK17_Integrating and using DPDK with Open vSwitch
LF_DPDK17_ OpenVswitch hardware offload over DPDK
LF_DPDK17_Lagopus Router
LF_DPDK17_testpmd: swissknife for NFV
LF_DPDK17_Make DPDK's software traffic manager a deployable solution for vBNG
LF_DPDK17_DPDK Membership Library
LF_DPDK17_Accelerating NFV with VMware's Enhanced Network Stack (ENS) and Int...
LF_DPDK_Mellanox bifurcated driver model
Ad

Similar to LF_DPDK17_DPDK's best kept secret – Micro-benchmark performance tests (20)

PDF
HPC Facility Designing for next generation HPC systems Ram Nagappan Intel Final
PDF
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
PDF
High Performance Computing: The Essential tool for a Knowledge Economy
PDF
O uso de tecnologias Intel na implantação de sistemas de alto desempenho
PDF
Lynn Comp - Intel Big Data & Cloud Summit 2013 (2)
PDF
Crooke CWF Keynote FINAL final platinum
PDF
Microsoft Build 2019- Intel AI Workshop
PDF
Explore, design and implement threading parallelism with Intel® Advisor XE
PDF
Driving Industrial InnovationOn the Path to Exascale
PDF
ONS 2018 LA - Intel Tutorial: Cloud Native to NFV - Alon Bernstein, Cisco & K...
PPTX
Performance out of the box developers
PPTX
Intel® Select Solutions for the Network
PDF
AI & Computer Vision (OpenVINO) - CPBR12
PDF
Intel NFVi Enabling Kit Demo/Lab
PDF
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
PDF
Accelerate Ceph performance via SPDK related techniques
PDF
AIDC Summit LA- Hands-on Training
PDF
DUG'20: 01 - Welcome & DAOS Update
PPTX
E5 Intel Xeon Processor E5 Family Making the Business Case
PDF
“Getting Efficient DNN Inference Performance: Is It Really About the TOPS?,” ...
HPC Facility Designing for next generation HPC systems Ram Nagappan Intel Final
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
High Performance Computing: The Essential tool for a Knowledge Economy
O uso de tecnologias Intel na implantação de sistemas de alto desempenho
Lynn Comp - Intel Big Data & Cloud Summit 2013 (2)
Crooke CWF Keynote FINAL final platinum
Microsoft Build 2019- Intel AI Workshop
Explore, design and implement threading parallelism with Intel® Advisor XE
Driving Industrial InnovationOn the Path to Exascale
ONS 2018 LA - Intel Tutorial: Cloud Native to NFV - Alon Bernstein, Cisco & K...
Performance out of the box developers
Intel® Select Solutions for the Network
AI & Computer Vision (OpenVINO) - CPBR12
Intel NFVi Enabling Kit Demo/Lab
DPDK Summit - 08 Sept 2014 - Intel - Networking Workloads on Intel Architecture
Accelerate Ceph performance via SPDK related techniques
AIDC Summit LA- Hands-on Training
DUG'20: 01 - Welcome & DAOS Update
E5 Intel Xeon Processor E5 Family Making the Business Case
“Getting Efficient DNN Inference Performance: Is It Really About the TOPS?,” ...

More from LF_DPDK (18)

PDF
LF_DPDK17_Event Adapters - Connecting Devices to Eventdev
PDF
LF_DPDK17_OpenNetVM: A high-performance NFV platforms to meet future communic...
PDF
LF_DPDK17_DPDK on Microsoft Azure
PDF
LF_DPDK17_VPP Host Stack
PDF
LF_DPDK17_Accelerating Packet Processing with FPGA NICs
PDF
LF_DPDK17_rte_security: enhancing IPSEC offload
PDF
LF_DPDK17_Serverless DPDK - How SmartNIC resident DPDK Accelerates Packet Pro...
PDF
LF_DPDK17_Flexible and Extensible support for new protocol processing with DP...
PDF
LF_DPDK17_rte_raw_device: implementing programmable accelerators using generi...
PDF
LF_DPDK17_Technical Roadmap
PDF
LF_DPDK17_Abstract APIs for DPDK and ODP
PDF
LF_DPDK17_DPDK with KNI – Pushing the Performance of an SDWAN Gateway to High...
PDF
LF_DPDK17_mediated devices: better userland IO
PDF
LF_DPDK17_Enhanced Memory Management
PDF
LF_DPDK17_SafetyOrange - a tiny server class multi-purpose box with DPDK
PDF
LF_DPDK17_Reflections on Mirroring With DPDK
PDF
LF_DPDK17_Implementation and Testing of Soft Patch Panel
PDF
LF_DPDK17_Accelerating P4-based Dataplane with DPDK
LF_DPDK17_Event Adapters - Connecting Devices to Eventdev
LF_DPDK17_OpenNetVM: A high-performance NFV platforms to meet future communic...
LF_DPDK17_DPDK on Microsoft Azure
LF_DPDK17_VPP Host Stack
LF_DPDK17_Accelerating Packet Processing with FPGA NICs
LF_DPDK17_rte_security: enhancing IPSEC offload
LF_DPDK17_Serverless DPDK - How SmartNIC resident DPDK Accelerates Packet Pro...
LF_DPDK17_Flexible and Extensible support for new protocol processing with DP...
LF_DPDK17_rte_raw_device: implementing programmable accelerators using generi...
LF_DPDK17_Technical Roadmap
LF_DPDK17_Abstract APIs for DPDK and ODP
LF_DPDK17_DPDK with KNI – Pushing the Performance of an SDWAN Gateway to High...
LF_DPDK17_mediated devices: better userland IO
LF_DPDK17_Enhanced Memory Management
LF_DPDK17_SafetyOrange - a tiny server class multi-purpose box with DPDK
LF_DPDK17_Reflections on Mirroring With DPDK
LF_DPDK17_Implementation and Testing of Soft Patch Panel
LF_DPDK17_Accelerating P4-based Dataplane with DPDK

Recently uploaded (20)

PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Modernizing your data center with Dell and AMD
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
KodekX | Application Modernization Development
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
A Presentation on Artificial Intelligence
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
Cloud computing and distributed systems.
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Modernizing your data center with Dell and AMD
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
KodekX | Application Modernization Development
Spectral efficient network and resource selection model in 5G networks
Mobile App Security Testing_ A Comprehensive Guide.pdf
Encapsulation theory and applications.pdf
Network Security Unit 5.pdf for BCA BBA.
NewMind AI Monthly Chronicles - July 2025
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
The AUB Centre for AI in Media Proposal.docx
A Presentation on Artificial Intelligence
Digital-Transformation-Roadmap-for-Companies.pptx
20250228 LYD VKU AI Blended-Learning.pptx
Encapsulation_ Review paper, used for researhc scholars
Building Integrated photovoltaic BIPV_UPV.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Cloud computing and distributed systems.
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...

LF_DPDK17_DPDK's best kept secret – Micro-benchmark performance tests

  • 1. DPDK’s Best Kept Secret: Micro-benchmarks M Jay Muthurajan.Jayakumar@intel.com DPDK Summit - San Jose 2017
  • 2. 2 Optimization Notice: Intel’s compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specifc to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specifc instruction sets covered by this notice. Cost reduction scenarios described are intended as examples of how a given Intel- based product, in the specifed circumstances and confgurations, may affect future costs and provide cost savings. Circumstances will vary. Intel does not guarantee any costs or cost reduction. Intel technologies’ features and benefts depend on system confguration and may require enabled hardware, software or servic activation. Performance varies depending on system confguration. No computer system can be absolutely secure. Check with your system manufacturer or retailer or learn more at https://networkbuilders.intel.com/network-technologies/intelselectfasttrackkit. No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non- infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade. This document contains information on products, services and/or processes in development. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps. The products and services described may contain defects or errors known as errata which may cause deviations from published specifications. Current characterized errata are available on request. Copies of documents which have an order number and are referenced in this document may be obtained by calling 1-800-548-4725 or by visiting www.intel.com/design/literature.htm. © 2017 Intel Corporation. Intel, the Intel logo, and Xeon are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other countries. *Other names and brands may be claimed as the property of others. Legal Information
  • 4. Not all slots are made equal Not all slots are made equal ! Ensure that you have plugged in your NIC card in most optimal slot
  • 5. How many lcores, you think, are there in this 2 socket server? More than 100 lcores? 96 lcores? 64 lcores?
  • 6. 6 CPU 0 CPU 1 Question: What can be Improved here?
  • 7. 7 I/O Plugged in CPU1’s Slot How much memory do you see in CPU1 node? ZERO ! CPU 0 CPU 1 CPU 0 has only One Channel memory populated. Improvements -n 4
  • 8. In Which Socket lcore # 50 resides? Socket 0 or Socket 1? u Question: u In which socket you think lcore# 50 resides? – socket 0? Or socket 1? u Assume NIC is Plugged in socket 0 u Will the performance be best or sub-optimal? More than 100 lcores Socket 0? Socket 1?
  • 9. Why Should I Care About DPDK Micro-benchmarks? u We thought lcore # 50 resides in socket 0. u But actually, you can see it is in socket 1. u So, NIC in socket 0 is actually sub-optimal. u How to quantitatively ensure that system is set for optimal performance?
  • 10. QUIZ: Cores Within A Socket – All In Same Loop?
  • 11. Demo
  • 12. Cores Within A Socket – Not equal proximity
  • 13. Prior to application level benchmarking.. u Without tightening these, if you start developing your application… u And on top of that, if you start measuring application level performance u Root cause analysis is made unnecessarily complex u u Instead… what if .. u What if you can do basic benchmarking of key performant elements / ops u You will build strong foundation first u Will help you develop Applications confidently towards overall higher performance
  • 14. What Objects, What Operations to benchmark? u In other words, what are the key high performant objects and operations? u Objects: u Ring u Mem pool u Mbuf u Operations: u Mem copy u Hash Operations u Flow Classification
  • 15. Test_hash_multiwriter_main( ) Hash – Multi-writer – Transactional Memory
  • 18. Tests: Crypto, Event, Flow Classify
  • 20. SPSC MPMC – Time Taken
  • 21. Call To Action: Where To Find Them & How It Measures?
  • 22. Optimization Notice 22 Optimization Notice Intel’s compilers may or may not optimize to the same degree for non-Intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2®, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #20110804