SlideShare a Scribd company logo
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may
be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without any notice. Copyright © 2010, Intel Corporation.
Optimizing Hadoop* Workloads
Nurcan Coskun, Ph.D.
Intel Software & Solutions Group
October 12, 2010
Acknowledgements to Jason Dai, Intel SSG, for
of the test results and optimization techniques
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
2
Legal Disclaimers
Disclaimers & Legal Notices
THE INFORMATION IS FURNISHED FOR INFORMATIONAL USE ONLY, IS SUBJECT TO CHANGE WITHOUT NOTICE, AND SHOULD
NOT BE CONSTRUED AS A COMMITMENT BY INTEL CORPORATION. INTEL CORPORATION ASSUMES NO RESPONSIBILITY OR
LIABILITY FOR ANY ERRORS OR INACCURACIES THAT MAY APPEAR IN THIS DOCUMENT OR ANY SOFTWARE THAT MAY BE
PROVIDED IN ASSOCIATION WITH THIS DOCUMENT. THIS INFORMATION IS PROVIDED "AS IS" AND INTEL DISCLAIMS ANY
EXPRESS OR IMPLIED WARRANTY, RELATING TO THE USE OF THIS INFORMATION INCLUDING WARRANTIES RELATING TO
FITNESS FOR A PARTICULAR PURPOSE, COMPLIANCE WITH A SPECIFICATION OR STANDARD, MERCHANTABILITY OR
NONINFRINGEMENT.
Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate
performance of Intel products as measured by those tests. Any difference in system hardware or software design or
configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance
of systems or components they are considering purchasing. For more information on performance tests and on the
performance of Intel products, visit Intel Performance Benchmark Limitations
INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR
IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT
AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY
WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL
PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY,
OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. UNLESS OTHERWISE AGREED
IN WRITING BY INTEL, THE INTEL PRODUCTS ARE NOT DESIGNED NOR INTENDED FOR ANY APPLICATION IN WHICH THE
FAILURE OF THE INTEL PRODUCT COULD CREATE A SITUATION WHERE PERSONAL INJURY OR DEATH MAY OCCUR.
Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the
absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future
definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The
information here is subject to change without notice. Do not finalize a design with this information. The products described in
this document may contain design defects or errors known as errata which may cause the product to deviate from published
specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to
obtain the latest specifications and before placing your product order. Copies of documents which have an order number and
are referenced in this document, or other Intel literature, may be obtained by calling 1-800-548-4725, or by visiting Intel's
Web Site http://www.intel.com/.
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
3
Why Optimize Hadoop Deployments?
Handle
More
Data
At
Lower
Cost
In
Less
Time
With
Less
Power
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
4
Workload traits drive optimization approach
4
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
5
Where to Optimize ?
Hardware Hadoop / HDFS Software
Equipment, Settings Version, Settings OS, JVM, Settings
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
6
Server Considerations
2 Socket Systems
with Intel®
Xeon® Processor
5600 series
Sweet spot for
performance,
efficiency, cost
12-24 GB DDR3 CPU intense or
HBASE may require
more.
4-6 1TB SATA
HDD 7200
Pure I/O workloads
may require more
1-2GB Ethernet Channel bonding for
increased throughput
Energy efficient
components
Gold certified power
supplies, efficient
fans, low power
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
7
Processor Choice Matters
Faster
Handles More Data
More Energy Efficient
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
8
Processor Choice Impacts Speed
Data Source: Intel internal measurements. Hadoop 0.19.1 results as of September 20, 2009 and Hadoop 0.20.2 results as of August 8, 2010.
Hardware configurations are on slide 21. Performance tests and ratings are measured using specific computer systems and/or components and reflect
the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration
may affect actual performance.
Last Year This Year
36%
Up to
faster
29%
Up to
faster
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
9
Processor Choice Impacts Throughput
• Throughput = # of tasks
completed / minute when
cluster is at 100% utilization
• Intel Xeon processor 5600
provides up to 30% more
throughput than 5500 series1
Data Source: Intel internal measurements by using Hadoop 0.20.2 as of August 8, 2010.
Hardware configurations are on slide 22. Performance tests and ratings are measured using specific computer systems and/or components and reflect
the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration
may affect actual performance.
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
Turn on Intel® Hyper-threading Technology
Intel® Hyper-threading
Technology
Increases performance for threaded
applications delivering greater throughput
and responsiveness
Up to 28% better performance1
1
Data Source: Intel internal measurements by using Hadoop 0.20.2 as of August 8, 2010.
Hardware configurations are on slide 23. Performance tests and ratings are measured using specific computer systems and/or components and reflect
the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration
may affect actual performance.
0
0.2
0.4
0.6
0.8
1
1.2
1.4
Terasort Wordcount
SMT OFF
SMT ON
Job Running Time
(Lower values are better)
10
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
Memory
• Equipping 1~3 GB of RAM per CPU core
• ECC memory is highly recommended1, to detect and correct errors
introduced during storage transmission of data.
Hard drives
• Run in AHCI mode with NCQ enabled to improve multiple
simultaneous Read/Write performance
• Enable hard drive’s write cache
1. See in the discussion mail list http://mail-archives.apache.org/mod_mbox/hadoop-core-
dev/200705.mbox/%3C465C3065.09050501@dragonflymc.com%3E
Memory & Storage
11
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
• 1-2 x 1Gigabit Ethernet per node
• Ensure multiple RX/TX queue support for multi-core processors
• Enable channel bonding to resolve network-bound workloads if needed
– E.g., Improves sort workload by 30% in job running time
1
NetworkingMap/ReduceTasks
bootstrap
map
shuffle
sort
reduce
idle
0%
20%
40%
60%
80%
100%
CPUUtilization
idle
wait I/O
system
user
0%
20%
40%
60%
80%
100%
DiskUtilization
disk
0%
20%
40%
60%
80%
100%
Network
Utilization
network
Map/ReduceTasks
bootstrap
map
shuffle
sort
reduce
idle
0%
20%
40%
60%
80%
100%
CPUUtilization
idle
wait I/O
system
user
0%
20%
40%
60%
80%
100%
DiskUtilization
disk
0%
20%
40%
60%
80%
100%
Network
Utilization
network
I/O improves
substantially
Sort – no channel bonding Sort – channel bonding
100% network
utilization without
channel bonding
12
1
Data Source: Intel internal measurements by using Hadoop 0.20.2 as of August 8, 2010. Hardware configurations are on slide 24. Performance tests
and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured
by those tests. Any difference in system hardware or software design or configuration may affect actual performance.
time time
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
Hadoop Disk Drives
13
Doubling disk drives  >2x Speedup
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
14
OS
•Use a Linux* distribution based on kernel version 2.6.30 or
later because of the optimizations included for energy and
threading efficiency
– For Example: energy consumption can be up to 60 percent (42 watts)
higher at idle for each server using older versions of Linux
•Optimize Linux* configurations
– Linux open file descriptor limit using /etc/security/limits.conf
• Default 1024 is too low for Hadoop daemon, and try to increase to approximately 64,000
– In kernel 2.6.28, epoll file descriptor limit using /etc/sysctl.conf
• Default 128 is too low for Hadoop daemon, and try to increase to approximately 4096
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
15
JVM
JVM (set in hadoop-env.sh)
• Prefer Sun Hotspot Java Runtime Environment
• Prefer 1.6 update 14 or later 64-bit version JVM
• “-server” option
– Recommend for Hadoop framework processes (E.g., JobTracker, Namenode), targeting at the production
deployments
• Specific GC related options for framework process
– E.g., Using parallel GC algorithm -XX:ParallelGCThreads=8 -XX:+UseConcMarkSweepGC
• Set the parameter java.net.preferIPv4Stack to true as well.
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
16
Choosing a proper codec for your IO intensive workloads
Data Compression
• Compress data wherever possible
• Reduces storage footprint
• Speed I/O bound workloads
• Set mapred.output.compress
and/or
mapred.compress.map.output to
be true
• Consider LZO format
• Terasort with LZO compression:
• 60% faster than uncompressed
• 56% faster than zlib
Data Source: Intel internal measurements by using Hadoop 0.20.2 as of August 8, 2010. Hardware configurations are on slide 25. Performance tests
and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured
by those tests. Any difference in system hardware or software design or configuration may affect actual performance.
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
17
Hadoop Configuration Tuning
1. Increase DFS block size
• dfs.block.size
– HDFS file block size, to use larger block size (such as 128M or 256M) for large file system.
• E.g., Increasing block size from 128M to 256M saves Terasort running time by 7%
2. Supply enough handlers (HDFS)
– dfs.datanode.max.xcievers
• The maximum number of threads that can be connected to a data node simultaneously, set
larger number (e.g., 2048) rather than the default value 256.
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
18
For More Detail – See Intel’s Recent Paper
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
19
Summary
• Tune and optimize Hadoop case by case
• Most of the Hadoop applications are data-intensive
• Tune your IO related application subsystem first
• Processor choice matters:
• X5670 (Westmere) shows 20-40% improvement for CPU-intensive workloads over
X5570 (Nehalem)1
• For I/O Intensive workloads – consider scaling HDD with core count
• Performance tuning tips:
• Channel bonding can reduce the network bottleneck for I/O intensive workloads
• Using larger DFS block size decreases task overhead
• Enabling HT shows gains up to 28% for CPU intensive workloads2
• Using LZO can significantly improve TeraSort results
1,2
Data Source: Intel internal measurements by using Hadoop 0.20.2 as of August 8, 2010.
Hardware configurations (speed test) are on slide 21. Hardware configurations (HyperThreading) are on slide 23. Performance tests and ratings are
measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those
tests. Any difference in system hardware or software design or configuration may affect actual performance.
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
20
Backup
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
Cluster Configurations Information
(Slide: “Processor Choice Impacts Speed”)
Source: Intel internal measurement as of September 19 2009 running Hadoop*, WordCount, and TeraSort
Intel® Xeon® X5460-based server
Processor: Dual-socket quad-core Intel® Xeon® X5460 3.16GHz
Processor Memory: 16GB (DDR2 FBDIM ECC 667MHz) RAM
Storage: 1 X 300GB 15K RPM SAS disk for system and log files, 4 X 1TB 7200RPM SATA for HDFS and intermediate results
Network: 1 Gigabit Ethernet NIC
BIOS: BIOS version S5000.86B.10.60.0091.100920081631EIST (Enhanced Intel SpeedStep Technology) disabled both hardware prefetcher and
adjacent cache-line, prefetch disable
Intel® Xeon® X5570-based server
Processor: Dual-socket quad-core Intel® Xeon® X5570 2.93GHz
Processor Memory: 16GB (DDR3 ECC 1333MHz) RAM
Storage: 1 X 1TB 7200RPM SATA for system and log files, 4 X 1TB 7200RPM SATA for HDFS and intermediate results
Network: 1 Gigabit Ethernet NIC
BIOS: BIOS version 4.6.3 Both EIST (Enhanced Intel SpeedStep Technology) and Turbo mode disabled both hardware prefetcher and adjacent cache-
line prefetch enabled, SMT (Simultaneous MultiThreading), enabled (Disabling hardware prefetcher and adjacent cache-line prefetch helps improve
Hadoop performance on Xeon X5460 server according to our benchmarking.)
Source: Intel internal measurement as of August 8, 2010 running Hadoop* WordCount and TeraSort.
Results: WordCount single job running time was 407 seconds on the Xeon® 5500® processor series and 289 seconds on the Intel® Xeon® 5600
processor series. TeraSort single job running time was 2,541 seconds on the Xeon processor 5500 series and 2,182 seconds on the Intel Xeon processor
5600 series.
Hardware, cluster configuration, and settings were as follows:
(1 Namenode/JobTracker + 5 DataNode/TaskTracker, each has two port 1 GbE connectivity to a single GbE switch with channel bonding enabled.) Intel
Xeon processor 5600 series servers: HP ProLiant* z6000 G6 Server with 2x Intel® Xeon® processor X5670 2.93 GHz (12 cores per node), 24 GB DDR3
RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for system and log files with isolated partition). Both EIST
(Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel® Hyper-
Threading Technology enabled. Intel Xeon processor 5500 series servers: HP ProLiant z6000 G6 Server with 2x Intel® Xeon® processor X5570 2.93
GHz (8 cores per node), 24 GB DDR3 RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for system and log files with
isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both hardware prefetcher and adjacent cache-line
prefetch enabled. Intel Hyper-Threading Technology enabled. Software: Red Hat Enterprise Linux* 5.4 (with kernel 2.6.30x86_64). Ext4 file system,
mounted with “noatime,nodiratime” options.). Sun JVM 1.6 (Java* version 1.6.0_14 Java SE Runtime Environment Java HotSpot* 64-bit server virtual
machine). Cloudera distribution of Hadoop [hadoop-0.20.2-CDH3 beta 2 (hadoop patch level 320)].
21
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
Cluster Configurations Information
(Slide: “Processor Choice Impacts Throughput”)
Source: Intel internal measurement as of August 8, 2010 running Hadoop* WordCount and TeraSort.
Results: Total completed tasks per minute of WordCount over Intel® Xeon® processor 5500 series
was approximately 71.58, and over Intel® Xeon® process 5600 series was approximately 93.22.
Hardware, cluster configuration, and settings were as follows:
(1 Namenode/JobTracker + 5 DataNode/TaskTracker, each has two port 1 GbE connectivity to a
single GbE switch with channel bonding enabled.) Intel Xeon processor 5600 series servers: HP
ProLiant* z6000 G6 Server with 2x Intel Xeon processor X5670 2.93 GHz (12 cores per node), 24 GB
DDR3 RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for system
and log files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo
mode disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel® Hyper-
Threading Technology enabled. Intel Xeon processor 5500 series servers: HP ProLiant z6000 G6
Server with 2x Intel Xeon processor X5570 2.93 GHz (8 cores per node), 24 GB DDR3 RAM, 6 SATA
disks per node (All six for HDFS and intermediate results, sharing one for system and log files with
isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled.
Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel Hyper-Threading
Technology enabled. Software: Red Hat Enterprise Linux* 5.4 (with kernel 2.6.30x86_64). Ext4 file
system, mounted with “noatime,nodiratime” options.). Sun JVM 1.6 (Java* version 1.6.0_14 Java SE
Runtime Environment Java HotSpot* 64-bit server virtual machine). Cloudera distribution of Hadoop
[hadoop-0.20.2-CDH3 beta 2 (hadoop patch level 320)]
22
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
Cluster Configurations Information
(Slide: “Intel® Hyper-threading Technology”)
Source: Intel internal measurement as of August 8, 2010 based on the following cluster and server
configuration: 6 nodes (1 NameNode/JobTracker, 5 DataNode/TaskTracker) in each, configured with
2GbE connectivity to each server. Intel® Xeon® processor 5600 series servers: HP ProLiant* z6000
G6 Server 2 x Intel® Xeon® processor X5670 2.93 GHz (12 cores per node), 24 GB DDR3 RAM, 6
SATA disks per node (All six for HDFS and intermediate results, sharing one for file system and log
files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode
disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel® Hyper-
Threading Technology (Intel® HT Technology) requires a computer system with an Intel® processor
supporting Intel HT Technology and an Intel HT Technology-enabled chipset, BIOS, and operating
system. Performance will vary depending on the specific hardware and software you use.
See www.intel.com/products/ht/hyperthreading_more.htm for more information including details on
which processors support Intel HT Technology.
23
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
Cluster Configurations Information
(Slide: “Networking”)
24
Source: Intel internal measurement as of August 8, 2010 running Hadoop* WordCount and TeraSort.
Hardware, cluster configuration, and settings were as follows:
(1 Namenode/JobTracker + 5 DataNode/TaskTracker, each has two port 1 GbE connectivity to a single
GbE switch with channel bonding enabled.) Intel Xeon processor 5600 series servers: HP ProLiant*
z6000 G6 Server with 2x Intel Xeon processor X5670 2.93 GHz (12 cores per node), 24 GB DDR3
RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for system and
log files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode
disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel® Hyper-Threading
Technology enabled. Intel Xeon processor 5500 series servers: HP ProLiant z6000 G6 Server with 2x
Intel Xeon processor X5570 2.93 GHz (8 cores per node), 24 GB DDR3 RAM, 6 SATA disks per node
(All six for HDFS and intermediate results, sharing one for system and log files with isolated
partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both
hardware prefetcher and adjacent cache-line prefetch enabled. Intel Hyper-Threading Technology
enabled. Software: Red Hat Enterprise Linux* 5.4 (with kernel 2.6.30x86_64). Ext4 file system,
mounted with “noatime,nodiratime” options.). Sun JVM 1.6 (Java* version 1.6.0_14 Java SE Runtime
Environment Java HotSpot* 64-bit server virtual machine). Cloudera distribution of Hadoop [hadoop-
0.20.2-CDH3 beta 2 (hadoop patch level 320)]
Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands
may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation.
Cluster Configurations Information
(Slide: “Data Compression”)
Source: Intel internal measurement as of August 8,2010 running Hadoop* TeraSort.
Results: TeraSort single job running time was 1477 seconds without compression, 1256 seconds with
default(zlib) compression, and 586 seconds with LZO compression.
Hardware, cluster configuration, and settings were as follows: (1 NameNode/JobTracker + 32
DataNode/TaskTracker; each has 1 port 1 GbE connectivity to a single GbE switch) Intel Xeon
processor 5500 series servers: 2x Xeon processor X5570 2.93 GHz (8 cores per node), 24 GB DDR3
RAM, 4 SATA disks per node (All 4 for HDFS and intermediate results, sharing 1 for system and log
files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode
disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel Hyper-Threading
Technology enabled. Software: Red Hat Enterprise Linux* 5.4 (with kernel 2.6.30.10 x86_64). Ext3
filesystem, mounted with “noatime,nodiratime” options.). Sun JVM 1.6 (Java* version 1.6.0_14 Java
SE Runtime Environment Java HotSpot* 64-bit server virtual machine). Hadoop 0.20.1 version
25

More Related Content

PDF
Software-defined Visualization, High-Fidelity Visualization: OpenSWR and OSPRay
PDF
Embree Ray Tracing Kernels
PDF
How Funcom Increased Play Time in Lego Minifigures by 40%
PDF
Austin Cherian: Big data and HPC technologies - intel
PDF
Crooke CWF Keynote FINAL final platinum
PDF
OIT to Volumetric Shadow Mapping, 101 Uses for Raster-Ordered Views using Dir...
PPTX
Ready access to high performance Python with Intel Distribution for Python 2018
Software-defined Visualization, High-Fidelity Visualization: OpenSWR and OSPRay
Embree Ray Tracing Kernels
How Funcom Increased Play Time in Lego Minifigures by 40%
Austin Cherian: Big data and HPC technologies - intel
Crooke CWF Keynote FINAL final platinum
OIT to Volumetric Shadow Mapping, 101 Uses for Raster-Ordered Views using Dir...
Ready access to high performance Python with Intel Distribution for Python 2018

Similar to Intel - Nurcan Coskun - Hadoop World 2010 (20)

PDF
Hw09 Optimizing Hadoop Deployments
PDF
Hw09 Optimizing Hadoop Deployments
PDF
High Performance Computing: The Essential tool for a Knowledge Economy
PPTX
E5 Intel Xeon Processor E5 Family Making the Business Case
PDF
Intel Technologies for High Performance Computing
PDF
Lynn Comp - Intel Big Data & Cloud Summit 2013 (2)
PDF
Driving Industrial InnovationOn the Path to Exascale
PDF
Intel HPC Update
PDF
O uso de tecnologias Intel na implantação de sistemas de alto desempenho
PDF
Xeon E5 Making the Business Case PowerPoint
PDF
Improving the performance of OpenSubdiv* on Intel Architecture
PDF
TDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura Intel
PDF
Intel: How to Use Alluxio to Accelerate BigData Analytics on the Cloud and Ne...
PPTX
Intel® Select Solutions for the Network
PDF
Intel® Xeon® Processor E7-8800/4800 v4 EAMG 2.0
PDF
TDC2017 | São Paulo - Trilha Machine Learning How we figured out we had a SRE...
PDF
Tendências da junção entre Big Data Analytics, Machine Learning e Supercomput...
PDF
Intel NFVi Enabling Kit Demo/Lab
PDF
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
PDF
4 dpdk roadmap(1)
Hw09 Optimizing Hadoop Deployments
Hw09 Optimizing Hadoop Deployments
High Performance Computing: The Essential tool for a Knowledge Economy
E5 Intel Xeon Processor E5 Family Making the Business Case
Intel Technologies for High Performance Computing
Lynn Comp - Intel Big Data & Cloud Summit 2013 (2)
Driving Industrial InnovationOn the Path to Exascale
Intel HPC Update
O uso de tecnologias Intel na implantação de sistemas de alto desempenho
Xeon E5 Making the Business Case PowerPoint
Improving the performance of OpenSubdiv* on Intel Architecture
TDC2018SP | Trilha IA - Inteligencia Artificial na Arquitetura Intel
Intel: How to Use Alluxio to Accelerate BigData Analytics on the Cloud and Ne...
Intel® Select Solutions for the Network
Intel® Xeon® Processor E7-8800/4800 v4 EAMG 2.0
TDC2017 | São Paulo - Trilha Machine Learning How we figured out we had a SRE...
Tendências da junção entre Big Data Analytics, Machine Learning e Supercomput...
Intel NFVi Enabling Kit Demo/Lab
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
4 dpdk roadmap(1)
Ad

More from Cloudera, Inc. (20)

PPTX
Partner Briefing_January 25 (FINAL).pptx
PPTX
Cloudera Data Impact Awards 2021 - Finalists
PPTX
2020 Cloudera Data Impact Awards Finalists
PPTX
Edc event vienna presentation 1 oct 2019
PPTX
Machine Learning with Limited Labeled Data 4/3/19
PPTX
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
PPTX
Introducing Cloudera DataFlow (CDF) 2.13.19
PPTX
Introducing Cloudera Data Science Workbench for HDP 2.12.19
PPTX
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
PPTX
Leveraging the cloud for analytics and machine learning 1.29.19
PPTX
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
PPTX
Leveraging the Cloud for Big Data Analytics 12.11.18
PPTX
Modern Data Warehouse Fundamentals Part 3
PPTX
Modern Data Warehouse Fundamentals Part 2
PPTX
Modern Data Warehouse Fundamentals Part 1
PPTX
Extending Cloudera SDX beyond the Platform
PPTX
Federated Learning: ML with Privacy on the Edge 11.15.18
PPTX
Analyst Webinar: Doing a 180 on Customer 360
PPTX
Build a modern platform for anti-money laundering 9.19.18
PPTX
Introducing the data science sandbox as a service 8.30.18
Partner Briefing_January 25 (FINAL).pptx
Cloudera Data Impact Awards 2021 - Finalists
2020 Cloudera Data Impact Awards Finalists
Edc event vienna presentation 1 oct 2019
Machine Learning with Limited Labeled Data 4/3/19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Leveraging the cloud for analytics and machine learning 1.29.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Leveraging the Cloud for Big Data Analytics 12.11.18
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 1
Extending Cloudera SDX beyond the Platform
Federated Learning: ML with Privacy on the Edge 11.15.18
Analyst Webinar: Doing a 180 on Customer 360
Build a modern platform for anti-money laundering 9.19.18
Introducing the data science sandbox as a service 8.30.18
Ad

Recently uploaded (20)

PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Empathic Computing: Creating Shared Understanding
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Machine learning based COVID-19 study performance prediction
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Cloud computing and distributed systems.
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
Teaching material agriculture food technology
PDF
KodekX | Application Modernization Development
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Electronic commerce courselecture one. Pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Building Integrated photovoltaic BIPV_UPV.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Empathic Computing: Creating Shared Understanding
Network Security Unit 5.pdf for BCA BBA.
NewMind AI Monthly Chronicles - July 2025
Encapsulation_ Review paper, used for researhc scholars
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Machine learning based COVID-19 study performance prediction
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Cloud computing and distributed systems.
Mobile App Security Testing_ A Comprehensive Guide.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Teaching material agriculture food technology
KodekX | Application Modernization Development
The AUB Centre for AI in Media Proposal.docx
Electronic commerce courselecture one. Pdf

Intel - Nurcan Coskun - Hadoop World 2010

  • 1. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without any notice. Copyright © 2010, Intel Corporation. Optimizing Hadoop* Workloads Nurcan Coskun, Ph.D. Intel Software & Solutions Group October 12, 2010 Acknowledgements to Jason Dai, Intel SSG, for of the test results and optimization techniques
  • 2. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 2 Legal Disclaimers Disclaimers & Legal Notices THE INFORMATION IS FURNISHED FOR INFORMATIONAL USE ONLY, IS SUBJECT TO CHANGE WITHOUT NOTICE, AND SHOULD NOT BE CONSTRUED AS A COMMITMENT BY INTEL CORPORATION. INTEL CORPORATION ASSUMES NO RESPONSIBILITY OR LIABILITY FOR ANY ERRORS OR INACCURACIES THAT MAY APPEAR IN THIS DOCUMENT OR ANY SOFTWARE THAT MAY BE PROVIDED IN ASSOCIATION WITH THIS DOCUMENT. THIS INFORMATION IS PROVIDED "AS IS" AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THE USE OF THIS INFORMATION INCLUDING WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, COMPLIANCE WITH A SPECIFICATION OR STANDARD, MERCHANTABILITY OR NONINFRINGEMENT. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, visit Intel Performance Benchmark Limitations INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL® PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. UNLESS OTHERWISE AGREED IN WRITING BY INTEL, THE INTEL PRODUCTS ARE NOT DESIGNED NOR INTENDED FOR ANY APPLICATION IN WHICH THE FAILURE OF THE INTEL PRODUCT COULD CREATE A SITUATION WHERE PERSONAL INJURY OR DEATH MAY OCCUR. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The information here is subject to change without notice. Do not finalize a design with this information. The products described in this document may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order. Copies of documents which have an order number and are referenced in this document, or other Intel literature, may be obtained by calling 1-800-548-4725, or by visiting Intel's Web Site http://www.intel.com/.
  • 3. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 3 Why Optimize Hadoop Deployments? Handle More Data At Lower Cost In Less Time With Less Power
  • 4. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 4 Workload traits drive optimization approach 4
  • 5. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 5 Where to Optimize ? Hardware Hadoop / HDFS Software Equipment, Settings Version, Settings OS, JVM, Settings
  • 6. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 6 Server Considerations 2 Socket Systems with Intel® Xeon® Processor 5600 series Sweet spot for performance, efficiency, cost 12-24 GB DDR3 CPU intense or HBASE may require more. 4-6 1TB SATA HDD 7200 Pure I/O workloads may require more 1-2GB Ethernet Channel bonding for increased throughput Energy efficient components Gold certified power supplies, efficient fans, low power
  • 7. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 7 Processor Choice Matters Faster Handles More Data More Energy Efficient
  • 8. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 8 Processor Choice Impacts Speed Data Source: Intel internal measurements. Hadoop 0.19.1 results as of September 20, 2009 and Hadoop 0.20.2 results as of August 8, 2010. Hardware configurations are on slide 21. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Last Year This Year 36% Up to faster 29% Up to faster
  • 9. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 9 Processor Choice Impacts Throughput • Throughput = # of tasks completed / minute when cluster is at 100% utilization • Intel Xeon processor 5600 provides up to 30% more throughput than 5500 series1 Data Source: Intel internal measurements by using Hadoop 0.20.2 as of August 8, 2010. Hardware configurations are on slide 22. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance.
  • 10. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. Turn on Intel® Hyper-threading Technology Intel® Hyper-threading Technology Increases performance for threaded applications delivering greater throughput and responsiveness Up to 28% better performance1 1 Data Source: Intel internal measurements by using Hadoop 0.20.2 as of August 8, 2010. Hardware configurations are on slide 23. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. 0 0.2 0.4 0.6 0.8 1 1.2 1.4 Terasort Wordcount SMT OFF SMT ON Job Running Time (Lower values are better) 10
  • 11. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. Memory • Equipping 1~3 GB of RAM per CPU core • ECC memory is highly recommended1, to detect and correct errors introduced during storage transmission of data. Hard drives • Run in AHCI mode with NCQ enabled to improve multiple simultaneous Read/Write performance • Enable hard drive’s write cache 1. See in the discussion mail list http://mail-archives.apache.org/mod_mbox/hadoop-core- dev/200705.mbox/%3C465C3065.09050501@dragonflymc.com%3E Memory & Storage 11
  • 12. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. • 1-2 x 1Gigabit Ethernet per node • Ensure multiple RX/TX queue support for multi-core processors • Enable channel bonding to resolve network-bound workloads if needed – E.g., Improves sort workload by 30% in job running time 1 NetworkingMap/ReduceTasks bootstrap map shuffle sort reduce idle 0% 20% 40% 60% 80% 100% CPUUtilization idle wait I/O system user 0% 20% 40% 60% 80% 100% DiskUtilization disk 0% 20% 40% 60% 80% 100% Network Utilization network Map/ReduceTasks bootstrap map shuffle sort reduce idle 0% 20% 40% 60% 80% 100% CPUUtilization idle wait I/O system user 0% 20% 40% 60% 80% 100% DiskUtilization disk 0% 20% 40% 60% 80% 100% Network Utilization network I/O improves substantially Sort – no channel bonding Sort – channel bonding 100% network utilization without channel bonding 12 1 Data Source: Intel internal measurements by using Hadoop 0.20.2 as of August 8, 2010. Hardware configurations are on slide 24. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. time time
  • 13. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. Hadoop Disk Drives 13 Doubling disk drives  >2x Speedup
  • 14. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 14 OS •Use a Linux* distribution based on kernel version 2.6.30 or later because of the optimizations included for energy and threading efficiency – For Example: energy consumption can be up to 60 percent (42 watts) higher at idle for each server using older versions of Linux •Optimize Linux* configurations – Linux open file descriptor limit using /etc/security/limits.conf • Default 1024 is too low for Hadoop daemon, and try to increase to approximately 64,000 – In kernel 2.6.28, epoll file descriptor limit using /etc/sysctl.conf • Default 128 is too low for Hadoop daemon, and try to increase to approximately 4096
  • 15. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 15 JVM JVM (set in hadoop-env.sh) • Prefer Sun Hotspot Java Runtime Environment • Prefer 1.6 update 14 or later 64-bit version JVM • “-server” option – Recommend for Hadoop framework processes (E.g., JobTracker, Namenode), targeting at the production deployments • Specific GC related options for framework process – E.g., Using parallel GC algorithm -XX:ParallelGCThreads=8 -XX:+UseConcMarkSweepGC • Set the parameter java.net.preferIPv4Stack to true as well.
  • 16. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 16 Choosing a proper codec for your IO intensive workloads Data Compression • Compress data wherever possible • Reduces storage footprint • Speed I/O bound workloads • Set mapred.output.compress and/or mapred.compress.map.output to be true • Consider LZO format • Terasort with LZO compression: • 60% faster than uncompressed • 56% faster than zlib Data Source: Intel internal measurements by using Hadoop 0.20.2 as of August 8, 2010. Hardware configurations are on slide 25. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance.
  • 17. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 17 Hadoop Configuration Tuning 1. Increase DFS block size • dfs.block.size – HDFS file block size, to use larger block size (such as 128M or 256M) for large file system. • E.g., Increasing block size from 128M to 256M saves Terasort running time by 7% 2. Supply enough handlers (HDFS) – dfs.datanode.max.xcievers • The maximum number of threads that can be connected to a data node simultaneously, set larger number (e.g., 2048) rather than the default value 256.
  • 18. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 18 For More Detail – See Intel’s Recent Paper
  • 19. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 19 Summary • Tune and optimize Hadoop case by case • Most of the Hadoop applications are data-intensive • Tune your IO related application subsystem first • Processor choice matters: • X5670 (Westmere) shows 20-40% improvement for CPU-intensive workloads over X5570 (Nehalem)1 • For I/O Intensive workloads – consider scaling HDD with core count • Performance tuning tips: • Channel bonding can reduce the network bottleneck for I/O intensive workloads • Using larger DFS block size decreases task overhead • Enabling HT shows gains up to 28% for CPU intensive workloads2 • Using LZO can significantly improve TeraSort results 1,2 Data Source: Intel internal measurements by using Hadoop 0.20.2 as of August 8, 2010. Hardware configurations (speed test) are on slide 21. Hardware configurations (HyperThreading) are on slide 23. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance.
  • 20. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. 20 Backup
  • 21. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. Cluster Configurations Information (Slide: “Processor Choice Impacts Speed”) Source: Intel internal measurement as of September 19 2009 running Hadoop*, WordCount, and TeraSort Intel® Xeon® X5460-based server Processor: Dual-socket quad-core Intel® Xeon® X5460 3.16GHz Processor Memory: 16GB (DDR2 FBDIM ECC 667MHz) RAM Storage: 1 X 300GB 15K RPM SAS disk for system and log files, 4 X 1TB 7200RPM SATA for HDFS and intermediate results Network: 1 Gigabit Ethernet NIC BIOS: BIOS version S5000.86B.10.60.0091.100920081631EIST (Enhanced Intel SpeedStep Technology) disabled both hardware prefetcher and adjacent cache-line, prefetch disable Intel® Xeon® X5570-based server Processor: Dual-socket quad-core Intel® Xeon® X5570 2.93GHz Processor Memory: 16GB (DDR3 ECC 1333MHz) RAM Storage: 1 X 1TB 7200RPM SATA for system and log files, 4 X 1TB 7200RPM SATA for HDFS and intermediate results Network: 1 Gigabit Ethernet NIC BIOS: BIOS version 4.6.3 Both EIST (Enhanced Intel SpeedStep Technology) and Turbo mode disabled both hardware prefetcher and adjacent cache- line prefetch enabled, SMT (Simultaneous MultiThreading), enabled (Disabling hardware prefetcher and adjacent cache-line prefetch helps improve Hadoop performance on Xeon X5460 server according to our benchmarking.) Source: Intel internal measurement as of August 8, 2010 running Hadoop* WordCount and TeraSort. Results: WordCount single job running time was 407 seconds on the Xeon® 5500® processor series and 289 seconds on the Intel® Xeon® 5600 processor series. TeraSort single job running time was 2,541 seconds on the Xeon processor 5500 series and 2,182 seconds on the Intel Xeon processor 5600 series. Hardware, cluster configuration, and settings were as follows: (1 Namenode/JobTracker + 5 DataNode/TaskTracker, each has two port 1 GbE connectivity to a single GbE switch with channel bonding enabled.) Intel Xeon processor 5600 series servers: HP ProLiant* z6000 G6 Server with 2x Intel® Xeon® processor X5670 2.93 GHz (12 cores per node), 24 GB DDR3 RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for system and log files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel® Hyper- Threading Technology enabled. Intel Xeon processor 5500 series servers: HP ProLiant z6000 G6 Server with 2x Intel® Xeon® processor X5570 2.93 GHz (8 cores per node), 24 GB DDR3 RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for system and log files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel Hyper-Threading Technology enabled. Software: Red Hat Enterprise Linux* 5.4 (with kernel 2.6.30x86_64). Ext4 file system, mounted with “noatime,nodiratime” options.). Sun JVM 1.6 (Java* version 1.6.0_14 Java SE Runtime Environment Java HotSpot* 64-bit server virtual machine). Cloudera distribution of Hadoop [hadoop-0.20.2-CDH3 beta 2 (hadoop patch level 320)]. 21
  • 22. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. Cluster Configurations Information (Slide: “Processor Choice Impacts Throughput”) Source: Intel internal measurement as of August 8, 2010 running Hadoop* WordCount and TeraSort. Results: Total completed tasks per minute of WordCount over Intel® Xeon® processor 5500 series was approximately 71.58, and over Intel® Xeon® process 5600 series was approximately 93.22. Hardware, cluster configuration, and settings were as follows: (1 Namenode/JobTracker + 5 DataNode/TaskTracker, each has two port 1 GbE connectivity to a single GbE switch with channel bonding enabled.) Intel Xeon processor 5600 series servers: HP ProLiant* z6000 G6 Server with 2x Intel Xeon processor X5670 2.93 GHz (12 cores per node), 24 GB DDR3 RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for system and log files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel® Hyper- Threading Technology enabled. Intel Xeon processor 5500 series servers: HP ProLiant z6000 G6 Server with 2x Intel Xeon processor X5570 2.93 GHz (8 cores per node), 24 GB DDR3 RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for system and log files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel Hyper-Threading Technology enabled. Software: Red Hat Enterprise Linux* 5.4 (with kernel 2.6.30x86_64). Ext4 file system, mounted with “noatime,nodiratime” options.). Sun JVM 1.6 (Java* version 1.6.0_14 Java SE Runtime Environment Java HotSpot* 64-bit server virtual machine). Cloudera distribution of Hadoop [hadoop-0.20.2-CDH3 beta 2 (hadoop patch level 320)] 22
  • 23. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. Cluster Configurations Information (Slide: “Intel® Hyper-threading Technology”) Source: Intel internal measurement as of August 8, 2010 based on the following cluster and server configuration: 6 nodes (1 NameNode/JobTracker, 5 DataNode/TaskTracker) in each, configured with 2GbE connectivity to each server. Intel® Xeon® processor 5600 series servers: HP ProLiant* z6000 G6 Server 2 x Intel® Xeon® processor X5670 2.93 GHz (12 cores per node), 24 GB DDR3 RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for file system and log files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel® Hyper- Threading Technology (Intel® HT Technology) requires a computer system with an Intel® processor supporting Intel HT Technology and an Intel HT Technology-enabled chipset, BIOS, and operating system. Performance will vary depending on the specific hardware and software you use. See www.intel.com/products/ht/hyperthreading_more.htm for more information including details on which processors support Intel HT Technology. 23
  • 24. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. Cluster Configurations Information (Slide: “Networking”) 24 Source: Intel internal measurement as of August 8, 2010 running Hadoop* WordCount and TeraSort. Hardware, cluster configuration, and settings were as follows: (1 Namenode/JobTracker + 5 DataNode/TaskTracker, each has two port 1 GbE connectivity to a single GbE switch with channel bonding enabled.) Intel Xeon processor 5600 series servers: HP ProLiant* z6000 G6 Server with 2x Intel Xeon processor X5670 2.93 GHz (12 cores per node), 24 GB DDR3 RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for system and log files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel® Hyper-Threading Technology enabled. Intel Xeon processor 5500 series servers: HP ProLiant z6000 G6 Server with 2x Intel Xeon processor X5570 2.93 GHz (8 cores per node), 24 GB DDR3 RAM, 6 SATA disks per node (All six for HDFS and intermediate results, sharing one for system and log files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel Hyper-Threading Technology enabled. Software: Red Hat Enterprise Linux* 5.4 (with kernel 2.6.30x86_64). Ext4 file system, mounted with “noatime,nodiratime” options.). Sun JVM 1.6 (Java* version 1.6.0_14 Java SE Runtime Environment Java HotSpot* 64-bit server virtual machine). Cloudera distribution of Hadoop [hadoop- 0.20.2-CDH3 beta 2 (hadoop patch level 320)]
  • 25. Intel and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Other names and brands may be claimed as the property of others. All products, dates, and figures are preliminary and are subject to change without notice. Copyright © 2010, Intel Corporation. Cluster Configurations Information (Slide: “Data Compression”) Source: Intel internal measurement as of August 8,2010 running Hadoop* TeraSort. Results: TeraSort single job running time was 1477 seconds without compression, 1256 seconds with default(zlib) compression, and 586 seconds with LZO compression. Hardware, cluster configuration, and settings were as follows: (1 NameNode/JobTracker + 32 DataNode/TaskTracker; each has 1 port 1 GbE connectivity to a single GbE switch) Intel Xeon processor 5500 series servers: 2x Xeon processor X5570 2.93 GHz (8 cores per node), 24 GB DDR3 RAM, 4 SATA disks per node (All 4 for HDFS and intermediate results, sharing 1 for system and log files with isolated partition). Both EIST (Enhanced Intel® SpeedStep Technology) and Turbo mode disabled. Both hardware prefetcher and adjacent cache-line prefetch enabled. Intel Hyper-Threading Technology enabled. Software: Red Hat Enterprise Linux* 5.4 (with kernel 2.6.30.10 x86_64). Ext3 filesystem, mounted with “noatime,nodiratime” options.). Sun JVM 1.6 (Java* version 1.6.0_14 Java SE Runtime Environment Java HotSpot* 64-bit server virtual machine). Hadoop 0.20.1 version 25