SlideShare a Scribd company logo
Tuning Java Servers
Srinath Perera
Outline
 With size of systems growing, Performance
become increasingly critical for both R&D as
well as support
o Less performance => more machines => lot of
Devops costs and potential for failures
 We will do
o Bit of Theory
o Then to Profiling
 Will discuss different profiling views, and most
importantly when to use them.
Based on article
http://www.infoq.com/articles/Tuning-Java-Servers
Big Pic
What is Performance Tuning?
 Get maximum out of the machine
o Max Throughput
o Min Latency
 Often this become a tradeoff, so you need a
balance
o General case is to get max throughput within given
latency limits. (E.g. max throughput with latency
less than 100ms.)
Lets understand what they are
 What is throughput?
o Number of messages server can process per second
o Measured as
the number of messages processed/ time to process messages
 What is latency?
o End to end time for a server to process an message
 We discuss performance as relationship
between three variables: latency,
concurrency, throughput
What Limits Performance?
 Lack of resources
 Bottleneck – most scarce resource decides the
performance
o CPU bound
o Disk bound
o Bandwidth bound
o Latency bound
o Memory bound
Performance and Bottlenecks
 Think of performance as a water pipe system
 Bottle neck decides overall throughput
 Latency problems often caused at the same
bottlenecks due to contention
 Potential Bottlenecks (Resources)
o CPU
o Disk
o Network
o Memory
- Available Memory
- Heap Size
- GC (Java) – will eat in CPU
Performance of a Server
Little’s Law
 Bound your queue’s (reject too much requests
to give better response time)
Amdhal’s Law
Tuning Java Servers
Performance Numbers (based on Jeff
Dean’s numbers )
Latency / time
for 1 memory
Op
If Memory
access is a
Second
L1 cache reference 0.05 1/20th sec
Main memory reference 1 1 sec
Send 2K bytes over 1 Gbps network 200 3 min
Read 1 MB sequentially from memory 2500 41 min
Disk seek 1*10^5 27 hours
Read 1 MB sequentially from disk 2*10^5 2 days
Send packet CA->Netherlands->CA 1.5*10^6 17 days
See http://tapestryjava.blogspot.com/2012/06/latency-numbers-every-
programmer-should.html
Performance Numbers (based on Jeff
Dean’s numbers )
Latency / time
for 1 memory
Op
If Memory
access is a
Second
L1 cache reference 0.05 1/20th sec
Main memory reference 1 1 sec
Send 2K bytes over 1 Gbps network 200 3 min
Read 1 MB sequentially from memory 2500 41 min
Disk seek 1*10^5 27 hours
Read 1 MB sequentially from disk 2*10^5 2 days
Send packet CA->Netherlands->CA 1.5*10^6 17 days
See http://tapestryjava.blogspot.com/2012/06/latency-numbers-every-
programmer-should.html
Latency Lags Bandwidth
 Observation in prof.
Patterson’s Keynote at 2004
 Bandwidth improves, but
not latency
 Same holds now, and the
gap is widening with new
systems
Handling Speed Differences in
Memory Hierarchy
1. Caching
o E.g. Processor caches, file
cache, disk cache, permission
cache
2. Replication
o E.g. RAID, Content Distribution
Networks (CDN), Web Cache
3. Prediction – Predict what data
will be needed and prefect
o Tradeoff bandwidth
o E.g. disk caches, Google Earth
Potential Performance Problems
 Low performance
o Low Throughput
o High latency
 Memory leaks
 File, Thread, Connection Leaks
Warning
 Many think performance is due to CPU
 People forget network and Disk (most our
workloads are network bound)
o That’s why CPU 10% but machine is loaded (load
average 40)
 Need to look at all
 First step is find the bottleneck
Where to start? Load Average
 Most start by looking at CPU usage
 That does not reflect non-CPU load
 Load average is much better
o Represent the size of the process queue
o Can found via top (1m, 5m, 15m)
o Goes high when CPU, network, disk, memory etc.,
are limited
o Normal value = number of cores, 4X number of
cores means high
o 10X core size means machine is unusable
Read
• http://prutser.wordpress.com/2012/04/23/understanding-linux-load-average-
part-1/ also part2 and part 3
Profiling Tools
 First step is connect through JConsole, and get
basic idea.
 Also monitor the load average, CPU usage, and
IO operations (via top or Sar)
 Then you can switch to tools like JProfiler,
Yourkit, Java Flight Recorder
 Important views
o CPU views (bottlenecks)
o Thread views and monitor views
o Memory views (GC, allocations, and memory leaks)
o IO and JDBC tracing
Top
JConsole
Java Flight Recorder
 Former Oracle Jrocket Mission control
 Now included with JDK 7 update 44+
 Nice tool, very fast, you could run it in
production
 Can start via $JAVA_HOME/bin/jmc
 Profiling JVM need to have following java
option at start
o -XX:+UnlockCommercialFeatures -XX:+FlightRecorder
Tuning Java Servers
Hands On
 Code and scripts from
https://github.com/srinathperera/JavaProfilingTalk
 Run ant to build the service
 Download and unzip WSO2 AS. Run wso2server.sh to
start
 Go to http:// 127.0.0.1:9443/carbon and login via
admin/ admin
 Deploy the aar by going to Services->Add->AAR
Service
 Download jmeter, unzip, and run bin/jmeter
 Run Jmeter scripts to recreate performance problems
Get started with JProfiler
 Download install – work with linux, windows,
and Mac
 New Session, then configure
o Same Machine connect
o Remote connect
- Java -
agentpath:/Applications/jprofiler7/bin/macos/libjprofil
erti.jnilib=port=8849 
o Offline profile
 Adjust filters to org.wos2, org.apache,
java.util. and remove excludes (can change
this)
 Connect
JProfiler
 Instrumentation vs. Sampling
 Jpenable let you connect to an running server
remotely
Problem 1: Tuning for Throughput
 Is machine loaded (checked via load average)?
o If you are putting small load, throughout will be
less
 Try increasing concurrency (load)
Problem 1: Tuning for.. (Contd)
 If load average low
o Check lock profile
o Try tuning threads
- If server set to small number of threads, that will limit
performance
o Ensure network not saturated
- Try transfer a large file
Problem 1: Tuning for.. (Contd)
 If high load average
o Is machine loaded by something else?
o Check CPU (top)
- If high check CPU profile
- If GC percentage > 10% tune GC
o Check network
o Check Disk
o Check is Machine paging
CPU Profile
CPU Profile
 First check hotspots
 Then call tree
o show inherent time as oppose to cumulative time
(Call graph)
 Make sure what takes CPU is where real work is
done
 CPU1 and CPU2 samples
Network/ Disk Profile
 Enable Socket or Servlet (or Files) via Probes
 Hotspots
 Read/Write via telematics
 Samples tooManyIOPSCall.jmx
tooManyNetowrkCalls.jmx
Lock Profile
 Threads view (History and telemetric view)
 Monitor view
 bockedThreadsCall.jmx sample
Problem 2: Tuning for Latency
 Look for bottlenecks – same causes add latency
due to queuing
o Try to find any queue are growing
 Check thread views and lock views
o As discussed in Lock profile
 All IO (both disk and network) are expensive
o Network and Disk Profiles
 How to find?
o Call Graph
o Put system outs and check
Problem 3: Finding Memory Leaks
 Use Eclipse Memory Analyzer
 Go to JProfiler if it is hard to pin down
 Support mark and monitoring memory in
running system
 Can also analyze a memory dump
o Select the objects
o Checking incoming references (cumulative)
o Reference Graph can also help
Eclipse Memory Analyzer
 When OOM, WSO2 servers create a dump
 Open with eclipse memory analyzer
Memory leaks with Jprofiler
Allocation view
 Use to find where objects are created
GC Tuning
 If time spent on GC > 10%, then should tune GC
 “gc overhead limit exceeded” – means > 90%
time spent on GC
GC Tuning
 If time spent on GC > 10%, then should tune GC
 “gc overhead limit exceeded” – means > 90%
time spent on GC
 Consider using allocation views to reduce
objects created
Problem 4: Finding Thread Leaks
 Check the number of threads in thread graph
 Check which thread are created
 Thread dumps
o jstack <pid> >> threaddumps.log
o Jprofiler can get thread dumps as well
 Take thread dumps at different times
Thread Views
Problem 3: Connection Leaks
 Exhausted pools, file descriptor limits
 Check at finally points and in catch clauses
Tuning without an GUI
 Create a JProfile session and configure the triggers.
o Need Periodic triggers Copy local .jprofiler7/config.xml and
JProfiler binaries or agent code to the remote machine
 Add following before the java command.
o -
agentpath:JPROF_LOCATION/bin/macos/libjprofilerti.jnilib=offline,id=S
ESSION_ID,config=/Users/srinath/.jprofiler7/config.xml
 Here replace the configuration file and JProfiler location with
your machine's values.
o SESSION_ID is the session ID of the session you created with JProfiler UI.
Here ~./jprofiler7/config.xml file has settings for all sessions creates by
jprofiler, and it will pick up the right value. If you are profiling in a
remote machine copy your local configs to the remote host.
 Start and run the program, and it will print the following
 JProfiler> Using JVMTI
 ..
 Open the snapshot file and analyze using JProfiler UI.
Tuning
 OS Level Tuning
o For example,
http://www.lognormal.com/blog/2012/09/27/linu
x-tcpip-tuning/
o Generally available with the product
o It depends on OS etc.
 Product Tuning
o Thread pools sizes mainly, there may be others
 Application Tuning
o This is your application, you need to profile
Conclusion
 Know the goals
 Find where the bottleneck using Load average and
other OS tools
 Zoom in using profiler
 Know profile views and when to use them
 Details are important, so is the big picture
 Think .. Think ..
References
 Responding in a Timely Manner,
https://www.youtube.com/watch?v=q_DCipkM
sy0
 Tuning Java Servers -
http://www.infoq.com/articles/Tuning-Java-
Servers
 Thinking Clearly about Performance,
http://queue.acm.org/detail.cfm?id=1854041#
content-comments
Questions?

More Related Content

PDF
ACM DEBS 2015: Realtime Streaming Analytics Patterns
PPTX
ICTER 2014 Invited Talk: Large Scale Data Processing in the Real World: from ...
PDF
DEBS 2015 Tutorial : Patterns for Realtime Streaming Analytics
PPTX
View, Act, and React: Shaping Business Activity with Analytics, BigData Queri...
PPTX
Solving DEBS Grand Challenge with WSO2 CEP
PDF
Patterns of Streaming Applications
PPT
Scalable Realtime Analytics with declarative SQL like Complex Event Processin...
PDF
Strata NYC 2015: Sketching Big Data with Spark: randomized algorithms for lar...
ACM DEBS 2015: Realtime Streaming Analytics Patterns
ICTER 2014 Invited Talk: Large Scale Data Processing in the Real World: from ...
DEBS 2015 Tutorial : Patterns for Realtime Streaming Analytics
View, Act, and React: Shaping Business Activity with Analytics, BigData Queri...
Solving DEBS Grand Challenge with WSO2 CEP
Patterns of Streaming Applications
Scalable Realtime Analytics with declarative SQL like Complex Event Processin...
Strata NYC 2015: Sketching Big Data with Spark: randomized algorithms for lar...

What's hot (20)

PDF
Albert Bifet – Apache Samoa: Mining Big Data Streams with Apache Flink
PPTX
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
PDF
Deep Learning with MXNet - Dmitry Larko
PDF
Snorkel: Dark Data and Machine Learning with Christopher Ré
PDF
PyData 2015 Keynote: "A Systems View of Machine Learning"
PPTX
Building High Available and Scalable Machine Learning Applications
PDF
Data Science with Spark
PPT
Strata 2014 Talk:Tracking a Soccer Game with Big Data
PPTX
Mining data streams
PDF
Cloud-based Data Stream Processing
PPTX
A Scaleable Implementation of Deep Learning on Spark -Alexander Ulanov
PPTX
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
PDF
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
PPTX
Machine Learning with Spark
PPTX
Streaming Algorithms
PPTX
IEEE Cloud 2012: Clouds Hands-On Tutorial
PPTX
TensorFrames: Google Tensorflow on Apache Spark
PDF
High Performance Machine Learning in R with H2O
PDF
A Practical Guide to Anomaly Detection for DevOps
PDF
Spark Meetup @ Netflix, 05/19/2015
Albert Bifet – Apache Samoa: Mining Big Data Streams with Apache Flink
A full Machine learning pipeline in Scikit-learn vs in scala-Spark: pros and ...
Deep Learning with MXNet - Dmitry Larko
Snorkel: Dark Data and Machine Learning with Christopher Ré
PyData 2015 Keynote: "A Systems View of Machine Learning"
Building High Available and Scalable Machine Learning Applications
Data Science with Spark
Strata 2014 Talk:Tracking a Soccer Game with Big Data
Mining data streams
Cloud-based Data Stream Processing
A Scaleable Implementation of Deep Learning on Spark -Alexander Ulanov
Deep-Dive into Deep Learning Pipelines with Sue Ann Hong and Tim Hunter
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Machine Learning with Spark
Streaming Algorithms
IEEE Cloud 2012: Clouds Hands-On Tutorial
TensorFrames: Google Tensorflow on Apache Spark
High Performance Machine Learning in R with H2O
A Practical Guide to Anomaly Detection for DevOps
Spark Meetup @ Netflix, 05/19/2015
Ad

Viewers also liked (6)

PPT
Java application server in the cloud
PDF
Gradle build automation tool
PDF
Enterprise Integration Patterns with Apache Camel
PPTX
Memory Management: What You Need to Know When Moving to Java 8
PDF
Blazing Performance with Flame Graphs
PDF
Introducing Ballerina
Java application server in the cloud
Gradle build automation tool
Enterprise Integration Patterns with Apache Camel
Memory Management: What You Need to Know When Moving to Java 8
Blazing Performance with Flame Graphs
Introducing Ballerina
Ad

Similar to Tuning Java Servers (20)

PPTX
Application Profiling for Memory and Performance
PDF
Application Profiling for Memory and Performance
PDF
How To Get The Most Out Of Your Hibernate, JBoss EAP 7 Application (Ståle Ped...
PPT
Performance Analysis of Idle Programs
PDF
Java Performance & Profiling
PDF
The Diabolical Developers Guide to Performance Tuning
PDF
Slices Of Performance in Java - Oleksandr Bodnar
PDF
Oracle Soa Suite 11g Performance Tuning Cookbook Osama Oransa
PDF
Java Performance and Profiling
PDF
Java Performance and Using Java Flight Recorder
PDF
Software Profiling: Java Performance, Profiling and Flamegraphs
PDF
Adtech x Scala x Performance tuning
PDF
Adtech scala-performance-tuning-150323223738-conversion-gate01
PDF
Java Performance Engineer's Survival Guide
PDF
Software Profiling: Understanding Java Performance and how to profile in Java
PPTX
Java performance tuning
PDF
Tools and Tips to Diagnose Performance Issues
PPT
Best Practices for performance evaluation and diagnosis of Java Applications ...
ODP
Jvm tuning in a rush! - Lviv JUG
PDF
Door to perfomance testing
Application Profiling for Memory and Performance
Application Profiling for Memory and Performance
How To Get The Most Out Of Your Hibernate, JBoss EAP 7 Application (Ståle Ped...
Performance Analysis of Idle Programs
Java Performance & Profiling
The Diabolical Developers Guide to Performance Tuning
Slices Of Performance in Java - Oleksandr Bodnar
Oracle Soa Suite 11g Performance Tuning Cookbook Osama Oransa
Java Performance and Profiling
Java Performance and Using Java Flight Recorder
Software Profiling: Java Performance, Profiling and Flamegraphs
Adtech x Scala x Performance tuning
Adtech scala-performance-tuning-150323223738-conversion-gate01
Java Performance Engineer's Survival Guide
Software Profiling: Understanding Java Performance and how to profile in Java
Java performance tuning
Tools and Tips to Diagnose Performance Issues
Best Practices for performance evaluation and diagnosis of Java Applications ...
Jvm tuning in a rush! - Lviv JUG
Door to perfomance testing

More from Srinath Perera (20)

PDF
Book: Software Architecture and Decision-Making
PDF
Data science Applications in the Enterprise
PDF
An Introduction to APIs
PDF
An Introduction to Blockchain for Finance Professionals
PDF
AI in the Real World: Challenges, and Risks and how to handle them?
PDF
Healthcare + AI: Use cases & Challenges
PDF
How would AI shape Future Integrations?
PDF
The Role of Blockchain in Future Integrations
PDF
Future of Serverless
PDF
Blockchain: Where are we? Where are we going?
PDF
Few thoughts about Future of Blockchain
PDF
A Visual Canvas for Judging New Technologies
PDF
Privacy in Bigdata Era
PDF
Blockchain, Impact, Challenges, and Risks
PPTX
Today's Technology and Emerging Technology Landscape
PDF
An Emerging Technologies Timeline
PDF
The Rise of Streaming SQL and Evolution of Streaming Applications
PDF
Analytics and AI: The Good, the Bad and the Ugly
PDF
Transforming a Business Through Analytics
PDF
SoC Keynote:The State of the Art in Integration Technology
Book: Software Architecture and Decision-Making
Data science Applications in the Enterprise
An Introduction to APIs
An Introduction to Blockchain for Finance Professionals
AI in the Real World: Challenges, and Risks and how to handle them?
Healthcare + AI: Use cases & Challenges
How would AI shape Future Integrations?
The Role of Blockchain in Future Integrations
Future of Serverless
Blockchain: Where are we? Where are we going?
Few thoughts about Future of Blockchain
A Visual Canvas for Judging New Technologies
Privacy in Bigdata Era
Blockchain, Impact, Challenges, and Risks
Today's Technology and Emerging Technology Landscape
An Emerging Technologies Timeline
The Rise of Streaming SQL and Evolution of Streaming Applications
Analytics and AI: The Good, the Bad and the Ugly
Transforming a Business Through Analytics
SoC Keynote:The State of the Art in Integration Technology

Recently uploaded (20)

PDF
Nekopoi APK 2025 free lastest update
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
System and Network Administration Chapter 2
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Understanding Forklifts - TECH EHS Solution
PPT
Introduction Database Management System for Course Database
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PPTX
history of c programming in notes for students .pptx
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PPTX
ai tools demonstartion for schools and inter college
PPTX
ManageIQ - Sprint 268 Review - Slide Deck
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PPTX
Online Work Permit System for Fast Permit Processing
PPTX
ISO 45001 Occupational Health and Safety Management System
Nekopoi APK 2025 free lastest update
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
System and Network Administration Chapter 2
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Understanding Forklifts - TECH EHS Solution
Introduction Database Management System for Course Database
Navsoft: AI-Powered Business Solutions & Custom Software Development
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
VVF-Customer-Presentation2025-Ver1.9.pptx
2025 Textile ERP Trends: SAP, Odoo & Oracle
Odoo POS Development Services by CandidRoot Solutions
How to Migrate SBCGlobal Email to Yahoo Easily
history of c programming in notes for students .pptx
CHAPTER 2 - PM Management and IT Context
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
ai tools demonstartion for schools and inter college
ManageIQ - Sprint 268 Review - Slide Deck
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Online Work Permit System for Fast Permit Processing
ISO 45001 Occupational Health and Safety Management System

Tuning Java Servers

  • 2. Outline  With size of systems growing, Performance become increasingly critical for both R&D as well as support o Less performance => more machines => lot of Devops costs and potential for failures  We will do o Bit of Theory o Then to Profiling  Will discuss different profiling views, and most importantly when to use them.
  • 5. What is Performance Tuning?  Get maximum out of the machine o Max Throughput o Min Latency  Often this become a tradeoff, so you need a balance o General case is to get max throughput within given latency limits. (E.g. max throughput with latency less than 100ms.)
  • 6. Lets understand what they are  What is throughput? o Number of messages server can process per second o Measured as the number of messages processed/ time to process messages  What is latency? o End to end time for a server to process an message  We discuss performance as relationship between three variables: latency, concurrency, throughput
  • 7. What Limits Performance?  Lack of resources  Bottleneck – most scarce resource decides the performance o CPU bound o Disk bound o Bandwidth bound o Latency bound o Memory bound
  • 8. Performance and Bottlenecks  Think of performance as a water pipe system  Bottle neck decides overall throughput  Latency problems often caused at the same bottlenecks due to contention  Potential Bottlenecks (Resources) o CPU o Disk o Network o Memory - Available Memory - Heap Size - GC (Java) – will eat in CPU
  • 10. Little’s Law  Bound your queue’s (reject too much requests to give better response time)
  • 13. Performance Numbers (based on Jeff Dean’s numbers ) Latency / time for 1 memory Op If Memory access is a Second L1 cache reference 0.05 1/20th sec Main memory reference 1 1 sec Send 2K bytes over 1 Gbps network 200 3 min Read 1 MB sequentially from memory 2500 41 min Disk seek 1*10^5 27 hours Read 1 MB sequentially from disk 2*10^5 2 days Send packet CA->Netherlands->CA 1.5*10^6 17 days See http://tapestryjava.blogspot.com/2012/06/latency-numbers-every- programmer-should.html
  • 14. Performance Numbers (based on Jeff Dean’s numbers ) Latency / time for 1 memory Op If Memory access is a Second L1 cache reference 0.05 1/20th sec Main memory reference 1 1 sec Send 2K bytes over 1 Gbps network 200 3 min Read 1 MB sequentially from memory 2500 41 min Disk seek 1*10^5 27 hours Read 1 MB sequentially from disk 2*10^5 2 days Send packet CA->Netherlands->CA 1.5*10^6 17 days See http://tapestryjava.blogspot.com/2012/06/latency-numbers-every- programmer-should.html
  • 15. Latency Lags Bandwidth  Observation in prof. Patterson’s Keynote at 2004  Bandwidth improves, but not latency  Same holds now, and the gap is widening with new systems
  • 16. Handling Speed Differences in Memory Hierarchy 1. Caching o E.g. Processor caches, file cache, disk cache, permission cache 2. Replication o E.g. RAID, Content Distribution Networks (CDN), Web Cache 3. Prediction – Predict what data will be needed and prefect o Tradeoff bandwidth o E.g. disk caches, Google Earth
  • 17. Potential Performance Problems  Low performance o Low Throughput o High latency  Memory leaks  File, Thread, Connection Leaks
  • 18. Warning  Many think performance is due to CPU  People forget network and Disk (most our workloads are network bound) o That’s why CPU 10% but machine is loaded (load average 40)  Need to look at all  First step is find the bottleneck
  • 19. Where to start? Load Average  Most start by looking at CPU usage  That does not reflect non-CPU load  Load average is much better o Represent the size of the process queue o Can found via top (1m, 5m, 15m) o Goes high when CPU, network, disk, memory etc., are limited o Normal value = number of cores, 4X number of cores means high o 10X core size means machine is unusable Read • http://prutser.wordpress.com/2012/04/23/understanding-linux-load-average- part-1/ also part2 and part 3
  • 20. Profiling Tools  First step is connect through JConsole, and get basic idea.  Also monitor the load average, CPU usage, and IO operations (via top or Sar)  Then you can switch to tools like JProfiler, Yourkit, Java Flight Recorder  Important views o CPU views (bottlenecks) o Thread views and monitor views o Memory views (GC, allocations, and memory leaks) o IO and JDBC tracing
  • 21. Top
  • 23. Java Flight Recorder  Former Oracle Jrocket Mission control  Now included with JDK 7 update 44+  Nice tool, very fast, you could run it in production  Can start via $JAVA_HOME/bin/jmc  Profiling JVM need to have following java option at start o -XX:+UnlockCommercialFeatures -XX:+FlightRecorder
  • 25. Hands On  Code and scripts from https://github.com/srinathperera/JavaProfilingTalk  Run ant to build the service  Download and unzip WSO2 AS. Run wso2server.sh to start  Go to http:// 127.0.0.1:9443/carbon and login via admin/ admin  Deploy the aar by going to Services->Add->AAR Service  Download jmeter, unzip, and run bin/jmeter  Run Jmeter scripts to recreate performance problems
  • 26. Get started with JProfiler  Download install – work with linux, windows, and Mac  New Session, then configure o Same Machine connect o Remote connect - Java - agentpath:/Applications/jprofiler7/bin/macos/libjprofil erti.jnilib=port=8849 o Offline profile  Adjust filters to org.wos2, org.apache, java.util. and remove excludes (can change this)  Connect
  • 27. JProfiler  Instrumentation vs. Sampling  Jpenable let you connect to an running server remotely
  • 28. Problem 1: Tuning for Throughput  Is machine loaded (checked via load average)? o If you are putting small load, throughout will be less  Try increasing concurrency (load)
  • 29. Problem 1: Tuning for.. (Contd)  If load average low o Check lock profile o Try tuning threads - If server set to small number of threads, that will limit performance o Ensure network not saturated - Try transfer a large file
  • 30. Problem 1: Tuning for.. (Contd)  If high load average o Is machine loaded by something else? o Check CPU (top) - If high check CPU profile - If GC percentage > 10% tune GC o Check network o Check Disk o Check is Machine paging
  • 32. CPU Profile  First check hotspots  Then call tree o show inherent time as oppose to cumulative time (Call graph)  Make sure what takes CPU is where real work is done  CPU1 and CPU2 samples
  • 33. Network/ Disk Profile  Enable Socket or Servlet (or Files) via Probes  Hotspots  Read/Write via telematics  Samples tooManyIOPSCall.jmx tooManyNetowrkCalls.jmx
  • 34. Lock Profile  Threads view (History and telemetric view)  Monitor view  bockedThreadsCall.jmx sample
  • 35. Problem 2: Tuning for Latency  Look for bottlenecks – same causes add latency due to queuing o Try to find any queue are growing  Check thread views and lock views o As discussed in Lock profile  All IO (both disk and network) are expensive o Network and Disk Profiles  How to find? o Call Graph o Put system outs and check
  • 36. Problem 3: Finding Memory Leaks  Use Eclipse Memory Analyzer  Go to JProfiler if it is hard to pin down  Support mark and monitoring memory in running system  Can also analyze a memory dump o Select the objects o Checking incoming references (cumulative) o Reference Graph can also help
  • 37. Eclipse Memory Analyzer  When OOM, WSO2 servers create a dump  Open with eclipse memory analyzer
  • 38. Memory leaks with Jprofiler
  • 39. Allocation view  Use to find where objects are created
  • 40. GC Tuning  If time spent on GC > 10%, then should tune GC  “gc overhead limit exceeded” – means > 90% time spent on GC
  • 41. GC Tuning  If time spent on GC > 10%, then should tune GC  “gc overhead limit exceeded” – means > 90% time spent on GC  Consider using allocation views to reduce objects created
  • 42. Problem 4: Finding Thread Leaks  Check the number of threads in thread graph  Check which thread are created  Thread dumps o jstack <pid> >> threaddumps.log o Jprofiler can get thread dumps as well  Take thread dumps at different times
  • 44. Problem 3: Connection Leaks  Exhausted pools, file descriptor limits  Check at finally points and in catch clauses
  • 45. Tuning without an GUI  Create a JProfile session and configure the triggers. o Need Periodic triggers Copy local .jprofiler7/config.xml and JProfiler binaries or agent code to the remote machine  Add following before the java command. o - agentpath:JPROF_LOCATION/bin/macos/libjprofilerti.jnilib=offline,id=S ESSION_ID,config=/Users/srinath/.jprofiler7/config.xml  Here replace the configuration file and JProfiler location with your machine's values. o SESSION_ID is the session ID of the session you created with JProfiler UI. Here ~./jprofiler7/config.xml file has settings for all sessions creates by jprofiler, and it will pick up the right value. If you are profiling in a remote machine copy your local configs to the remote host.  Start and run the program, and it will print the following  JProfiler> Using JVMTI  ..  Open the snapshot file and analyze using JProfiler UI.
  • 46. Tuning  OS Level Tuning o For example, http://www.lognormal.com/blog/2012/09/27/linu x-tcpip-tuning/ o Generally available with the product o It depends on OS etc.  Product Tuning o Thread pools sizes mainly, there may be others  Application Tuning o This is your application, you need to profile
  • 47. Conclusion  Know the goals  Find where the bottleneck using Load average and other OS tools  Zoom in using profiler  Know profile views and when to use them  Details are important, so is the big picture  Think .. Think ..
  • 48. References  Responding in a Timely Manner, https://www.youtube.com/watch?v=q_DCipkM sy0  Tuning Java Servers - http://www.infoq.com/articles/Tuning-Java- Servers  Thinking Clearly about Performance, http://queue.acm.org/detail.cfm?id=1854041# content-comments