SlideShare a Scribd company logo
Ole Wegen, Matthias Trapp, Jürgen Döllner
Hasso Plattner Institute, Faculty of Digital Engineering,
University of Potsdam, Germany
Sebastian Pasewaldt
Digital Masterpieces GmbH
Germany
In Cooperation with:
Funded by:
Performance Evaluation and Comparison of Service-based Image Processing based on Software Rendering
Performance Evaluation and Comparison of Service-based Image Processing based on Software Rendering
Image processing is a common task
Observations:
1. Increased usage of mobile hardware
2. Increase in cloud-processing capabilities and infrastructure
3. Increase of network throughput
Service-based provisioning of image processing functionality
for web-based or mobile applications:
▪ Accessible from anywhere
▪ Useable with any device (hardware independent)
Important aspects to account for:
1. Processing implementations often rely on hardware-acceleration (e.g. GPUs)
2. Scalability as a crucial factor when using a service-based approach
These requirements are often accompanied with high financial costs:
• Dedicated server hardware w.r.t. specs
• Limited availability
Software rendering (SWR) can reduce these costs:
• Rendering performed entirely on the CPU (affordable)
• Enables execution of GPU-based programs on hosts without GPU support
SWR was investigated for:
▪ Ma and Parker, 2001:
Visualizing large-scale datasets
▪ Mileff and Dudra, 2013:
Texture rendering
▪ Hayashi et al., 2018:
In-situ visualization for volume rendering system
Contribution A: SWR for Service-based image processing
Contribution B: Performance comparison between dedicated GPU server vs. SWR server:
▪ Main classes of image processing techniques
(point and neighbourhood, single vs. multipass)
▪ Different image resolutions
▪ Various server configurations
Input
Output
Mesa3D:
▪ 3D Graphics Library implementing
graphics API specifications
▪ Supports OpenGL, Vulkan and others
▪ Commonly used for SWR
Gallium3D:
▪ API to support driver development
▪ Abstracting from graphics API
▪ Abstracting from operating system
Available Gallium3D Driver:
▪ softpipe
▪ LLVMpipe
▪ openSWR
https://en.wikipedia.org/wiki/Mesa_(computer_graphics)
Test machine:
▪ Intel Core i5-8400
▪ 6 Cores at 2.8 GHz
▪ 16 GB DDR4 RAM
Operation: Morphological Closing (Kernel size 3)
Two Docker Containers:
▪ The instance of image processor
▪ A NodeJS server exposing a REST [Winkler and Schlesiger, 2013] interface for communication
▪ Communication between these containers through WebSockets
▪ Processing Techniques:
• Color Invert (A)
• Point-based
• Single Pass
• Morphological Closing (B)
• Neighbourhood-based
• Separated Passes
• Tested kernel sizes: 3, 14, 90
• Oilpaint (C)
• Multipass, Neighbourhood-based
• # Passes: 18
▪ Tested spatial resolutions:
• 1280 x 720 (HD)
• 1920 x 1080 (FHD)
• 2560 x 1440 (QHD)
• 3840 x 2160 (4K)
A
B C
“[…] each vCPU in an Amazon EC2 instance is a hyperthread of an Intel Xeon CPU core.”
Demystifying the Number of vCPUs for Optimal Workload Performance, Amazon, Sept. 2018
https://d1.awsstatic.com/whitepapers/Demystifying_vCPUs.pdf
GPU Server
EC2 t2.large
Amazon Elastic Cloud
EC2 c4.4xlarge EC2 c5.18xlarge
CPU Intel Xeon
3.5 GHz
Intel Xeon
3.0 GHz
Intel Xeon
2.9 GHz
Intel Xeon Platinum
3.0 GHz
# Cores/vCPUs 8 Cores 2 vCPUs 16 vCPUs 72 vCPUs
RAM 64 GB RAM 8 GB RAM 30 GB RAM 144 GB RAM
GPU NVIDIA Quadro M6000
24 GB
None None None
Test Procedure:
1. Six measurements for each combination of resolution and processing technique
2. Discarding the first measurement (setup costs of image processor)
3. Averaging the remaining five
GPU Server
EC2 t2.large
Amazon Elastic Cloud
EC2 c4.4xlarge EC2 c5.18xlarge
CPU Intel Xeon
3.5 GHz
Intel Xeon
3.0 GHz
Intel Xeon
2.9 GHz
Intel Xeon Platinum
3.0 GHz
# Cores/vCPUs 8 Cores 2 vCPUs 16 vCPUs 72 vCPUs
RAM 64 GB RAM 8 GB RAM 30 GB RAM 144 GB RAM
GPU NVIDIA Quadro M6000
24 GB
None None None
Performance Evaluation and Comparison of Service-based Image Processing based on Software Rendering
▪ GPU-based rendering
is significant faster
▪ Same relations between
processing techniques
▪ For Invert, software
rendering is faster
(maybe due to no
RAM-VRAM bus
transfer costs)
▪ For complex techniques:
the greater the resolution
the faster is GPU-based
rendering compared to
software rendering
▪ For simple techniques:
stable speed factor
▪ Bend in the curve for FHD
Speed factor = duration SWR / duration GPU
▪t2.large instance:
• 2 vCPUs
• 8 GB RAM
c4.large instance:
• 16 vCPUs
• 30 GB RAM
More vCPUs reduce
processing time
t2.large instance:
▪ 2 vCPUs
▪ 8 GB RAM
c4.large instance:
▪ 16 vCPUs
▪ 30 GB RAM
Performance Evaluation and Comparison of Service-based Image Processing based on Software Rendering
There probably exists
an upper bound but
none could be observed
Without GPU:
With GPU:
Instance type t2.large c4.4xlarge c5.18xlarge
Number of vCPUs 2 16 (x8) 72 (x36)
USD per hour 0.1008 0.905 (x9) 3.456 (x34)
Instance type g3.4xlarge
GPU Nvidia Tesla M60
8 GB
USD per hour 1.21
Run-time performance cost are mainly determined by:
1. Complexity of the processing technique
2. Resolution of the input image
3. Number of virtual CPUs
The performance penalty can be attenuated
by increasing the number of vCPUs/Threads
Software rendering is a suitable approach for
reducing the financial costs for a scalable web-based provisioning of
image processing, but it comes with costs regarding performance
Contact:
▪ ole.wegen@student.hpi.de | trapp@hpi.de | doellner@hpi.de
▪ sebastian.pasewaldt@digitalmasterpieces.de
Funded by (01IS15041):
In Cooperation with:

More Related Content

PDF
GTC 2013 Jen-Hsun Huang Keynote
PDF
Ivo Pavlik - thesis (print version)
PDF
Image Processing Application on Graphics processors
PPT
Current Trends in HPC
PDF
The road to multi/many core computing
PDF
The Rise of Parallel Computing
PPTX
Graphics processing unit ppt
PDF
Compressing of Magnetic Resonance Images with Cuda
GTC 2013 Jen-Hsun Huang Keynote
Ivo Pavlik - thesis (print version)
Image Processing Application on Graphics processors
Current Trends in HPC
The road to multi/many core computing
The Rise of Parallel Computing
Graphics processing unit ppt
Compressing of Magnetic Resonance Images with Cuda

Similar to Performance Evaluation and Comparison of Service-based Image Processing based on Software Rendering (20)

PDF
GPU Technology Conference 2014 Keynote
PDF
DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
PDF
Newbie’s guide to_the_gpgpu_universe
PDF
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
PDF
Keynote (Johan Andersson) - Mantle for Developers - by Johan Andersson, Techn...
PPTX
Gpu with cuda architecture
PDF
Computing using GPUs
PDF
Hardware & Software Platforms for HPC, AI and ML
PDF
GPU Virtualization on VMware's Hosted I/O Architecture
PDF
thesis
PDF
N A G P A R I S280101
PPTX
Mantle for Developers
PPTX
GPU Computing: A brief overview
PDF
Architecture exploration of recent GPUs to analyze the efficiency of hardware...
PDF
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
PDF
FPGA Hardware Accelerator for Machine Learning
PPT
Vpu technology &gpgpu computing
PPT
Vpu technology &gpgpu computing
PPT
Vpu technology &gpgpu computing
PPTX
Cuda meetup presentation 5
GPU Technology Conference 2014 Keynote
DX12 & Vulkan: Dawn of a New Generation of Graphics APIs
Newbie’s guide to_the_gpgpu_universe
Bitfusion Nimbix Dev Summit Heterogeneous Architectures
Keynote (Johan Andersson) - Mantle for Developers - by Johan Andersson, Techn...
Gpu with cuda architecture
Computing using GPUs
Hardware & Software Platforms for HPC, AI and ML
GPU Virtualization on VMware's Hosted I/O Architecture
thesis
N A G P A R I S280101
Mantle for Developers
GPU Computing: A brief overview
Architecture exploration of recent GPUs to analyze the efficiency of hardware...
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
FPGA Hardware Accelerator for Machine Learning
Vpu technology &gpgpu computing
Vpu technology &gpgpu computing
Vpu technology &gpgpu computing
Cuda meetup presentation 5

More from Matthias Trapp (20)

PDF
Interactive Control over Temporal Consistency while Stylizing Video Streams
PDF
A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...
PDF
A Framework for Interactive 3D Photo Stylization Techniques on Mobile Devices
PDF
ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...
PDF
A Service-based Preset Recommendation System for Image Stylization Applications
PDF
Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...
PDF
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
PDF
Efficient GitHub Crawling using the GraphQL API
PDF
CodeCV - Mining Expertise of GitHub Users from Coding Activities - Online.pdf
PDF
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
PDF
TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...
PDF
Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...
PDF
Web-based and Mobile Provisioning of Virtual 3D Reconstructions
PDF
Visualization of Knowledge Distribution across Development Teams using 2.5D S...
PDF
Real-time Screen-space Geometry Draping for 3D Digital Terrain Models
PDF
FERMIUM - A Framework for Real-time Procedural Point Cloud Animation & Morphing
PDF
Interactive Editing of Signed Distance Fields
PDF
Integration of Image Processing Techniques into the Unity Game Engine
PDF
Interactive GPU-based Image Deformation for Mobile Devices
PDF
Interactive Photo Editing on Smartphones via Intrinsic Decomposition
Interactive Control over Temporal Consistency while Stylizing Video Streams
A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...
A Framework for Interactive 3D Photo Stylization Techniques on Mobile Devices
ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...
A Service-based Preset Recommendation System for Image Stylization Applications
Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
Efficient GitHub Crawling using the GraphQL API
CodeCV - Mining Expertise of GitHub Users from Coding Activities - Online.pdf
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...
Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...
Web-based and Mobile Provisioning of Virtual 3D Reconstructions
Visualization of Knowledge Distribution across Development Teams using 2.5D S...
Real-time Screen-space Geometry Draping for 3D Digital Terrain Models
FERMIUM - A Framework for Real-time Procedural Point Cloud Animation & Morphing
Interactive Editing of Signed Distance Fields
Integration of Image Processing Techniques into the Unity Game Engine
Interactive GPU-based Image Deformation for Mobile Devices
Interactive Photo Editing on Smartphones via Intrinsic Decomposition

Recently uploaded (20)

PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPT
Teaching material agriculture food technology
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
Spectroscopy.pptx food analysis technology
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Machine Learning_overview_presentation.pptx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Encapsulation_ Review paper, used for researhc scholars
Mobile App Security Testing_ A Comprehensive Guide.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Dropbox Q2 2025 Financial Results & Investor Presentation
Reach Out and Touch Someone: Haptics and Empathic Computing
Group 1 Presentation -Planning and Decision Making .pptx
Assigned Numbers - 2025 - Bluetooth® Document
Diabetes mellitus diagnosis method based random forest with bat algorithm
Building Integrated photovoltaic BIPV_UPV.pdf
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Teaching material agriculture food technology
MIND Revenue Release Quarter 2 2025 Press Release
“AI and Expert System Decision Support & Business Intelligence Systems”
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Spectroscopy.pptx food analysis technology
Digital-Transformation-Roadmap-for-Companies.pptx
20250228 LYD VKU AI Blended-Learning.pptx
Machine Learning_overview_presentation.pptx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11

Performance Evaluation and Comparison of Service-based Image Processing based on Software Rendering

  • 1. Ole Wegen, Matthias Trapp, Jürgen Döllner Hasso Plattner Institute, Faculty of Digital Engineering, University of Potsdam, Germany Sebastian Pasewaldt Digital Masterpieces GmbH Germany In Cooperation with: Funded by:
  • 4. Image processing is a common task Observations: 1. Increased usage of mobile hardware 2. Increase in cloud-processing capabilities and infrastructure 3. Increase of network throughput Service-based provisioning of image processing functionality for web-based or mobile applications: ▪ Accessible from anywhere ▪ Useable with any device (hardware independent)
  • 5. Important aspects to account for: 1. Processing implementations often rely on hardware-acceleration (e.g. GPUs) 2. Scalability as a crucial factor when using a service-based approach These requirements are often accompanied with high financial costs: • Dedicated server hardware w.r.t. specs • Limited availability Software rendering (SWR) can reduce these costs: • Rendering performed entirely on the CPU (affordable) • Enables execution of GPU-based programs on hosts without GPU support
  • 6. SWR was investigated for: ▪ Ma and Parker, 2001: Visualizing large-scale datasets ▪ Mileff and Dudra, 2013: Texture rendering ▪ Hayashi et al., 2018: In-situ visualization for volume rendering system Contribution A: SWR for Service-based image processing Contribution B: Performance comparison between dedicated GPU server vs. SWR server: ▪ Main classes of image processing techniques (point and neighbourhood, single vs. multipass) ▪ Different image resolutions ▪ Various server configurations
  • 8. Mesa3D: ▪ 3D Graphics Library implementing graphics API specifications ▪ Supports OpenGL, Vulkan and others ▪ Commonly used for SWR Gallium3D: ▪ API to support driver development ▪ Abstracting from graphics API ▪ Abstracting from operating system Available Gallium3D Driver: ▪ softpipe ▪ LLVMpipe ▪ openSWR https://en.wikipedia.org/wiki/Mesa_(computer_graphics)
  • 9. Test machine: ▪ Intel Core i5-8400 ▪ 6 Cores at 2.8 GHz ▪ 16 GB DDR4 RAM Operation: Morphological Closing (Kernel size 3)
  • 10. Two Docker Containers: ▪ The instance of image processor ▪ A NodeJS server exposing a REST [Winkler and Schlesiger, 2013] interface for communication ▪ Communication between these containers through WebSockets
  • 11. ▪ Processing Techniques: • Color Invert (A) • Point-based • Single Pass • Morphological Closing (B) • Neighbourhood-based • Separated Passes • Tested kernel sizes: 3, 14, 90 • Oilpaint (C) • Multipass, Neighbourhood-based • # Passes: 18 ▪ Tested spatial resolutions: • 1280 x 720 (HD) • 1920 x 1080 (FHD) • 2560 x 1440 (QHD) • 3840 x 2160 (4K) A B C
  • 12. “[…] each vCPU in an Amazon EC2 instance is a hyperthread of an Intel Xeon CPU core.” Demystifying the Number of vCPUs for Optimal Workload Performance, Amazon, Sept. 2018 https://d1.awsstatic.com/whitepapers/Demystifying_vCPUs.pdf GPU Server EC2 t2.large Amazon Elastic Cloud EC2 c4.4xlarge EC2 c5.18xlarge CPU Intel Xeon 3.5 GHz Intel Xeon 3.0 GHz Intel Xeon 2.9 GHz Intel Xeon Platinum 3.0 GHz # Cores/vCPUs 8 Cores 2 vCPUs 16 vCPUs 72 vCPUs RAM 64 GB RAM 8 GB RAM 30 GB RAM 144 GB RAM GPU NVIDIA Quadro M6000 24 GB None None None
  • 13. Test Procedure: 1. Six measurements for each combination of resolution and processing technique 2. Discarding the first measurement (setup costs of image processor) 3. Averaging the remaining five GPU Server EC2 t2.large Amazon Elastic Cloud EC2 c4.4xlarge EC2 c5.18xlarge CPU Intel Xeon 3.5 GHz Intel Xeon 3.0 GHz Intel Xeon 2.9 GHz Intel Xeon Platinum 3.0 GHz # Cores/vCPUs 8 Cores 2 vCPUs 16 vCPUs 72 vCPUs RAM 64 GB RAM 8 GB RAM 30 GB RAM 144 GB RAM GPU NVIDIA Quadro M6000 24 GB None None None
  • 15. ▪ GPU-based rendering is significant faster ▪ Same relations between processing techniques ▪ For Invert, software rendering is faster (maybe due to no RAM-VRAM bus transfer costs)
  • 16. ▪ For complex techniques: the greater the resolution the faster is GPU-based rendering compared to software rendering ▪ For simple techniques: stable speed factor ▪ Bend in the curve for FHD Speed factor = duration SWR / duration GPU
  • 17. ▪t2.large instance: • 2 vCPUs • 8 GB RAM c4.large instance: • 16 vCPUs • 30 GB RAM
  • 18. More vCPUs reduce processing time t2.large instance: ▪ 2 vCPUs ▪ 8 GB RAM c4.large instance: ▪ 16 vCPUs ▪ 30 GB RAM
  • 20. There probably exists an upper bound but none could be observed
  • 21. Without GPU: With GPU: Instance type t2.large c4.4xlarge c5.18xlarge Number of vCPUs 2 16 (x8) 72 (x36) USD per hour 0.1008 0.905 (x9) 3.456 (x34) Instance type g3.4xlarge GPU Nvidia Tesla M60 8 GB USD per hour 1.21
  • 22. Run-time performance cost are mainly determined by: 1. Complexity of the processing technique 2. Resolution of the input image 3. Number of virtual CPUs The performance penalty can be attenuated by increasing the number of vCPUs/Threads Software rendering is a suitable approach for reducing the financial costs for a scalable web-based provisioning of image processing, but it comes with costs regarding performance
  • 23. Contact: ▪ ole.wegen@student.hpi.de | trapp@hpi.de | doellner@hpi.de ▪ sebastian.pasewaldt@digitalmasterpieces.de Funded by (01IS15041): In Cooperation with: