SlideShare a Scribd company logo
Productive OpenCL
with Intel Xeon Phi Coprocessors and ArrayFire
by John Melonakos
Why Software Libraries Are Great
 Reduce many lines of code to one line
 Obsessively tuned by experts; faster than DIY
 Well-tested and maintained
 Continuously improving
12 Minutes of Library Love
 Dusty servers = bad, very bad
 Software struggles lead to
under utilization
 Everyone loses
Five Influencers (besides price)
Portability Scalability Community
ProgrammabilityPerformance
Faster
Time-
consuming
SSE or
AVX
Slower
Easy-to-use
Performance & Programmability
Faster
Time-
consuming
Writing
Kernels
SSE or
AVX
Slower
Easy-to-use
Performance & Programmability
Faster
Time-
consuming
Writing
Kernels
Compiler
Directives
SSE or
AVX
Slower
Easy-to-use
Performance & Programmability
Faster
Time-
consuming
Writing
Kernels
Using
Libraries
Compiler
Directives
SSE or
AVX
Slower
Easy-to-use
Performance & Programmability
Portability
 Flavors of portability
HW options
 CPU fallback
High-performance mobile computing
 Libraries can provide portability
Scalability
 Always start with one device
 Potential headaches of adding devices
 Performance hit
Development complexity
 Libraries can make scaling easy
Community
 What do you do when bugs arise?
 Continuous refinement
 Someone to answer questions
 Libraries can have great community support
Benefits of Using a Library
Development
Documentation
Test and QA
Maintenance
Porting
TIME
COST
TIME
COST
ArrayFire eliminates
hidden costs of software
development
Pain Pleasure
Beware of Bad Libraries
 Bad libraries are a time sink
 Ruin the “library” reputation
Here’s One Good Library
Performance & Programmability
 Super easy to program
 Highly optimized
Portability
Scalability
 Multi-Device is 1-line of code
array *y = new array[n];
for (int i = 0; i < n; ++i) {
deviceset(i); // change device
array x = randu(5,5); // add work to device’s queue
y[i] = fft(x); // more work in queue
}
// all devices are now computing simultaneously
Community
 Over 8,000 posts at
http://forums.accelereyes.com
 Nightly library update releases
 Stable releases a few times a year
 v2.0 coming at the end of summer
Graphics Functions
#include <arrayfire.h>
using namespace af;
int main() {
// random 3d surface
const int n = 256;
while (1) {
array x = randu(n,n);
// 3d surface plot
surface(x);
}
return 0;
}
GFOR Parallel Loops
BA(,,1:3)C(,,1:3)
*=
*=
*=
gfor (array i, 3)
C(span,span,i) = A(span,span,i) * B;
Parallel matrix multiplications (1 kernel launch)
Software Shop for CUDA & OpenCL
 Two ways to work with us:
Use
 Hire our CUDA & OpenCL developers
Code development; CUDA & OpenCL training
Take Home Messages
 Free version of ArrayFire
http://accelereyes.com
 Need acceleration? Try a library!
Productive OpenCL
with Intel Xeon Phi Coprocessors and ArrayFire
by John Melonakos

More Related Content

KEY
Managing Eclipse Preferences for Teams (EclipseCon 2011)
PDF
Axemblr Provisionr 0.3.x Overview
PDF
Beefing Up AIR - FITC AMS 2012
PDF
Puppet at Spotify
PDF
Terraform
PDF
Cloudformation vs terraform_vs_ansible
PDF
AWS Connect 2017 - Container (feat. AWS)
PDF
OpenIO Summit'17 - ARM, Object Storage and more
Managing Eclipse Preferences for Teams (EclipseCon 2011)
Axemblr Provisionr 0.3.x Overview
Beefing Up AIR - FITC AMS 2012
Puppet at Spotify
Terraform
Cloudformation vs terraform_vs_ansible
AWS Connect 2017 - Container (feat. AWS)
OpenIO Summit'17 - ARM, Object Storage and more

What's hot (19)

PPTX
Infrastructure as code, using Terraform
PPTX
Automation in VLSI related tasks.
PDF
Adobe AIR for mobile games
PPTX
Doing Azure With PowerShell
PPTX
Shelly cloud & heroku & engineyard. Pros & Cons
PPTX
Big Boards & docker
TXT
TXT
PDF
Aws certified solutions architect associate exam dumps
TXT
hosting
PDF
Delphi L01 Intro
PPTX
Wso2 con raspberry-pi-cluster
PPTX
Storage Is Not Virtualized Enough - part 1
PDF
Embedded JavaScript (FluentConf 2014)
TXT
Hosts
TXT
Hosts
ODP
Overseer: Low-Level Hardware Monitoring and Management for Java
Infrastructure as code, using Terraform
Automation in VLSI related tasks.
Adobe AIR for mobile games
Doing Azure With PowerShell
Shelly cloud & heroku & engineyard. Pros & Cons
Big Boards & docker
Aws certified solutions architect associate exam dumps
hosting
Delphi L01 Intro
Wso2 con raspberry-pi-cluster
Storage Is Not Virtualized Enough - part 1
Embedded JavaScript (FluentConf 2014)
Hosts
Hosts
Overseer: Low-Level Hardware Monitoring and Management for Java
Ad

Similar to Productive OpenCL with Intel Xeon Phi Coprocessors (20)

PPTX
.Net Core 1.0 vs .NET Framework
PDF
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
PPTX
Introduction to Phoenix Framework (Elixir) 2016-01-07
PDF
(Costless) Software Abstractions for Parallel Architectures
PPT
The Nuxeo Way: leveraging open source to build a world-class ECM platform
PPT
Track A-Compilation guiding and adjusting - IBM
PPTX
Fedora os presentation
PPT
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
PPT
SynapseIndia java and .net development
PPT
basic_java.ppt
PPT
Visual Studio .NET2010
PDF
TechWiseTV Workshop: Catalyst Switching Programmability
PPTX
Next .NET and C#
PPTX
Overview of VS2010 and .NET 4.0
PDF
Minko - Targeting Flash/Stage3D with C++ and GLSL
PDF
Aggregate Programming in Scala
PDF
LCNA14: Why Use Xen for Large Scale Enterprise Deployments? - Konrad Rzeszute...
PPT
JavaHTP7e_0101_DDP.ppt
PPTX
Docker - Demo on PHP Application deployment
DOC
RResume
.Net Core 1.0 vs .NET Framework
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Introduction to Phoenix Framework (Elixir) 2016-01-07
(Costless) Software Abstractions for Parallel Architectures
The Nuxeo Way: leveraging open source to build a world-class ECM platform
Track A-Compilation guiding and adjusting - IBM
Fedora os presentation
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
SynapseIndia java and .net development
basic_java.ppt
Visual Studio .NET2010
TechWiseTV Workshop: Catalyst Switching Programmability
Next .NET and C#
Overview of VS2010 and .NET 4.0
Minko - Targeting Flash/Stage3D with C++ and GLSL
Aggregate Programming in Scala
LCNA14: Why Use Xen for Large Scale Enterprise Deployments? - Konrad Rzeszute...
JavaHTP7e_0101_DDP.ppt
Docker - Demo on PHP Application deployment
RResume
Ad

More from Intel IT Center (20)

PDF
AI Crash Course- Supercomputing
PPTX
FPGA Inference - DellEMC SURFsara
PDF
High Memory Bandwidth Demo @ One Intel Station
PDF
INFOGRAPHIC: Advantages of Intel vs. IBM Power on SAP HANA solutions
PDF
Disrupt Hackers With Robust User Authentication
PDF
Strengthen Your Enterprise Arsenal Against Cyber Attacks With Hardware-Enhanc...
PDF
Harness Digital Disruption to Create 2022’s Workplace Today
PPTX
Don't Rely on Software Alone. Protect Endpoints with Hardware-Enhanced Security.
PDF
Achieve Unconstrained Collaboration in a Digital World
PDF
Intel® Xeon® Scalable Processors Enabled Applications Marketing Guide
PDF
#NABshow: National Association of Broadcasters 2017 Super Session Presentatio...
PPTX
Identity Protection for the Digital Age
PDF
Three Steps to Making a Digital Workplace a Reality
PDF
Three Steps to Making The Digital Workplace a Reality - by Intel’s Chad Const...
PDF
Intel® Xeon® Processor E7-8800/4800 v4 EAMG 2.0
PDF
Intel® Xeon® Processor E5-2600 v4 Enterprise Database Applications Showcase
PDF
Intel® Xeon® Processor E5-2600 v4 Core Business Applications Showcase
PDF
Intel® Xeon® Processor E5-2600 v4 Financial Security Applications Showcase
PDF
Intel® Xeon® Processor E5-2600 v4 Telco Cloud Digital Applications Showcase
PDF
Intel® Xeon® Processor E5-2600 v4 Tech Computing Applications Showcase
AI Crash Course- Supercomputing
FPGA Inference - DellEMC SURFsara
High Memory Bandwidth Demo @ One Intel Station
INFOGRAPHIC: Advantages of Intel vs. IBM Power on SAP HANA solutions
Disrupt Hackers With Robust User Authentication
Strengthen Your Enterprise Arsenal Against Cyber Attacks With Hardware-Enhanc...
Harness Digital Disruption to Create 2022’s Workplace Today
Don't Rely on Software Alone. Protect Endpoints with Hardware-Enhanced Security.
Achieve Unconstrained Collaboration in a Digital World
Intel® Xeon® Scalable Processors Enabled Applications Marketing Guide
#NABshow: National Association of Broadcasters 2017 Super Session Presentatio...
Identity Protection for the Digital Age
Three Steps to Making a Digital Workplace a Reality
Three Steps to Making The Digital Workplace a Reality - by Intel’s Chad Const...
Intel® Xeon® Processor E7-8800/4800 v4 EAMG 2.0
Intel® Xeon® Processor E5-2600 v4 Enterprise Database Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Core Business Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Financial Security Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Telco Cloud Digital Applications Showcase
Intel® Xeon® Processor E5-2600 v4 Tech Computing Applications Showcase

Recently uploaded (20)

PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
KodekX | Application Modernization Development
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Electronic commerce courselecture one. Pdf
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Spectroscopy.pptx food analysis technology
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Approach and Philosophy of On baking technology
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Cloud computing and distributed systems.
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Encapsulation theory and applications.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
MYSQL Presentation for SQL database connectivity
KodekX | Application Modernization Development
“AI and Expert System Decision Support & Business Intelligence Systems”
Dropbox Q2 2025 Financial Results & Investor Presentation
MIND Revenue Release Quarter 2 2025 Press Release
Electronic commerce courselecture one. Pdf
sap open course for s4hana steps from ECC to s4
Spectroscopy.pptx food analysis technology
The Rise and Fall of 3GPP – Time for a Sabbatical?
Approach and Philosophy of On baking technology
Chapter 3 Spatial Domain Image Processing.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Diabetes mellitus diagnosis method based random forest with bat algorithm
Cloud computing and distributed systems.
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Advanced methodologies resolving dimensionality complications for autism neur...

Productive OpenCL with Intel Xeon Phi Coprocessors