Alveo Products Marketplace
Expertise Software Services
Alveo Solutions
Expertise in Alveo FPGA programming
PCIe
x86 CPU
Host
Application
Runtime and Drivers
Acceleration API
FPGA
Accelerated
Functions
DMA Engine
AXI Interfaces
byteLAKE’s
Solutions
Xilinx
Acceleration
Platform
C/C++ code
with
OpenCL API calls
C/C++
or
OpenCL C
FPGACPU
2
More at: byteLAKE.com/en/Alveo
• Xilinx pioneered C to FPGA compilation technology (aka “HLS”) in 2011
3
Source code in C, C++ or OpenCL
loop_main:for(int j=0;j<NUM_SIMGROUPS;j+=2) {
loop_share:for(uint k=0;k<NUM_SIMS;k++) {
loop_parallel:for(int i=0;i<NUM_RNGS;i++) {
mt_rng[i].BOX_MULLER(&num1[i][k],&num2[i][k],ratio4,ratio3);
float payoff1 = expf(num1[i][k])-1.0f;
float payoff2 = expf(num2[i][k])-1.0f;
if(num1[i][k]>0.0f)
pCall1[i][k]+= payoff1;
else
pPut1[i][k]-=payoff1;
if(num2[i][k]>0.0f)
pCall2[i][k]+=payoff2;
else
pPut2[i][k]-=payoff2;
}
}
}
FPGACompile
Xilinx FPGAs highlights
4
• No predefined instruction set or underlying architecture
• Developers customize the architecture to their needs
– Custom data paths
– Custom bit-width
– Custom memory hierarchies
• Excels at all types of parallelism
– Deeply pipelined (e.g. Video codecs)
– Bit manipulations (e.g. AES, SHA)
– Wide data path (e.g. DNN)
– Custom memory hierarchy (e.g. Data analytics)
• Adapts to evolving algorithms and workload needs
FPGAs - the Ultimate Parallel Processing Device
• Compute domain divided
into sub-domains
• Host sends data to the FPGA
global memory
• Host calls kernels to execute them on
FPGA (kernel is called many times)
• Each kernel call represents
a single time step
• FPGA sends the output array
back to the host
Typical Architecture
More at: byteLAKE.com/en/PPAM19
• Kernel is distributed
into 4 SLRs
• Each sub-domain is
allocated in different
memory bank
• Data transfer occurs
between neighboring
memory banks
Example processing
SLR0
Kernel_A
SLR1
Kernel_B
SLR2
Kernel_C
SLR3
Kernel_D
Kernel
Bank0 Bank1
Bank2 Bank3
Sub-domain Sub-domain
Sub-domain Sub-domain
19
Case study: CFD Kernels adaptation
Typical CFD workflow
From CAD to MESH…
(meshing)
Image source: https://www.openfoam.com/products/visualcfd.php
…to CFD simulation and visualization.
• MESH conversion (input)
• byteLAKE’s CFD Kernels
• Data output for visualization
upto5%
ofsimulationtime
major
workload
OPENFOAM® is a registered trademark of ESI Group. This offering is not approved or endorsed by ESI Group, the producer of the OpenFOAM software and owner of the OPENFOAM® and OpenCFD® trademarks.
byteLAKE created set of highly optimized CFD kernels
for Xilinx Alveo Datacentre accelerator cards
–Advection (movement of some material, dissolved or
suspended in the fluid)
–Pseudo velocity (approximation of the relative velocity)
–Divergence (measures how much of fluid is flowing into/ out
of a certain point in a vector field)
–Thomas algorithm (simplified form of Gaussian
elimination for tridiagonal system of equations)
8Download solution description:
bytelake.com/en/download/2716/
CFD Kernels
CFD acceleration with Alveo FPGA
9
More at: byteLAKE.com/en/FPGA
MPDATA Accelerated
CFD / Advection algorithm optimized for heterogeneous
computing.
CFD
Computational Fluid Dynamics
• Numerical analysis and algorithms
to solve fluid flows problems
–how liquids and gases flow
and interact with surfaces
• Widely used across industries:
–automotive, chemical, aerospace,
biomedical, power and energy, and
construction etc.
• Typical applications
–weather simulations,
–aerodynamic characteristics modelling and
optimization,
–petroleum mass flow rate assessment
11
• MPDATA
(Multidimensional Positive Definite Advection Transport Algorithm)
– main part of the dynamic core of the Eulerian/
semi-Lagrangian (EULAG) model
– EULAG (MPDATA+elliptic solver) is the established computational model,
developed for simulating thermo-fluid flows across a wide range of scales
and physical scenarios
– currently, this model is being implemented as the new dynamic core of the COSMO
(Consortium for Small-scale Modeling) weather prediction framework
– advection (together with the elliptic solver) is a key part of many frameworks that allow
users to implement their simulations
• Advection
– movement of some material (dissolved or suspended) in the fluid.
Algorithm: Advection (MPDATA)
General Information
• Easy to integrate
– Can work as a standalone application or be called as a function via our dedicated interface
(e.g. can be called as a function with input and output arrays)
– Compatible with frameworks like TensorFlow for integrating deep learning with CFD codes
• Easy to visualize the results
– Results can be stored in a raw format as a binary file of the output arrays or converted via
byteLAKE tools to a ParaView format
• See benefits already in 1-node HPC configurations
– Strongly adapted to Alveo U250, were single card supports the max size of arrays: 2,1 Gcells
(max compute domain: 1264 x 1264 x 1264) ~ 60 GB
• Scalable to many cards per node and many nodes
Algorithm: Advection (MPDATA)
byteLAKE’s implementation compatibility
• First-order-accurate step of the advection scheme.
Second-order is an option.
• Input data
– Array X – non-diffusive quantity
(e.g. temperature of water vapor, ice, precipitation, etc.)
– Arrays V1, V2, V3 - each of them stores the velocity vectors in one direction
– (optional) Arrays Fi, Fe - implosion and explosion forces acting on a structure of X
– (optional) Array D with density
– (optional) Array rho which defines an interface for the coupling of COSMO and EULAG dynamic core
(used to provide the transformation of the X variable)
– DT – time step (scalar)
• Output data
– single X array that was updated in the given time step
Algorithm: Advection (MPDATA)
Technical Information
• Applications include
– To characterize the sub-grid scales effect in global numerical simulations
of turbulent stellar interiors
– To compare anelastic and compressible convection-permitting weather forecasts
for the Alpine region
– Modeling the prediction of forest fire spread
– Flood simulations
– Biomechanical modeling of brain injuries within the Voigt model
(a linear system of differential equations where the motion of the brain tissue depends
merely on the balance between viscous and elastic forces)
– Simulation gravity wave turbulence in the Earth's atmosphere
– Simulation of geophysical turbulence in the Earth's atmosphere
– Ocean modeling: simulation of three-dimensional solitary wave generation and
propagation using EULAG coupled to the barotropic NCOM (Navy Coastal Ocean Model)
tidal model
15
Applications of Advection (MPDATA)
• Applications include cont.
– Oil and Gas: provides a significant return on investment (ROI) in seismic analysis,
reservoir modelling and basin modelling. Used also to monitor drilling and seismic data
to optimize drilling trajectories and minimize environmental risk.
– AgriTech: models to track and predict various environmental impacts on crop yield such
as weather changes. For example, daily weather predictions can be customized based on
the needs of each client and range from hyperlocal to global.
• Example adopters
– Poznan Supercomputing and Networking Center, Poland: prognosis of air pollution
– European Centre for Medium-Range Weather Forecasts, UK: weather forecast
– Institute of Meteorology and Water Management, Poland: weather forecasts
– German Aerospace Center: aeronautics, transport and energy areas
– University of Cape Town, RPA: weather simulation
– Montreal University: weather simulation
– Warsaw University: ocean simulation
Applications of Advection (MPDATA), cont.
Full list
Algorithm: Advection (MPDATA)
Alveo FPGA Benchmark
INTEL XEON
E5-2995
INTEL XEON
E5-2995
INTEL XEON
GOLD 6148
INTEL XEON
PLATINUM
8168
XILINX
ALVEO U250
Performance (the higher
the better)
INTEL XEON
E5-2995
INTEL XEON
E5-2995
INTEL XEON
GOLD 6148
INTEL XEON
PLATINUM
8168
XILINX
ALVEO U250
Energy (the lower the
better) INTEL XEON
E5-2995
INTEL XEON
E5-2995
INTEL XEON
GOLD 6148
INTEL XEON
PLATINUM
8168
XILINX
ALVEO U250
Performance/W (the
higher the better)
More at: byteLAKE.com/en/FPGA
Explore byteLAKE’s CFD Suite
www.byteLAKE.com/en/CFDSuite
byteLAKE’s
CFD Suite
AI for CFD
AI
• highly optimized AI engines to analyze
text, image, video, sound and time
series data.
• Detecting shapes & patterns.
• Complex tasks automation.
• IoT/ edge, Cloud, on-premise.
HPC
• accelerating time to results and
adapting complex algorithms to GPU,
FPGA, many-CPU architectures.
• From single nodes to clusters.
Meet byteLAKE
AI and HPC Experts Your software partner
for AI & HPC projects
Experts in adapting
& optimizing
software for
Select Products
AI for CFD.
Ultra fast results,
radically lower TCO.
New possibilities.
Objects Detection
Edge AI and real time
computer vision.
56x faster AI training.
R&I • R&D • Licensing
HPC at byteLAKE
Accelerating time to results and adapting complex algorithms
to GPU, FPGA, many-CPU architectures.
Unleashing the power:
• selecting the right programming model to a given problem
(task parallelism, data parallelism, mixture of these two)
• providing the right balance between CPUs and GPUs/FPGAs
• optimizing data transfers between host memory and accelerators
• code adaptation to a variety of computing platforms
Bottom line: lowering TCO thru various optimizations
(performance, energy efficiency, accuracy of calculations)
More at: byteLAKE.com/en/HPC
Making the most of the hardware:
• Speedup: accelerating time to results for complex algorithms
• Green Computing: optimizing algorithms to reduce energy consumption
• Scalability: from single nodes to clusters
Products and Services
Cognitive AutomationEdge AI
Services
HPC
Products
CFD Suite
brainello
Ewa Guard
Federated Learning
Green Computing
(FPGA, GPU)
Intelligent
Restaurant
Incubation
byteLAKE among top AI companies in Poland!
"It contains information on practically all meaningful
companies operating in Poland which offer services or
products in the field of modern technologies. We believe this map
will be necessary to help both domestic and international
investors looking for interesting projects in Poland.",
Aleksander Kutela, President of Digital Poland Foundation

More Related Content

PDF
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
PDF
Dell NVIDIA AI Roadshow - South Western Ontario
PPTX
AI Solutions in Manufacturing
PPTX
Dive into H2O: NYC
PDF
Hpe partner summit proposal 2017
PDF
ML Model Deployment and Scoring on the Edge with Automatic ML & DF
PDF
Your AI Transformation
PDF
Device to Intelligence, IOT and Big Data in Oracle
AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)
Dell NVIDIA AI Roadshow - South Western Ontario
AI Solutions in Manufacturing
Dive into H2O: NYC
Hpe partner summit proposal 2017
ML Model Deployment and Scoring on the Edge with Automatic ML & DF
Your AI Transformation
Device to Intelligence, IOT and Big Data in Oracle

What's hot (20)

PPTX
Getting Your Supply Chain Back on Track with AI
PDF
AI Foundations Course Module 1 - An AI Transformation Journey
PDF
Vertex Perspectives | AI Optimized Chipsets | Part III
PPTX
Powering the Internet of Things with Apache Hadoop
PPTX
AI in the Enterprise at Scale
PDF
Graphcore presenting at Project Juno Machine Intelligence Showcase
PPTX
Introduction & Hands-on with H2O Driverless AI
PDF
Accelerating AI Adoption with Partners
PDF
Vertex perspectives artificial intelligence
PPTX
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
PDF
Design advantages of Hadoop ETL offload with the Intel processor-powered Dell...
PDF
Gtc2013 recap
PPTX
Scaling & Managing Production Deployments with H2O ModelOps
PPTX
TechWiseTV Workshop: Improving Performance and Agility with Cisco HyperFlex
PDF
Logitech Accelerates Cloud Analytics Using Data Virtualization by Avinash Des...
PDF
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
PDF
Deep Learning In Enterprise
PPTX
Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...
PDF
Dell Digital Transformation Through AI and Data Analytics Webinar
PDF
AIOps: Anomalies Detection of Distributed Traces
Getting Your Supply Chain Back on Track with AI
AI Foundations Course Module 1 - An AI Transformation Journey
Vertex Perspectives | AI Optimized Chipsets | Part III
Powering the Internet of Things with Apache Hadoop
AI in the Enterprise at Scale
Graphcore presenting at Project Juno Machine Intelligence Showcase
Introduction & Hands-on with H2O Driverless AI
Accelerating AI Adoption with Partners
Vertex perspectives artificial intelligence
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
Design advantages of Hadoop ETL offload with the Intel processor-powered Dell...
Gtc2013 recap
Scaling & Managing Production Deployments with H2O ModelOps
TechWiseTV Workshop: Improving Performance and Agility with Cisco HyperFlex
Logitech Accelerates Cloud Analytics Using Data Virtualization by Avinash Des...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
Deep Learning In Enterprise
Advanced Techniques to Accelerate Model Tuning | Software for AI Optimization...
Dell Digital Transformation Through AI and Data Analytics Webinar
AIOps: Anomalies Detection of Distributed Traces
Ad

Similar to byteLAKE's Alveo FPGA Solutions (20)

PDF
byteLAKE's expertise across NVIDIA architectures and configurations
PDF
The CAOS framework: democratize the acceleration of compute intensive applica...
PPTX
LEGaTO: Software Stack Runtimes
PPTX
Introduction to FPGA acceleration
PPTX
Programmable Exascale Supercomputer
PDF
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
PPT
Short.course.introduction.to.vhdl for beginners
PDF
Tieng Nguyen resume
PPTX
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
PDF
Barcelona Supercomputing Center, Generador de Riqueza
PPTX
PREP_ASIC.pptx KS KKA SPNNDPS FK KMAKDK D
PDF
The Principle Of Ultrasound Imaging System
PDF
The CAOS framework: Democratize the acceleration of compute intensive applica...
PPTX
Exascale Capabl
PDF
Summer training vhdl
PPTX
Summer training vhdl
PPTX
DATE 2020: Design, Automation and Test in Europe Conference
PPTX
Onnc intro
PDF
Nt1310 Unit 5 Algorithm
DOCX
CV-RENJINIK-27062016
byteLAKE's expertise across NVIDIA architectures and configurations
The CAOS framework: democratize the acceleration of compute intensive applica...
LEGaTO: Software Stack Runtimes
Introduction to FPGA acceleration
Programmable Exascale Supercomputer
Fast Insights to Optimized Vectorization and Memory Using Cache-aware Rooflin...
Short.course.introduction.to.vhdl for beginners
Tieng Nguyen resume
SS-CPSIoT 2023_Kevin Mika and Piotr Zierhoffer presentation
Barcelona Supercomputing Center, Generador de Riqueza
PREP_ASIC.pptx KS KKA SPNNDPS FK KMAKDK D
The Principle Of Ultrasound Imaging System
The CAOS framework: Democratize the acceleration of compute intensive applica...
Exascale Capabl
Summer training vhdl
Summer training vhdl
DATE 2020: Design, Automation and Test in Europe Conference
Onnc intro
Nt1310 Unit 5 Algorithm
CV-RENJINIK-27062016
Ad

More from byteLAKE (20)

PDF
Agent AI (LLM) dla Grupy Prawniczej (WGPR, byteLAKE)
PDF
AI Innovation: Digital Automation with Cognitive Services
PDF
byteLAKE's AI Products (use cases) (short)
PDF
byteLAKE's AI Products (use cases) - presentation
PDF
byteLAKE's AI Products for Industries (2024-02)
PDF
byteLAKE's CFD Suite (AI-accelerated CFD) (2024-02)
PDF
AI Solutions for Industries | Quality Inspection | Data Insights | Predictive...
PDF
Self-Checkout for Restaurants / AI Restaurants (2024-02)
PDF
Self-Checkout (AI for Restautants) - case study by byteLAKE's partner: Simpra
PDF
byteLAKE: Sztuczna Inteligencja dla Przemysłu i Usług
PDF
Przegląd zastosowań sztucznej inteligencji (2024-01)
PDF
Przegląd zastosowań Sztucznej inteligencjI
PDF
AI Solutions for Industries
PDF
AI-accelerated CFD (Computational Fluid Dynamics)
PDF
Advanced Quality Inspection and Data Insights (Artificial Intelligence)
PDF
AI Solutions for Industries (short)
PDF
Self-Checkout (AI for Restautants)
PDF
Applying Industrial AI Models to Product Quality Inspection
PDF
byteLAKE and Intel Partnership
PDF
CFD Suite (AI-accelerated CFD) - Sztuczna Inteligencja Przyspiesza Symulacje ...
Agent AI (LLM) dla Grupy Prawniczej (WGPR, byteLAKE)
AI Innovation: Digital Automation with Cognitive Services
byteLAKE's AI Products (use cases) (short)
byteLAKE's AI Products (use cases) - presentation
byteLAKE's AI Products for Industries (2024-02)
byteLAKE's CFD Suite (AI-accelerated CFD) (2024-02)
AI Solutions for Industries | Quality Inspection | Data Insights | Predictive...
Self-Checkout for Restaurants / AI Restaurants (2024-02)
Self-Checkout (AI for Restautants) - case study by byteLAKE's partner: Simpra
byteLAKE: Sztuczna Inteligencja dla Przemysłu i Usług
Przegląd zastosowań sztucznej inteligencji (2024-01)
Przegląd zastosowań Sztucznej inteligencjI
AI Solutions for Industries
AI-accelerated CFD (Computational Fluid Dynamics)
Advanced Quality Inspection and Data Insights (Artificial Intelligence)
AI Solutions for Industries (short)
Self-Checkout (AI for Restautants)
Applying Industrial AI Models to Product Quality Inspection
byteLAKE and Intel Partnership
CFD Suite (AI-accelerated CFD) - Sztuczna Inteligencja Przyspiesza Symulacje ...

Recently uploaded (20)

PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPTX
Chapter 5: Probability Theory and Statistics
PPT
Geologic Time for studying geology for geologist
PDF
A review of recent deep learning applications in wood surface defect identifi...
PPTX
The various Industrial Revolutions .pptx
PPTX
Benefits of Physical activity for teenagers.pptx
PPT
Module 1.ppt Iot fundamentals and Architecture
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
observCloud-Native Containerability and monitoring.pptx
PDF
Getting Started with Data Integration: FME Form 101
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PDF
August Patch Tuesday
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Architecture types and enterprise applications.pdf
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
Web Crawler for Trend Tracking Gen Z Insights.pptx
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Chapter 5: Probability Theory and Statistics
Geologic Time for studying geology for geologist
A review of recent deep learning applications in wood surface defect identifi...
The various Industrial Revolutions .pptx
Benefits of Physical activity for teenagers.pptx
Module 1.ppt Iot fundamentals and Architecture
Final SEM Unit 1 for mit wpu at pune .pptx
observCloud-Native Containerability and monitoring.pptx
Getting Started with Data Integration: FME Form 101
A contest of sentiment analysis: k-nearest neighbor versus neural network
August Patch Tuesday
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
DP Operators-handbook-extract for the Mautical Institute
Architecture types and enterprise applications.pdf
Developing a website for English-speaking practice to English as a foreign la...
Hindi spoken digit analysis for native and non-native speakers
A novel scalable deep ensemble learning framework for big data classification...
Web Crawler for Trend Tracking Gen Z Insights.pptx

byteLAKE's Alveo FPGA Solutions

  • 1. Alveo Products Marketplace Expertise Software Services Alveo Solutions
  • 2. Expertise in Alveo FPGA programming PCIe x86 CPU Host Application Runtime and Drivers Acceleration API FPGA Accelerated Functions DMA Engine AXI Interfaces byteLAKE’s Solutions Xilinx Acceleration Platform C/C++ code with OpenCL API calls C/C++ or OpenCL C FPGACPU 2 More at: byteLAKE.com/en/Alveo
  • 3. • Xilinx pioneered C to FPGA compilation technology (aka “HLS”) in 2011 3 Source code in C, C++ or OpenCL loop_main:for(int j=0;j<NUM_SIMGROUPS;j+=2) { loop_share:for(uint k=0;k<NUM_SIMS;k++) { loop_parallel:for(int i=0;i<NUM_RNGS;i++) { mt_rng[i].BOX_MULLER(&num1[i][k],&num2[i][k],ratio4,ratio3); float payoff1 = expf(num1[i][k])-1.0f; float payoff2 = expf(num2[i][k])-1.0f; if(num1[i][k]>0.0f) pCall1[i][k]+= payoff1; else pPut1[i][k]-=payoff1; if(num2[i][k]>0.0f) pCall2[i][k]+=payoff2; else pPut2[i][k]-=payoff2; } } } FPGACompile
  • 4. Xilinx FPGAs highlights 4 • No predefined instruction set or underlying architecture • Developers customize the architecture to their needs – Custom data paths – Custom bit-width – Custom memory hierarchies • Excels at all types of parallelism – Deeply pipelined (e.g. Video codecs) – Bit manipulations (e.g. AES, SHA) – Wide data path (e.g. DNN) – Custom memory hierarchy (e.g. Data analytics) • Adapts to evolving algorithms and workload needs FPGAs - the Ultimate Parallel Processing Device
  • 5. • Compute domain divided into sub-domains • Host sends data to the FPGA global memory • Host calls kernels to execute them on FPGA (kernel is called many times) • Each kernel call represents a single time step • FPGA sends the output array back to the host Typical Architecture More at: byteLAKE.com/en/PPAM19
  • 6. • Kernel is distributed into 4 SLRs • Each sub-domain is allocated in different memory bank • Data transfer occurs between neighboring memory banks Example processing SLR0 Kernel_A SLR1 Kernel_B SLR2 Kernel_C SLR3 Kernel_D Kernel Bank0 Bank1 Bank2 Bank3 Sub-domain Sub-domain Sub-domain Sub-domain 19
  • 7. Case study: CFD Kernels adaptation Typical CFD workflow From CAD to MESH… (meshing) Image source: https://www.openfoam.com/products/visualcfd.php …to CFD simulation and visualization. • MESH conversion (input) • byteLAKE’s CFD Kernels • Data output for visualization upto5% ofsimulationtime major workload OPENFOAM® is a registered trademark of ESI Group. This offering is not approved or endorsed by ESI Group, the producer of the OpenFOAM software and owner of the OPENFOAM® and OpenCFD® trademarks.
  • 8. byteLAKE created set of highly optimized CFD kernels for Xilinx Alveo Datacentre accelerator cards –Advection (movement of some material, dissolved or suspended in the fluid) –Pseudo velocity (approximation of the relative velocity) –Divergence (measures how much of fluid is flowing into/ out of a certain point in a vector field) –Thomas algorithm (simplified form of Gaussian elimination for tridiagonal system of equations) 8Download solution description: bytelake.com/en/download/2716/ CFD Kernels
  • 9. CFD acceleration with Alveo FPGA 9 More at: byteLAKE.com/en/FPGA
  • 10. MPDATA Accelerated CFD / Advection algorithm optimized for heterogeneous computing.
  • 11. CFD Computational Fluid Dynamics • Numerical analysis and algorithms to solve fluid flows problems –how liquids and gases flow and interact with surfaces • Widely used across industries: –automotive, chemical, aerospace, biomedical, power and energy, and construction etc. • Typical applications –weather simulations, –aerodynamic characteristics modelling and optimization, –petroleum mass flow rate assessment 11
  • 12. • MPDATA (Multidimensional Positive Definite Advection Transport Algorithm) – main part of the dynamic core of the Eulerian/ semi-Lagrangian (EULAG) model – EULAG (MPDATA+elliptic solver) is the established computational model, developed for simulating thermo-fluid flows across a wide range of scales and physical scenarios – currently, this model is being implemented as the new dynamic core of the COSMO (Consortium for Small-scale Modeling) weather prediction framework – advection (together with the elliptic solver) is a key part of many frameworks that allow users to implement their simulations • Advection – movement of some material (dissolved or suspended) in the fluid. Algorithm: Advection (MPDATA) General Information
  • 13. • Easy to integrate – Can work as a standalone application or be called as a function via our dedicated interface (e.g. can be called as a function with input and output arrays) – Compatible with frameworks like TensorFlow for integrating deep learning with CFD codes • Easy to visualize the results – Results can be stored in a raw format as a binary file of the output arrays or converted via byteLAKE tools to a ParaView format • See benefits already in 1-node HPC configurations – Strongly adapted to Alveo U250, were single card supports the max size of arrays: 2,1 Gcells (max compute domain: 1264 x 1264 x 1264) ~ 60 GB • Scalable to many cards per node and many nodes Algorithm: Advection (MPDATA) byteLAKE’s implementation compatibility
  • 14. • First-order-accurate step of the advection scheme. Second-order is an option. • Input data – Array X – non-diffusive quantity (e.g. temperature of water vapor, ice, precipitation, etc.) – Arrays V1, V2, V3 - each of them stores the velocity vectors in one direction – (optional) Arrays Fi, Fe - implosion and explosion forces acting on a structure of X – (optional) Array D with density – (optional) Array rho which defines an interface for the coupling of COSMO and EULAG dynamic core (used to provide the transformation of the X variable) – DT – time step (scalar) • Output data – single X array that was updated in the given time step Algorithm: Advection (MPDATA) Technical Information
  • 15. • Applications include – To characterize the sub-grid scales effect in global numerical simulations of turbulent stellar interiors – To compare anelastic and compressible convection-permitting weather forecasts for the Alpine region – Modeling the prediction of forest fire spread – Flood simulations – Biomechanical modeling of brain injuries within the Voigt model (a linear system of differential equations where the motion of the brain tissue depends merely on the balance between viscous and elastic forces) – Simulation gravity wave turbulence in the Earth's atmosphere – Simulation of geophysical turbulence in the Earth's atmosphere – Ocean modeling: simulation of three-dimensional solitary wave generation and propagation using EULAG coupled to the barotropic NCOM (Navy Coastal Ocean Model) tidal model 15 Applications of Advection (MPDATA)
  • 16. • Applications include cont. – Oil and Gas: provides a significant return on investment (ROI) in seismic analysis, reservoir modelling and basin modelling. Used also to monitor drilling and seismic data to optimize drilling trajectories and minimize environmental risk. – AgriTech: models to track and predict various environmental impacts on crop yield such as weather changes. For example, daily weather predictions can be customized based on the needs of each client and range from hyperlocal to global. • Example adopters – Poznan Supercomputing and Networking Center, Poland: prognosis of air pollution – European Centre for Medium-Range Weather Forecasts, UK: weather forecast – Institute of Meteorology and Water Management, Poland: weather forecasts – German Aerospace Center: aeronautics, transport and energy areas – University of Cape Town, RPA: weather simulation – Montreal University: weather simulation – Warsaw University: ocean simulation Applications of Advection (MPDATA), cont. Full list
  • 17. Algorithm: Advection (MPDATA) Alveo FPGA Benchmark INTEL XEON E5-2995 INTEL XEON E5-2995 INTEL XEON GOLD 6148 INTEL XEON PLATINUM 8168 XILINX ALVEO U250 Performance (the higher the better) INTEL XEON E5-2995 INTEL XEON E5-2995 INTEL XEON GOLD 6148 INTEL XEON PLATINUM 8168 XILINX ALVEO U250 Energy (the lower the better) INTEL XEON E5-2995 INTEL XEON E5-2995 INTEL XEON GOLD 6148 INTEL XEON PLATINUM 8168 XILINX ALVEO U250 Performance/W (the higher the better) More at: byteLAKE.com/en/FPGA
  • 18. Explore byteLAKE’s CFD Suite www.byteLAKE.com/en/CFDSuite byteLAKE’s CFD Suite AI for CFD
  • 19. AI • highly optimized AI engines to analyze text, image, video, sound and time series data. • Detecting shapes & patterns. • Complex tasks automation. • IoT/ edge, Cloud, on-premise. HPC • accelerating time to results and adapting complex algorithms to GPU, FPGA, many-CPU architectures. • From single nodes to clusters. Meet byteLAKE AI and HPC Experts Your software partner for AI & HPC projects Experts in adapting & optimizing software for Select Products AI for CFD. Ultra fast results, radically lower TCO. New possibilities. Objects Detection Edge AI and real time computer vision. 56x faster AI training. R&I • R&D • Licensing
  • 20. HPC at byteLAKE Accelerating time to results and adapting complex algorithms to GPU, FPGA, many-CPU architectures. Unleashing the power: • selecting the right programming model to a given problem (task parallelism, data parallelism, mixture of these two) • providing the right balance between CPUs and GPUs/FPGAs • optimizing data transfers between host memory and accelerators • code adaptation to a variety of computing platforms Bottom line: lowering TCO thru various optimizations (performance, energy efficiency, accuracy of calculations) More at: byteLAKE.com/en/HPC Making the most of the hardware: • Speedup: accelerating time to results for complex algorithms • Green Computing: optimizing algorithms to reduce energy consumption • Scalability: from single nodes to clusters
  • 21. Products and Services Cognitive AutomationEdge AI Services HPC Products CFD Suite brainello Ewa Guard Federated Learning Green Computing (FPGA, GPU) Intelligent Restaurant Incubation
  • 22. byteLAKE among top AI companies in Poland! "It contains information on practically all meaningful companies operating in Poland which offer services or products in the field of modern technologies. We believe this map will be necessary to help both domestic and international investors looking for interesting projects in Poland.", Aleksander Kutela, President of Digital Poland Foundation