SlideShare a Scribd company logo
QPACE      QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)




     Heiko Joerg Schick
     Firmware Project and Bring-up Lead
     Böblingen, 2009-06-29



    IBM Deutschland Research & Development GmbH 10/15/09     © Copyright IBM Corporation 2009
Agenda

             Overview


             QPACE Architecture


             QPACE Node Card




2        IBM Deutschland Research & Development GmbH 10/15/09   © Copyright IBM Corporation 2009
Overview


QPACE = QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)


    The QPACE project is a research collaboration of IBM Development and European
    universities and research institutes with the goal to build a prototype of a cell processor-
    based supercomputer.

    The major part of this project will be funded by the German Research Foundation (DFG –
    Deutsche Forschungsgemeinschaft) as part of a Collaborative Research Center (SFB –
    Sonderforschungsbereich [TR55]). Additional funds come the PRACE project.




3              IBM Deutschland Research & Development GmbH 10/15/09            © Copyright IBM Corporation 2009
QPACE Architecture


Architecture

    System:
      Node card with PowerXCell 8i processor and network processor (NWP)
      Commodity processor interconnected by a custom network
      256 node-cards per rack = 26 Tflops peak performance

    Network:
      3-dimensional Torus Network: nearest-neighbour communication, 3-dimensional torus topology
      Tree Network: evaluation of global conditions and synchronization
      Gigabit Ethernet: One link per node, rack-level switches

    Applications:
      Optimized for calculations in theoretical particle physics: Simulation of Quantum Chromodynamics
      Target sustained performance of 20-30%

    Liquid cooling system:
      Closed node card housing acts as heat conductor
      Housing is connected to liquid-cooled “cold plate”
      Cold Plate is placed between two rows of node cards



4             IBM Deutschland Research & Development GmbH 10/15/09                © Copyright IBM Corporation 2009
QPACE Architecture




      Root Card
      (16 per rack)                                                                     Backplane
                                                                                         (8 per rack)




    Node Card
    (256 per rack)


                                                                      Power Supply and Power Adapter Card
                                                                                   (24 per rack)
                                                               Rack


5       IBM Deutschland Research & Development GmbH 10/15/09                         © Copyright IBM Corporation 2009
QPACE Node Card


Features

    Components:
      PowerXCell 8i processor 3.2 GHZ
      4 Gigabyte DDR2 memory 800 MHZ with ECC
      Network processor (NWP) Xilinx FPGA LX110T FPGA
      Ethernet PHY
      6 x 1GB/s external links using PCI Express physical layer
      Service Processor (SP) Freescale 52211
      FLASH (firmware and FPGA configuration)
      Power subsystem
      Clocking

    Network Processor:
      FLEXIO interface to PowerXCell 8i processor, 2 bytes with 3 GHZ bit rate
      Gigabit Ethernet
      UART FW Linux console
      UART SP communication
      SPI Master (boot flash)
      SPI Slave for training and configuration
      GPIO


6             IBM Deutschland Research & Development GmbH 10/15/09                © Copyright IBM Corporation 2009
QPACE Node Card


                                                 Network Processor   Network PHYs
               PowerXCell 8i                     (FPGA)
    Memory     Processor




7            IBM Deutschland Research & Development GmbH 10/15/09          © Copyright IBM Corporation 2009
QPACE Node Card


Block Diagram

                                                              DDR2          DDR2
                                                           DDR2           DDR2

                                                                               800MHz


          I2C
                       Power                        SPI                                    RW
                     Subsystem                                                           (Debug)
                                                               PowerXCell 8i




                                                          FLEXIO            FLEXIO
                      Clocking
                                                           6GB/s             6GB/s


                                                                                                         RS232
                                   SPI
        I2C
                   SP                                                FPGA Virtex-5
                                 UART
                Freescale
                MCF52211                                                                     GigE         PHY



                                 SPI                                            384 IO@250MHZ
                  Flash
                                                                                           4*8*2*6 = 384 IO
                                                                                        680 available (LX110T)
                                                                   6x 1GB/s PHY




                                                                Compute Network




8       IBM Deutschland Research & Development GmbH 10/15/09                                                     © Copyright IBM Corporation 2009
QPACE Node Card


FPGA Architecture
                                                           FLEXIO


                                                           Rocket IO                                IBM:
                                                                                                    •  RocketIO Logic
                                                           IOC (IOIF)
                                                            IOC (IOIF)
                                                             FELX iO                                •  IOC Logic
                                                                                                    •  GBIF Logic


                         Slave(BE)                            GBIF                  Master(BE)




                     Receive Requests                                               Make Requests


                                                  Switch / Address Decode / FIFOs
                                                           Bus Controller



                                                                                                    Academic Partners:
                                                                                                    •  Network Processor Logic




     6 x 1GB/S




9                IBM Deutschland Research & Development GmbH 10/15/09                               © Copyright IBM Corporation 2009
QPACE Node Card


FlexIO Processor Interface

     Interface:
       High bandwidth interface between IBM PowerXCell 8i processor and Xilinx Viretx-5 FPGA via an
        interface implementation from Rambus Inc.
       Optimized for intra-board environments.


     Challenges:
       QPACE FlexIO connection is very challenging:
         –  Speed, Latency, Bandwidth and Timing (Clock)
         –  3 Gbyte/sec communication channel
         –  2 Byte link wide

     Requirements:
       FlexIO requires link training after power-on:
         –  Phase calibration (aligns the data for optimal sampling point)
         –  Parallel calibration (synchronizes the receive deserializer with the transmit serializer)
         –  Levelization calibration (aligns all data lanes)




10             IBM Deutschland Research & Development GmbH 10/15/09                                     © Copyright IBM Corporation 2009
11   IBM Deutschland Research & Development GmbH 10/15/09   © Copyright IBM Corporation 2009
12   IBM Deutschland Research & Development GmbH 10/15/09   © Copyright IBM Corporation 2009
Thank you very much for your attention.
13     IBM Deutschland Research & Development GmbH 10/15/09   © Copyright IBM Corporation 2009
Weitere Themen


Disclaimer

IBM®, DB2®, MVS/ESA, AIX®, S/390®, AS/400®, OS/390®, OS/400®, iSeries, pSeries, xSeries, zSeries, z/
OS, AFP, Intelligent Miner, WebSphere®, Netfinity®, Tivoli®, Informix und Informix® Dynamic ServerTM,
IBM, BladeCenter and POWER and others are trademarks of the IBM Corporation in US and/or other
countries.

Cell Broadband Engine is a trademark of Sony Computer Entertainment, Inc. in the United States,
other countries, or both and is used under license there from. Linux is a trademark of Linus Torvalds
in the United States, other countries or both.

Other company, product, or service names may be trademarks or service marks of others.
The information and materials are provided on an "as is" basis and are subject to change.




14              IBM Deutschland Research & Development GmbH 10/15/09                   © Copyright IBM Corporation 2009

More Related Content

PDF
Pic base a_1
PDF
An introduction to FPGAs and Their MPSOCs
PPTX
Introducing the ADSP BF609 Blackfin Processors
PDF
FPGA_Overview_Ibr_2014
PPT
Fpga(field programmable gate array)
PDF
Advance hdl design training on xilinx fpga
PPT
Flash Programming F28x
PPTX
DESIGN CHOICES FOR EMBEDDED REAL-TIME CONTROL SYSTEMS @ 4th FPGA Camp
Pic base a_1
An introduction to FPGAs and Their MPSOCs
Introducing the ADSP BF609 Blackfin Processors
FPGA_Overview_Ibr_2014
Fpga(field programmable gate array)
Advance hdl design training on xilinx fpga
Flash Programming F28x
DESIGN CHOICES FOR EMBEDDED REAL-TIME CONTROL SYSTEMS @ 4th FPGA Camp

What's hot (20)

PPTX
Fpga in space
PDF
Nexys4ddr rm FPGA board Datasheet
PPTX
An Automatic Generation of NoC Architectures: An Application-Mapping Approach
PPTX
PCIe Gen 3.0 Presentation @ 4th FPGA Camp
PPTX
Do You Like Coffee with Your dessert? Java and the Raspberry Pi - Simon Ritte...
PDF
GlusterFS モジュール超概論
PDF
AMC & VPX Form Factor Boards With High Speed SERDES: Embedded World 2010
PPTX
Digital Design Flow
PDF
NNSA Explorations: ARM for Supercomputing
PDF
Adv. FPGA Motor Control--EBV & Univ. of Koln: Embedded World 2010
PPTX
小型安価なFPGAボードの紹介と任意波形発生器
PDF
Basys Board Digilent Manual
PDF
POWER10 innovations for HPC
PPTX
What's new in cdt 8.1
PDF
FPGA_BasedGCD
PDF
Under the Armor of Knights Corner: Intel MIC Architecture at Hotchips 2012
PDF
Continuum PCAP
PPTX
ISBI MPI Tutorial
PDF
ds894-zynq-ultrascale-plus-overview
Fpga in space
Nexys4ddr rm FPGA board Datasheet
An Automatic Generation of NoC Architectures: An Application-Mapping Approach
PCIe Gen 3.0 Presentation @ 4th FPGA Camp
Do You Like Coffee with Your dessert? Java and the Raspberry Pi - Simon Ritte...
GlusterFS モジュール超概論
AMC & VPX Form Factor Boards With High Speed SERDES: Embedded World 2010
Digital Design Flow
NNSA Explorations: ARM for Supercomputing
Adv. FPGA Motor Control--EBV & Univ. of Koln: Embedded World 2010
小型安価なFPGAボードの紹介と任意波形発生器
Basys Board Digilent Manual
POWER10 innovations for HPC
What's new in cdt 8.1
FPGA_BasedGCD
Under the Armor of Knights Corner: Intel MIC Architecture at Hotchips 2012
Continuum PCAP
ISBI MPI Tutorial
ds894-zynq-ultrascale-plus-overview
Ad

Similar to QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.) (20)

PDF
Training Lecture
PPTX
Experiences in Application Specific Supercomputer Design - Reasons, Challenge...
PDF
Power Blades Implementation
PDF
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
PDF
Sun sparc enterprise t5440 server technical presentation
PDF
05 2012 power_roadshow_software_on_power
PPTX
Intel_Low Power Intelligent Solutions with Intel Atom Processor
PDF
IBM I and blade center update 2009
PDF
Vigor Ex
PPTX
IBM System x3850 X5 Technical Presenation abbrv.
PDF
Power Systems 2009 Hardware
PDF
Sun sparc enterprise t5140 and t5240 servers technical presentation
PDF
QsNetIII, An HPC Interconnect For Peta Scale Systems
PPT
The Cell Processor
PPT
Modeling System Behaviors: A Better Paradigm on Prototyping
PPTX
8-way-server
PDF
Nucleus GP
PDF
Amd accelerated computing -ufrj
PDF
Xilinx manual
PDF
GPUDirect RDMA and Green Multi-GPU Architectures
Training Lecture
Experiences in Application Specific Supercomputer Design - Reasons, Challenge...
Power Blades Implementation
QPACE - QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)
Sun sparc enterprise t5440 server technical presentation
05 2012 power_roadshow_software_on_power
Intel_Low Power Intelligent Solutions with Intel Atom Processor
IBM I and blade center update 2009
Vigor Ex
IBM System x3850 X5 Technical Presenation abbrv.
Power Systems 2009 Hardware
Sun sparc enterprise t5140 and t5240 servers technical presentation
QsNetIII, An HPC Interconnect For Peta Scale Systems
The Cell Processor
Modeling System Behaviors: A Better Paradigm on Prototyping
8-way-server
Nucleus GP
Amd accelerated computing -ufrj
Xilinx manual
GPUDirect RDMA and Green Multi-GPU Architectures
Ad

More from Heiko Joerg Schick (16)

PDF
Da Vinci - A scaleable architecture for neural network computing (updated v4)
PDF
Huawei empowers healthcare industry with AI technology
PDF
The 2025 Huawei trend forecast gives you the lowdown on data centre facilitie...
PDF
The Smarter Car for Autonomous Driving
PDF
From edge computing to in-car computing
PDF
Need and value for various levels of autonomous driving
PPT
Petascale Analytics - The World of Big Data Requires Big Analytics
PDF
Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS
PDF
Blue Gene Active Storage
PDF
Browser and Management App for Google's Person Finder
PDF
High Performance Computing - Challenges on the Road to Exascale Computing
PDF
IBM Corporate Service Corps - Helping Create Interactive Flood Maps
PDF
Real time Flood Simulation for Metro Manila and the Philippines
PPT
Slimline Open Firmware
PPT
Agnostic Device Drivers
PPTX
directCell - Cell/B.E. tightly coupled via PCI Express
Da Vinci - A scaleable architecture for neural network computing (updated v4)
Huawei empowers healthcare industry with AI technology
The 2025 Huawei trend forecast gives you the lowdown on data centre facilitie...
The Smarter Car for Autonomous Driving
From edge computing to in-car computing
Need and value for various levels of autonomous driving
Petascale Analytics - The World of Big Data Requires Big Analytics
Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS
Blue Gene Active Storage
Browser and Management App for Google's Person Finder
High Performance Computing - Challenges on the Road to Exascale Computing
IBM Corporate Service Corps - Helping Create Interactive Flood Maps
Real time Flood Simulation for Metro Manila and the Philippines
Slimline Open Firmware
Agnostic Device Drivers
directCell - Cell/B.E. tightly coupled via PCI Express

Recently uploaded (20)

DOCX
NFL London Browns Keep Sanders at No. 4 QB Slot.docx
PDF
FIFA World Cup Semi Final Los Angeles is a Global Soccer Powerhouse for the F...
PPTX
Best All-Access Digital Pass me .pptxxxx
PPTX
India – The Diverse and Dynamic Country | TIDA Sports
PPTX
Introduction to Chess with piece movement information
PDF
Understanding Volunteering_ A Look at Its True Meaning by David Bennett Gallo...
DOCX
Europa League Final 2026 Showcase13.docx
DOC
Bishop's毕业证学历认证,维耶蒙特利尔学校毕业证毕业证文凭
PPTX
Presentación powerpoint ---deportes.pptx
DOCX
NFL Dublin Rondale Moore’s Comeback Ends in Heartbreak.docx
DOCX
NFL London Jets QB Room Dealing with Multiple Injuries.docx
DOCX
MetLife Stadium Seeks Volunteers for FIFA 2026, Including the Final Match.docx
DOCX
NFL Dublin Labriola on Steelers’ Victory Over the Jaguars.docx
PDF
FIFA World Cup Scaloni Hopeful for Messi’s FIFA World Cup 2026 Participation.pdf
DOCX
FIFA World Cup Semi Final: Lionel Messi one win from crowning glory after Arg...
DOCX
NFL Dublin Vikings’ Blueprint to Stop the Patriots’ Attack.docx
PDF
Best All-Access Digital Pass me .... pdf
DOCX
FIFA World Cup 2026 Run-Up Just 10 Months Until Kickoff.docx
DOCX
FA Cup Final 2026 Siring: Arne Slot Crit
PPTX
Badminton Sport group presentation pathfit
NFL London Browns Keep Sanders at No. 4 QB Slot.docx
FIFA World Cup Semi Final Los Angeles is a Global Soccer Powerhouse for the F...
Best All-Access Digital Pass me .pptxxxx
India – The Diverse and Dynamic Country | TIDA Sports
Introduction to Chess with piece movement information
Understanding Volunteering_ A Look at Its True Meaning by David Bennett Gallo...
Europa League Final 2026 Showcase13.docx
Bishop's毕业证学历认证,维耶蒙特利尔学校毕业证毕业证文凭
Presentación powerpoint ---deportes.pptx
NFL Dublin Rondale Moore’s Comeback Ends in Heartbreak.docx
NFL London Jets QB Room Dealing with Multiple Injuries.docx
MetLife Stadium Seeks Volunteers for FIFA 2026, Including the Final Match.docx
NFL Dublin Labriola on Steelers’ Victory Over the Jaguars.docx
FIFA World Cup Scaloni Hopeful for Messi’s FIFA World Cup 2026 Participation.pdf
FIFA World Cup Semi Final: Lionel Messi one win from crowning glory after Arg...
NFL Dublin Vikings’ Blueprint to Stop the Patriots’ Attack.docx
Best All-Access Digital Pass me .... pdf
FIFA World Cup 2026 Run-Up Just 10 Months Until Kickoff.docx
FA Cup Final 2026 Siring: Arne Slot Crit
Badminton Sport group presentation pathfit

QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.)

  • 1. QPACE QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.) Heiko Joerg Schick Firmware Project and Bring-up Lead Böblingen, 2009-06-29 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 2. Agenda Overview QPACE Architecture QPACE Node Card 2 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 3. Overview QPACE = QCD Parallel Computing on the Cell Broadband Engine™ (Cell/B.E.) The QPACE project is a research collaboration of IBM Development and European universities and research institutes with the goal to build a prototype of a cell processor- based supercomputer. The major part of this project will be funded by the German Research Foundation (DFG – Deutsche Forschungsgemeinschaft) as part of a Collaborative Research Center (SFB – Sonderforschungsbereich [TR55]). Additional funds come the PRACE project. 3 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 4. QPACE Architecture Architecture System:   Node card with PowerXCell 8i processor and network processor (NWP)   Commodity processor interconnected by a custom network   256 node-cards per rack = 26 Tflops peak performance Network:   3-dimensional Torus Network: nearest-neighbour communication, 3-dimensional torus topology   Tree Network: evaluation of global conditions and synchronization   Gigabit Ethernet: One link per node, rack-level switches Applications:   Optimized for calculations in theoretical particle physics: Simulation of Quantum Chromodynamics   Target sustained performance of 20-30% Liquid cooling system:   Closed node card housing acts as heat conductor   Housing is connected to liquid-cooled “cold plate”   Cold Plate is placed between two rows of node cards 4 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 5. QPACE Architecture Root Card (16 per rack) Backplane (8 per rack) Node Card (256 per rack) Power Supply and Power Adapter Card (24 per rack) Rack 5 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 6. QPACE Node Card Features Components:   PowerXCell 8i processor 3.2 GHZ   4 Gigabyte DDR2 memory 800 MHZ with ECC   Network processor (NWP) Xilinx FPGA LX110T FPGA   Ethernet PHY   6 x 1GB/s external links using PCI Express physical layer   Service Processor (SP) Freescale 52211   FLASH (firmware and FPGA configuration)   Power subsystem   Clocking Network Processor:   FLEXIO interface to PowerXCell 8i processor, 2 bytes with 3 GHZ bit rate   Gigabit Ethernet   UART FW Linux console   UART SP communication   SPI Master (boot flash)   SPI Slave for training and configuration   GPIO 6 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 7. QPACE Node Card Network Processor Network PHYs PowerXCell 8i (FPGA) Memory Processor 7 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 8. QPACE Node Card Block Diagram DDR2 DDR2 DDR2 DDR2 800MHz I2C Power SPI RW Subsystem (Debug) PowerXCell 8i FLEXIO FLEXIO Clocking 6GB/s 6GB/s RS232 SPI I2C SP FPGA Virtex-5 UART Freescale MCF52211 GigE PHY SPI 384 IO@250MHZ Flash 4*8*2*6 = 384 IO 680 available (LX110T) 6x 1GB/s PHY Compute Network 8 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 9. QPACE Node Card FPGA Architecture FLEXIO Rocket IO IBM: •  RocketIO Logic IOC (IOIF) IOC (IOIF) FELX iO •  IOC Logic •  GBIF Logic Slave(BE) GBIF Master(BE) Receive Requests Make Requests Switch / Address Decode / FIFOs Bus Controller Academic Partners: •  Network Processor Logic 6 x 1GB/S 9 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 10. QPACE Node Card FlexIO Processor Interface Interface:   High bandwidth interface between IBM PowerXCell 8i processor and Xilinx Viretx-5 FPGA via an interface implementation from Rambus Inc.   Optimized for intra-board environments. Challenges:   QPACE FlexIO connection is very challenging: –  Speed, Latency, Bandwidth and Timing (Clock) –  3 Gbyte/sec communication channel –  2 Byte link wide Requirements:   FlexIO requires link training after power-on: –  Phase calibration (aligns the data for optimal sampling point) –  Parallel calibration (synchronizes the receive deserializer with the transmit serializer) –  Levelization calibration (aligns all data lanes) 10 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 11. 11 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 12. 12 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 13. Thank you very much for your attention. 13 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009
  • 14. Weitere Themen Disclaimer IBM®, DB2®, MVS/ESA, AIX®, S/390®, AS/400®, OS/390®, OS/400®, iSeries, pSeries, xSeries, zSeries, z/ OS, AFP, Intelligent Miner, WebSphere®, Netfinity®, Tivoli®, Informix und Informix® Dynamic ServerTM, IBM, BladeCenter and POWER and others are trademarks of the IBM Corporation in US and/or other countries. Cell Broadband Engine is a trademark of Sony Computer Entertainment, Inc. in the United States, other countries, or both and is used under license there from. Linux is a trademark of Linus Torvalds in the United States, other countries or both. Other company, product, or service names may be trademarks or service marks of others. The information and materials are provided on an "as is" basis and are subject to change. 14 IBM Deutschland Research & Development GmbH 10/15/09 © Copyright IBM Corporation 2009