SlideShare a Scribd company logo
May 9, 2016
1
HEAT
(Hardware enabled Algorithmic tester)
for 2.5D HBM Solution
May 9, 2016
Kunal Varshney, Open-Silicon
Ganesh Venkatkrishnan, Open-Silicon
Pankaj Prajapati, Open-Silicon
May 9, 2016
2
Agenda
• High Bandwidth Memory (HBM) Basics
• Chip Introduction & Motivation
• Challenges
• HEAT Features
• Conclusion
May 9, 2016
3
Understanding HBM
• JEDEC standard
• DRAM embedded in ASIC Package (Stacked Memory Dies)
• Backed by AMD, SK-Hynix, Samsung, nVidia
• 2.5D Interposer based ASIC implementation
• Initial adoption expected in GPU & HPC markets, now followed
by Networking
• HBM Gen2 JEDEC specifications:
– 8 Gb per DRAM die
– 2 Gbps/pin (DDR)
– 4/8 High stack for 4GB/8GB density
– 256 GB/s total bandwidth
3
May 9, 2016
4
Understanding HBM
4
From Memory Vendor
Developed at
Open-Silicon
May 9, 2016
5
Chip Introduction & Motivation
• HBM Test Chip
• Technology node : 16nm FF from TSMC
• 2.5D Interposer technology
• HBM Memory : SK-Hynix
• Key Motivation: On-chip Silicon Validation of in-house HBM
capabilities
• HBM Protocol Controller
• HBM PHY
• D2D IOs
5
May 9, 2016
6
Chip Block Diagram
6
hbm_top.v
HBM PHY
HBM
Controller IO
PADS
PLL
HEAT
Tap
Controller
TAPSEL_EN
Jtag_2_csr(C)
Reset De-assertion
Reset De-assertion
HBM D2D
Channel 0
HBM D2D
Channel 1
IEEE 1500
clk_reset_ctrl.v
Clk_CNTRL/
DEBUG
PLL
Clock/Reset
Signals
Reset
Chatter
Controller
CSR i/f
UIF 0
UIF 1
UIF 2
UIF 3
Jtag_2_uif
AUTO
BISTAUTO
BISTAUTO
BISTAUTO
BIST
Padring.v
GPIO
Logic Die
HBM reset to
memory
Soc reset
HBM_clk
HBM Memory DIE
Package
JTAG 5
5
14
7
DA
60
DFT Speed Test Module/ IO
May 9, 2016
7
Challenges
• At speed Functional Testing (1Ghz)
– At speed Traffic Generation ( mimicking system traffic)
– Single cycle data integrity check
• Performance Measurement
• Power aware Design
• Minimal Package IO Count
• Fall-back chip booting options
• Functional Debug
• User Interface Debug
• Testing Logic die before Assembly
7
May 9, 2016
8
HEAT Architecture
8
May 9, 2016
9
Functional Testing
• Traffic generation
– Galois LFSRs used
– Modes of operation:
• BW: WWWW…..
• BR: RRRRR…...
• RAW: WRWRWR….
WWRRWWRR….
– Control on DATA/ADDR
• Constant, INC/DEC by N, Random, W0, W1, CB
• Scenarios: Memory sweep, Bank Switching, Max Latency, Max BW etc.
• Data Integrity
– Wr_data stored in 16 deep CAM. ID used as addr of CAM
– Single cycle data comparator when rd response is received.
9
May 9, 2016
10
Challenges
• At speed Functional Testing (1Ghz)
– At speed Traffic Generation ( mimicking system traffic)
– Single cycle data integrity check
• Performance Measurement
• Power aware Design
• Minimal Package IO Count
• Fall-back chip booting options
• Functional Debug
• User Interface Debug
• Testing Logic die before Assembly
10
May 9, 2016
11
Performance Measurement
• Latency:
– Min latency with transaction details
– Max latency with transaction details
– Avg Latency
• Bandwidth:
– Total number of clks taken by a sequence.
• Configurable Timeout Value and Timeout status.
• Byte wise Granularity in LFSR to reduce Power consumption
11
May 9, 2016
12
Challenges
• At speed Functional Testing (1Ghz)
– At speed Traffic Generation ( mimicking system traffic)
– Single cycle data integrity check
• Performance Measurement
• Power aware Design
• Minimal Package IO Count
• Fall-back chip booting options
• Functional Debug
• User Interface Debug
• Testing Logic die before Assembly
12
May 9, 2016
13
Package IO Count
• Test Chip : Limited Package IO availability
• JTAG Selected as i/f for Programming the HEAT
• Functional JTAG and DFT JTAG Muxed
• Functional IO Muxing.
– Same IO used as i/p during chip reset and o/p after chip reset
– Inputs from board used for fall-back chip booting options.
13
May 9, 2016
14
Challenges
• At speed Functional Testing (1Ghz)
– At speed Traffic Generation ( mimicking system traffic)
– Single cycle data integrity check
• Performance Measurement
• Power aware Design
• Minimal Package IO Count
• Fall-back chip booting options
• Functional Debug
• User Interface Debug
• Testing Logic die before Assembly
14
May 9, 2016
15
Functional Debug
• Number of UIF read/write errors received
• Number of Data Mismatch errors with Expected data
and actual data received on UIF
• Info of first failed transaction
• Directed Mode
– Single or upto 4 transactions (read or write)
– User can create it’s own transaction
• Bitwise Data masking to pin-point single bit errors in
data stream
• Option of stop/continue on error.
15
May 9, 2016
16
Challenges
• At speed Functional Testing (1Ghz)
– At speed Traffic Generation ( mimicking system traffic)
– Single cycle data integrity check
• Performance Measurement
• Power aware Design
• Minimal Package IO Count
• Fall-back chip booting options
• Functional Debug
• User Interface Debug
• Testing Logic die before Assembly
16
May 9, 2016
17
User Interface Debug
• Bus Monitor design
– Monitors HEAT engine output and UIF Bus
– Helpful in Debugging on-chip failures
• Multiple mode of operations
– Bus Capture
– Transaction recorder
– Error recorder
17
May 9, 2016
18
Challenges
• At speed Functional Testing (1Ghz)
– At speed Traffic Generation ( mimicking system traffic)
– Single cycle data integrity check
• Performance Measurement
• Power aware Design
• Minimal Package IO Count
• Fall-back chip booting options
• Functional Debug
• User Interface Debug
• Testing Logic die before Assembly
18
May 9, 2016
19
Testing Logic Die before Assembly
• Loopback Handles for testing without memory
– Point of Loopback :
» Core Side Loopback (before D2D IO)
» IO loopback (Loopback using D2D IO)
– Loopback Transaction generation:
» UIF level Loopback
» DFI-i (input of PHY) level Address loopback
» DFI-i (input of PHY) level Data loopback
19
May 9, 2016
20
Conclusion
• At speed Functional Testing (1Ghz) 
– At speed Traffic Generation ( mimicking system traffic)
– Single cycle data integrity check
• Performance Measurement 
• Power aware Design 
• Minimal Package IO Count 
• Fall-back chip booting options 
• Functional Debug 
• User Interface Debug 
• Testing Logic die before Assembly 
20

More Related Content

PPTX
Eddy Kvetny, System Engineering Group Leader, Intel
PDF
SOUG_SDM_OracleDB_V3
PPTX
OVHcloud – Enterprise Cloud Databases
PPTX
Webinar: StorPool and WHIR - better storage, better business
PDF
Optimization_of_Virtual_Machines_for_High_Performance
PPTX
Nagios Conference 2014 - Frank Pantaleo - Nagios Monitoring of Netezza Databases
PDF
Обзор новой СХД EMC Unity. Планирование обновления с VNX\VNX2, Тимофей Григор...
PPTX
Varrow Madness 2014 - Virtualizing SQL
Eddy Kvetny, System Engineering Group Leader, Intel
SOUG_SDM_OracleDB_V3
OVHcloud – Enterprise Cloud Databases
Webinar: StorPool and WHIR - better storage, better business
Optimization_of_Virtual_Machines_for_High_Performance
Nagios Conference 2014 - Frank Pantaleo - Nagios Monitoring of Netezza Databases
Обзор новой СХД EMC Unity. Планирование обновления с VNX\VNX2, Тимофей Григор...
Varrow Madness 2014 - Virtualizing SQL

What's hot (20)

PPTX
Varrow datacenter storage today and tomorrow
PPTX
TDS-16489U - Dual Processor
PDF
SOUG_GV_Flashgrid_V4
PPTX
Nvidia grid and vGPU
PPTX
Recent Developments in Donard
PDF
Ceph Day Beijing - Ceph RDMA Update
PPTX
Ceph Day Taipei - How ARM Microserver Cluster Performs in Ceph
PDF
VMware - Virtual SAN - IT Changes Everything
PPTX
UniPlex vScaleDB pat. pending
PDF
Storage based snapshots for KVM VMs in CloudStack
PPTX
V mware2012 20121221_final
PPTX
Hyper-v Best Practices
PDF
1 sysadmin vs 250 clusters de stockage
PPTX
Using next gen storage in Cloudstack
PDF
VMworld 2013: Lowering TCO for Virtual Desktops with VMware View and VMware V...
PDF
Propelling IoT Innovation with Predictive Analytics
PPTX
Ceph Community Talk on High-Performance Solid Sate Ceph
PDF
Introduction to CloudStack Storage Subsystem
PPTX
Virtual SAN 6.2, hyper-converged infrastructure software
PDF
VMware Virtual SAN Presentation
Varrow datacenter storage today and tomorrow
TDS-16489U - Dual Processor
SOUG_GV_Flashgrid_V4
Nvidia grid and vGPU
Recent Developments in Donard
Ceph Day Beijing - Ceph RDMA Update
Ceph Day Taipei - How ARM Microserver Cluster Performs in Ceph
VMware - Virtual SAN - IT Changes Everything
UniPlex vScaleDB pat. pending
Storage based snapshots for KVM VMs in CloudStack
V mware2012 20121221_final
Hyper-v Best Practices
1 sysadmin vs 250 clusters de stockage
Using next gen storage in Cloudstack
VMworld 2013: Lowering TCO for Virtual Desktops with VMware View and VMware V...
Propelling IoT Innovation with Predictive Analytics
Ceph Community Talk on High-Performance Solid Sate Ceph
Introduction to CloudStack Storage Subsystem
Virtual SAN 6.2, hyper-converged infrastructure software
VMware Virtual SAN Presentation
Ad

Similar to Kunal Varshney, VLSI Engineer, Open-Silicon (20)

PDF
Tackling Network Bottlenecks with Hardware Accelerations: Cloud vs. On-Premise
PDF
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
PDF
Ken Liao, Senior Associate VP, Faraday
PDF
[OpenStack Days Korea 2016] Track3 - OpenStack on 64-bit ARM with X-Gene
PPTX
Sagar Kadam, Lead Software Engineer, Open-Silicon
PDF
IBM Power9 Features and Specifications
PDF
SharkFest16_Palm_Online
PDF
Evoluzione dello storage
PDF
Heterogeneous Computing : The Future of Systems
PDF
Oow 2008 yahoo_pie-db
PDF
Performance challenges in software networking
PDF
OpenCAPI next generation accelerator
PDF
OpenCAPI Technology Ecosystem
PPT
MAGPI: Advanced Services: IPv6, Multicast, DNSSEC
PPTX
Introduction to DPDK
PDF
100 M pps on PC.
PPTX
Jvm problem diagnostics
PDF
QsNetIII, An HPC Interconnect For Peta Scale Systems
PDF
Power overview 2018 08-13b
PDF
IBM POWER8 Systems Technology Group Development
Tackling Network Bottlenecks with Hardware Accelerations: Cloud vs. On-Premise
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
Ken Liao, Senior Associate VP, Faraday
[OpenStack Days Korea 2016] Track3 - OpenStack on 64-bit ARM with X-Gene
Sagar Kadam, Lead Software Engineer, Open-Silicon
IBM Power9 Features and Specifications
SharkFest16_Palm_Online
Evoluzione dello storage
Heterogeneous Computing : The Future of Systems
Oow 2008 yahoo_pie-db
Performance challenges in software networking
OpenCAPI next generation accelerator
OpenCAPI Technology Ecosystem
MAGPI: Advanced Services: IPv6, Multicast, DNSSEC
Introduction to DPDK
100 M pps on PC.
Jvm problem diagnostics
QsNetIII, An HPC Interconnect For Peta Scale Systems
Power overview 2018 08-13b
IBM POWER8 Systems Technology Group Development
Ad

More from chiportal (20)

PDF
Prof. Zhihua Wang, Tsinghua University, Beijing, China
PPTX
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
PPTX
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
PPTX
Prof. Uri Weiser,Technion
PDF
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
PDF
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
PPTX
Dr.Efraim Aharoni, ESD Leader, TowerJazz
PPTX
Dr. John Bainbridge, Principal Application Architect, NetSpeed
PPTX
Xavier van Ruymbeke, App. Engineer, Arteris
PPTX
Asi Lifshitz, VP R&D, Vtool
PPTX
Zvika Rozenshein,General Manager, EngineeringIQ
PPTX
Lewis Chu,Marketing Director,GUC
PPSX
Tuvia Liran, Director of VLSI, Nano Retina
PPTX
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
PDF
Prof. Emanuel Cohen, Technion
PPTX
Prof. Ramez Daniel, Technion
PPTX
Rotem Ben-Hur,Graduate Student,Technio
PPTX
Misbah Ramadan, Graduate Student,Technion
PPTX
Roman Kaplan, Graduate Student,Technion
PPTX
Prof. Yosef Bernstein, Ariel-University
Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Uri Weiser,Technion
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
Dr.Efraim Aharoni, ESD Leader, TowerJazz
Dr. John Bainbridge, Principal Application Architect, NetSpeed
Xavier van Ruymbeke, App. Engineer, Arteris
Asi Lifshitz, VP R&D, Vtool
Zvika Rozenshein,General Manager, EngineeringIQ
Lewis Chu,Marketing Director,GUC
Tuvia Liran, Director of VLSI, Nano Retina
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Prof. Emanuel Cohen, Technion
Prof. Ramez Daniel, Technion
Rotem Ben-Hur,Graduate Student,Technio
Misbah Ramadan, Graduate Student,Technion
Roman Kaplan, Graduate Student,Technion
Prof. Yosef Bernstein, Ariel-University

Recently uploaded (20)

PPTX
Amazon (Business Studies) management studies
PDF
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
PDF
Training And Development of Employee .pdf
DOCX
unit 1 COST ACCOUNTING AND COST SHEET
PDF
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
PDF
How to Get Business Funding for Small Business Fast
PDF
Nidhal Samdaie CV - International Business Consultant
PPTX
HR Introduction Slide (1).pptx on hr intro
PDF
Reconciliation AND MEMORANDUM RECONCILATION
PPTX
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
PDF
Outsourced Audit & Assurance in USA Why Globus Finanza is Your Trusted Choice
PDF
Chapter 5_Foreign Exchange Market in .pdf
PPTX
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
PDF
WRN_Investor_Presentation_August 2025.pdf
PPTX
AI-assistance in Knowledge Collection and Curation supporting Safe and Sustai...
PDF
MSPs in 10 Words - Created by US MSP Network
PDF
Stem Cell Market Report | Trends, Growth & Forecast 2025-2034
PDF
kom-180-proposal-for-a-directive-amending-directive-2014-45-eu-and-directive-...
PDF
A Brief Introduction About Julia Allison
PDF
20250805_A. Stotz All Weather Strategy - Performance review July 2025.pdf
Amazon (Business Studies) management studies
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
Training And Development of Employee .pdf
unit 1 COST ACCOUNTING AND COST SHEET
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
How to Get Business Funding for Small Business Fast
Nidhal Samdaie CV - International Business Consultant
HR Introduction Slide (1).pptx on hr intro
Reconciliation AND MEMORANDUM RECONCILATION
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
Outsourced Audit & Assurance in USA Why Globus Finanza is Your Trusted Choice
Chapter 5_Foreign Exchange Market in .pdf
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
WRN_Investor_Presentation_August 2025.pdf
AI-assistance in Knowledge Collection and Curation supporting Safe and Sustai...
MSPs in 10 Words - Created by US MSP Network
Stem Cell Market Report | Trends, Growth & Forecast 2025-2034
kom-180-proposal-for-a-directive-amending-directive-2014-45-eu-and-directive-...
A Brief Introduction About Julia Allison
20250805_A. Stotz All Weather Strategy - Performance review July 2025.pdf

Kunal Varshney, VLSI Engineer, Open-Silicon

  • 1. May 9, 2016 1 HEAT (Hardware enabled Algorithmic tester) for 2.5D HBM Solution May 9, 2016 Kunal Varshney, Open-Silicon Ganesh Venkatkrishnan, Open-Silicon Pankaj Prajapati, Open-Silicon
  • 2. May 9, 2016 2 Agenda • High Bandwidth Memory (HBM) Basics • Chip Introduction & Motivation • Challenges • HEAT Features • Conclusion
  • 3. May 9, 2016 3 Understanding HBM • JEDEC standard • DRAM embedded in ASIC Package (Stacked Memory Dies) • Backed by AMD, SK-Hynix, Samsung, nVidia • 2.5D Interposer based ASIC implementation • Initial adoption expected in GPU & HPC markets, now followed by Networking • HBM Gen2 JEDEC specifications: – 8 Gb per DRAM die – 2 Gbps/pin (DDR) – 4/8 High stack for 4GB/8GB density – 256 GB/s total bandwidth 3
  • 4. May 9, 2016 4 Understanding HBM 4 From Memory Vendor Developed at Open-Silicon
  • 5. May 9, 2016 5 Chip Introduction & Motivation • HBM Test Chip • Technology node : 16nm FF from TSMC • 2.5D Interposer technology • HBM Memory : SK-Hynix • Key Motivation: On-chip Silicon Validation of in-house HBM capabilities • HBM Protocol Controller • HBM PHY • D2D IOs 5
  • 6. May 9, 2016 6 Chip Block Diagram 6 hbm_top.v HBM PHY HBM Controller IO PADS PLL HEAT Tap Controller TAPSEL_EN Jtag_2_csr(C) Reset De-assertion Reset De-assertion HBM D2D Channel 0 HBM D2D Channel 1 IEEE 1500 clk_reset_ctrl.v Clk_CNTRL/ DEBUG PLL Clock/Reset Signals Reset Chatter Controller CSR i/f UIF 0 UIF 1 UIF 2 UIF 3 Jtag_2_uif AUTO BISTAUTO BISTAUTO BISTAUTO BIST Padring.v GPIO Logic Die HBM reset to memory Soc reset HBM_clk HBM Memory DIE Package JTAG 5 5 14 7 DA 60 DFT Speed Test Module/ IO
  • 7. May 9, 2016 7 Challenges • At speed Functional Testing (1Ghz) – At speed Traffic Generation ( mimicking system traffic) – Single cycle data integrity check • Performance Measurement • Power aware Design • Minimal Package IO Count • Fall-back chip booting options • Functional Debug • User Interface Debug • Testing Logic die before Assembly 7
  • 8. May 9, 2016 8 HEAT Architecture 8
  • 9. May 9, 2016 9 Functional Testing • Traffic generation – Galois LFSRs used – Modes of operation: • BW: WWWW….. • BR: RRRRR…... • RAW: WRWRWR…. WWRRWWRR…. – Control on DATA/ADDR • Constant, INC/DEC by N, Random, W0, W1, CB • Scenarios: Memory sweep, Bank Switching, Max Latency, Max BW etc. • Data Integrity – Wr_data stored in 16 deep CAM. ID used as addr of CAM – Single cycle data comparator when rd response is received. 9
  • 10. May 9, 2016 10 Challenges • At speed Functional Testing (1Ghz) – At speed Traffic Generation ( mimicking system traffic) – Single cycle data integrity check • Performance Measurement • Power aware Design • Minimal Package IO Count • Fall-back chip booting options • Functional Debug • User Interface Debug • Testing Logic die before Assembly 10
  • 11. May 9, 2016 11 Performance Measurement • Latency: – Min latency with transaction details – Max latency with transaction details – Avg Latency • Bandwidth: – Total number of clks taken by a sequence. • Configurable Timeout Value and Timeout status. • Byte wise Granularity in LFSR to reduce Power consumption 11
  • 12. May 9, 2016 12 Challenges • At speed Functional Testing (1Ghz) – At speed Traffic Generation ( mimicking system traffic) – Single cycle data integrity check • Performance Measurement • Power aware Design • Minimal Package IO Count • Fall-back chip booting options • Functional Debug • User Interface Debug • Testing Logic die before Assembly 12
  • 13. May 9, 2016 13 Package IO Count • Test Chip : Limited Package IO availability • JTAG Selected as i/f for Programming the HEAT • Functional JTAG and DFT JTAG Muxed • Functional IO Muxing. – Same IO used as i/p during chip reset and o/p after chip reset – Inputs from board used for fall-back chip booting options. 13
  • 14. May 9, 2016 14 Challenges • At speed Functional Testing (1Ghz) – At speed Traffic Generation ( mimicking system traffic) – Single cycle data integrity check • Performance Measurement • Power aware Design • Minimal Package IO Count • Fall-back chip booting options • Functional Debug • User Interface Debug • Testing Logic die before Assembly 14
  • 15. May 9, 2016 15 Functional Debug • Number of UIF read/write errors received • Number of Data Mismatch errors with Expected data and actual data received on UIF • Info of first failed transaction • Directed Mode – Single or upto 4 transactions (read or write) – User can create it’s own transaction • Bitwise Data masking to pin-point single bit errors in data stream • Option of stop/continue on error. 15
  • 16. May 9, 2016 16 Challenges • At speed Functional Testing (1Ghz) – At speed Traffic Generation ( mimicking system traffic) – Single cycle data integrity check • Performance Measurement • Power aware Design • Minimal Package IO Count • Fall-back chip booting options • Functional Debug • User Interface Debug • Testing Logic die before Assembly 16
  • 17. May 9, 2016 17 User Interface Debug • Bus Monitor design – Monitors HEAT engine output and UIF Bus – Helpful in Debugging on-chip failures • Multiple mode of operations – Bus Capture – Transaction recorder – Error recorder 17
  • 18. May 9, 2016 18 Challenges • At speed Functional Testing (1Ghz) – At speed Traffic Generation ( mimicking system traffic) – Single cycle data integrity check • Performance Measurement • Power aware Design • Minimal Package IO Count • Fall-back chip booting options • Functional Debug • User Interface Debug • Testing Logic die before Assembly 18
  • 19. May 9, 2016 19 Testing Logic Die before Assembly • Loopback Handles for testing without memory – Point of Loopback : » Core Side Loopback (before D2D IO) » IO loopback (Loopback using D2D IO) – Loopback Transaction generation: » UIF level Loopback » DFI-i (input of PHY) level Address loopback » DFI-i (input of PHY) level Data loopback 19
  • 20. May 9, 2016 20 Conclusion • At speed Functional Testing (1Ghz)  – At speed Traffic Generation ( mimicking system traffic) – Single cycle data integrity check • Performance Measurement  • Power aware Design  • Minimal Package IO Count  • Fall-back chip booting options  • Functional Debug  • User Interface Debug  • Testing Logic die before Assembly  20