SlideShare a Scribd company logo
A Deep Dive into Energy Efficient Multi-
core Processor
Speaker:呂宗螢
Adviser:梁文耀 老師
Date:2006/11/1
嵌入式及平行系統實驗室
Hardware Software Cooperation for Best
Performance-Power Optimization
 ACPI
 Schedule appropriate idle
state, either C1, C2, ….Cn
 OS schedule and decide
designated idle state
 Execute BIOS code
sequence
 Enter idle state and
periodically re-evaluate
 Quickly back to active
state when the break
event occurs
 Utilize idle state effectively
嵌入式及平行系統實驗室
Power States
 C0 is core active power state
 C1 is auto halt state
 Instruction execution halted
 Core clock is off
 Quick transition and switch on/off almost immediately
 C2 is stop clock state
 Core and bus clock off
 The front side bus can placed in a lower power state
 Chipset can initiate power-saving measure
 C3 is deep sleep state
 Clock generator off
 Processor disables internal Phase Locked Loops
 C4 is deeper sleep state
 Reduce VCC
 DC4 is further deeper sleep state
 Further reduce VCC
 DC4 has eliminating most static power but taking a
long time the enter DC4 and back to C0
嵌入式及平行系統實驗室
Multi-core Power States
 Multi-core power state
 Each core has its own power state
 Multi-core shared resource has power states for all cores
• Maximum performance benefit
• Minimum power consumption
 Multi-core power control partion
 Typical individual core units:execution units , core scheduler, L1 cache, core
clocking control
 Typical shared resources:APIC, global power control, L2 cache, bus
嵌入式及平行系統實驗室
Intel Core™ 2 Duo Processor
 A core reduce the frequency , the other
core still runt at full speed
 Three domain
 Each core have power management
domain
 Share reources a power management
domain
 All domain share a single power plane
and a single-core PLL(Phase Locked
Loops)
 The Coordination mechanism serves as a
transparent layer between the individuall
controlled cores and the shared resource
on die and on the platform
 Determines the required CPU C-state
 Base on both cores’ individual requests
 Controls the state of the shared
resources
嵌入式及平行系統實驗室
Smart Cache Optimization
 Smart Cache size shrink
 Less accesses than threshold
 Multi-core condition permits
 Stop cach size shrink
 Other core active
 Not meet multi-core shrinking
condition
 Interrupt pending
 Expand cache size
 Access frequency over threshold
 Other core sleep state change
 Multi-core trend to active

More Related Content

PPT
Libckpt transparent checkpointing under unix
PPT
Architecture of the oasis mobile shared virtual memory system
PPT
Performance improvement techniques for software distributed shared memory
PDF
asap2013-khoa-presentation
PDF
Kernel Recipes 2015: Solving the Linux storage scalability bottlenecks
PDF
Fast switching of threads between cores - Advanced Operating Systems
PPTX
Linux Interrupts
PPTX
SecureCore RTAS2013
Libckpt transparent checkpointing under unix
Architecture of the oasis mobile shared virtual memory system
Performance improvement techniques for software distributed shared memory
asap2013-khoa-presentation
Kernel Recipes 2015: Solving the Linux storage scalability bottlenecks
Fast switching of threads between cores - Advanced Operating Systems
Linux Interrupts
SecureCore RTAS2013

What's hot (20)

PPTX
MemGuard: Memory Bandwidth Reservation System for Efficient Performance Isola...
PPTX
Multi-IMA Partition Scheduling for Global I/O Synchronization
PPTX
System performance monitoring pcp + vector
PDF
A Simplex Architecture for Intelligent and Safe Unmanned Aerial Vehicles
PDF
HKG15-305: Real Time processing comparing the RT patch vs Core isolation
PDF
Get Lower Latency and Higher Throughput for Java Applications
PDF
AOS Lab 8: Interrupts and Device Drivers
PDF
ACM Applicative System Methodology 2016
PDF
YOW2021 Computing Performance
PDF
Computing Performance: On the Horizon (2021)
PDF
Mastering Real-time Linux
PDF
Linux BPF Superpowers
PPTX
Real time operating systems (rtos) concepts 3
PPT
Threading Successes 03 Gamebryo
PPTX
Real time operating systems (rtos) concepts 5
PPTX
Replication and replica sets
PPTX
Improving Real-Time Performance on Multicore Platforms using MemGuard
PPTX
Taming Non-blocking Caches to Improve Isolation in Multicore Real-Time Systems
PPTX
Unit 5 ppt
PDF
Parallelism-Aware Memory Interference Delay Analysis for COTS Multicore Systems
MemGuard: Memory Bandwidth Reservation System for Efficient Performance Isola...
Multi-IMA Partition Scheduling for Global I/O Synchronization
System performance monitoring pcp + vector
A Simplex Architecture for Intelligent and Safe Unmanned Aerial Vehicles
HKG15-305: Real Time processing comparing the RT patch vs Core isolation
Get Lower Latency and Higher Throughput for Java Applications
AOS Lab 8: Interrupts and Device Drivers
ACM Applicative System Methodology 2016
YOW2021 Computing Performance
Computing Performance: On the Horizon (2021)
Mastering Real-time Linux
Linux BPF Superpowers
Real time operating systems (rtos) concepts 3
Threading Successes 03 Gamebryo
Real time operating systems (rtos) concepts 5
Replication and replica sets
Improving Real-Time Performance on Multicore Platforms using MemGuard
Taming Non-blocking Caches to Improve Isolation in Multicore Real-Time Systems
Unit 5 ppt
Parallelism-Aware Memory Interference Delay Analysis for COTS Multicore Systems
Ad

Viewers also liked (20)

PPT
Device Driver - Chapter 3字元驅動程式
PPTX
Vue.js
PPTX
Pkn
PPT
Engranajes fotos
PPTX
Tik allisya smpit rpi
PDF
Programme on Quality Improvement For Cooperative Banking & Non Banking Organi...
PPTX
Ramya mmwt
PDF
Cs437 lecture 13
PPTX
Obesity
PPTX
Life insurance after retirement
PDF
Cs437 lecture 7-8
PDF
Cs437 lecture 09
PPTX
Tik icha SMPIT RPI
PDF
How to do Spirometry in the Workplace
PPTX
PDF
Programme on Governance and Reforms in Cooperatives for UCB and Credit Societies
PDF
Programme on recently recruited clerks of UCB/DCC/State Cooperative Banks
Device Driver - Chapter 3字元驅動程式
Vue.js
Pkn
Engranajes fotos
Tik allisya smpit rpi
Programme on Quality Improvement For Cooperative Banking & Non Banking Organi...
Ramya mmwt
Cs437 lecture 13
Obesity
Life insurance after retirement
Cs437 lecture 7-8
Cs437 lecture 09
Tik icha SMPIT RPI
How to do Spirometry in the Workplace
Programme on Governance and Reforms in Cooperatives for UCB and Credit Societies
Programme on recently recruited clerks of UCB/DCC/State Cooperative Banks
Ad

Similar to A deep dive into energy efficient multi core processor (12)

ODP
The Theory and Implementation of DVFS on Linux
PDF
PowerManagement
PPT
Multicore computers
PDF
Kernel Recipes 2015: Introduction to Kernel Power Management
DOC
Introduction to multi core
PPTX
Intern presentation nicolechiou_2018_vareximaging_sanitized
PDF
IRJET-A Review on Trends in Multicore Processor Based on Cache and Power Diss...
PDF
Parallel and Distributed Computing Chapter 9
PPTX
Advanced trends in microcontrollers by suhel
PDF
BKK16-317 How to generate power models for EAS and IPA
PDF
Linux_swspnd_v0.3_pub1
PDF
BKK16-TR08 How to generate power models for EAS and IPA
The Theory and Implementation of DVFS on Linux
PowerManagement
Multicore computers
Kernel Recipes 2015: Introduction to Kernel Power Management
Introduction to multi core
Intern presentation nicolechiou_2018_vareximaging_sanitized
IRJET-A Review on Trends in Multicore Processor Based on Cache and Power Diss...
Parallel and Distributed Computing Chapter 9
Advanced trends in microcontrollers by suhel
BKK16-317 How to generate power models for EAS and IPA
Linux_swspnd_v0.3_pub1
BKK16-TR08 How to generate power models for EAS and IPA

More from ZongYing Lyu (11)

PPT
Device Driver - Chapter 6字元驅動程式的進階作業
PDF
Web coding principle
PDF
提高 Code 品質心得
PPT
PPT
Consistency protocols
PPT
Compiler optimization
PPT
MPI use c language
PPT
PDF
Parallel program design
PDF
PPT
OpenMP
Device Driver - Chapter 6字元驅動程式的進階作業
Web coding principle
提高 Code 品質心得
Consistency protocols
Compiler optimization
MPI use c language
Parallel program design
OpenMP

Recently uploaded (20)

PPTX
Cloud computing and distributed systems.
PDF
cuic standard and advanced reporting.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Spectroscopy.pptx food analysis technology
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPT
Teaching material agriculture food technology
PDF
Spectral efficient network and resource selection model in 5G networks
DOCX
The AUB Centre for AI in Media Proposal.docx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
Cloud computing and distributed systems.
cuic standard and advanced reporting.pdf
Electronic commerce courselecture one. Pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Spectroscopy.pptx food analysis technology
Dropbox Q2 2025 Financial Results & Investor Presentation
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Teaching material agriculture food technology
Spectral efficient network and resource selection model in 5G networks
The AUB Centre for AI in Media Proposal.docx
“AI and Expert System Decision Support & Business Intelligence Systems”
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
MYSQL Presentation for SQL database connectivity
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Digital-Transformation-Roadmap-for-Companies.pptx

A deep dive into energy efficient multi core processor

  • 1. A Deep Dive into Energy Efficient Multi- core Processor Speaker:呂宗螢 Adviser:梁文耀 老師 Date:2006/11/1
  • 2. 嵌入式及平行系統實驗室 Hardware Software Cooperation for Best Performance-Power Optimization  ACPI  Schedule appropriate idle state, either C1, C2, ….Cn  OS schedule and decide designated idle state  Execute BIOS code sequence  Enter idle state and periodically re-evaluate  Quickly back to active state when the break event occurs  Utilize idle state effectively
  • 3. 嵌入式及平行系統實驗室 Power States  C0 is core active power state  C1 is auto halt state  Instruction execution halted  Core clock is off  Quick transition and switch on/off almost immediately  C2 is stop clock state  Core and bus clock off  The front side bus can placed in a lower power state  Chipset can initiate power-saving measure  C3 is deep sleep state  Clock generator off  Processor disables internal Phase Locked Loops  C4 is deeper sleep state  Reduce VCC  DC4 is further deeper sleep state  Further reduce VCC  DC4 has eliminating most static power but taking a long time the enter DC4 and back to C0
  • 4. 嵌入式及平行系統實驗室 Multi-core Power States  Multi-core power state  Each core has its own power state  Multi-core shared resource has power states for all cores • Maximum performance benefit • Minimum power consumption  Multi-core power control partion  Typical individual core units:execution units , core scheduler, L1 cache, core clocking control  Typical shared resources:APIC, global power control, L2 cache, bus
  • 5. 嵌入式及平行系統實驗室 Intel Core™ 2 Duo Processor  A core reduce the frequency , the other core still runt at full speed  Three domain  Each core have power management domain  Share reources a power management domain  All domain share a single power plane and a single-core PLL(Phase Locked Loops)  The Coordination mechanism serves as a transparent layer between the individuall controlled cores and the shared resource on die and on the platform  Determines the required CPU C-state  Base on both cores’ individual requests  Controls the state of the shared resources
  • 6. 嵌入式及平行系統實驗室 Smart Cache Optimization  Smart Cache size shrink  Less accesses than threshold  Multi-core condition permits  Stop cach size shrink  Other core active  Not meet multi-core shrinking condition  Interrupt pending  Expand cache size  Access frequency over threshold  Other core sleep state change  Multi-core trend to active