This document analyzes the efficiency of hardware resources in recent GPU architectures like Pascal compared to older architectures like Fermi. It simulates 9 benchmarks on a Fermi and Pascal-based GPU configuration using a cycle-accurate simulator. The results show that Pascal improves performance by 273% on average over Fermi. It also analyzes the impact of computing resources versus memory resources, varying the number of warp schedulers, and measuring barrier synchronization overhead. The goal is to understand how hardware upgrades in newer architectures translate to performance gains and guide future GPU development.