Hardware/Software Approaches for Reducing the Process Variation Impact on Instruction Fetches

KADAYIF, İSMAİL; TURKCAN, Mahir; KIZILTEPE, Seher; Ozturk, Ozcan

doi:10.1145/2489778

Hardware/Software Approaches for Reducing the Process Variation Impact on Instruction Fetches

Atıf İçin Kopyala

KADAYIF İ., TURKCAN M., KIZILTEPE S., Ozturk O.

ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, cilt.18, sa.4, 2013 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 18 Sayı: 4
Basım Tarihi: 2013
Doi Numarası: 10.1145/2489778
Dergi Adı: ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Çanakkale Onsekiz Mart Üniversitesi Adresli: Evet

Özet

As technology moves towards finer process geometries, it is becoming extremely difficult to control critical physical parameters such as channel length, gate oxide thickness, and dopant ion concentration. Variations in these parameters lead to dramatic variations in access latencies in Static Random Access Memory (SRAM) devices. This means that different lines of the same cache may have different access latencies. A simple solution to this problem is to adopt the worst-case latency paradigm. While this egalitarian cache management is simple, it may introduce significant performance overhead during instruction fetches when both address translation (instruction Translation Lookaside Buffer (TLB) access) and instruction cache access take place, making this solution infeasible for future high-performance processors. In this study, we first propose some hardware and software enhancements and then, based on those, investigate several techniques to mitigate the effect of process variation on the instruction fetch pipeline stage in modern processors. For address translation, we study an approach that performs the virtual-to-physical page translation once, then stores it in a special register, reusing it as long as the execution remains on the same instruction page. To handle varying access latencies across different instruction cache lines, we annotate the cache access latency of instructions within themselves to give the circuitry a hint about how long to wait for the next instruction to become available.