TY - GEN
T1 - A trace cache with DVFS techniques for a low power microprocessor
AU - Jang, Hyung Beom
AU - Choi, Lynn
AU - Chung, Sung Woo
PY - 2008
Y1 - 2008
N2 - The trace cache is a solution to achieving high instruction fetches bandwidth by buffering and reusing dynamic instruction traces. This work presents a new trace cache implementation that includes the DVFS (Dynamic Voltage and Frequency Scaling) techniques for energy efficiency. The focus of this paper is to compare the trace cache with DVFS techniques to the conventional trace cache organization where any DVFS technique is not applied. Instead of storing the basic blocks in the unified trace cache space, the first block of each trace is stored in the specific space of the trace cache and the other basic blocks are stored in the rest of the trace cache space. The first basic block area is not voltage scaled because the first basic block should be supplied to processor's front-end as soon as possible. On the other hand, other basic block area is voltage-scaled down in order to reduce the power consumption. Transistor switching speed of other basic block area is slower than that of the first basic block area due to the lowered supply voltage. Our experiments show that when we adopted the DVFS techniques to the conventional trace cache, 12.8% fetch engine energy consumption is reduced, on average. Applying different supply voltages to each different region of the trace cache, we can reduce the dynamic power consumption. However, we can know that the region which is supplied with lowered voltage inevitably deteriorates the performance of the trace cache by 5.7%.
AB - The trace cache is a solution to achieving high instruction fetches bandwidth by buffering and reusing dynamic instruction traces. This work presents a new trace cache implementation that includes the DVFS (Dynamic Voltage and Frequency Scaling) techniques for energy efficiency. The focus of this paper is to compare the trace cache with DVFS techniques to the conventional trace cache organization where any DVFS technique is not applied. Instead of storing the basic blocks in the unified trace cache space, the first block of each trace is stored in the specific space of the trace cache and the other basic blocks are stored in the rest of the trace cache space. The first basic block area is not voltage scaled because the first basic block should be supplied to processor's front-end as soon as possible. On the other hand, other basic block area is voltage-scaled down in order to reduce the power consumption. Transistor switching speed of other basic block area is slower than that of the first basic block area due to the lowered supply voltage. Our experiments show that when we adopted the DVFS techniques to the conventional trace cache, 12.8% fetch engine energy consumption is reduced, on average. Applying different supply voltages to each different region of the trace cache, we can reduce the dynamic power consumption. However, we can know that the region which is supplied with lowered voltage inevitably deteriorates the performance of the trace cache by 5.7%.
UR - http://www.scopus.com/inward/record.url?scp=57849096210&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=57849096210&partnerID=8YFLogxK
U2 - 10.1109/ICCIT.2008.148
DO - 10.1109/ICCIT.2008.148
M3 - Conference contribution
AN - SCOPUS:57849096210
SN - 9780769534077
T3 - Proceedings - 3rd International Conference on Convergence and Hybrid Information Technology, ICCIT 2008
SP - 587
EP - 592
BT - Proceedings - 3rd International Conference on Convergence and Hybrid Information Technology, ICCIT 2008
T2 - 3rd International Conference on Convergence and Hybrid Information Technology, ICCIT 2008
Y2 - 11 November 2008 through 13 November 2008
ER -