tailieunhanh - Báo cáo hóa học: " Research Article Hardware Architecture of Reinforcement Learning Scheme for Dynamic Power Management in Embedded Systems"

Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: Research Article Hardware Architecture of Reinforcement Learning Scheme for Dynamic Power Management in Embedded Systems | Hindawi Publishing Corporation EURASIP Journal on Embedded Systems Volume 2007 Article ID 65478 6 pages doi 2007 65478 Research Article Hardware Architecture of Reinforcement Learning Scheme for Dynamic Power Management in Embedded Systems Viswanathan Lakshmi Prabha1 and Elwin Chandra Monie2 1 Department of Electronics and Communication Engineering Government College of Technology Coimbatore 641-013 Tamil Nadu India 2 Thanthai Periyar Government Institute of Technology TPGIT Vellore 632002 Tamil Nadu India Received 6 July 2006 Revised 7 November 2006 Accepted 28 May 2007 Recommended by Rajesh K. Gupta Dynamic power management DPM is a technique to reduce power consumption of electronic systems by selectively shutting down idle components. In this paper a novel and nontrivial enhancement of conventional reinforcement learning RL is adopted to choose the optimal policy out of the existing DPM policies. A hardware architecture evolved from the VHDL model of Temporal Difference RL algorithm is proposed in this paper which can suggest the winner policy to be adopted for any given workload to achieve power savings. The effectiveness of this approach is also demonstrated by an event-driven simulator which is designed using JAVA for power-manageable embedded devices. The results show that RL applied to DPM can lead up to 28 power savings. Copyright 2007 V. L. Prabha and E. C. Monie. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use distribution and reproduction in any medium provided the original work is properly cited. 1. INTRODUCTION Dynamic power management DPM techniques aid energy efficient utilization of systems by selectively placing system components into low-power states when they are idle. A DPM system model consists of Service provider Service queue Service requestor and Power Manager. Power manager PM implements a control procedure or policy based on observations of the workload. It

TÀI LIỆU LIÊN QUAN