TY - JOUR
T1 - Estimating Power, Performance, and Area for On-Sensor Deployment of AR/VR Workloads Using an Analytical Framework
AU - Sun, Xiaoyu
AU - Peng, Xiaochen
AU - Zhang, Sai Qian
AU - Gomez, Jorge
AU - Khwa, Win San
AU - Sarwar, Syed Shakib
AU - Li, Ziyun
AU - Cao, Weidong
AU - Wang, Zhao
AU - Liu, Chiao
AU - Chang, Meng Fan
AU - De Salvo, Barbara
AU - Akarvardar, Kerem
AU - Wong, H. S.Philip
N1 - Publisher Copyright:
© 2024 Copyright held by the owner/author(s).
PY - 2024/9/18
Y1 - 2024/9/18
N2 - Augmented Reality and Virtual Reality have emerged as the next frontier of intelligent image sensors and computer systems. In these systems, 3D die stacking stands out as a compelling solution, enabling in situ processing capability of the sensory data for tasks such as image classification and object detection at low power, low latency, and a small form factor. These intelligent 3D CMOS Image Sensor (CIS) systems present a wide design space, encompassing multiple domains (e.g., computer vision algorithms, circuit design, system architecture, and semiconductor technology, including 3D stacking) that have not been explored in-depth so far. This article aims to fill this gap. We first present an analytical evaluation framework, STAR-3DSim, dedicated to rapid pre-RTL evaluation of 3D-CIS systems capturing the entire stack from the pixel layer to the on-sensor processor layer. With STAR-3DSim, we then propose several knobs for PPA (power, performance, area) improvement of the Deep Neural Network (DNN) accelerator that can provide up to 53%, 41%, and 63% reduction in energy, latency, and area, respectively, across a broad set of relevant AR/VR workloads. Last, we present full-system evaluation results by taking image sensing, cross-tier data transfer, and off-sensor communication into consideration.
AB - Augmented Reality and Virtual Reality have emerged as the next frontier of intelligent image sensors and computer systems. In these systems, 3D die stacking stands out as a compelling solution, enabling in situ processing capability of the sensory data for tasks such as image classification and object detection at low power, low latency, and a small form factor. These intelligent 3D CMOS Image Sensor (CIS) systems present a wide design space, encompassing multiple domains (e.g., computer vision algorithms, circuit design, system architecture, and semiconductor technology, including 3D stacking) that have not been explored in-depth so far. This article aims to fill this gap. We first present an analytical evaluation framework, STAR-3DSim, dedicated to rapid pre-RTL evaluation of 3D-CIS systems capturing the entire stack from the pixel layer to the on-sensor processor layer. With STAR-3DSim, we then propose several knobs for PPA (power, performance, area) improvement of the Deep Neural Network (DNN) accelerator that can provide up to 53%, 41%, and 63% reduction in energy, latency, and area, respectively, across a broad set of relevant AR/VR workloads. Last, we present full-system evaluation results by taking image sensing, cross-tier data transfer, and off-sensor communication into consideration.
KW - 3D CMOS image sensor
KW - Augmented reality
KW - DNN accelerator
KW - virtual reality
UR - http://www.scopus.com/inward/record.url?scp=85208033689&partnerID=8YFLogxK
U2 - 10.1145/3670404
DO - 10.1145/3670404
M3 - Article
AN - SCOPUS:85208033689
SN - 1084-4309
VL - 29
JO - ACM Transactions on Design Automation of Electronic Systems
JF - ACM Transactions on Design Automation of Electronic Systems
IS - 6
M1 - 93
ER -