Thermally Constrained Codesign of Heterogeneous 3-D Integration of Compute-in-Memory, Digital ML Accelerator, and RISC-V Cores for Mixed ML and Non-ML Workloads

Yuan Chun Luo*, Anni Lu, Janak Sharda, Moritz Scherer, Jorge Tomas Gomez, Syed Shakib Sarwar, Ziyun Li, Reid Frederick Pinkham, Barbara De Salvo, Shimeng Yu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Heterogeneous 3-D (H3D) integration not only reduces the chip form factor and fabrication cost but also allows the merging of diverse compute paradigms that suit different applications. This is especially attractive when modern algorithms, such as the augmented reality/virtual reality (AR/VR) workloads, consist of mixed machine learning (ML) and non-ML workloads. To date, codesign that considers the thermal, latency, and power constraints of H3D hardware is largely unexplored. In this work, a thermally aware framework for H3D hardware design is developed to evaluate the thermal, latency, and power trade-offs for a heterogeneous system with compute-in-memory (CIM), digital ML cores, and RISC-V cores. The framework solves for runtime tunable operating points described as the optimal speedup factor, the number of activated RISC-V cores, the cooling coefficient, and the activity rate based on user-defined criteria, achieving up to 135 TOPS and 215 TOPS/W under 74 °C for the AR/VR workloads.

Original languageEnglish
Pages (from-to)1718-1725
Number of pages8
JournalIEEE Transactions on Very Large Scale Integration (VLSI) Systems
Volume32
Issue number9
DOIs
StatePublished - 2024
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 1993-2012 IEEE.

Keywords

  • Augmented reality/virtual reality
  • RISC-V
  • compute-in-memory (CIM)
  • heterogenous 3-D integration
  • machine learning (ML)
  • thermally constrained framework

Fingerprint

Dive into the research topics of 'Thermally Constrained Codesign of Heterogeneous 3-D Integration of Compute-in-Memory, Digital ML Accelerator, and RISC-V Cores for Mixed ML and Non-ML Workloads'. Together they form a unique fingerprint.

Cite this