Thermally Constrained Codesign of Heterogeneous 3-D Integration of Compute-in-Memory, Digital ML Accelerator, and RISC-V Cores for Mixed ML and Non-ML Workloads

Yuan Chun Luo, Anni Lu, Janak Sharda, Moritz Scherer, Jorge Tomas Gomez, Syed Shakib Sarwar, Ziyun Li, Reid Frederick Pinkham, Barbara De Salvo, Shimeng Yu

Research output: Contribution to journalArticlepeer-review

Abstract

Heterogeneous 3-D (H3D) integration not only reduces the chip form factor and fabrication cost but also allows the merging of diverse compute paradigms that suit different applications. This is especially attractive when modern algorithms, such as the augmented reality/virtual reality (AR/VR) workloads, consist of mixed machine learning (ML) and non-ML workloads. To date, codesign that considers the thermal, latency, and power constraints of H3D hardware is largely unexplored. In this work, a thermally aware framework for H3D hardware design is developed to evaluate the thermal, latency, and power trade-offs for a heterogeneous system with compute-in-memory (CIM), digital ML cores, and RISC-V cores. The framework solves for runtime tunable operating points described as the optimal speedup factor, the number of activated RISC-V cores, the cooling coefficient, and the activity rate based on user-defined criteria, achieving up to 135 TOPS and 215 TOPS/W under 74 <inline-formula> <tex-math notation="LaTeX">$^{\circ}$</tex-math> </inline-formula>C for the AR/VR workloads.

Original languageEnglish
Pages (from-to)1-8
Number of pages8
JournalIEEE Transactions on Very Large Scale Integration (VLSI) Systems
DOIs
StateAccepted/In press - 2024
Externally publishedYes

Bibliographical note

Publisher Copyright:
IEEE

Keywords

  • Augmented reality/virtual reality
  • compute-in-memory (CIM)
  • heterogenous 3-D integration
  • machine learning (ML)
  • RISC-V
  • thermally constrained framework

Fingerprint

Dive into the research topics of 'Thermally Constrained Codesign of Heterogeneous 3-D Integration of Compute-in-Memory, Digital ML Accelerator, and RISC-V Cores for Mixed ML and Non-ML Workloads'. Together they form a unique fingerprint.

Cite this