Home /Research /Circuits and Architectures for In-Memory Computing-Based Machine Learning Accelerators
PERCEPTION

Circuits and Architectures for In-Memory Computing-Based Machine Learning Accelerators

Aayush Ankit, Indranil Chakraborty, Amogh Agrawal, Mustafa Ali, Kaushik Roy

Year
2020
Citations
41

Abstract

Machine learning applications, especially deep neural networks (DNNs) have seen ubiquitous use in computer vision, speech recognition, and robotics. However, the growing complexity of DNN models have necessitated efficient hardware implementations. The key compute primitives of DNNs are matrix vector multiplications, which lead to significant data movement between memory and processing units in today's von Neumann systems. A promising alternative would be colocating memory and processing elements, which can be further extended to performing computations inside the memory itself. We believe in-memory computing is a propitious candidate for future DNN accelerators, since it mitigates the memory wall bottleneck. In this article, we discuss various in-memory computing primitives in both CMOS and emerging nonvolatile memory (NVM) technologies. Subsequently, we describe how such primitives can be incorporated in standalone machine learning accelerator architectures. Finally, we analyze the challenges associated with designing such in-memory computing accelerators and explore future opportunities.

Keywords

Computer scienceIn-Memory ProcessingVon Neumann architectureComputer architectureBottleneckComputing with MemoryMemory mapMemory managementParallel computingArtificial intelligence

Related papers

Browse all PERCEPTION papers