Design and Performance Evaluation of a Hardware-Accelerated VLSI Architecture for Deep Neural Network Inference

M. Mejail, B.K. Nestares, L. Gravano, E. Tacconi, G.R. Meira, A. Desages

Authors

M. Mejail, B.K. Nestares, L. Gravano, E. Tacconi, G.R. Meira, A. Desages Centro de Investigacion y Desarrollo de Tecnologias Aeronauticas (CITeA) Fuerza Aerea Argentina Las Higueras, Cordoba, Argentina

Keywords:

Deep Neural Network Inference, Hardware Acceleration, VLSI Architecture, Processing Element Array, Energy-Efficient Computing, Edge AI Systems

Abstract

The deep neural network (DNN) inference has become an order of workload in edge and embedded computing systems, necessitating much computational throughput with tight energy and area requirements. Traditional CPU and GPU based implementations are characterised by pronounced memory bandwidth reductions as well as low power efficiency upon deployment in a resource-constrained system. In this paper, the author introduces the hardware-accelerated VLSI architecture to allow scalable, low-latency, and energy-efficient DNN inference. The proposed structure combines an array of parallel multiply accumulate (MAC) processing elements (PE) with pipelined computer and streamlined reuse of on-chip memory to ensure that minimal off-chip data transfer is performed. A computational and throughput model is designed in a structured manner to analytically describe the scalability and performance limit. This design is synthesized and tested with representative loads of convolutional neural networks, and saves significant improvement in latency but saves on energy consumption relative to similar baseline architectures. Experimental data proves the close to linear scaling as the number of PE is increased and under good area performance trade-offs. The suggested architecture offers an effective and realistic solution to the issue of real-time deep learning inference of edge and embedded VLSI architectures.

Design and Performance Evaluation of a Hardware-Accelerated VLSI Architecture for Deep Neural Network Inference

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Informations

Announcements

Call for Papers – ECC Summit 2025