Skip to main navigation Skip to search Skip to main content

An Exploration of Accuracy Configurable Matrix Multiply-Addition Architectures using HLS

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Low-power consumption and constraint resources limit the implementation of deep learning inference solutions at the edge. Besides, the approximate computing paradigm reports promising techniques for the design of DNN accelerators to deal with inherent limitations of the edge. This paper summarises the automatic generation of generic matrix multiplication-addition (GEMMA) processing elements (PEs), leveraging High-Level Synthesis and emphasising in adaptable matrix size, data bit-width, and data type for accuracy configuration, and their impact on the overall design resource consumption. For generated PEs efficiency evaluation, this work presents a novel Figure of merit that considers computing performance and resource utilisation regarding the FPGA platform underneath. Finally, we analyse the impact of different design configurations in the numerical errors introduced due to the output bit-width preservation regarding the input, and matrix size, data bit-width and type configuration.

Original languageEnglish
Title of host publicationProceedings of the 2022 IEEE Dallas Circuits and Systems Conference, DCAS 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665498852
DOIs
StatePublished - 2022
Event15th IEEE Dallas Circuits and Systems Conference, DCAS 2022 - Richardson, United States
Duration: 17 Jun 202219 Jun 2022

Publication series

NameProceedings of the 2022 IEEE Dallas Circuits and Systems Conference, DCAS 2022

Conference

Conference15th IEEE Dallas Circuits and Systems Conference, DCAS 2022
Country/TerritoryUnited States
CityRichardson
Period17/06/2219/06/22

Keywords

  • AI accelerators
  • approximate computing
  • design automation
  • High-Level Synthesis

Fingerprint

Dive into the research topics of 'An Exploration of Accuracy Configurable Matrix Multiply-Addition Architectures using HLS'. Together they form a unique fingerprint.

Cite this