Skip to main navigation Skip to search Skip to main content

Acceleration of Fully Connected Layers on FPGA using the Strassen Matrix Multiplication

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Deep Learning is one of the most popular techniques of Machine Learning (ML) but also one of the most computationally intensive and energy-demanding task in High-Performance Computing (HPC), resulting in concerns about the sustainability of executing Artificial Intelligence massively. The challenge becomes harder when performing inference tasks at the Edge, where computational resources and energy are scarce. Field Programmable Gate Arrays (FPGAs) are hardware-level reconfigurable devices with greater benefits in computer power vs energy consumption compared to CPUs and GPUs in the wide spectrum of HPC. High-Level Synthesis has helped reduce the complexity of using FPGAs to accelerate algorithms using high-level languages like C++. This opens opportunities to research new hardware architectures for accelerating Deep Learning Inference (DLI) using FPGAs for energy-constraint applications. Most ML applications involve Deep Neural Networks (DNNs), where matrix multiplication is one of the most popular operations. This work covers the optimisation of matrix multiplication by proposing a generic, HLS-compliant Strassen Matrix Multiplication Processing Element (PE) capable of adapting its implementation under different numerical precisions and hardware approximations. We discuss the PE's numerical and resource consumption analysis under several configurations and describe how it behaves in an actual DNN model dedicated to anomaly detection, spotting promising results for FPGA-based DLI at the Edge, saving up to 12.5% of DSP cells compared to the standard multiplication units on an XC7A50T (low-end FPGA) with negligible accuracy loss.

Original languageEnglish
Title of host publication5th IEEE International Conference on BioInspired Processing, BIP 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350330052
DOIs
StatePublished - 2023
Event5th IEEE International Conference on BioInspired Processing, BIP 2023 - San Carlos, Alajuela, Costa Rica
Duration: 28 Nov 202330 Nov 2023

Publication series

Name5th IEEE International Conference on BioInspired Processing, BIP 2023

Conference

Conference5th IEEE International Conference on BioInspired Processing, BIP 2023
Country/TerritoryCosta Rica
CitySan Carlos, Alajuela
Period28/11/2330/11/23

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • deep learning
  • edge computing
  • field programmable gate arrays
  • high-level synthesis
  • low-power electronics
  • Strassen multiplication

Fingerprint

Dive into the research topics of 'Acceleration of Fully Connected Layers on FPGA using the Strassen Matrix Multiplication'. Together they form a unique fingerprint.

Cite this