Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

A Quantitative Evaluation of Approximate Softmax Functions for Deep Neural Networks

  • Anthony Leiva-Valverde
  • , Fabricio Elizondo-Fernández
  • , Luis G. León-Vega
  • , Cristina Meinhardt
  • , Jorge Castro-Godínez
  • Costa Rica Institute of Technology
  • University of Trieste
  • Universidade Federal de Santa Catarina

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

The softmax function is a widely used activation function in the output layers of neural networks, responsible for converting raw scores into class probabilities while introducing essential non-linearity. Implementing softmax efficiently poses challenges on low-end FPGAs due to limited hardware resources and the computational complexity of exponential and division operations. This work evaluates approximate computing techniques for softmax acceleration using Taylor series and interpolation methods using Look-Up Tables (LUTs). These approximations aim to reduce execution time and resource consumption while maintaining acceptable levels of numerical precision. Our findings show that quadratic interpolation with LUTs yields the lowest numerical error. In contrast, Taylor-based approximations offer significantly better performance in terms of execution time and resource efficiency due to their computational simplicity. When applied to real-world deep learning models such as LeNet 5 and MobileNet V2, the first- and second-order Taylor approximations provided substantial trade-offs between accuracy and resource savings, achieving up to 0.2% accuracy degradation and 14% resource reduction compared to exact implementations. These results highlight the effectiveness of approximate softmax designs on resource-constrained FPGAs and lay the groundwork for their integration into larger models, including large language models (LLMs).

Idioma originalInglés
Título de la publicación alojada2026 IEEE 17th Latin American Symposium on Circuits and Systems, LASCAS 2026 - Proceedings
EditorialInstitute of Electrical and Electronics Engineers Inc.
ISBN (versión digital)9798331570972
DOI
EstadoPublicada - 2026
Evento17th Latin American Symposium on Circuits and Systems, LASCAS 2026 - Arequipa, Perú
Duración: 24 feb 202627 feb 2026

Serie de la publicación

Nombre2026 IEEE 17th Latin American Symposium on Circuits and Systems, LASCAS 2026 - Proceedings

Conferencia

Conferencia17th Latin American Symposium on Circuits and Systems, LASCAS 2026
País/TerritorioPerú
CiudadArequipa
Período24/02/2627/02/26

ODS de las Naciones Unidas

Este resultado contribuye a los siguientes Objetivos de Desarrollo Sostenible

  1. ODS 8: Trabajo decente y crecimiento económico
    ODS 8: Trabajo decente y crecimiento económico
  2. ODS 12: Producción y consumo responsables
    ODS 12: Producción y consumo responsables

Huella

Profundice en los temas de investigación de 'A Quantitative Evaluation of Approximate Softmax Functions for Deep Neural Networks'. En conjunto forman una huella única.

Citar esto