Uncertainty Estimation in Large Language Models to Support Biodiversity Conservation

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

6 Citas (Scopus)

Resumen

Large Language Models (LLM) provide significant value in question answering (QA) scenarios and have practical application in complex decision-making contexts, such as biodiversity conservation. However, despite substantial performance improvements, they may still produce inaccurate outcomes. Consequently, incorporating uncertainty quantification alongside predictions is essential for mitigating the potential risks associated with their use. This study introduces an exploratory analysis of the application of Monte Carlo Dropout (MCD) and Expected Calibration Error (ECE) to assess the uncertainty of generative language models. To that end, we analyzed two publicly available language models (Falcon-7B and DistilGPT-2). Our findings suggest the viability of employing ECE as a metric to estimate uncertainty in generative LLM. The findings from this research contribute to a broader project aiming at facilitating free and open access to standardized and integrated data and services about Costa Rica's biodiversity to support the development of science, education, and biodiversity conservation.

Idioma originalInglés
Título de la publicación alojadaIndustry Track
EditoresYi Yang, Aida Davani, Avi Sil, Anoop Kumar
EditorialAssociation for Computational Linguistics (ACL)
Páginas368-378
Número de páginas11
ISBN (versión digital)9798891761209
DOI
EstadoPublicada - 30 jun 2024
Evento2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024 - Hybrid, Mexico City, México
Duración: 18 jun 202418 jun 2024

Serie de la publicación

NombreProceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024
Volumen6

Conferencia

Conferencia2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2024
País/TerritorioMéxico
CiudadHybrid, Mexico City
Período18/06/2418/06/24

Huella

Profundice en los temas de investigación de 'Uncertainty Estimation in Large Language Models to Support Biodiversity Conservation'. En conjunto forman una huella única.

Citar esto