Resumen
Deep learning methods rely on large labeled datasets. However, in real-world applications such as medical imaging, only scarcely labeled datasets might be available. Semi-supervised deep learning leverages both labeled and unlabeled datasets to improve the model's accuracy. However, semi-supervised methodologies frequently assume similar distributions of the labeled and unlabeled dataset. In real-world settings, this assumption might be violated. This gives rise to the need of enforcing data quality attributes such as the consistency between labeled and unlabeled data sources. In this work, we propose the usage of a Mahalanobis-based distance in the embedding space of the deep learning model to pre-assess labeled to unlabeled dataset affinity in a semi-supervised setting. We test how this distance can highly correlate with the accuracy of a popular semi-supervised method known as MixMatch. Moreover, the proposed method is considerably more efficient than previous work in the field.
Idioma original | Inglés |
---|---|
Título de la publicación alojada | 2024 IEEE 42nd Central America and Panama Convention, CONCAPAN 2024 |
Editorial | Institute of Electrical and Electronics Engineers Inc. |
Edición | 2024 |
ISBN (versión digital) | 9798350366723 |
DOI | |
Estado | Publicada - 2024 |
Evento | 42nd IEEE Central America and Panama Convention, CONCAPAN 2024 - San Jose, Costa Rica Duración: 27 nov 2024 → 29 nov 2024 |
Conferencia
Conferencia | 42nd IEEE Central America and Panama Convention, CONCAPAN 2024 |
---|---|
País/Territorio | Costa Rica |
Ciudad | San Jose |
Período | 27/11/24 → 29/11/24 |