TY - GEN
T1 - Squeeze Every Bit of Insight
T2 - 51st Latin American Computer Conference, CLEI 2025
AU - Fallas-Moya, Fabian
AU - Xie-Li, Danny
AU - Calderon-Ramirez, Saul
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Object detection (OD) typically demands large annotated datasets and substantial computational resources. To address these challenges, we propose a novel two-stage pipeline that integrates Visual Foundation Models (VFMs) for object proposal generation with few-shot learning models enhanced by Mahalanobis distance-based classification. Our approach improves upon traditional Euclidean-based methods by incorporating data covariance through support and context prototypes. We specifically focus on scenarios with only just a few annotated images, reflecting real-world limitations where large-scale labeling is not feasible. Validated on pineapple detection from drone imagery, our method outperforms state-of-The-Art (SOTA) few-shot models using minimal labeled data. Extensive experiments show that FastSAM, when combined with a Mahalanobis distance variant that applies singular value decomposition (SVD) and diagonal loading for regularization, achieves the highest mean average precision (mAP), offering a practical and effective tool for crop monitoring and management.
AB - Object detection (OD) typically demands large annotated datasets and substantial computational resources. To address these challenges, we propose a novel two-stage pipeline that integrates Visual Foundation Models (VFMs) for object proposal generation with few-shot learning models enhanced by Mahalanobis distance-based classification. Our approach improves upon traditional Euclidean-based methods by incorporating data covariance through support and context prototypes. We specifically focus on scenarios with only just a few annotated images, reflecting real-world limitations where large-scale labeling is not feasible. Validated on pineapple detection from drone imagery, our method outperforms state-of-The-Art (SOTA) few-shot models using minimal labeled data. Extensive experiments show that FastSAM, when combined with a Mahalanobis distance variant that applies singular value decomposition (SVD) and diagonal loading for regularization, achieves the highest mean average precision (mAP), offering a practical and effective tool for crop monitoring and management.
KW - agriculture
KW - few-shot
KW - mahalanobis
KW - object detection
UR - https://www.scopus.com/pages/publications/105036214026
U2 - 10.1109/CLEI67442.2025.11420556
DO - 10.1109/CLEI67442.2025.11420556
M3 - Contribución a la conferencia
AN - SCOPUS:105036214026
T3 - Proceedings - 2025 51st Latin American Computer Conference, CLEI 2025
BT - Proceedings - 2025 51st Latin American Computer Conference, CLEI 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 27 October 2025 through 31 October 2025
ER -