TY - GEN
T1 - Design of a Background Extraction Platform with Semantic Elements
AU - Fernandez-Badilla, Justin
AU - Chavarria-Zamora, Luis
AU - Barboza-Artavia, Luis Alonso
AU - Jiménez, Jason Leitón
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - This work presents the design and implementation of a modular software platform for semantic segmentation and 3D reconstruction of road scenes using monocular vision and GPS data. The system focuses on extracting vertical traffic signs and background information from road images, generating georeferenced point clouds in. pcd format. The proposed method leverages a fine-tuned Dense Prediction Transformer (DPT) for semantic segmentation. The platform processes synchronized video and GPS data, applies inference on each frame, and fuses depth, segmantic data, and location information to build a spatially 3D representation of the environment. The fine-tuned model achieved high performance, with a maximum Intersection over Union (IoU) of 0.9447 and F1-score of 0.9709. The results demonstrate substantial improvements over the pre-trained baseline and help to give a field of expertise to the model.
AB - This work presents the design and implementation of a modular software platform for semantic segmentation and 3D reconstruction of road scenes using monocular vision and GPS data. The system focuses on extracting vertical traffic signs and background information from road images, generating georeferenced point clouds in. pcd format. The proposed method leverages a fine-tuned Dense Prediction Transformer (DPT) for semantic segmentation. The platform processes synchronized video and GPS data, applies inference on each frame, and fuses depth, segmantic data, and location information to build a spatially 3D representation of the environment. The fine-tuned model achieved high performance, with a maximum Intersection over Union (IoU) of 0.9447 and F1-score of 0.9709. The results demonstrate substantial improvements over the pre-trained baseline and help to give a field of expertise to the model.
KW - 3D point cloud
KW - DPT
KW - Semantic segmentation
KW - Vertical traffic signs
KW - Vision transformers
UR - https://www.scopus.com/pages/publications/105038790096
U2 - 10.1109/BIP68491.2025.11489147
DO - 10.1109/BIP68491.2025.11489147
M3 - Contribución a la conferencia
AN - SCOPUS:105038790096
T3 - 2025 IEEE 7th International Conference on BioInspired Processing, BIP 2025
BT - 2025 IEEE 7th International Conference on BioInspired Processing, BIP 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th IEEE International Conference on BioInspired Processing, BIP 2025
Y2 - 3 December 2025 through 5 December 2025
ER -