Monocular camera-based 3D point cloud reconstruction and traffic sign detection using vision transformers and YOLOv8

Research output: Contribution to journalArticlepeer-review

Abstract

This study presents a novel method for extracting point clouds of traffic signs using a single monocular camera sensor. Traditional light detection and ranging (LiDAR) techniques, although highly accurate, are expensive, require integration with cameras for segmentation tasks, and increase overall system complexity. The proposed approach is significant as it enables the generation of accurately segmented point clouds without relying on a LiDAR sensor, which was not available to the research group. The solution is flexible, allowing substitution with equivalent algorithms for monocular depth estimation, image segmentation, camera calibration, and global positioning system (GPS) association. Furthermore, the integration of machine learning techniques is proposed for traffic sign classification.

Original languageEnglish
Pages (from-to)1473-1485
Number of pages13
JournalInternational Journal of Advanced Technology and Engineering Exploration
Volume12
Issue number131
DOIs
StatePublished - Oct 2025

Keywords

  • Depth estimation
  • Image segmentation
  • Machine learning
  • Monocular vision
  • Point cloud extraction
  • Traffic sign detection

Fingerprint

Dive into the research topics of 'Monocular camera-based 3D point cloud reconstruction and traffic sign detection using vision transformers and YOLOv8'. Together they form a unique fingerprint.

Cite this