Monocular Image-Based 3D Object Detection for Industrial Pick-and-Place Tasks
Citations

SCOPUS

0

초록

This paper proposes a new pipeline that estimates depth information from a single RGB image and utilizes it for object detection. By fusing the pseudo depth map generated through the MiDaS-based Monocular Depth Estimation with the RGB image, the performance of the existing RGB-based detection model is improved. The proposed method is practical because spatial structure information can be utilized without an actual depth sensor, and shows robust performance, especially in occlusion between objects and complex background environments. As a result of experiments on the KITTI and SUN RGB-D datasets, the proposed method achieved performance improvement of 4.2% and 3.8% on mAP compared to the existing RGB-based method, respectively.

키워드

Computer VisionDeep LearningMonocular Depth EstimationObject DetectionRGB-D Fusion
제목
Monocular Image-Based 3D Object Detection for Industrial Pick-and-Place Tasks
저자
Jun, JihwanPark, ByungjunJeong, Jongpil
DOI
10.37394/23202.2025.24.65
발행일
2025
유형
Article
저널명
WSEAS Transactions on Systems
24
페이지
779 ~ 786