A 2-Stage Model for Vehicle Class and Orientation Detection with Photo-Realistic Image Generation
A 2-Stage Model for Vehicle Class and Orientation Detection with Photo-Realistic Image Generation
We aim to detect the class and orientation of a vehicle by training a model with synthetic data. However, the distribution of the classes in the training data is imbalanced, and the model trained on the synthetic image is difficult to predict in real-world images. We propose a two-stage detection model with photo-realistic image generation to tackle this issue. Our model mainly takes four steps to detect the class and orientation of the vehicle. (1) It builds a table containing the image, class, and location information of objects in the image, (2) transforms the synthetic images into real-world images style, and merges them into the meta table. (3) Classify vehicle class and orientation using images from the meta-table. (4) Finally, the vehicle class and orientation are detected by combining the pre-extracted location information and the predicted classes. We achieved 4th place in IEEE BigData Challenge 2022 Vehicle class and Orientation Detection (VOD) with our approach.
Youngmin Kim、Donghwa Kang、Hyeongboo Baek
10.1109/BigData55660.2022.10020472
计算技术、计算机技术
Youngmin Kim,Donghwa Kang,Hyeongboo Baek.A 2-Stage Model for Vehicle Class and Orientation Detection with Photo-Realistic Image Generation[EB/OL].(2025-06-02)[2025-07-02].https://arxiv.org/abs/2506.01338.点此复制
评论