A curated list of papers using UnrealCV.
This is a list of papers that use UnrealCV, an open source project to help researchers build virtual worlds using Unreal Engine 4. The papers are categorized by their applications, such as semantic understanding, 3D vision, embodied vision, etc.
The following labels are used to indicate the usage of UnrealCV in each paper:
- π
Dataset
: generate synthetic dataset for training. - π
Diagnosis
: control the factor for model diagnosis. - π
Interaction
: train/test agent(s) by interacting with the virtual worlds (e.g., RL).
If you find any papers that used UnrealCV but are not included in this list, please feel free to send a PR or open an issue.
When sending PRs, please put the new paper at the correct chronological position as the following format:
- Usage Tags(πππ)
**Paper Title**
*Author(s)*
Conference/Journal Year. [[Paper](link)] [[code](link)] [[Website](link)]
-
π ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond. Siyuan Qiao, Wei Shen, Weichao Qiu, Chenxi Liu, Alan Yuille. ICCV 2017. [paper] [code]
-
π Falling Things: A Synthetic Dataset for 3D Object Detection and Pose Estimation. Tremblay, Jonathan, Thang To, and Stan Birchfield. CVPR 2018 Workshop. [paper] [code]
-
π A Framework for Self-Training Perceptual Agents in Simulated Photorealistic Environments. Patrick Mania, Michael Beetz. ICRA 2019. [paper] [code]
-
π Semantic Part Detection via Matching: Learning to Generalize to Novel Viewpoints from Limited Training Data. Yutong Bai, Qing Liu, Lingxi Xie, Weichao Qiu, Yan Zheng, Alan Yuille. ICCV 2019. [paper]
-
π AugPOD: Augmentation-oriented Probabilistic Object Detection. Chuan-Wei Wang*, Chin-An Cheng*, Ching-Ju Cheng*, Hou-Ning Hu, Hung-Kuo Chu, Min Sun. CVPR 2019 Workshop. [paper]
-
π OmniSCV: An Omnidirectional Synthetic Image Generator for Computer Vision. Berenguel-Baeta, Bruno, Jesus Bermudez-Cameo, and Jose J. Guerrero. Sensors 2020. [paper]
-
π UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World. Minghui Liao, Guan Pang, Jing Huang, Tal Hassner, Xiang Bai. CVPR 2020. [paper] [code]
-
π Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild. Weijia Wu, Ning Lu, Enze Xie, Yuxing Wang, Wenwen Yu, Cheng Yang, Hong Zhou. ACCV 2020. [paper] [code]
-
π UnrealPerson: An Adaptive Pipeline for Costless Person Re-identification. Tianyu Zhang, Lingxi Xie, Longhui Wei, Zijie Zhuang, Yongfei Zhang, Bo Li, Qi Tian. CVPR 2021. [paper] [code]
-
π Robust Person Re-Identification with Wireless Signals Xi, Dong, Wengang Zhou, and Houqiang Li. ICME 2023. [paper]
- π Identity Preserve Transform: Understand What Activity Classification Models Have Learnt. Jialing Lyu, Weichao Qiu, Xinyue Wei, Yi Zhang, Alan Yuille, Zheng-Jun Zha. CVPR 2020 Workshop. [paper]
- π Active shooter detection and robust tracking utilizing supplemental synthetic data. Joshua R. Waite, Jiale Feng, Riley Tavassoli, Laura Harris, Sin Yong Tan, Subhadeep Chakraborty, Soumik Sarkar. Arxiv 2023. [paper]
-
π Unrealstereo: Controlling hazardous factors to analyze stereo vision. Yi Zhang, Weichao Qiu, Qi Chen, Xiaolin Hu, Alan Yuille. 3DV 2018. [paper] [code]
-
π Coupled Real-Synthetic Domain Adaptation for Real-World Deep Depth Enhancement. Xiao Gu, Yao Guo, Fani Deligianni, Guang-Zhong Yang. IEEE Transactions on Image Processing (TIP) 2020. [paper]
-
π π Enhancing optical-flow-based control by learning visual appearance cues for flying robots. G. C. H. E. de Croon, C. De Wagter, T. Seidl. Nature Machine Intelligence 2021. [paper]
-
π SMD-Nets: Stereo Mixture Density Networks. Fabio Tosi, Yiyi Liao, Carolin Schmitt, Andreas Geiger. CVPR 2021. [paper][code]
-
π G2-MonoDepth: A General Framework of Generalized Depth Inference from Monocular RGB+ X Data. Haotian Wang, Meng Yang, Nanning Zheng. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2023. [paper]
-
π Learning Occluded Branch Depth Maps in Forest Environments Using RGB-D Images Geckeler, C., Aucone, E., Schnider, Y., Simeon, A., von Bassewitz, J. P., Zhu, Y., & Mintchev, S. IEEE Robotics and Automation Letters (RA-L) 2024. [[paper](https://ieeexplore.ieee.org/document/10403997]
- π Submodular Trajectory Optimization for Aerial 3D Scanning. Mike Roberts, Debadeepta Dey, Anh Truong, Sudipta Sinha, Shital Shah, Ashish Kapoor, Pat Hanrahan, Neel Joshi. ICCV 2017. [paper ] [project]
- π GEN-SLAM: Generative Modeling for Monocular Simultaneous Localization and Mapping. Punarjay Chakravarty, Praveen Narayanan, Tom Roussel. ICRA 2019. [paper]
- π QuadricSLAM: Dual Quadrics from Object Detections as Landmarks in Object-oriented SLAM. Lachlan Nicholson, Michael Milford, Niko SΓΌnderhauf. IEEE Robotics and Automation Letters (RA-L) 2019. [paper]
- π Path Planning for Active V-Slam Based on Reinforcement Learning. Borui Li, Fuchun Sun, Huaping Liu, Bin Fang. International Conference on Cognitive Systems and Signal Processing 2019. [paper]
- π UnrealNavigation: Simulation Software for testing SLAM in Virtual Reality. Anne M. Bettens, Benjamin Morrell, Mauricio Coen, Neil McHenry, Xiaofeng Wu, Peter Gibbens, Gregory Chamitoff. AIAA Scitech 2020 Forum. [paper] [code]
- π An Efficient Sampling-based Method for Online Informative Path Planning in Unknown Environments. Lukas Schmid, Michael Pantic, Raghav Khanna, Lionel Ott, Roland Siegwart, Juan Nieto. IEEE Robotics and Automation Letters (RA-L) 2020. [paper] [code]
- π Next-Best View Policy for 3D Reconstruction. Peralta, D., Casimiro, J., Nilles, A.M., Aguilar, J.A., Atienza, R., and Cajote, R. ECCV Workshops 2020. [paper] [code]
- π Flight Planning for Survey-Grade 3D Reconstruction of Truss Bridges. Zhexiong Shang, Zhigang Shen. Remote Sens. 2022. [paper]
- π View Planning Using Discrete Optimization for 3D Reconstruction of Row Crops. Athanasios Bacharis, Henry J. Nelson and Nikolaos Papanikolopoulos. IROS 2022. [paper]
-
π A Unified Framework for Multi-View Multi-Class Object Pose Estimation. Chi Li, Jin Bai, Gregory D. Hager. ECCV 2018. [paper]
-
π π CRAVES: Controlling Robotic Arm with a Vision-based, Economic System. Yiming Zuo*, Weichao Qiu*, Lingxi Xie, Fangwei Zhong, Yizhou Wang, Alan Yuille. CVPR 2019. [paper] [code]
-
π Learning from Synthetic Animals. Jiteng Mu*, Weichao Qiu*, Gregory Hager, Alan Yuille. CVPR 2020 (Oral). [paper] [code]
-
π AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in the Wild. Zhe Zhang, Chunyu Wang, Weichao Qiu, Wenhu Qin, Wenjun Zeng. IJCV 2020. [paper] [code]
-
π Deep Learning for Spacecraft Pose Estimation from Photorealistic Rendering. Pedro F. ProenΓ§a, Yang Gao. ICRA 2020. [paper] [project]
-
π Learning From Synthetic Vehicles. Tae Soo Kim, Bohoon Shim, Michael Peven, Weichao Qiu, Alan Yuille, Gregory D. Hager. WACV 2022. [paper] [dataset]
-
π Proactive Multi-Camera Collaboration for 3D Human Pose Estimation. Hai Ci*, Mickel Liu*, Xuehai Pan*, Fangwei Zhong, Yizhou Wang. ICLR 2023. [paper] [project]
-
π Monocular 3D Human Pose Estimation for Sports Broadcasts using Partial Sports Field Registration. Tobias Baumgartner, Stefanie Klatt. CVPR-W 2023. [paper ] [code]
-
π Hardware-accelerated Mars Sample Localization via Deep Transfer Learning from Photorealistic Simulations R. Castilla-Arquillo, C. J. P Μerez-del-Pulgar, G. J. Paz-Delgado and L. Gerdes IEEE Robotics and Automation Letters 2022. [paper] [code]
- π Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines. Ben Mildenhall, Pratul P. Srinivasan, Rodrigo Ortiz-Cayon, Nima Khademi Kalantari, Ravi Ramamoorthi, Ren Ng, Abhishek Kar. SIGGRAPH 2019. [paper][project]
-
π ESIM: an Open Event Camera Simulator. Henri Rebecq, Daniel Gehrig, Davide Scaramuzza. CoRL 2018. [paper][code]
-
π Insights into Batch Selection for Event-Camera Motion Estimation. Valerdi, Juan L., Chiara Bartolozzi, and Arren Glover. Sensors 2023. [paper]
- π An Improved Method Based on Deep Reinforcement Learning for Target Searching. Xiao Long Wei, Xiang Lin Huang, Tao Lu, Ge Ge Song. International Conference on Robotics and Automation Engineering (ICRAE) 2019. [paper]
- π Training an Agent to Find and Reach an Object in Different Environments using Visual Reinforcement Learning and Transfer Learning. Evelyn Conceição Santos Batista, Wouter Caarls, Leonardo A. Forero, Marco AurΓ©lio C. Pacheco. ICAART 2021. [paper]
- π π Enhancing optical-flow-based control by learning visual appearance cues for flying robots. G. C. H. E. de Croon, C. De Wagter, T. Seidl. Nature Machine Intelligence 2021. [paper]
- π π Simultaneous localization and mapping architecture for small bodies and space exploration.. Bettens, A., Morrell, B., Coen, M., Wu, X., Gibbens, P., & Chamitoff, G.. Advances in Space Research 2024, 73(1), 1185-1197. [paper]
-
π End-to-end Active Object Tracking via Reinforcement Learning. Wenhan Luo*, Peng Sun*, Fangwei Zhong, Wei Liu, Tong Zhang, Yizhou Wang. ICML 2018. [paper] [project]
-
π End-to-end Active Object Tracking and Its Real-world Deployment via Reinforcement Learning. Wenhan Luo*, Peng Sun*, Fangwei Zhong*, Wei Liu, Tong Zhang, Yizhou Wang. IEEE TPAMI 2019. [paper] [project]
-
π AD-VAT: An Asymmetric Dueling mechanism for learning Visual Active Tracking. Fangwei Zhong, Wenhan Luo, Peng Sun, Tingyun Yan, Yizhou Wang. ICLR 2019. [paper] [code]
-
π π AD-VAT+: An Asymmetric Dueling Mechanism for Learning and Understanding Visual Active Tracking. Fangwei Zhong, Wenhan Luo, Peng Sun, Tingyun Yan, Yizhou Wang. IEEE TPAMI 2019. [paper] [code]
-
π Pose-Assisted Multi-Camera Collaboration for Active Object Tracking. Jing Li*, Jing Xu*, Fangwei Zhong*, Xiangyu Kong, Yu Qiao, Yizhou Wang. AAAI 2020. [paper] [code]
-
π π Towards Distraction-Robust Active Visual Tracking. Fangwei Zhong, Wenhan Luo, Peng Sun, Tingyun Yan, Yizhou Wang. ICML 2021. [paper] [code] [project]
-
π Anti-Distractor Active Object Tracking in 3D Environments. Mao Xi, Yun Zhou, Zheng Chen, Wengang Zhou, Houqiang Li. IEEE TCSVT 2022. [paper]
-
π RSPT: Reconstruct Surroundings and Predict Trajectories for Generalizable Active Object Tracking. Fangwei Zhong*, Xiao Bi*, Yudi Zhang, Wei Zhang, Yizhou Wang. AAAI 2023. [paper] [project]
-
π Pose-Assisted Multi-Camera Collaboration for Active Object Tracking. Jing Li*, Jing Xu*, Fangwei Zhong*, Xiangyu Kong, Yu Qiao, Yizhou Wang. AAAI 2020. [paper] [code]
-
π Proactive Multi-Camera Collaboration for 3D Human Pose Estimation. Hai Ci*, Mickel Liu*, Xuehai Pan*, Fangwei Zhong, Yizhou Wang. ICLR 2023. [paper] [project]
- π A scalable pipeline to create synthetic datasets from functionalβstructural plant models for deep learning. Dirk Norbert Baker, Felix Maximilian Bauer, Mona Giraud, Andrea Schnepf, Jens Henrik GΓΆbbert, Hanno Scharr, Ebba Hvannberg, Morris Riedel. in silico Plants, 6(1), diad022. [paper]
- π Adapting Agricultural Virtual Environments in Game Engines to Improve HPC Accessibility. Dirk Norbert Baker, Felix Bauer, Andrea Schnepf, Hanno Scharr, Morris Riedel, Jens Henrik GΓΆbbert, Ebba Hvannberg. [paper]