Drones equipped with cameras have been fast deployed to a wide range of applications, such as agriculture, aerial photography, fast delivery, and surveillance. As the core steps in those applications, video object detection and tracking attracts much research effort in recent years. However, the current video object detection and tracking algorithms are not usually optimal for dealing with video sequences captured by drones, due to various challenges, such as viewpoint change and scales. To promote and track the development of the detection and tracking algorithms with drones, we organized the Vision Meets Drone Video Detection and Tracking (VisDrone-VDT2018) challenge, which is a subtrack of the Vision Meets Drone 2018 challenge workshop in conjunction with the 15th European Conference on Computer Vision (ECCV 2018). Specifically, this workshop challenge consists of two tasks, (1) video object detection, and (2) multi-object tracking. We present a large-scale video object detection and tracking dataset, which consists of 79 video clips with about 1.5 million annotated bounding boxes in 33,366 frames. We also provide rich annotations, including object categories, occlusion, and truncation ratios for better data usage. Being the largest such dataset ever published, the challenge enables extensive evaluation, investigation and tracking the progress of object detection and tracking algorithms on the drone platform. We present the evaluation protocol of the VisDrone-VDT2018 challenge and the results of the algorithms on the benchmark dataset, which are publicly available on the challenge website: http://www.aiskyeye.com/. We hope the challenge largely boost the research and development in related fields.
@inproceedings{Noor2018:ECCVW_Visdrone_Report,
author = "P. Zhu and L. Wen and D. Du and X. Bian and H. Ling and Q. Hu and H. Wu and Q. Nie and H. Cheng and C. Liu and X. Liu and W. Ma and L. Wang and A. Schumann and D. Wang and D. Ortego and E. Luna and E. Michail and E. Bochinski and F. Ni and F. Bunyak and G. Zhang and G. Seetharaman and G. Li and H. Yu and I. Kompatsiaris and J. Zhao and J. Gao and J. M. Martinez and J. C. S. Miguel and K. Palaniappan and K. Avgerinakis and L. Sommer and M. Lauer and M. Liu and N. M. Al-Shakarji and O. Acatay and P. Giannakeris and Q. Zhao and Q. Ma and Q. Huang and S. Vrochidis and T. Sikora and T. Senst and W. Song and W. Tian and W. Zhang and Y. Zhao and Y. Bai and Y. Wu and Y. Wang and Y. Li and Z. Pi and Z. Ma",
title = "VisDrone-VDT2018: The vision meets drone video detection and tracking challenge results",
year = 2018,
booktitle = "Proceedings of the European Conference on Computer Vision (ECCV) Workshops",
keywords = "drone-based multiple object tracking, drone performance evaluation, multi-object tracking",
url = "https://openaccess.thecvf.com/content_eccv_2018_workshops/w27/html/Zhu_VisDrone-VDT2018_The_Vision_Meets_Drone_Video_Detection_and_Tracking_Challenge_ECCVW_2018_paper.html"
}
P. Zhu, L. Wen, D. Du, X. Bian, H. Ling, Q. Hu, H. Wu, Q. Nie, H. Cheng, C. Liu, X. Liu, W. Ma, L. Wang, A. Schumann, D. Wang, D. Ortego, E. Luna, E. Michail, E. Bochinski, F. Ni, F. Bunyak, G. Zhang, G. Seetharaman, G. Li, H. Yu, I. Kompatsiaris, J. Zhao, J. Gao, J. M. Martinez, J. C. S. Miguel, K. Palaniappan, K. Avgerinakis, L. Sommer, M. Lauer, M. Liu, N. M. Al-Shakarji, O. Acatay, P. Giannakeris, Q. Zhao, Q. Ma, Q. Huang, S. Vrochidis, T. Sikora, T. Senst, W. Song, W. Tian, W. Zhang, Y. Zhao, Y. Bai, Y. Wu, Y. Wang, Y. Li, Z. Pi, and Z. Ma. VisDrone-VDT2018: The vision meets drone video detection and tracking challenge results. Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2018.