Automatic Ground Truths: Projected Image Annotations for Omnidirectional Vision

Victor Stamatescu, Peter Barsznica, Manjung Kim, Kin K. Liu, Mark McKenzie, Will Meakin, Gwilyn Saunders, Sebastien C. Wong, Russell S.A. Brinkworth

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present a novel data set made up of omnidirectional video of multiple objects whose centroid positions are annotated automatically. Omnidirectional vision is an active field of research focused on the use of spherical imagery in video analysis and scene understanding, involving tasks such as object detection, tracking and recognition. Our goal is to provide a large and consistently annotated video data set that can be used to train and evaluate new algorithms for these tasks. Here we describe the experimental setup and software environment used to capture and map the 3D ground truth positions of multiple objects into the image. Furthermore, we estimate the expected systematic error on the mapped positions. In addition to final data products, we release publicly the software tools and raw data necessary to re-calibrate the camera and/or redo this mapping. The software also provides a simple framework for comparing the results of standard image annotation tools or visual tracking systems against our mapped ground truth annotations.

Original languageEnglish
Title of host publicationDICTA 2017 - 2017 International Conference on Digital Image Computing
Subtitle of host publicationTechniques and Applications
EditorsYi Guo, Hongdong Li, Weidong Tom Cai, Manzur Murshed, Zhiyong Wang, Junbin Gao, David Dagan Feng
Place of PublicationPiscataway, NJ
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-8
ISBN (Electronic)9781538628393
DOIs
Publication statusPublished - 19 Dec 2017
Externally publishedYes
Event2017 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2017 - Sydney, Australia
Duration: 29 Nov 20171 Dec 2017

Publication series

NameDICTA 2017 - 2017 International Conference on Digital Image Computing: Techniques and Applications
Volume2017-December

Conference

Conference2017 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2017
CountryAustralia
CitySydney
Period29/11/171/12/17

Keywords

  • omnidirectional vision
  • spherical imagery
  • video analysis
  • algorithms
  • software tools
  • raw data

Fingerprint Dive into the research topics of 'Automatic Ground Truths: Projected Image Annotations for Omnidirectional Vision'. Together they form a unique fingerprint.

  • Cite this

    Stamatescu, V., Barsznica, P., Kim, M., Liu, K. K., McKenzie, M., Meakin, W., Saunders, G., Wong, S. C., & Brinkworth, R. S. A. (2017). Automatic Ground Truths: Projected Image Annotations for Omnidirectional Vision. In Y. Guo, H. Li, W. T. Cai, M. Murshed, Z. Wang, J. Gao, & D. D. Feng (Eds.), DICTA 2017 - 2017 International Conference on Digital Image Computing: Techniques and Applications (pp. 1-8). (DICTA 2017 - 2017 International Conference on Digital Image Computing: Techniques and Applications; Vol. 2017-December). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/DICTA.2017.8227409