Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset

Shreya Ghosh, Abhinav Dhall, Garima Sharma, Sarthak Gupta, Nicu Sebe

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

26 Citations (Scopus)

Abstract

Labelling of human behavior analysis data is a complex and time consuming task. In this paper, a fully automatic technique for labelling an image based gaze behavior dataset for driver gaze zone estimation is proposed. Domain knowledge is added to the data recording paradigm and later labels are generated in an automatic manner using Speech To Text conversion (STT). In order to remove the noise in the STT process due to different illumination and ethnicity of subjects in our data, the speech frequency and energy are analysed. The resultant Driver Gaze in the Wild (DGW) dataset contains 586 recordings, captured during different times of the day including evenings. The large scale dataset contains 338 subjects with an age range of 18-63 years. As the data is recorded in different lighting conditions, an illumination robust layer is proposed in the Convolutional Neural Network (CNN). The extensive experiments show the variance in the dataset resembling real-world conditions and the effectiveness of the proposed CNN pipeline. The proposed network is also fine-tuned for the eye gaze prediction task, which shows the discriminativeness of the representation learnt by our network on the proposed DGW dataset.

Original languageEnglish
Title of host publicationProceedings
Subtitle of host publication2021 IEEE/CVF International Conference on Computer Vision Workshops: ICCVW 2021
Place of PublicationUnited States
PublisherInstitute of Electrical and Electronics Engineers
Pages2896-2905
Number of pages10
ISBN (Electronic)978-1-6654-0191-3
ISBN (Print)978-1-6654-0192-0
DOIs
Publication statusPublished - 24 Nov 2021
Externally publishedYes
Event18th IEEE/CVF International Conference on Computer Vision Workshops - Virtual
Duration: 11 Oct 202117 Oct 2021

Publication series

NameIEEE International Conference on Computer Vision
PublisherInstitute of Electrical and Electronics Engineers
Volume2021
ISSN (Print)2473-9936
ISSN (Electronic)2473-9944

Conference

Conference18th IEEE/CVF International Conference on Computer Vision Workshops
Abbreviated titleICCVW 2021
CityVirtual
Period11/10/2117/10/21

Keywords

  • Datasets
  • Human behaviour analysis
  • Automated techniques
  • Domain knowledge
  • Speech to text
  • Driver gaze

Fingerprint

Dive into the research topics of 'Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset'. Together they form a unique fingerprint.

Cite this