Unsupervised Learning of Eye Gaze Representation from the Web

Neeru Dubey, Shreya Ghosh, Abhinav Dhall

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

17 Citations (Scopus)

Abstract

Automatic eye gaze estimation has interested researchers for a while now. In this paper, we propose an unsupervised learning based method for estimating the eye gaze region. To train the proposed network "Ize-Net" in self-supervised manner, we collect a large 'in the wild' dataset containing 1,54,251 images from the web. For the images in the database, we divide the gaze into three regions based on an automatic technique based on pupil-centers localization and then use a feature-based technique to determine the gaze region. The performance is evaluated on the Tablet Gaze and CAVE datasets by fine-tuning results of Ize-Net for the task of eye gaze estimation. The feature representation learned is also used to train traditional machine learning algorithms for eye gaze estimation. The results demonstrate that the proposed method learns a rich data representation, which can be efficiently finetuned for any eye gaze estimation dataset.

Original languageEnglish
Title of host publication2019 International Joint Conference on Neural Networks
Subtitle of host publicationIJCNN 2019
PublisherInstitute of Electrical and Electronics Engineers
Pages3488-3494
Number of pages7
ISBN (Electronic)9781728119854
DOIs
Publication statusPublished - 30 Sept 2019
Externally publishedYes
Event2019 International Joint Conference on Neural Networks - Budapest, Hungary
Duration: 14 Jul 201919 Jul 2019

Publication series

NameProceedings of the International Joint Conference on Neural Networks
PublisherInstitute of Electrical and Electronics Engineers
Volume2019-July
ISSN (Electronic)2161-4407

Conference

Conference2019 International Joint Conference on Neural Networks
Abbreviated titleIJCNN 2019
Country/TerritoryHungary
CityBudapest
Period14/07/1919/07/19

Fingerprint

Dive into the research topics of 'Unsupervised Learning of Eye Gaze Representation from the Web'. Together they form a unique fingerprint.

Cite this