MTGLS: Multi-Task Gaze Estimation with Limited Supervision

Shreya Ghosh, Munawar Hayat, Abhinav Dhall, Jarrod Knibbe

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

24 Citations (Scopus)

Abstract

Robust gaze estimation is a challenging task, even for deep CNNs, due to the non-availability of large-scale labeled data. Moreover, gaze annotation is a time-consuming process and requires specialized hardware setups. We propose MTGLS: a Multi-Task Gaze estimation framework with Limited Supervision, which leverages abundantly available non-annotated facial image data. MTGLS distills knowledge from off-the-shelf facial image analysis models, and learns strong feature representations of human eyes, guided by three complementary auxiliary signals: (a) the line of sight of the pupil (i.e. pseudo-gaze) defined by the localized facial landmarks, (b) the head-pose given by Euler angles, and (c) the orientation of the eye patch (left/right eye). To overcome inherent noise in the supervisory signals, MT-GLS further incorporates a noise distribution modelling approach. Our experimental results show that MTGLS learns highly generalized representations which consistently perform well on a range of datasets. Our proposed framework outperforms the unsupervised state-of-the-art on CAVE (by ∼ 6.43%) and even supervised state-of-the-art methods on Gaze360 (by ∼ 6.59%) datasets.

Original languageEnglish
Title of host publicationProceedings
Subtitle of host publication2022 IEEE Winter Conference on Applications of Computer Vision
Place of PublicationUnited States
PublisherInstitute of Electrical and Electronics Engineers
Pages1161-1172
Number of pages12
ISBN (Electronic)978-1-6654-0915-5
ISBN (Print)978-1-6654-0916-2
DOIs
Publication statusPublished - 15 Feb 2022
Externally publishedYes
Event22nd IEEE/CVF Winter Conference on Applications of Computer Vision - Waikoloa, United States
Duration: 3 Jan 20228 Jan 2022

Publication series

NameIEEE Winter Conference on Applications of Computer Vision
PublisherInstitute of Electrical and Electronics Engineers
Volume2022
ISSN (Print)2472-6737
ISSN (Electronic)2642-9381

Conference

Conference22nd IEEE/CVF Winter Conference on Applications of Computer Vision
Abbreviated titleWACV 2022
Country/TerritoryUnited States
CityWaikoloa
Period3/01/228/01/22

Keywords

  • Biometrics
  • Face Processing Human-Computer Interaction
  • Few-shot
  • Large-scale Vision Applications
  • Semi- and Un- supervised Learning
  • Transfer

Fingerprint

Dive into the research topics of 'MTGLS: Multi-Task Gaze Estimation with Limited Supervision'. Together they form a unique fingerprint.

Cite this