Skip to main navigation Skip to search Skip to main content

DIF: Dataset of perceived intoxicated faces for drunk person identification

  • Vineet Mehta
  • , Sai Srinadhu Katta
  • , Devendra Pratap Yadav
  • , Abhinav Dhall

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Citations (Scopus)

Abstract

Traffic accidents cause over a million deaths every year, of which a large fraction is attributed to drunk driving. An automated intoxicated driver detection system in vehicles will be useful in reducing accidents and related financial costs. Existing solutions require special equipment such as electrocardiogram, infrared cameras or breathalyzers. In this work, we propose a new dataset called DIF (Dataset of perceived Intoxicated Faces) which contains audio-visual data of intoxicated and sober people obtained from online sources. To the best of our knowledge, this is the first work for automatic bimodal non-invasive intoxication detection. Convolutional Neural Networks (CNN) and Deep Neural Networks (DNN) are trained for computing the video and audio baselines, respectively. 3D CNN is used to exploit the Spatio-temporal changes in the video. A simple variation of the traditional 3D convolution block is proposed based on inducing nonlinearity between the spatial and temporal channels. Extensive experiments are performed to validate the approach and baselines.

Original languageEnglish
Title of host publicationICMI 2019 - Proceedings of the 2019 International Conference on Multimodal Interaction
EditorsWen Gao, Helen Mei Ling Meng, Matthew Turk, Susan R. Fussell, Bjorn Schuller, Bjorn Schuller, Yale Song, Kai Yu
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery, Inc
Pages367-374
Number of pages8
ISBN (Electronic)9781450368605
DOIs
Publication statusPublished - 14 Oct 2019
Externally publishedYes
Event2019 International Conference on Multimodal Interaction - Suzhou, China
Duration: 14 Oct 201918 Oct 2019
Conference number: 21st

Conference

Conference2019 International Conference on Multimodal Interaction
Abbreviated titleICMI 2019
Country/TerritoryChina
CitySuzhou
Period14/10/1918/10/19

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Affect recognition
  • Convolutional Neural Network
  • Intoxication Detection

Fingerprint

Dive into the research topics of 'DIF: Dataset of perceived intoxicated faces for drunk person identification'. Together they form a unique fingerprint.

Cite this