Abstract
Traffic accidents cause over a million deaths every year, of which a large fraction is attributed to drunk driving. An automated intoxicated driver detection system in vehicles will be useful in reducing accidents and related financial costs. Existing solutions require special equipment such as electrocardiogram, infrared cameras or breathalyzers. In this work, we propose a new dataset called DIF (Dataset of perceived Intoxicated Faces) which contains audio-visual data of intoxicated and sober people obtained from online sources. To the best of our knowledge, this is the first work for automatic bimodal non-invasive intoxication detection. Convolutional Neural Networks (CNN) and Deep Neural Networks (DNN) are trained for computing the video and audio baselines, respectively. 3D CNN is used to exploit the Spatio-temporal changes in the video. A simple variation of the traditional 3D convolution block is proposed based on inducing nonlinearity between the spatial and temporal channels. Extensive experiments are performed to validate the approach and baselines.
| Original language | English |
|---|---|
| Title of host publication | ICMI 2019 - Proceedings of the 2019 International Conference on Multimodal Interaction |
| Editors | Wen Gao, Helen Mei Ling Meng, Matthew Turk, Susan R. Fussell, Bjorn Schuller, Bjorn Schuller, Yale Song, Kai Yu |
| Place of Publication | New York, NY |
| Publisher | Association for Computing Machinery, Inc |
| Pages | 367-374 |
| Number of pages | 8 |
| ISBN (Electronic) | 9781450368605 |
| DOIs | |
| Publication status | Published - 14 Oct 2019 |
| Externally published | Yes |
| Event | 2019 International Conference on Multimodal Interaction - Suzhou, China Duration: 14 Oct 2019 → 18 Oct 2019 Conference number: 21st |
Conference
| Conference | 2019 International Conference on Multimodal Interaction |
|---|---|
| Abbreviated title | ICMI 2019 |
| Country/Territory | China |
| City | Suzhou |
| Period | 14/10/19 → 18/10/19 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Affect recognition
- Convolutional Neural Network
- Intoxication Detection
Fingerprint
Dive into the research topics of 'DIF: Dataset of perceived intoxicated faces for drunk person identification'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver