Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3173574.3174226acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article

Cognitive Load Estimation in the Wild

Published: 21 April 2018 Publication History

Abstract

Cognitive load has been shown, over hundreds of validated studies, to be an important variable for understanding human performance. However, establishing practical, non-contact approaches for automated estimation of cognitive load under real-world conditions is far from a solved problem. Toward the goal of designing such a system, we propose two novel vision-based methods for cognitive load estimation, and evaluate them on a large-scale dataset collected under real-world driving conditions. Cognitive load is defined by which of 3 levels of a validated reference task the observed subject was performing. On this 3-class problem, our best proposed method of using 3D convolutional neural networks achieves 86.1% accuracy at predicting task-induced cognitive load in a sample of 92 subjects from video alone. This work uses the driving context as a training and evaluation dataset, but the trained network is not constrained to the driving environment as it requires no calibration and makes no assumptions about the subject's visual appearance, activity, head pose, scale, and perspective.

References

[1]
Linda S Angell, J Auflick, PA Austria, Dev S Kochhar, Louis Tijerina, W Biever, T Diptiman, J Hogsett, and S Kiger. 2006. Driver Workload Metrics. Technical Report.
[2]
Marie-Pierre Bruyas, Laëtitia Dumont, and France Bron. 2013. Sensitivity of Detection Response Task (DRT) to the driving demand and task difficulty. In Proceedings of the Seventh International Driving Symposium on Human Factors in Driver Assessment, Training, and Vehicle Design. 64--70.
[3]
Siyuan Chen and Julien Epps. 2013. Automatic classification of eye activity for cognitive load measurement with emotion interference. Computer methods and programs in biomedicine 110, 2 (2013), 111--124.
[4]
Melissa Patricia Coral. 2016. Analyzing Cognitive Workload Through Eye-related Measurements: A Meta-Analysis. Ph.D. Dissertation. Wright State University.
[5]
Lex Fridman, Daniel E. Brown, William Angell, Irman Abdic, Bryan Reimer, and Hae Young Noh. 2016a. Automated Synchronization of Driving Data Using Vibration and Steering Events. Pattern Recognition Letters (2016).
[6]
Lex Fridman, Joonbum Lee, Bryan Reimer, and Trent Victor. 2016b. Owl and Lizard: Patterns of Head Pose and Eye Pose in Driver Gaze Classification. IET Computer Vision (2016).
[7]
Eija Haapalainen, SeungJun Kim, Jodi F Forlizzi, and Anind K Dey. 2010. Psycho-physiological measures for assessing cognitive load. In Proceedings of the 12th ACM international conference on Ubiquitous computing. ACM, 301--310.
[8]
Tal Hassner, Shai Harel, Eran Paz, and Roee Enbar. 2015. Effective face frontalization in unconstrained images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4295--4304.
[9]
Vahid Kazemi and Josephine Sullivan. 2014. One millisecond face alignment with an ensemble of regression trees. In Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on. IEEE, 1867--1874.
[10]
Davis E. King. 2009. Dlib-ml: A Machine Learning Toolkit. Journal of Machine Learning Research 10 (2009), 1755--1758.
[11]
Sergey Kirshner. 2005. Modeling of multivariate time series using hidden Markov models. Ph.D. Dissertation. UNIVERSITY OF CALIFORNIA, IRVINE.
[12]
Rainer Lienhart and Jochen Maydt. 2002. An extended set of haar-like features for rapid object detection. In Image Processing. 2002. Proceedings. 2002 International Conference on, Vol. 1. IEEE, I--900.
[13]
Bruce Mehler, Bryan Reimer, and Joseph F Coughlin. 2012. Sensitivity of physiological measures for detecting systematic variations in cognitive demand from a working memory task: an on-road study across three age groups. Human factors 54, 3 (2012), 396--412.
[14]
Bruce Mehler, Bryan Reimer, and JA Dusek. 2011. MIT AgeLab delayed digit recall task (n-back). Cambridge, MA: Massachusetts Institute of Technology (2011).
[15]
Fred Paas, Juhani E Tuovinen, Huib Tabbers, and Pascal WM Van Gerven. 2003. Cognitive load measurement as a means to advance cognitive load theory. Educational psychologist 38, 1 (2003), 63--71.
[16]
Bastian Pfleging, Drea K Fekety, Albrecht Schmidt, and Andrew L Kun. 2016. A Model Relating Pupil Diameter to Mental Workload and Lighting Conditions. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, 5776--5788.
[17]
Thomas A Ranney, GH Baldwin, Larry A Smith, Elizabeth N Mazzae, and Russell S Pierce. 2014. Detection Response Task (DRT) Evaluation for Driver Distraction Measurement Application. Technical Report.
[18]
Miguel A Recarte and Luis M Nunes. 2003. Mental workload while driving: effects on visual search, discrimination, and decision making. Journal of experimental psychology: Applied 9, 2 (2003), 119.
[19]
Bryan Reimer, Bruce Mehler, J Dobres, and JF Coughlin. 2013. The effects of a production level "voice-command" interface on driver behavior: summary findings on reported workload, physiology, visual attention, and driving performance. (2013).
[20]
Bryan Reimer, Bruce Mehler, Jonathan Dobres, Hale McAnulty, Alea Mehler, Daniel Munger, and Adrian Rumpold. 2014. Effects of an'Expert Mode'Voice Command System on Task Performance, Glance Behavior & Driver Physiology. In Proceedings of the 6th international conference on automotive user interfaces and interactive vehicular applications. ACM, 1--9.
[21]
Bryan Reimer, Bruce Mehler, Ying Wang, and Joseph F Coughlin. 2012. A field study on the impact of variations in short-term memory demands on drivers' visual attention and driving performance across three age groups. Human Factors: The Journal of the Human Factors and Ergonomics Society 54, 3 (2012), 454--468.
[22]
Alexander Schliep, Benjamin Georgi, Wasinee Rungsarityotin, I Costa, and A Schonhuth. 2004. The general hidden markov model library: Analyzing systems with unobservable states. Proceedings of the Heinz-Billing-Price 2004 (2004), 121--135.
[23]
Gerald Schweighofer and Axel Pinz. 2008. Globally Optimal O (n) Solution to the PnP Problem for General Camera Models. In BMVC. 1--10.
[24]
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
[25]
John Sweller, Paul Ayres, and Slava Kalyuga. 2011. Measuring cognitive load. In Cognitive load theory. Springer, 71--85.
[26]
Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, and Manohar Paluri. 2015. Learning spatiotemporal features with 3d convolutional networks. In 2015 IEEE International Conference on Computer Vision (ICCV). IEEE, 4489--4497.
[27]
Ying Wang, Bryan Reimer, Jonathan Dobres, and Bruce Mehler. 2014. The sensitivity of different methodologies for characterizing drivers' gaze concentration under increased cognitive demand. Transportation research part F: traffic psychology and behaviour 26 (2014), 227--237.
[28]
Joe Yue-Hei Ng, Matthew Hausknecht, Sudheendra Vijayanarasimhan, Oriol Vinyals, Rajat Monga, and George Toderici. 2015. Beyond short snippets: Deep networks for video classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4694--4702.
[29]
Matthew D Zeiler and Rob Fergus. 2014. Visualizing and understanding convolutional networks. In European Conference on Computer Vision. Springer, 818--833.
[30]
Matthew D Zeiler, Graham W Taylor, and Rob Fergus. 2011. Adaptive deconvolutional networks for mid and high level feature learning. In 2011 International Conference on Computer Vision. IEEE, 2018--2025.
[31]
Yilu Zhang, Yuri Owechko, and Jing Zhang. 2004. Driver cognitive workload estimation: A data-driven perspective. In Intelligent Transportation Systems, 2004. Proceedings. The 7th International IEEE Conference on. IEEE, 642--647.

Cited By

View all
  • (2024)EEG, Pupil Dilations, and Other Physiological Measures of Working Memory Load in the Sternberg TaskMultimodal Technologies and Interaction10.3390/mti80400348:4(34)Online publication date: 19-Apr-2024
  • (2024)Evaluating the robustness of multimodal task load estimation modelsFrontiers in Computer Science10.3389/fcomp.2024.13711816Online publication date: 10-Apr-2024
  • (2024)PrivateGaze: Preserving User Privacy in Black-box Mobile Gaze Tracking ServicesProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36785958:3(1-28)Online publication date: 9-Sep-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems
April 2018
8489 pages
ISBN:9781450356206
DOI:10.1145/3173574
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 April 2018

Permissions

Request permissions for this article.

Check for updates

Badges

  • Honorable Mention

Author Tags

  1. deep learning
  2. human-centered artificial intelligence

Qualifiers

  • Research-article

Conference

CHI '18
Sponsor:

Acceptance Rates

CHI '18 Paper Acceptance Rate 666 of 2,590 submissions, 26%;
Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)289
  • Downloads (Last 6 weeks)46
Reflects downloads up to 29 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)EEG, Pupil Dilations, and Other Physiological Measures of Working Memory Load in the Sternberg TaskMultimodal Technologies and Interaction10.3390/mti80400348:4(34)Online publication date: 19-Apr-2024
  • (2024)Evaluating the robustness of multimodal task load estimation modelsFrontiers in Computer Science10.3389/fcomp.2024.13711816Online publication date: 10-Apr-2024
  • (2024)PrivateGaze: Preserving User Privacy in Black-box Mobile Gaze Tracking ServicesProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36785958:3(1-28)Online publication date: 9-Sep-2024
  • (2024)Hide-and-seek: Detecting Workers' Emotional Workload in Emotional Labor Contexts Using Multimodal SensingProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/36785938:3(1-28)Online publication date: 9-Sep-2024
  • (2024)AdaptiveVoice: Cognitively Adaptive Voice Interface for Driving AssistanceProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642876(1-18)Online publication date: 11-May-2024
  • (2024)Driver Profiling and Bayesian Workload Estimation Using Naturalistic Peripheral Detection Study DataIEEE Transactions on Intelligent Vehicles10.1109/TIV.2023.33134199:1(3047-3060)Online publication date: Jan-2024
  • (2024)Multimodal Brain–Computer Interface for In-Vehicle Driver Cognitive Load Measurement: Dataset and BaselinesIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.334584625:6(5949-5964)Online publication date: Jun-2024
  • (2024)User-Aware Multilevel Cognitive Workload Estimation From Multimodal Physiological SignalsIEEE Transactions on Cognitive and Developmental Systems10.1109/TCDS.2023.334213916:4(1212-1222)Online publication date: Aug-2024
  • (2024)DeepFace-Attention: Multimodal Face Biometrics for Attention Estimation With Application to e-LearningIEEE Access10.1109/ACCESS.2024.343729112(111343-111359)Online publication date: 2024
  • (2024)Democratizing EEG: Embedding Electroencephalography in a Head-Mounted Display for Ubiquitous Brain-Computer InterfacingInternational Journal of Human–Computer Interaction10.1080/10447318.2024.2388368(1-25)Online publication date: 19-Aug-2024
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media