Early Prediction of Children’s Disengagement in a Tablet Tutor Using Visual Features

Bikram Boote¹³,
Mansi Agarwal¹⁴ &
Jack Mostow¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12749))

Included in the following conference series:

International Conference on Artificial Intelligence in Education

4651 Accesses

Abstract

Intelligent tutoring systems could benefit from human teachers’ ability to monitor students’ affective states by watching them and thereby detecting early warning signs of disengagement in time to prevent it. Toward that goal, this paper describes a method that uses input from a tablet tutor’s user-facing camera to predict whether the student will complete the current activity or disengage from it. Training a disengagement predictor is useful not only in itself but also in identifying visual indicators of negative affective states even when they don’t lead to non-completion of the task. Unlike prior work that relied on tutor-specific features, the method relies solely on visual features and so could potentially apply to other tutors. We present a deep learning method to make such predictions based on a Long Short Term Memory (LSTM) model that uses a target replication loss function. We train and test the model on screen capture videos of children in Tanzania using a tablet tutor to learn basic Swahili literacy and numeracy. We achieve balanced-class-size prediction accuracy of 73.3% when 40% of the activity is still left. We also analysed how prediction accuracy varies among tutor activities, revealing two distinct causes of disengagement.

B. Boote and M. Agarwal—Both student authors contributed equally.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Real-Time Detection of Student Engagement: Deep Learning-Based System

User Engagement Recognition Using Transfer Learning and Multi-task Classification

Enhancing Engagement Prediction in Online Environment Using Temporal Features

References

Agarwal, M., Mostow, J.: Semi-supervised learning to perceive children’s affective states in a tablet tutor. In: Thirty-Fourth AAAI Conference on Artificial Intelligence, pp. 13350–13357 (2020)
Google Scholar
Arroyo, I., Woolf, B.P.: Inferring learning and attitudes from a Bayesian network of log file data. In: International Conference on Artificial Intelligence in Education, pp. 33–40 (2005)
Google Scholar
Baltrušaitis, T., Robinson, P., Morency, L.P.: OpenFace: an open source facial behavior analysis toolkit. In: 2016 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1–10. IEEE (2016)
Google Scholar
Bosch, N., D’Mello, S.: Automatic detection of mind wandering from video in the lab and in the classroom. IEEE Trans. Affect. Comput. (2019)
Google Scholar
Bosch, N., D’mello, S.K., Ocumpaugh, J., Baker, R.S., Shute, V.: Using video to automatically detect learner affect in computer-enabled classrooms. ACM Trans. Interact. Intell. Syst. (TiiS) 6(2), 1–26 (2016)
Article Google Scholar
Chang, C., Zhang, C., Chen, L., Liu, Y.: An ensemble model using face and body tracking for engagement detection. In: Proceedings of the 20th ACM International Conference on Multimodal Interaction, pp. 616–622 (2018)
Google Scholar
Dewan, M.A.A., Murshed, M., Lin, F.: Engagement detection in online learning: a review. Smart Learn. Environ. 6(1), 1 (2019)
Article Google Scholar
González-Brenes, J.P., Mostow, J.: Predicting task completion from rich but scarce data (2010)
Google Scholar
Liang, W.C., Yuan, J., Sun, D.C., Lin, M.H.: Changes in physiological parameters induced by indoor simulated driving: effect of lower body exercise at mid-term break. Sensors 9(9), 6913–6933 (2009)
Article Google Scholar
Lipton, Z.C., Kale, D.C., Elkan, C., Wetzel, R.: Learning to diagnose with LSTM recurrent neural networks. arXiv preprint arXiv:1511.03677 (2015)
McReynolds, A.A., Naderzad, S.P., Goswami, M., Mostow, J.: Toward learning at scale in developing countries: lessons from the global learning XPRIZE field study. In: Proceedings of the Seventh ACM Conference on Learning@ Scale, pp. 175–183 (2020)
Google Scholar
Mostow, J., Chang, K., Nelson, J.: Toward exploiting EEG input in a reading tutor. In: Biswas, G., Bull, S., Kay, J., Mitrovic, A. (eds.) AIED 2011. LNCS (LNAI), vol. 6738, pp. 230–237. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-21869-9_31
Chapter Google Scholar
Thomas, C., Jayagopi, D.B.: Predicting student engagement in classrooms using facial behavioral cues. In: Proceedings of the 1st ACM SIGCHI International Workshop on Multimodal Interaction for Education, pp. 33–40 (2017)
Google Scholar
Yuen, H., Princen, J., Illingworth, J., Kittler, J.: Comparative study of Hough transform methods for circle finding. Image Vis. Comput. 8(1), 71–77 (1990)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Jadavpur University, Kolkata, India
Bikram Boote
Delhi Technological University, Rohini, India
Mansi Agarwal
Carnegie Mellon University, Pittsburgh, USA
Jack Mostow

Authors

Bikram Boote
View author publications
You can also search for this author in PubMed Google Scholar
Mansi Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Jack Mostow
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Technion – Israel Institute of Technology, Haifa, Israel
Ido Roll
Arizona State University, Tempe, AZ, USA
Danielle McNamara
Utrecht University, Utrecht, The Netherlands
Sergey Sosnovsky
London Knowledge Lab, London, UK
Rose Luckin
University of Leeds, Leeds, UK
Vania Dimitrova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Boote, B., Agarwal, M., Mostow, J. (2021). Early Prediction of Children’s Disengagement in a Tablet Tutor Using Visual Features. In: Roll, I., McNamara, D., Sosnovsky, S., Luckin, R., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2021. Lecture Notes in Computer Science(), vol 12749. Springer, Cham. https://doi.org/10.1007/978-3-030-78270-2_17

Download citation

DOI: https://doi.org/10.1007/978-3-030-78270-2_17
Published: 12 June 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-78269-6
Online ISBN: 978-3-030-78270-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics