Nothing Special   »   [go: up one dir, main page]

Skip to main content
Log in

PTSD in the wild: a video database for studying post-traumatic stress disorder recognition in unconstrained environments

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

POST-traumatic stress disorder (PTSD) is a chronic and debilitating mental condition that is developed in response to catastrophic life events, such as military combat, sexual assault, and natural disasters. PTSD is characterized by flashbacks of past traumatic events, intrusive thoughts, nightmares, hypervigilance, and sleep disturbance, all of which affect a person’s life and lead to considerable social, occupational, and interpersonal dysfunction. The diagnosis of PTSD is done by medical professionals using self-assessment questionnaire of PTSD symptoms as defined in the Diagnostic and Statistical Manual of Mental Disorders (DSM). In this paper, and for the first time, we collected, annotated, and prepared for public distribution a new video database for automatic PTSD diagnosis, called PTSD-in-the-wild dataset. The database exhibits “natural" and big variability in acquisition conditions with different pose, facial expression, lighting, focus, resolution, age, gender, race, occlusions and background. In addition to describing the details of the dataset collection, we provide a benchmark for evaluating machine learning-based approaches on PTSD-in-the-wild dataset. In addition, we propose and we evaluate a deep learning-based approach for PTSD detection in respect to the given benchmark. The proposed approach shows very promising results. Interested researcher can download a copy of PTSD-in-the wild dataset from http://www.lissi.fr/PTSD-Dataset/.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data Availability

The datasets generated during and/or analysed during the current study are available in the LISSI repository. Interested researcher can download a copy of PTSD-in-the wild dataset from http://www.lissi.fr/PTSD-Dataset/

Notes

  1. https://www.youtube.com/c/VeteransMTC

  2. https://www.youtube.com/c/VeteransHealthAdmin

  3. GQ: https://www.youtube.com/c/GQ

  4. https://github.com/DTaoo/VGGish

  5. https://github.com/tensorflow/models/tree/master/research/audioset/vggish

  6. https://keras.io/api/layers/recurrent_layers/lstm/

  7. ResNet50v2: https://keras.io/api/applications/resnet/#resnet50v2-function

References

  1. Aadam Tubaishat A, Al-Obeidat F, Halim Z, Waqas M, Qayum F (2022) Emopercept: Eeg-based emotion classification through perceiver. Soft Computing, pp 1–8

  2. Abualigah L, Alfar HE, Shehab M, Hussein AM (2020) Sentiment analysis in healthcare: a brief review. Recent advances in NLP: The case of arabic language, pp 129–141

  3. Baevski A, Auli M, Mohamed A (2019) Effectiveness of self-supervised pre-training for speech recognition

  4. Baevski A, Zhou Y, Mohamed A, Auli M (2020) wav2vec 2.0: A framework for self-supervised learning of speech representations

  5. Banerjee D, Islam K, Xue K, Mei G, Xiao L, Zhang G, Xu R, Lei C, Ji S, Li J (2019) A deep transfer learning approach for improved post-traumatic stress disorder diagnosis. Knowl Inf Syst 60(3):1693–1724

    Article  Google Scholar 

  6. Batbaatar E, Li M, Ryu KH (2019) Semantic-emotion neural network for emotion recognition from text. IEEE Access 7:111866–111878

    Article  Google Scholar 

  7. Bauer MR, Ruef AM, Pineles SL, Japuntich SJ, Macklin ML, Lasko NB, Orr SP (2013) Psychophysiological assessment of PTSD: a potential research domain criteria construct. Psychol Assess 25(3):1037–1043

    Article  Google Scholar 

  8. de Beurs E, Thomaes K, Kronemeijer H, Dekker J (2020) the PTSD checklist for DSM-5 (PCL-5): comparing responsivity with the outcome questionnaire (OQ-45) and practical utility. Tijdschr Psychiatr 62(6):448–456

    Google Scholar 

  9. Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: A large-scale hierarchical image database. In: 2009 IEEE Conference on computer vision and pattern recognition, pp 248–255. IEEE

  10. Desmet B, Hoste V (2013) Emotion detection in suicide notes. Exp Syst Appl 40(16):6351–6358

    Article  Google Scholar 

  11. Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding

  12. Gemmeke JF, Ellis DP, Freedman D, Jansen A, Lawrence W, Moore RC, Plakal M, Ritter M (2017) Audio set: An ontology and human-labeled dataset for audio events. In: 2017 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 776–780. IEEE

  13. Gratch J, Artstein R, Lucas GM, Stratou G, Scherer S, Nazarian A, Wood R, Boberg J, DeVault D, Marsella S, et al (2014) The distress analysis interview corpus of human and computer interviews. Technical report, UNIVERSITY OF SOUTHERN CALIFORNIA LOS ANGELES

  14. Graves A, Fernández S, Gomez F, Schmidhuber J (2006) Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks. In: Proceedings of the 23rd International Conference on Machine Learning, ICML ’06, pp 369–376, New York, NY, USA. Association for computing machinery

  15. Gupta S, Goel L, Singh A, Agarwal AK, Singh RK (2022) Toxgb: Teamwork optimization based xgboost model for early identification of post-traumatic stress disorder. Cognitive Neurodynamics, pp 1–14

  16. Halim Z, Rehan M (2020) On identification of driving-induced stress using electroencephalogram signals: A framework based on wearable safety-critical scheme and machine learning. Inf Fusion 53:66–79

    Article  Google Scholar 

  17. He K, Zhang X, Ren S, Sun J (2016) Identity mappings in deep residual networks. In: European conference on computer vision, pp 630–645. Springer

  18. Hershey S, Chaudhuri S, Ellis DP, Gemmeke JF, Jansen A, Moore RC, Plakal M, Platt D, Saurous RA, Seybold B, et al (2017) Cnn architectures for large-scale audio classification. In: 2017 ieee international conference on acoustics, speech and signal processing (icassp), pp 131–135. IEEE

  19. Islam KA, Perez D, Li J (2018) A transfer learning approach for the 2018 femh voice data challenge. In: 2018 IEEE International conference on big data (Big Data), pp 5252–5257. IEEE

  20. Kaur S, Aggarwal H, Rani R (2020) Hyper-parameter optimization of deep learning model for prediction of parkinson’s disease. Mach Vis Appl 31:1–15

    Article  Google Scholar 

  21. Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. arXiv:1412.6980

  22. Kusters R, Misevic D, Berry H, Cully A, Le Cunff Y, Dandoy L, Díaz-Rodríguez N, Ficher M, Grizou J, Othmani A et al (2020) Interdisciplinary research in artificial intelligence: Challenges and opportunities. Front Big Data 3:577974

    Article  Google Scholar 

  23. Loshchilov I, Hutter F (2017) Decoupled weight decay regularization

  24. Dia M, Khodabandelou G, Othmani A (2023) A novel stochastic transformer-based approach for post-traumatic stress disorder detection using audio recording of clinical interviews. In: 36th IEEE International Symposium on Computer-Based Medical Systems (IEEE CBMS2023)

  25. McLean SA, Ressler K, Koenen KC, Neylan T, Germine L, Jovanovic T, Clifford GD, Zeng D, An X, Linnstaedt S et al (2020) The aurora study: a longitudinal, multimodal library of brain biology and function after traumatic stress exposure. Mol Psyc 25(2):283–296

    Article  Google Scholar 

  26. Muzammel M, Salam H, Hoffmann Y, Chetouani M, Othmani A (2020) Audvowelconsnet: A phoneme-level based deep cnn architecture for clinical depression diagnosis. Mach Learn Appl 2:100005

    Google Scholar 

  27. Muzammel M, Salam H, Othmani A (2021) End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis. Comput Methods Prog Biomed 211:106433

    Article  Google Scholar 

  28. O’Malley T, Bursztein E, Long J, Chollet F, Jin H, Invernizzi L, et al (2019) Kerastuner. https://github.com/keras-team/keras-tuner

  29. Othmani A, Brahem B, Haddou Y (2023) Machine learning-based approaches for post-traumatic stress disorder diagnosis using video and eeg sensors: A review

  30. Othmani A, Kadoch D, Bentounes K, Rejaibi E, Alfred R, Hadid A (2021) Towards robust deep neural networks for affect and depression recognition from speech. In: International conference on pattern recognition, pp 5–19. Springer

  31. Pampouchidou A, Pediaditis M, Kazantzaki E, Sfakianakis S, Apostolaki IA, Argyraki K, Manousos D, Meriaudeau F, Marias K, Yang F et al (2020) Automated facial video-based recognition of depression and anxiety symptom severity: cross-corpus validation. Mach Vis Appl 31(4):30

    Article  Google Scholar 

  32. Rahman AU, Halim Z (2023) Identifying dominant emotional state using handwriting and drawing samples by fusing features. Appl Intell 53(3):2798–2814

    Article  Google Scholar 

  33. Rejaibi E, Komaty A, Meriaudeau F, Agrebi S, Othmani A (2022) Mfcc-based recurrent neural network for automatic clinical depression recognition and assessment from speech. Biom Signal Process Control 71:103107

    Article  Google Scholar 

  34. Rozgic V, Vazquez-Reina A, Crystal M, Srivastava A, Tan V, Berka C (2014) Multi-modal prediction of ptsd and stress indicators. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp 3636–3640. IEEE

  35. Schoneveld L, Othmani A, Abdelkawy H (2021) Leveraging recent advances in deep learning for audio-visual emotion recognition. Pattern Recognit Lett 146:1–7

    Article  Google Scholar 

  36. Schultebraucks K, Yadav V, Shalev AY, Bonanno GA, Galatzer-Levy IR (2022) Deep learning-based classification of posttraumatic stress disorder and depression following trauma utilizing visual and auditory markers of arousal and mood. Psychol Med 52(5):957–967

    Article  Google Scholar 

  37. Alice Othmani Sirine Chaari, El Ouni C (2022) A mobile monitoring application for post-traumatic stress disorder. In: Proceedings of 2022 International Conference on Medical Imaging and Computer-Aided Diagnosis (MICAD 2022)

  38. Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Res 15(56):1929–1958

    MathSciNet  Google Scholar 

  39. Stappen L, Baird A, Schumann L, Bjorn S (2021) The multimodal sentiment analysis in car reviews (muse-car) dataset: Collection, insights and improvements. arXiv:2101.06053

  40. Stappen L, Meßner EM, Cambria E, Zhao G, Schuller BW (2021) Muse 2021 challenge: Multimodal emotion, sentiment, physiological-emotion, and stress detection. In: Proceedings of the 29th ACM International conference on multimedia, pp 5706–5707

  41. Tokuno S, Tsumatori G, Shono S, Takei E, Yamamoto T, Suzuki G, Mituyoshi S, Shimura M (2011) Usage of emotion recognition in military health care. In: 2011 Defense Science Research Conference and Expo (DSR), pp 1–5. IEEE

  42. Ullah S, Halim Z (2021) Imagined character recognition through eeg signals using deep convolutional neural network. Med Biol Eng Comput 59(5):1167–1183

    Article  Google Scholar 

  43. Yang L, Sahli H, Xia X, Pei E, Oveneke MC, Jiang D (2017) Hybrid depression classification and estimation from audio video and text information. In: Proceedings of the 7th annual workshop on audio/visual emotion challenge, pp 45–51

  44. Zhang K, Zhang Z, Li Z, Qiao Y (2016) Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process Lett 23(10):1499–1503

    Article  Google Scholar 

  45. Zhuang X, Rozgić V, Crystal M, Marx BP (2014) Improving speech-based ptsd detection via multi-view learning. In: 2014 IEEE Spoken Language Technology Workshop (SLT), pp 260–265. IEEE

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alice Othmani.

Ethics declarations

Not applicable

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Moctar Abdoul Latif Sawadogo and Furkan Pala are contributed equally to this work.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sawadogo, M.A.L., Pala, F., Singh, G. et al. PTSD in the wild: a video database for studying post-traumatic stress disorder recognition in unconstrained environments. Multimed Tools Appl 83, 42861–42883 (2024). https://doi.org/10.1007/s11042-023-17203-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-17203-x

Keywords

Navigation