Does Informativeness Matter? Active Learning for Educational Dialogue Act Classification

Wei Tan¹²,
Jionghao Lin^12,13,
David Lang¹⁴,
Guanliang Chen¹²,
Dragan Gašević¹²,
Lan Du¹² &
…
Wray Buntine^12,15

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13916))

Included in the following conference series:

International Conference on Artificial Intelligence in Education

4210 Accesses
4 Citations

Abstract

Dialogue Acts (DAs) can be used to explain what expert tutors do and what students know during the tutoring process. Most empirical studies adopt the random sampling method to obtain sentence samples for manual annotation of DAs, which are then used to train DA classifiers. However, these studies have paid little attention to sample informativeness, which can reflect the information quantity of the selected samples and inform the extent to which a classifier can learn patterns. Notably, the informativeness level may vary among the samples and the classifier might only need a small amount of low informative samples to learn the patterns. Random sampling may overlook sample informativeness, which consumes human labelling costs and contributes less to training the classifiers. As an alternative, researchers suggest employing statistical sampling methods of Active Learning (AL) to identify the informative samples for training the classifiers. However, the use of AL methods in educational DA classification tasks is under-explored. In this paper, we examine the informativeness of annotated sentence samples. Then, the study investigates how the AL methods can select informative samples to support DA classifiers in the AL sampling process. The results reveal that most annotated sentences present low informativeness in the training dataset and the patterns of these sentences can be easily captured by the DA classifier. We also demonstrate how AL methods can reduce the cost of manual annotation in the AL sampling process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://github.com/allenai/cartography.
2.
Due to the space limit, we only present a part of the analysis results, and the full results can be accessed at https://github.com/jionghaolin/INFO.
3.
Due to the space limit, we documented our full results in a digital appendix, which is accessible via https://github.com/jionghaolin/INFO.

References

Boyer, K., Ha, E.Y., Phillips, R., Wallis, M., Vouk, M., Lester, J.: Dialogue act modeling in a complex task-oriented domain. In: Proceedings of the SIGDIAL 2010 Conference, pp. 297–305 (2010)
Google Scholar
Clark, K., Luong, M.T., Le, Q.V., Manning, C.D.: Electra: pre-training text encoders as discriminators rather than generators. In: ICLR (2019)
Google Scholar
D’Mello, S., Olney, A., Person, N.: Mining collaborative patterns in tutorial dialogues. J. Educ. Data Min. 2(1), 1–37 (2010)
Google Scholar
Du, B., et al.: Exploring representativeness and informativeness for active learning. IEEE Trans. Cybern. 47(1), 14–26 (2015)
Article Google Scholar
Du Boulay, B., Luckin, R.: Modelling human teaching tactics and strategies for tutoring systems: 14 years on. IJAIED 26(1), 393–404 (2016)
Google Scholar
Ezen-Can, A., Boyer, K.E.: Understanding student language: an unsupervised dialogue act classification approach. JEDM 7(1), 51–78 (2015)
Google Scholar
Ezen-Can, A., Grafsgaard, J.F., Lester, J.C., Boyer, K.E.: Classifying student dialogue acts with multimodal learning analytics. In: Proceedings of the Fifth LAK, pp. 280–289 (2015)
Google Scholar
Hastings, P., Hughes, S., Britt, M.A.: Active learning for improving machine learning of student explanatory essays. In: Penstein Rosé, C., et al. (eds.) AIED 2018. LNCS (LNAI), vol. 10947, pp. 140–153. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93843-1_11
Chapter Google Scholar
Hennessy, S., et al.: Developing a coding scheme for analysing classroom dialogue across educational contexts. Learn. Cult. Soc. Interact. 9, 16–44 (2016)
Article Google Scholar
Karamcheti, S., Krishna, R., Fei-Fei, L., Manning, C.D.: Mind your outliers! Investigating the negative impact of outliers on active learning for visual question answering. In: Proceedings of the 59th ACL, pp. 7265–7281 (2021)
Google Scholar
Karumbaiah, S., Lan, A., Nagpal, S., Baker, R.S., Botelho, A., Heffernan, N.: Using past data to warm start active machine learning: does context matter? In: LAK21: 11th LAK, pp. 151–160 (2021)
Google Scholar
Lin, J., et al.: Is it a good move? Mining effective tutoring strategies from human-human tutorial dialogues. Futur. Gener. Comput. Syst. 127, 194–207 (2022)
Article Google Scholar
Lin, J., et al.: Enhancing educational dialogue act classification with discourse context and sample informativeness. IEEE TLT (in press)
Google Scholar
Nye, B.D., Graesser, A.C., Hu, X.: Autotutor and family: a review of 17 years of natural language tutoring. IJAIED 24(4), 427–469 (2014)
Google Scholar
Nye, B.D., Morrison, D.M., Samei, B.: Automated session-quality assessment for human tutoring based on expert ratings of tutoring success. Int. Educ. Data Min. Soc. (2015)
Google Scholar
Rus, V., Maharjan, N., Banjade, R.: Dialogue act classification in human-to-human tutorial dialogues. In: Innovations in Smart Learning. LNET, pp. 183–186. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-2419-1_25
Chapter Google Scholar
Rus, V., et al.: An analysis of human tutors’ actions in tutorial dialogues. In: The Thirtieth International Flairs Conference (2017)
Google Scholar
Samei, B., Li, H., Keshtkar, F., Rus, V., Graesser, A.C.: Context-based speech act classification in intelligent tutoring systems. In: Trausan-Matu, S., Boyer, K.E., Crosby, M., Panourgia, K. (eds.) ITS 2014. LNCS, vol. 8474, pp. 236–241. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07221-0_28
Chapter Google Scholar
Settles, B.: Active Learning. Synthesis Digital Library of Engineering and Computer Science. Morgan & Claypool, San Rafael (2012)
Google Scholar
Sha, L., Li, Y., Gasevic, D., Chen, G.: Bigger data or fairer data? Augmenting BERT via active sampling for educational text classification. In: Proceedings of the 29th COLING, pp. 1275–1285 (2022)
Google Scholar
Swayamdipta, S., et al.: Dataset cartography: mapping and diagnosing datasets with training dynamics. In: Proceedings of EMNLP. ACL, Online (2020)
Google Scholar
Tan, W., Du, L., Buntine, W.: Diversity enhanced active learning with strictly proper scoring rules. In: NeurIPS, vol. 34 (2021)
Google Scholar
Vail, A.K., Grafsgaard, J.F., Boyer, K.E., Wiebe, E.N., Lester, J.C.: Predicting learning from student affective response to tutor questions. In: Micarelli, A., Stamper, J., Panourgia, K. (eds.) ITS 2016. LNCS, vol. 9684, pp. 154–164. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-39583-8_15
Chapter Google Scholar
Vail, A.K., Boyer, K.E.: Identifying effective moves in tutoring: on the refinement of dialogue act annotation schemes. In: Trausan-Matu, S., Boyer, K.E., Crosby, M., Panourgia, K. (eds.) ITS 2014. LNCS, vol. 8474, pp. 199–209. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-07221-0_24
Chapter Google Scholar
Yang, Y., Loog, M.: Active learning using uncertainty information. In: Proceedings of 23rd International Conference on Pattern Recognition, pp. 2646–2651 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Monash University, Clayton, Australia
Wei Tan, Jionghao Lin, Guanliang Chen, Dragan Gašević, Lan Du & Wray Buntine
Carnegie Mellon University, Pittsburgh, USA
Jionghao Lin
Stanford University, Stanford, USA
David Lang
VinUniversity, Hanoi, Vietnam
Wray Buntine

Authors

Wei Tan
View author publications
You can also search for this author in PubMed Google Scholar
Jionghao Lin
View author publications
You can also search for this author in PubMed Google Scholar
David Lang
View author publications
You can also search for this author in PubMed Google Scholar
Guanliang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Dragan Gašević
View author publications
You can also search for this author in PubMed Google Scholar
Lan Du
View author publications
You can also search for this author in PubMed Google Scholar
Wray Buntine
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jionghao Lin .

Editor information

Editors and Affiliations

University of Southern California, Los Angeles, CA, USA
Ning Wang
University of British Columbia, Vancouver, BC, Canada
Genaro Rebolledo-Mendez
North Carolina State University, Raleigh, NC, USA
Noboru Matsuda
Despacho 3.01, UNED-Grupo de Investigación aDeNu, Madrid, Spain
Olga C. Santos
University of Leeds, Leeds, UK
Vania Dimitrova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tan, W. et al. (2023). Does Informativeness Matter? Active Learning for Educational Dialogue Act Classification. In: Wang, N., Rebolledo-Mendez, G., Matsuda, N., Santos, O.C., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2023. Lecture Notes in Computer Science(), vol 13916. Springer, Cham. https://doi.org/10.1007/978-3-031-36272-9_15

Download citation

DOI: https://doi.org/10.1007/978-3-031-36272-9_15
Published: 26 June 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36271-2
Online ISBN: 978-3-031-36272-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics