research-article

Classification of Discussions in MOOC Forums: An Incremental Modeling Approach

Authors:

Anastasios Ntourmas,

Yannis Dimitriadis,

Sophia Daskalaki,

Nikolaos AvourisAuthors Info & Claims

L@S '21: Proceedings of the Eighth ACM Conference on Learning @ Scale

Pages 183 - 194

https://doi.org/10.1145/3430895.3460137

Published: 08 June 2021 Publication History

Get Access

Abstract

Supervised classification models are commonly used for classifying discussions in a MOOC forum. In most cases these models require a tedious process for manual labeling the forum messages as training data. So, new methods are needed to reduce the human effort necessary for the preparation of such training datasets. In this study we follow an incremental approach in order to examine how soon after the beginning of a new course, we have collected enough data for training a supervised classification model. We show that by employing features that derive from a seeded topic modeling method, we achieve classifiers with reliable performance early enough in the course life, thus reducing significantly the human effort. The content of the MOOC platform is used to bias the topic extraction towards discussions related to (a) course content, (b) logistics, or (c) social interactions. Then, we develop a supervised model at the start of each week based on the topic features of all previous weeks and evaluate its performance in classifying the discussions for the rest of the course. Our approach was implemented in three different MOOCs of different subjects and different sizes. The findings reveal that supervised models are able to perform reliably quite early in a MOOC's life and retain a steady overall accuracy across the remaining weeks, without requiring to be trained with the entire forum dataset.

Supplementary Material

MP4 File (L-at-S21-lsfp034.mp4)

In this video we present the study "Classification of Discussions in MOOC Forums: an Incremental Modeling Approach". In this study we address the need for new methods that are needed to reduce the human effort necessary for the preparation of training datasets in supervised classification tasks for MOOC forum discussions. We follow an incremental approach in order to examine how soon after the beginning of a course, we have collected enough data for training a supervised classifier. We show that by employing features that derive from a seeded topic modeling method biased by the content of the MOOC platform, we achieve a reliable performance early enough in the course life, thus reducing significantly the human effort. Our approach was implemented in three MOOCs of different subjects. The findings reveal that supervised models are able to perform reliably quite early in a MOOC?s life and retain a steady overall accuracy across the remaining weeks, without requiring to be trained with the entire forum dataset.

Download
269.28 MB

References

[1]

Melody M. Terras and Judith Ramsay. 2015. Massive open online courses (MOOCs): Insights and challenges from a psychological perspective. British Journal of Educational Technology 46, 3 (2015), 472--487.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

A Framework for Topic Generation and Labeling from MOOC Discussions

Superposter behavior in MOOC forums

Untangling chaos in discussion forums: A temporal analysis of topic-relevant forum posts in MOOCs

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations