Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2696704.2696758guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Mining for Marks: A Comparison of Classification Algorithms when Predicting Academic Performance to Identify "Students at Risk"

Published: 18 December 2013 Publication History

Abstract

A major concern for higher education institutions is the high failure and drop-out rates amongst students, especially first year students. Tertiary institutions thus have a common interest in identifying students at risk of failing or dropping out. Previous research studies have identified factors that influence success/failure which include, but are not limited to, the students' personal information, academic background and social environment. This study aims to use the emerging field of Educational Data Mining as a preventative measure rather than reiterate factors that influence success. The first year student data collected and stored in the School of Computer Science at the University of the Witwatersrand has been utilised in this study. The study used the students' first semester/midyear mark to predict success/failure at the end of the academic year. This will assist in identifying students at risk of failing and could assist with early intervention. A modified version of the CRISP-DM methodology was used. The investigation was broken down into two phases: training and test phase. In the training phase, student data from the years 2009 to 2011 were modelled using the WEKA Explorer GUI. Three classifiers: J48 classifier, Naïve Bayes and Decision Table, were used for modelling and were also compared. Using both the run information from WEKA and performance metrics, the J48 classifier was shown to be the better performing algorithm in the training phase. This algorithm was then integrated into the back-end of the Success Or Failure Determiner (SOFD) tool, which was created specifically for this study. In the test phase 92% of the instances were predicted correctly. Furthermore 23 of the 25 students who failed were flagged. The research findings indicated that the midyear mark can be considered as a factor which correctly predicts the Computer Science I final year marks. After further investigation with larger sample sizes, the tool can be used practically in the school of Computer Science to identify students at risk of failing.

References

[1]
Bhullar, M.S., Kaur, A.: Use of Data Mining in Education Sector. Lecture Notes in Engineering and Computer Science, vol.ä2200, pp. 513—516 (2012)
[2]
Butcher, D.F., Muth, W.A.: Perdicting performance in an introductory computer science course. ACMä28(3) (1985)
[3]
Campbell, P., McCabe, G.: Predicting the success of freshmen in a computer science major. Commun. ACMä27(11), 1108—1113 (1984), http://doi.acm.org/10.1145/1968.358288
[4]
Chandra, E., Nandhini, K.: Predicting student performance using classification techniques. In: Proceedings of SPIT-IEEE Clloquium and International Conference
[5]
Delavari, N., Phon-Amnuaisuk, S., Beikzadeh, M.: Data mining application in higher learning institutions. International Journal of Informatics in Educationä7(1), 31—54 (2008)
[6]
Durant, K.T., Smith, M.D.: Predicting unix commands using decision tables and decision trees. In: Proceedings of the Third International Conference on Data Mining, pp. 427—436 (September 2004)
[7]
Fraser, W.J., Killen, R.: Factors influencing academic success or failure of first-year and senior university students: do education students and lecturers perceive things differently. South African Journal of Educationä23(4), 254—260
[8]
Garcia-Saiz, D., Zorrilla, M.: Comparing classification methods for predicting distance students performance. In: JMLR: Workshop and Conference Proceedings 17, 2nd Workshop on Applications of Pattern Analysis 2011, pp. 26—32 (2011)
[9]
Kumar, V., Rathee, N.: Knowledge discovery from database using an integration of clustering and classification. IJACSA - International Journal of Advanced Computer Science and Applicationsä2(3), 29—33 (2011)
[10]
Panday, U.K., Pal, S.: Data Mining: A prediction of performer or underperformer using classification. International Journal of Computer Science and Information Technologiesä2, 686—690 (2011)
[11]
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometricsä33(1), 159—174 (1977), http://www.jstor.org/stable/2529310
[12]
Naik, N., Purohit, S.: Article: Prediction of final result and placement of students using classification algorithm. International Journal of Computer Applicationsä56(12), 35—40 (2012), published by Foundation of Computer Science, New York, USA
[13]
O'Byrne, J., Britton, S., George, A., Franklin, S., Frey, A.: Using academic predictors to identify first year science students at risk of failing. CAL-laborate Internationalä17 (2009)
[14]
Osmanbegović, E., Suljić, M.: Data mining approach for predicting student performance. Economic Reviewä10(1) (2012)
[15]
Riesenfeld, R.: Bayes' Theorem (2011), http://www.eng.utah.edu/~cs5961/Resources/bayes.pdf
[16]
Rauchas, S., Rosman, B., Konidaris, G.: Language performance at high school and success in first year computer science. SIGCSE 2006 (2006)
[17]
Obsivac, T., Popelinsky, L., Bydzovska, J.B.J.G., H.: Predicting drop-out from social behaviour of students, p. 103
[18]
Turner, E.H., Turner, R.M.: Teaching entering students to think like computer scientists. SIGCSE (2005)
[19]
Wimshurst, K.J., Wortley, R.K.: Academic success and failure: Student characteristics and broader implications for research in higher education. In: Effective Teaching and Learning. Griffith Institute for Higher Education (2005)
[20]
Yadav, S., Pal, S.: Data mining: A prediction for performance improvement of engineering students using classification. World of Computer Science and Information Technology Journal (WCSIT)ä2(2), 51—56 (2012)

Cited By

View all
  • (2018)Predicting academic performance: a systematic literature reviewProceedings Companion of the 23rd Annual ACM Conference on Innovation and Technology in Computer Science Education10.1145/3293881.3295783(175-199)Online publication date: 2-Jul-2018
  • (2015)Predicting Student Performance in Distance Higher Education Using Semi-supervised TechniquesProceedings of the 5th International Conference on Model and Data Engineering - Volume 934410.1007/978-3-319-23781-7_21(259-270)Online publication date: 26-Sep-2015
  1. Mining for Marks: A Comparison of Classification Algorithms when Predicting Academic Performance to Identify "Students at Risk"

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Guide Proceedings
      MIKE 2013: Proceedings of the First International Conference on Mining Intelligence and Knowledge Exploration - Volume 8284
      December 2013
      842 pages
      ISBN:9783319038438
      • Editors:
      • Rajendra Prasath,
      • T. Kathirvalavakumar

      Publisher

      Springer-Verlag

      Berlin, Heidelberg

      Publication History

      Published: 18 December 2013

      Author Tags

      1. Decision Table
      2. Educational Data Mining
      3. GUI
      4. J48 Classifier
      5. Naïve Bayes
      6. WEKA

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 22 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)Predicting academic performance: a systematic literature reviewProceedings Companion of the 23rd Annual ACM Conference on Innovation and Technology in Computer Science Education10.1145/3293881.3295783(175-199)Online publication date: 2-Jul-2018
      • (2015)Predicting Student Performance in Distance Higher Education Using Semi-supervised TechniquesProceedings of the 5th International Conference on Model and Data Engineering - Volume 934410.1007/978-3-319-23781-7_21(259-270)Online publication date: 26-Sep-2015

      View Options

      View options

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media