Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3696952.3697003acmotherconferencesArticle/Chapter ViewFull TextPublication PagesiciipConference Proceedingsconference-collections
research-article
Open access

Research on Data Mining Algorithms for Vocational Education Based on Student Behaviour Analysis

Published: 21 November 2024 Publication History

Abstract

In the era of big data and information, the traditional management of student behaviour is increasingly showing the disadvantages of untimely intervention and "backwardness" of the governance model. Nowadays, by applying education big data in the analysis and monitoring of students' daily behaviours, administrators are able to proactively grasp the characteristics and patterns of students' behaviours and make research judgments accordingly. With the development of information management systems in colleges and universities, it has become easier and more convenient to collect and analyse students' behavioural data, and extracting useful features from these behavioural patterns is beneficial to the understanding of students' learning process, and is also an important factor reflecting their learning styles and living habits at school. This paper describes the modeling of data mining algorithms for vocational education based on student behavioural analysis, extracts traditional features of vocational education, and is oriented towards modelling campus behavioural sequences to study the prediction of student performance.

1 Introduction

Data mining appeared in the late 1980s, and has developed rapidly in the field of artificial intelligence in the last decade, and has become a major hotspot in various industries in the era of big data. The so-called data mining technology refers to the non-trivial process of revealing implicit, noisy, random, previously unknown and potentially valuable information from large databases. It helps decision makers to analyse historical data as well as current data through data filtering and data preprocessing, highly automated analysis of original data and inductive reasoning. In the research component of this paper, data mining techniques are used to explore the patterns and patterns of students' daily behaviours, and through the study of students' campus behavioural data as a means of exploring and classifying and predicting the factors that correlate students' behaviours with their performance in vocational education.

2 Modelling of data mining algorithms for vocational education based on student behaviour analysis

A recurrent neural network captures features of an input sequence by recursively updating its internal hidden state and can handle sequences of varying lengths. Questions for recurrent neural networks: what are the components of its specific structure; how does it maintain its long-term memory capability; what is the mechanism of the backpropagation over time algorithm; how does the long-term dependency problem arise; and what are the different processing modes of a recurrent neural network depending on the goal of the task, as shown in Figure 1.
Figure 1:
Figure 1: Recurrent neural network diagram

3 Extraction of traditional features of vocational education based on student behavioural analysis

3.1 Data collection and cleansing

The data used in this paper comes from the open-source dataset of a third-party platform, which consists of data on the behaviour of students using campus one-card swipe cards in two academic years in X university and data on grade ranking in the teaching management system. Specifically, it includes book borrowing data, one-card data, dormitory access data, library access data, and student achievement data. The student information in the dataset has been anonymised through data desensitisation.

3.2 Student Behavioural Characteristics Extraction

In this paper, we study students' campus card behavioural data based on learning and summarising previous experiences to understand that the side-by-side assessment of students' performance rankings related to behaviour is multifaceted. Firstly, statistical analysis methods are used to select the behavioural attributes related to performance ranking. In this paper, 18 features are manually extracted, and based on the existing machine learning algorithms, the most suitable parameters are searched for to form the optimal state for predicting and evaluating the performance of students with different ranking grades. The feature fields about Studying Habits (SH), Living Habits (LH) and ConsumptionHabits (CH) are shown in Table 1:
Table 1:
Study Habits SH (1-8)Life Habits LH (9-15)Consumption habits CH (16-18)
borrow book dailyearly dormave Canteen
borrow book testlate dormave Market
late librarystay In dorm hourave Water
early libraryshower weekly 
stay in lab hourearly breakfast 
library testprint center daily 
library dailyprint center test 
pos statistics  
Table 1: Types of features extracted by traditional methods
The study habits of college students are formed over a long period of time, and their tendencies and behaviours cannot be easily changed with external factors. Characteristics 1-8 represent the number of times students borrowed books during non-exam time, the number of times they borrowed books during exam time, the number of times they entered the library earlier than 8 o'clock, the number of times they left the library later than 22 o'clock, the average daily stay in the library, the number of times they went to the library during non-exam time, the number of times they went to the library during exam time, and the number of times they spent money on the POS during class time, which were categorised as study habits. Good and regular habits of living behaviour are beneficial to academic performance, and the regularity of these habits is closely related to students' ability of self-control and self-discipline. In this paper, we choose indicators such as the average number of times per day to fetch water and dormitory stay time as the characteristics of living habits. Characteristics 9-15 represent the number of times students leave the dormitory earlier than 8:00, the number of times they enter the dormitory later than 22:00, the average daily stay in the dormitory, the number of weekly showers, the number of times they eat breakfast earlier than 8:00, the number of times they go to the copy centre in non-examination time periods, and the number of times they go to the copy centre in examination time periods, respectively. There are differences in the amount of campus card spending among college students with different academic achievements, reflecting the different consumption needs and consumption psychology of college students. Characteristics 16-18 represent the average daily consumption in the canteen, the average daily consumption in the supermarket and the average daily number of times to fetch water, respectively.

3.3 Analysis of student behavioural data

In order to better interpret the behavioural data and to have a more comprehensive understanding of the data in advance, this paper firstly classifies the students' performance ranking into three categories, i.e. good, fair and poor, based on the normal distribution. The labels were set as "1", accounting for 19.84% of the total number of students, "2", accounting for 60.02% of the total number of students, and "3", accounting for 20.14% of the total number of students, respectively. 20.14 per cent. The campus behavioural and ranking data were then analysed to convert the raw behavioural data into behavioural characteristics related to academic performance. This paper compares the differences in the number of visits to the library, books borrowed and visits to the literature and printing centre between exam and non-exam periods for the three categories of students. Library borrowing data were collected from students of all colleges over a period of two academic years. The results show that the average number of books borrowed by the students of category I is 52, the average number of books borrowed by the students of category II is 48 and the average number of books borrowed by the students of category III is 42. The behaviour of studying and borrowing books in the library decreases in the order of student category. Category 1 students are most equipped with good study and borrowing habits, while Category 3 students have relatively few of the above three behaviours and lack practical action in their daily campus routine.
In order to demonstrate the correlation between students' behavioural traits and academic performance, this paper further uses the statistical methods of factor analysis and principal component analysis to explain the correlation. Firstly, factor analysis is used to extract the common factors, and the factors affecting academic performance are rationalised by rotating the component matrix to explore the weight of each factor and common factor on the impact of performance. The basic structure of the observed data was explored by examining the internal dependencies of students' behavioural characteristics and several hypothetical variables were used to represent their basic data structure.
Before factor analysis, the selected characteristics were tested using KMO and Bartlett's test.KMO test was used to check the correlation and partial correlation between variables.The closer the KMO statistic is to 1, the stronger the correlation between variables and the weaker the partial correlation, the better the factor analysis. KMO result of this research is 0.720 (>0.6), which is suitable for factor analysis.The Bartlett's spherical test determines that if the correlation array is a unit array, the independent factor analysis method for each variable is invalid. The results showed that the approximate chi-square value of Bartlett's test was 47483.939 and the concomitant probability value of Sig. < 0.05 reached the significance level. It indicates that the correlation coefficient matrix of the factors is not a unit matrix and there is correlation among the variables. Therefore, based on the results of the above analysis, the original variables are suitable for factor analysis.
Principal component analysis is to find out the independent composite indicators that reflect multiple variables and how the internal structure between multiple variables can be revealed by several principal components. Using principal component analysis to examine the correlation between multiple variables and the predictive function for 18 factors affecting student achievement rankings, as shown in Tables 3, the first seven factors were able to explain 69.942% of the overall variance, effectively reflecting the overall information and having a significant relationship with academic achievement.
Table 2:
ComponentInitial eigenvaluesExtraction sums of squared loadingRotation sums of squared
Total% ofVarianceCumulative %Total% of VarianceCumulative %Total% of VarianceCumulative %
14.30723.92923.9294.30723.92923.9293.41218.95518.955
21.90610.58734.5161.90610.58734.5161.7799.88128.836
31.7029.45743.9731.7029.45743.9731.7519.73038.566
41.3907.72251.6951.3907.72251.6951.6439.12747.693
51.1686.48758.1831.1686.48758.1831.5528.62356.316
61.1076.15064.3321.1076.15064.3321.3067.25563.571
71.0105.61069.9421.0105.61069.9421.1476.37169.942
80.9055.02674.969      
90.8814.89379.861      
100.7494.15984.020      
110.7093.94287.962      
120.4752.63690.598      
130.4312.39592.993      
140.3662.03495.026      
150.3021.68096.706      
160.2771.53798.243      
170.1881.04299.286      
180.1290.714100.000      
Table 2: Total variance explained
Campus card swiping behaviour belongs to students' spontaneous behaviour and is also an important factor reflecting students' academic performance. Due to the individual differences of each student and the diversity and complexity of environmental factors, it is not possible to generalise and explain the model by extracting campus card swiping behavioural characteristics alone. Behavioural indicators affecting academic ranking can be discussed in the context of student performance classification, and data mining algorithms can be used to explore the practical value of these behavioural data in management, teaching and learning.

4 A Study of Grade Prediction for Campus Behavioural Sequence Modelling

4.1 Attention-based feature extraction for short-term behavioural sequences

In the first stage of the sequence-based performance prediction task, this paper uses a hybrid attention-based encoder-decoder model as the first classifier to learn sequential features of student behaviour. The inputs of the top and bottom encoders\(x = [ {{x}_1,{x}_2, \ldots ,{x}_{t - 1},{x}_t} ]\) are the same data of students' historical behaviours (e.g., x1 for "cafeteria" and xt for "shower"), while the outputs are the implicit representations of the students' sequential behavioural features hi . Notice that\(h_t^g\) and \(h_t^l\) play different roles, but they have the same value. The upper encoder encodes the entire initial sequence information through the last hidden layer state\(h_t^g\), while the lower encoder computes the attentional weights with the previous hidden states through the last hidden state\(h_t^l\), which then unifies the input and output high-dimensional vectors.
The encoder of the basic sequence is connected to the output of the attention-based sequence encoder to be fed to the sequence feature generator. Both use GRU as the basic network unit. This is because the GRU not only maintains important features in short-term propagation, but also solves the gradient vanishing problem well. The specific steps of the algorithm are, for each behavioural sequence xi, the GRU takes the historical campus behavioural sequences as inputs, and the output is a linear transformation between the previous implied state ht-1 and the candidate implied state\(h_t^{'}\) . This process can be expressed as:
\begin{equation} {z}_t = \sigma \left( {{W}^{\left( z \right)}{x}_t + {U}^{\left( z \right)}{h}_{t - 1}} \right) \end{equation}
(1)
\begin{equation} {r}_t = \sigma \left( {{W}^{\left( r \right)}{x}_t + {U}^{\left( r \right)}{h}_{t - 1}} \right) \end{equation}
(2)
\begin{equation} h_t^{'} = \tan \left( {W{x}_t + {r}_t \odot U{h}_{t - 1}} \right) \end{equation}
(3)
\begin{equation} {h}_t = {z}_t \odot {h}_{t - 1} + (1 - {z}_t) \odot h_t^{'} \end{equation}
(4)
Where ht-1 and \(h_t^{'}\) represent the previous hidden state and current hidden state, respectively. Equations 3-1 to 3-4 represent update gates, reset gates, new network units and hidden states, respectively. In particular, the update gate zt controls how much information from the previously hidden state ht-1 needs to be forgotten and how much information from the currently hidden state\(h_t^{'}\) needs to be remembered. The reset gate rt determines how much previously memorised information needs to be retained. The process interpolates linearly between the existing hidden states and the current hidden state, and the final hidden state of the encoder carries the information of the entire initial sequence. The final hidden state ht (later denoted as\(h_t^g\) ) is actually used as a representation of the student's behavioural sequence features, i.e. the basic sequence encoder.
\begin{equation} c_t^g = {h}^t = h_t^g \end{equation}
(5)
Noting that not all students' behaviours are related to their academic performance, the prediction is made with the hope that the SPC model will pay more attention to behavioural interactions that are related to their achievement performance. Therefore, this paper proposes a DEVICE-level attention mechanism to model this assumption, called Attention-Based Sequence Encoder.
\begin{equation} c_t^l = \sum\limits_{}^t {{\alpha }_{ti}{h}_i} \end{equation}
(6)
Where \(\sigma\) is the sigmoid function which converts ht and hi into potential space via a matrix. This function does not directly add up all the hidden states learned by the RNN network to represent the sequence of student behaviours. Rather, a weighting factor\({\alpha }_{ti}\) is used to indicate which hidden state/interaction is important to the encoder. Then, the weighted sum of the above hidden states is computed to represent the student's behavioural sequence features. To better understand \(c_t^l\), hi can also be represented as the last hidden state hi (i.e.,\(h_t^l\) ) at time step t. Therefore,\(c_t^l\) is modified as,
\begin{equation} c_t^l = \sum\limits_{}^t {{\alpha }_{ti}} {h}_i = \sum\limits_{}^t {{\alpha }_{ti}} h_t^l \end{equation}
(7)
The \(h_t^g\)are incorporated into\(c_t^g\), and \(h_t^l\) and \({\alpha }_{ti}\) are incorporated into\(c_t^l\), which together provide a behavioural sequence representation for the SPC model. Specifically, the last hidden state of the sequence-based encoder\(h_t^g\) is responsible for encoding the entire sequence behaviour, while the attention-based sequence encoder\(h_t^l\) is responsible for computing the attention weights of the previous hidden state. With this hybrid scheme, the basic sequence encoder and the attention-based sequence encoder can eventually be modelled as a unified representation ct, i.e., the sequence feature generator, which is represented as a splice of the vectors\(c_t^g\) and \(c_t^l\),
\begin{equation} {c}_t = \left[ {c_t^g;c_t^l} \right] = \left[ {h_t^g;\sum\limits_{}^t {{\alpha }_{ti}h_t^l} } \right] \end{equation}
(8)
For better student performance prediction, this paper applies a bilinear decoding mechanism with selectivity between the current sequence implicit representation and each campus card device to compute the similarity score Si .
\begin{equation} {S}_i = emb_i^mT{c}_t \end{equation}
(9)
Where T is a\(| D | * | H |\) matrix,\(| D |\) denotes the dimension of each campus card device embedding for mapping each behavioural vector to a low-dimensional space, and \(| H |\) is the dimension of the sequence representation. The similarity score of each campus card end device is then fed to the softmax layer to obtain the probability of a deep behavioural feature (denoted as F afterwards) for model decoding. For the sequence-based prediction task, the basic sequence encoder summarises the behaviours produced by the whole student each week, whereas the attention-based sequence encoder can adaptively select relevant behaviours to capture the student's main intention, which can focus on the most recent actions performed by the student. Therefore, a representation of the sequence behaviours and the previous hidden state is used to calculate the attentional weights for each occurring behaviour. The sequence behavioural features are then combined with the student's main intention features to form an extended representation of each timestamp, which allows for the learning of deeply periodic sequence features from the student's behaviour.

4.2 Classification model for predicting performance based on student behaviour

In the linearly separable case, the support vector classifier tries to find an optimal classification hyperplane\({W}^T \cdot x + b = 0\) to maximise the separation interval. To find this hyperplane, the following quadratic programming problem needs to be solved,
\begin{equation} \begin{array}{@{}l@{}} \min \Phi (w) = \frac{1}{2}{\left\| w \right\|}^2\\ s.t.{y}_i\left[ {\left( {{w}^T \cdot {x}_i + b} \right) - 1} \right] \ge 0,i = 1,2, \ldots ,n \end{array} \end{equation}
(10)
Where w is the normal vector, b is the bias term, and x is denoted as the identity. The solution of the above quadratic programming problem by the Lagrangian duality,
\begin{equation} \min L(w,b,a) = \frac{1}{2}{\left\| w \right\|}^2 - \sum\limits_{i = 1}^t {{\alpha }_i} \left[ {{y}_i\left( {{w}^T \cdot {x}_i + b} \right) - 1} \right] \end{equation}
(11)
This equation is the original problem, and its dual problem is obtained from the differential formula and the simplification of the relationship between w and \(\alpha\), b and \(\alpha\) as,
\begin{equation} \begin{array}{@{}l@{}} \max W(\alpha ) = \sum\limits_{i = 1}^t {{\alpha }_i} - \frac{1}{2}\sum\limits_{i,j = 1}^t {{\alpha }_i} {\alpha }_j{y}_i{y}_jx_i^T{x}_j\\ s.t.\sum\limits_{i = 1}^t {{\alpha }_i} {y}_i{\alpha }_i \ge 0,i = 1,2, \cdots ,l \end{array} \end{equation}
(12)
The solution steps are similar to the linearly divisible case, where \(C\sum\limits_{}^l {{\xi }_i}\) is the penalty term. The classification hyperplane at this point is. \(f(x) = \sum\limits_{}^l {\alpha _i^ * } {y}_iK( {{x}_i,{x}_j} ) + {b}^ *\) The nature of the kernel function corresponding to the inner product of the high-dimensional space is exploited to enable the linear classifier to implicitly build the classification plane of the high-dimensional space. The support vector machine constructs the optimal segmentation hyperplane in the feature space based on the theory of structural risk minimisation, which allows the learner to be globally optimised and the expected risk of the entire sample space to satisfy some upper bound with a certain probability.
In solving the multi-classification problem, this paper chooses a one-to-one approach, i.e., classifying each two categories separately.The input to the SVM is the complete sequence of behavioural features formed in the HRNN network model\(F = [ {{f}_1,{f}_2,...,{f}_{t - 1},{f}_t,} ]\), and the output is the real student's performance ranking classification\(y \in \{ {1,2,3} \}\) . Let\(( {{f}_i,{y}_i} )\) be the sample set and y denote the category. The academic performance of the students is classified into three main categories, students belonging to the first category A rank account for about 20% of the total number of students, students belonging to the second category B rank account for about 60% of the total number of people, and the last category C rank accounts for about 20% of the total number of people. Therefore, the method can be used to consider the prediction of students' academic performance as a short-term sequence modelling problem. If weekly behavioural data of a target student is provided, the student's academic performance can be predicted based on the SPC two-stage classifier in order to identify students with learning crisis in time.

5 Conclusion

With the development of computer hardware technology, the application of data mining techniques in education is an emerging interdisciplinary research area. The basic purpose of educational data mining is to indicate students' academic performance and evaluate learners. The characteristics of students' daily behaviour on campus contain rich information, and this paper explores the relationship between sequences of students' behaviours recorded from digital forms and their performance based on the sequence modelling perspective of deep learning. The results show that combining recurrent neural networks with data mining can be more effective in accurately classifying academic performance.

Acknowledgments

This research is funded by the Opening Foundation of Key Lab of Intelligent Optimization and Information Processing, Minnan Normal University (NO. ZNYH202402) and Construction of Labor Education System for Applied Universities, Wuhan Education & Science Planning project (No. 2022C145).

References

[1]
Zhou Limei, Wang Chunyan. 2024. Research on User Behaviour Data Mining and Consumption Behaviour Prediction on E-commerce Platform [J]. Old Brand Marketing, (09): 18-20.
[2]
Jia Yanling, Yang Liu, Song Zhiyang. 2024. Application of data mining in the use of big data in libraries [J]. Science and Technology Information, 22 (06): 224-226.
[3]
Xie Hongtao, Wang Yang. 2024. A study on the use of data mining technology in power data analysis [J]. China Information Community,(01): 102-105.
[4]
Mengna Lu. 2024. User behaviour analysis and data mining in intelligent lighting system [J]. China Lighting Appliance, (02): 69-71.
[5]
Yilong Ruan, Hongjun Zhang. 2024. A study of data mining techniques and their application in telecommunication industry [J]. Software, 45 (01): 13-17.
[6]
Li Yi, Zhao Yuanyuan. 2023. Design of learning behaviour analysis and performance prediction system for higher vocational students [J]. Journal of Shijiazhuang Institute of Vocational Technology, 35 (06): 23-28.
[7]
Sun Lin. 2023. Exploration of College Students' Online Learning Behaviour Supported by Data Mining Technology [J]. Information and Computer (Theoretical Edition), 35 (24): 184-186.
[8]
Xuhao Hu, Chenghao Han. 2023. Research on student behaviour analysis and early warning mechanism based on campus big data [J]. Information Record Material, 24 (09): 65-68.
[9]
Wei M, Xu H S, Tang H, Bai Y Shuai. 2023. Characterisation of online learning behaviour based on data mining [J]. Journal of Mianyang Normal College, 42 (08): 97-104.
[10]
Wang Jian, Chen Kaiquan. 2023. Learning Behaviour Analysis and Teaching Implications for College Students Based on Educational Data Mining Techniques–A Case Study of "Advanced Mathematics" Course in a University [J]. Digital Education, 9 (03): 41-48.

Index Terms

  1. Research on Data Mining Algorithms for Vocational Education Based on Student Behaviour Analysis
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICIIP '24: Proceedings of the 2024 9th International Conference on Intelligent Information Processing
    November 2024
    419 pages
    ISBN:9798400718076
    DOI:10.1145/3696952

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 November 2024

    Check for updates

    Author Tags

    1. Student behaviour analysis
    2. data mining
    3. vocational education

    Qualifiers

    • Research-article

    Conference

    ICIIP 2024

    Acceptance Rates

    Overall Acceptance Rate 87 of 367 submissions, 24%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 6
      Total Downloads
    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media