Nothing Special   »   [go: up one dir, main page]

Next Article in Journal
Multi-Step Parking Demand Prediction Model Based on Multi-Graph Convolutional Transformer
Previous Article in Journal
Industrial Organizations Adapting to the Novel EU Taxonomy While Developing Socio-Technical Systems: A Literature Review
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Research on Forecasting Sales of Pure Electric Vehicles in China Based on the Seasonal Autoregressive Integrated Moving Average–Gray Relational Analysis–Support Vector Regression Model

1
School of Economics and Management, Yanshan University, Qinhuangdao 066000, China
2
School of Vehicle and Energy, Yanshan University, Qinhuangdao 066004, China
*
Author to whom correspondence should be addressed.
Systems 2024, 12(11), 486; https://doi.org/10.3390/systems12110486
Submission received: 25 September 2024 / Revised: 4 November 2024 / Accepted: 11 November 2024 / Published: 13 November 2024
(This article belongs to the Topic Data-Driven Group Decision-Making)

Abstract

:
Aiming to address the complexity and challenges of predicting pure electric vehicle (EV) sales, this paper integrates a time series model, support vector machine and combined model to forecast EV sales in China. Firstly, a seasonal autoregressive integrated moving average (SARIMA) model was constructed using historical EV sales data, and the model was trained on sales statistics to obtain forecasting results. Secondly, variables that were highly correlated with sales were analyzed using gray relational analysis (GRA) and utilized as input parameters for the support vector regression (SVR) model, which was constructed to optimize sales predictions for EVs. Finally, a combined model incorporating different algorithms was verified against market sales data to explore the optimal sales prediction approach. The results indicate that the SARIMA-GRA-SVR model with the squared prediction error and inverse method achieved the best predictive performance, with MAPE, MAE and RMSE values of 12%, 1.45 and 2.08, respectively. This empirical study validates the effectiveness and superiority of the SARIMA-GRA-SVR model in forecasting EV sales.

1. Introduction

With the transformation of the global energy structure and the increasing awareness of environmental protection, EVs have experienced rapid development worldwide in recent years [1,2]. Particularly in China, driven by strong governmental support and the growing environmental consciousness among consumers, the market penetration of EVs has significantly increased. Data show that the EV sales surged from 409,000 units in 2016 to 6.657 million units in 2023, with the market share rising from 1.46% to 22.12%. China’s position in the global EV market has become increasingly prominent, with its sales accounting for over 60% of the global market in 2023, far exceeding the United States (approximately 10%) and Europe (approximately 20%). The Chinese government has implemented favorable policies, such as tax reductions and subsidies, which have substantially lowered the purchase costs for consumers and stimulated market demand [3,4]. Concurrently, technological advancements, particularly in battery, motor and charging technologies, have enhanced the driving range, charging efficiency and safety of EVs, further propelling market expansion. As zero-emission transportation options, EVs play a crucial role in reducing air pollution and mitigating greenhouse gas emissions, and their market demand is expected to continue growing [5,6]. The rapid development of China’s EV sales not only promotes the rise of the domestic EV industry but also alters the global EV market dynamics [7]. Through technological innovation, supply chain integration and policy guidance, China has continuously strengthened its influence in the global EV market, accelerated the global electrification process worldwide and provided strong support for the global goal of carbon neutrality [8]. With the international expansion of Chinese enterprises, the impact of China’s EV industry on the global market is expected to deepen further in the future [9]. However, the EV market remains in a phase of rapid development, influenced by factors such as macroeconomic conditions, policy directions and technological progress, which often exhibit complex nonlinear relationships, making sales forecasting highly challenging. Nonetheless, China’s position in the global EV market remains solid, consistently leading in terms of sales, technology, policy support and supply chain integration [10]. As the global market continues to expand, China will also face increasingly intense international competition. Therefore, accurately forecasting China’s EV sales is crucial for both companies and the government in formulating market strategies and optimizing resource allocation [11].
In the field of EV sales forecasting, despite the accumulation of predictive experiences related to traditional vehicle sales and the overall new energy vehicle market, both domestically and internationally, specialized research on the subcategory of EVs remains limited. Among the existing vehicle sales forecasting methods, the SARIMA model has demonstrated broad applicability due to its ability to comprehensively account for data trends, seasonal fluctuations and random disturbances, successfully uncovering the intrinsic patterns of sales dynamics. Researchers such as Gao [12], Ding [13], Chen [14] and Zhang [15] have proposed various forecasting models for new energy vehicle sales, including models based on structural relationship identification, SARIMA and Long Short-Term Memory (LSTM) hybrid models, ARIMA models as well as univariate and multivariate time series models. All of these approaches have shown strong performance in terms of forecasting accuracy and stability. Additionally, scholars have applied time series analysis to forecast sales in other industries, such as Li’s prediction of the CPI index [16], Li’s demand forecasting for pharmaceutical e-commerce platforms [17], Fantazzini’s forecast of automobile sales [18] and Nie’s prediction of GDP [19]. These studies have not only enriched the methodology of sales forecasting but also provided valuable support for market analysis and strategic planning by businesses.
However, given the complex and diverse factors influencing EV sales, relying solely on the SARIMA model may struggle to accurately capture the nonlinear and highly volatile characteristics of this market. To address this limitation, the SVR model has been introduced as a powerful supplementary tool. The SVR model achieves regression fitting for complex data patterns by constructing an optimal hyperplane, and its excellent nonlinear handling capability enables it to effectively address the complex variable relationships and uncertainties in EV sales forecasting, thus improving the model’s accuracy and robustness. Yao Jinhai successfully predicted stock market price fluctuations with high accuracy through the construction of a hybrid forecasting model that combines ARIMA with an information granulation-based SVR model [20]. Xu and others applied the SVR model to forecast the development potential of hydrogen fuel cell vehicles, improving prediction accuracy through parameter optimization [21]. Fan [22] and Liang [23] also utilized the SVR model to predict carbon market prices and short-term micro-tourism demand, both achieving promising results.
Sales data in the EV market may exhibit structural variations due to sudden policy shifts (such as tax adjustments or stricter environmental regulations) or technological innovations (such as breakthroughs in battery technology). Although advanced modeling techniques like LSTM and Artificial Neural Networks (ANNs) theoretically hold substantial potential, their practical application is limited by the need for large-scale datasets, showing limited advantages in environments with small or simple datasets [24]. Additionally, these models are complex, have extensive parameter distributions, require cumbersome training and involve challenging hyperparameter tuning. The interpretability and practical applicability of their predictions are also restricted, which hinders accurate forecasts for China’s EV sales. Thus, there is an urgent need to explore predictive methods that are effective with small datasets, highly interpretable and practically applicable to foster precise market forecasting and healthy development. Both the SARIMA and SVR models rely on historical data for modeling, making it challenging for a single model to quickly adapt to sudden changes, potentially distorting forecast accuracy. The SARIMA model excels at capturing linear and dynamic components in time series, while the SVR model is better suited to nonlinear relationships [25]. Since EV sales are influenced by various factors and may encompass linear, nonlinear, instantaneous and sporadic features, a single model cannot fully capture all patterns. Therefore, the SVR model and SARIMA model are combined to capture both linear and nonlinear characteristics in the data, so as to improve the prediction accuracy.
In addition, to address the challenges of data sparsity or varying quality in the forecasting process, GRA serves as a powerful tool for system analysis and decision support. It excels in revealing potential correlations and uncertainties among multiple factors under conditions of limited or low-quality data, providing a solid theoretical foundation for constructing and optimizing forecasting models. Tong and others developed a GRA method to analyze the correlation between multiple factors affecting new energy vehicle sales, offering a valuable tool for sales forecasting [26]. Yan and colleagues applied GRA to study the influencing factors of medical tourism development, identifying close relationships between medical conditions, tourism resources and tourism demand [27]. Xu utilized GRA to forecast new energy vehicle sales, achieving high prediction accuracy [28]. Therefore, incorporating GRA into the EV sales forecasting framework not only enhances the model’s adaptability to complex environments but also provides more scientific and comprehensive references for management decision-making.
In summary, the combined forecasting method incorporating SARIMA, SVR and GRA holds promise for improving prediction accuracy in the field of EV sales forecasting, thus providing strong support for industry planning, policy formulation and corporate strategy adjustments [29]. Accurate sales forecasting is particularly crucial for precise manufacturing and sales planning in the EV industry. On the one hand, accurate sales predictions enable automakers to minimize costs and time, optimize resources and enhance operational efficiency [30]. On the other hand, inaccurate forecasts may lead to negative consequences such as excess inventory, insufficient production supply, high labor costs and reputational damage [31,32]. Although previous studies have integrated different models, they have not fully combined influencing factors with the forecasting process. Therefore, this paper proposes a sales forecasting model based on SARIMA-GRA-SVR, aiming to improve the accuracy and reliability of EV sales forecasts in China.
In this study, we first conducted stationarity and seasonality tests on the time series data of EV sales. Based on the test results, a suitable SARIMA model was constructed to capture the time series characteristics and seasonal patterns of sales. The SARIMA model was trained using historical data to obtain preliminary sales forecasts. Next, GRA was employed to calculate the correlation between EV sales and various influencing factors. Based on the correlation results, the key factors affecting sales were identified, and those with a high correlation to sales were selected as input variables for the subsequent SVR model. Following this, an SVR model was constructed using the key influencing factors identified through GRA to predict EV sales, generating forecasts based on these influencing factors. Finally, the sales forecasts from the SARIMA model were combined with those from the GRA-SVR model in both serial and parallel configurations. The combined model was then evaluated using test set data, and based on the evaluation results, the model parameters were adjusted and optimized. The optimized model was applied to future real-world forecasting scenarios. A structural diagram of this paper’s contents is shown in Figure 1.
Based on the above research framework, this paper presents the following innovations:
(1)
This paper innovatively utilizes GRA to select the key factors influencing EV sales as input variables for the SVR model, enhancing the model’s adaptability to complex market environments.
(2)
A novel serial and parallel combination method of SARIMA and GRA-SVR models is proposed, integrating the forecasting strengths of different models and further improving the stability and accuracy of the prediction results.

2. Combined Model Theory

2.1. Data Sources

The data for this study are sourced from the China Association of Automobile Manufacturers, the China Electric Vehicle Charging Infrastructure Promotion Alliance, the National Bureau of Statistics, Qichacha, Huajing Intelligence Network, the Carbon Trading Network, the China Business Industry Research Institute and the Wind database. The sales data for EVs in China exhibit the following characteristics: consumers tend to be younger, the historical sales data span is relatively short and the volume of sales data is limited.

2.2. Combined Model Structure

In real-world time series, both linear and nonlinear information coexist. A combined forecasting dataset has a lower mean squared error than any single forecasting dataset [33]. Combined models are categorized into two types: parallel and serial.

2.2.1. Serial Combined Model Theory

A serial combined model divides the training data into linear and nonlinear components. Different models, each with distinct predictive strengths, are used to train these two parts separately. Finally, predictions from both models are combined to obtain the final forecast. Based on the principle of the serial combined model, this study assigns the linear part of the EV sales time series to the SARIMA model for forecasting, while the nonlinear part, or the residual, is predicted using the SVR model. The final forecast result is obtained by summing the two components (Figure 2).

2.2.2. Parallel Combined Model Theory

In a parallel combined model, weights are determined by calculating the prediction error of each individual model, and the weighted predictions are then aggregated to produce a final forecast that theoretically combines the strengths of multiple models for greater accuracy and robustness (Figure 3). The core of this approach is to reduce the bias and uncertainty that may arise from a single model by integrating multiple models, thereby enhancing prediction accuracy and reliability. In this study, the weights calculated from the error in the EV sales time series data are used to assign portions of the data to the SARIMA and SVR models, respectively. The final prediction of the parallel combined model is obtained by summing the results from both models. The specific calculation formula is as follows:
Y i = v 1 y i 1 + v 2 y i 2
In the above equation, v1 and v2 represent the weights of the SARIMA and SVR models, respectively, while yi1 and yi2 represent the prediction results of the SARIMA and SVR models, respectively.
Common methods for calculating weights include the simple weighting method, the reciprocal of the sum of squared prediction errors method, the reciprocal of the mean squared error method and the matrix advantage method [34].
(a)
Simple Weighting Method: The simple weighting method improves upon the equal weighting approach. The calculation formula is as follows:
v j = j j = 1 J j = 2 j J J + 1 , j = 1 , 2 , , J
where vj represents the weight of the j model, and J is the total number of models.
(b)
Reciprocal of the Sum of Squared Prediction Errors Method: The sum of squared prediction errors quantifies the deviation between actual and predicted values and is inversely proportional to the model’s accuracy. The calculation formula is as follows:
v j = R j 1 j = 1 J R j 1 , j = 1 , 2 , , J
where Rj−1 is the reciprocal of the sum of squared prediction errors for the j model, and the formula for this is the following:
R j 1 = 1 t = 1 N ( x t x ^ t ( j ) ) 2 , j = 1 , 2 , , J
(c)
Reciprocal of the Mean Squared Error Method: This method is similar to the reciprocal of the sum of squared prediction errors, where the mean squared error of the model is inversely proportional to the weight. The calculation formula is as follows:
v j = R j 1 2 j = 1 J R j 1 2 , j = 1 , 2 , , J
(d)
Matrix Advantage Method: This method involves comparing the predicted values of the SARIMA model and the SVR model with actual values. The number of times each model’s predictions are more accurate is recorded: let n1 denote the instances where the SARIMA model’s predictions are more accurate than those of the SVR model, and n2 denote the instances where the SVR model outperforms the SARIMA model. The weights assigned to the two models in the combined model are then the following:
v 1 = n 1 n 1 + n 2 , v 2 = n 2 n 1 + n 2

2.3. SARIMA Model

Due to the significant seasonal fluctuations in the EV sales in China and the suitability of the SARIMA model for forecasting time series data with distinct periodic or seasonal components, this study employed the SARIMA model to predict EV sales in China [35]. Defining the lag operator B, the specific expression for SARIMA (p, d, q) (P, D, Q, s) is as follows:
ϕ p B Φ P B s 1 B d 1 B s D y t = c + θ q B Θ Q B s ε t ϕ p B = 1 φ 1 B φ p B p Φ P B S = 1 φ 1 B s φ p B s p θ q B = 1 λ 1 B λ q B q Θ Q B s = 1 λ 1 B S λ Q B S Q
where p is the non-seasonal autoregressive order; q is the non-seasonal moving average order; P is the seasonal autoregressive order; Q is the seasonal moving average order; ϕ(P)(B) is the non-seasonal autoregressive characteristic polynomial; Φ(P)(Bs) is the seasonal autoregressive polynomial; θ(q)(B) is the non-seasonal moving average characteristic polynomial; Θ(Q)(Bs) is the seasonal moving average characteristic polynomial; (1 − B)d is the non-seasonal differencing operator; (1 − Bs)D is the seasonal differencing operator, both of which transform the time series yt into a stationary series; εt is white noise, normally distributed with a mean of 0 and constant variance; Bs is the seasonal lag operator; S is the period length; c is a constant; the subscript t denotes time; φ1, φ2,…, φp are the autoregressive coefficients; and λ1, λ2,…, λq are the moving average coefficients.

2.4. SVR Model

2.4.1. SVR Model Theory

Support vector machines (SVMs) are a type of machine learning model primarily used for data analysis in classification and regression tasks [36,37]. SVR is a specific application of SVMs designed for regression tasks. The principle of the SVR model is to identify a regression plane that minimizes the distance from all data points in the dataset to this plane (Figure 4). This study uses economic, policy and technological influencing factors, selected based on GRA, as the sample set. The SVR model is then used to forecast EV sales, resulting in sales predictions based on these influencing factors.
For a given training sample set U = {(x1, y1), (x2, y2), …, (xm, ym)}, it is necessary to find a regression model f(x) = wTx + b such that f(x) is as close as possible to y. Here, w is the normal vector of the decision plane, determining its direction, and b determines the position of the decision plane. Let the error be ε, and consider an error interval centered at f(x) with width 2ε. The optimization problem for the SVR model is then as follows:
min w , b w 2 2 + C i = 1 m l ε f x i y i l ε f x i y i = 0 , i f f x i y i ε f x i y i ε , o t h e r w i s e
where C is a constant greater than 0, serving as the penalty parameter, and l ε is the ε’s insensitive loss function.
By solving this, the optimal solution to the primal problem is obtained, as follows: w = i = 1 m α i α ^ i ϕ x i .
Introduce the kernel function: K ( x i , x j ) = ϕ ( x i ) T ϕ ( x j ) . Among them, the Gaussian kernel function K ( x i , x j ) = e x i x j 2 2 σ 2 is the most commonly used [38]. The final SVR model is as shown here:
f x = i = 1 m α i α ^ i ϕ x i T ϕ x j + b

2.4.2. SVR Parameter Optimization

For SVR model, this paper employs K-fold cross-validation. This method first splits the dataset into K subsets, using K − 1 subsets for training and 1 subset for validating model accuracy. During cross-validation, the validation process is performed K times, and a single estimate is produced by aggregating the results from each validation [39].

2.5. Variable Selection

2.5.1. SARIMA Model Variable Selection

The data in this section were sourced from the official website of the China Association of Automobile Manufacturers, covering the period from January 2016 to December 2023, encompassing a total 96 data points (Table 1). The 12 months after January 2023 are designated as the test sample, while the remaining data serve as the training sample.

2.5.2. SVR Model Variable Selection

Based on the key influencing factors of the EV sales in China identified through GRA [40], an SVR model was constructed to predict the sales using these factors.
(1)
GRA Theory
The GRA method describes the relationships among various factors and measures the degree of association between them [41]. Its advantages include low sample size requirements, no need for regularity in the sample, a small computational load, and results that align with qualitative analysis—making it suitable for the current small data volume of EVs. The steps involved are as follows:
(a)
Identify the reference sequence and the characteristic sequence. The reference sequence reflects the system’s characteristics and is denoted as x0 = (x0(1), x0(2), …, x0(n)); the characteristic sequence consists of factors affecting the system’s behavior and is represented in matrix form as follows:
( x 0 , x 1 , , x k ) T = x ( 1 ) 0 x ( 1 ) 1 x ( 1 ) k x ( 2 ) 0 x ( 2 ) 1 x ( 2 ) k x ( n ) 0 x ( n ) 1 x ( n ) k
where k is the number of influencing factors, and i is the number of data points.
(b)
Data preprocessing. To eliminate the impact of dimensionality, the data must be normalized. This paper uses the mean normalization method, which is as follows:
x i ' = x i ' ( 1 ) , x i ' ( 2 ) , , x i ' ( n ) T = x i ( 1 ) x ¯ i , x i ( 2 ) x ¯ i , , x i ( n ) x ¯ i T , i = 1 , 2 , , k
where x ¯ i is the mean value of the i-th influencing factor, expressed as follows:
x ¯ i = h = 1 n x i h n , i = 1 , 2 , , k
(c)
Calculate the gray relational coefficients. For a reference sequence x0 with k comparison sequences (x1, x2, …, xk)T, the gray relational coefficient between each comparison sequence and the reference sequence at each time point can be calculated using the following formulas:
Δ i h = x 0 h x i h , i = 1 , 2 , , k , h = 1 , 2 , , n
m = min i   min j   Δ i ( h ) , i = 1 , 2 , , k , h = 1 , 2 , , n
M = max i   max h   Δ i ( h ) , i = 1 , 2 , , k , h = 1 , 2 , , n
ξ i ( h ) = m + ρ M Δ i ( h ) + ρ M , i = 1 , 2 , , k , h = 1 , 2 , , n
where ρ is the distinguishing coefficient, which is generally set as [0,1].
(d)
Calculate the gray relational degree. According to the above formula, the value of γ ranges from 1 to γ. Each factor thus has γ gray relational coefficients with the reference sequence. To assess the overall association of a factor with the reference sequence, it is difficult to measure each of these γ coefficients individually. Therefore, the arithmetic mean of these γ gray relational coefficients is calculated, which represents the gray relational degree:
γ i = 1 n k = 1 n ξ i k , i = 1 , 2 , , k , h = 1 , 2 , n
(e)
Gray relational degree ranking. Perform a ranking analysis on the k calculated gray relational degrees and select the desired indicator features.
(2)
Specific Variable Selection
As of 22 October 2023, a total of 688 companies in the new energy vehicle sector were identified on the Royal Flush network financial station. This sector includes five main categories: complete vehicle manufacturing, components, battery power, electric motors and charging piles. To ensure the validity of the empirical results, further screening was performed using the Wind database, excluding data from companies with ST or ST* stock codes. Ultimately, data from A-share listed companies in EV manufacturing from 2016 to 2023 were selected, resulting in a sample of 20 companies (Table 2). Among these, tax refunds refer to the amount disclosed in the cash flow statement of the company’s annual report, government subsidies are the government grants included in the current period’s profit and loss, and automotive R&D expenses are the reported R&D investment data in the company’s annual report [42,43].
Through a comprehensive analysis of the factors influencing the sales of EV in China, and considering the availability of data, this paper will examine 14 variables from three perspectives: policy, economic and technological aspects, as potential determinants of annual EV sales. The descriptions of these variables are shown in Table 3.

2.6. Model Evaluation

In the forecasting of EV sales, commonly used accuracy evaluation metrics include Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE), each reflecting the model’s performance under different types of errors [44].
The determination of forecasting errors is complex and variable, influenced by the purpose of the forecast, industry standards and market uncertainties. The forecasting context, market volatility and decision-making requirements dictate varying tolerance levels for errors: short-term decisions typically require MAPE to be controlled within 5% to 10%, whereas in long-term strategic planning and under high uncertainty conditions, the tolerance range for MAPE can be relaxed to 15% to 30%. The priority of different evaluation metrics depends on the forecasting application scenario: RMSE is sensitive to large deviations and is suitable for situations where significant impacts from large deviations are critical for decision-making, such as inventory overstock or sales losses; MAE reflects the average difference between predicted and actual values, making it appropriate for tasks that demand high overall accuracy and consistency; MAPE is used to assess relative errors and can effectively reflect nonlinear market changes, generally considered reasonable within the range of 5% to 20%, although this must be judged in conjunction with the forecasting time span and market volatility.
This study selected the scenario of short-term sales forecasting for China’s EVs, aiming for the minimum MAPE while also achieving the smallest values for MAE and RMSE. The calculation formulas are as follows:
RMSE = 1 n i = 1 n y ^ i y i 2
MAE = 1 n i = 1 n y ^ i y i
MAPE = 1 n i = 1 n y ^ i y i y i
yi represents the actual value, and y ^ i represents the predicted value.

3. Results

3.1. Sales Forecast Results Based on SARIMA Model

Before modeling and forecasting sales using SARIMA, a time series plot of the sales of EVs in China from January 2016 to December 2023 was created to illustrate the development trend, as shown in Figure 5.
In this section, data from the first 7 years, totaling 84 months (from January 2016 to December 2022), were selected as training data for the SARIMA model. The monthly sales data of EVs in China exhibit an increasing trend, indicating that the series is non-stationary. Therefore, a first-order differencing was applied, as shown in Figure 6, where the differenced time series fluctuates around a mean of zero without changing over time. Figure 7 and Figure 8 display the autocorrelation plot and partial autocorrelation plot of the first-differenced data series, respectively. The autocorrelation plot shows that the lag-1 autocorrelation value does not exceed the boundary, while the partial autocorrelation plot indicates a significant non-zero value at lag-1, confirming that the series is stationary.
Based on the initial assessment of the order using the autocorrelation and partial autocorrelation plots, and after verification by comparing adjacent orders within the determined p, d, q range, the optimal model identified was SARIMA (3,1,3) (3,1,0)12. The fitting results are presented in Table 4. The residual sequence passed the white noise test, indicating that the model fits well. The prediction results are shown in Figure 9 and Table 5, forecasting that the monthly sales of EVs will exhibit seasonal fluctuations over the next year, reaching 884,500 units by the end of the year. The accuracy evaluation shows that the RMSE is 5.92, MAE is 3.54, and MAPE is 22%, indicating that the prediction results exhibit a certain level of accuracy. However, the potential sources of bias may stem from the SARIMA model’s assumption that EV sales can be predicted through linear trends based on historical data. When factors such as changes in government subsidy policies or advancements in battery technology trigger nonlinear fluctuations in sales, SARIMA struggles to effectively capture these complex relationships, resulting in prediction errors.

3.2. GRA Results

The influencing factors for EVs in China are all continuous time series, and there is a complex interplay among these factors. Based on the data from the 14 influencing factors, the parameter setting results are shown in Table 6. This study selects the annual sales of EVs in China as the reference series x0, with 14 representative factors (listed in Table 7) designated as comparison series x1x14, applying a normalization method for dimensionless processing. The gray relational degrees between each factor and sales are calculated according to Formula 16, as shown in Table 8. The goal is to identify the factors that closely align with the development trend of EVs, in order to analyze their impact on the growth trajectory of EVs.
As shown in Table 9, the correlation coefficients for the selected factors are all above 0.5, indicating a medium to high degree of correlation with the development of EVs. Among these factors, six have a gray relational degree greater than 0.6, ranked as follows: battery installation volume (x11) > number of registered EV-related enterprises (x14) > number of charging piles (x3) > tax refunds (x1) > cumulative number of registered battery recycling enterprises (x12) > carbon trading prices (x8). This indicates that these six factors significantly influence EV sales in China.

3.3. Sales Forecast Results Based on the GRA-SVR Model

The data used for the SVR model consist of the monthly data (from January 2016 to December 2022) for the six important factors identified through the gray relational degree analysis. Missing data were handled using the averaging method. The parameter ranges were set as follows: cost in the range of [2−5,215], and ε in the range of [0,1], with a step size of 0.01. Additionally, a Gaussian radial basis function kernel was employed, with the kernel parameter set to radial. After multiple experiments using MATLAB 2022b, we determined that the penalty parameter C is 90.5097, the gamma value g is 0.0156 and the error term ε is 1.0000 × 10−4, as shown in Table 10.
The SVR model was trained using the first 84 sample data points, yielding fitting results (Table 11) with an RMSE of 2.86, MAE of 1.79 and MAPE of 13%, indicating a good fit. The training set is illustrated in Figure 10, where the fitted data closely match the actual monthly sales data, further validating the model’s fitting effectiveness. The SVR model’s fitting results consistently outperform those of the SARIMA model, indicating a superior prediction accuracy. This result also reflects that the sales data of EVs are influenced by multiple factors, which exhibit complex nonlinear relationships. From a policy-driven perspective, sudden changes in government subsidy policies or vehicle purchase taxes can lead to dramatic increases or decreases in sales, which are difficult to capture using linear models. In terms of technological advancements, breakthroughs in battery technology and charging infrastructure significantly influence consumer decisions and consequently the market. These technological advancements and the degree of market acceptance exhibit nonlinear trends, making it challenging for traditional linear models to accurately describe them. Regarding consumer behavior, with the increasing or shifting awareness of environmental issues, purchasing intentions may also experience nonlinear fluctuations. Therefore, the GRA-SVR model, by using GRA to select key factors closely related to sales and then applying the SVR model for nonlinear modeling, can better capture the complex nonlinear patterns in sales data compared to the SARIMA model. Finally, the SVR model’s sales forecast for January to December 2023 is presented in Figure 11 and Table 12, indicating that the monthly sales of EVs in China could reach a maximum of 809,200 units during this period.

3.4. Sales Forecast Results Based on the SARIMA-GRA-SVR Model

The comparison of monthly sales forecasts for EVs in China using the two single models is shown in Figure 12. The GRA-SVR model better captures the periodicity of the data compared to the SARIMA model, with its predictions being closer to the actual values. To further explore the development trend of EVs in China, both the serial and parallel combinations of the SARIMA and GRA-SVR models were employed to analyze the monthly sales data.

3.4.1. Forecast Results of the SARIMA-GRA-SVR Serial Combined Model

In this section, the linear component of the EV sales time series is predicted using the SARIMA model, while the nonlinear component, i.e., the residual part, is predicted using the SVR model. The final prediction results are obtained by summing the predictions from both models. In this simulation, the nonlinear component consists of the residual series of monthly EV sales in China predicted by the SARIMA (3,1,3) (3,1,0)12 model, which is then trained using the SVR model. The parameter ranges were set as follows: cost in the range of [2−5,215], and ε in the range of [0,1], with a step size of 0.01. A Gaussian radial basis function kernel was used, with the kernel parameter set to radial. After multiple simulation experiments with MATLAB 2022b, the model parameters were C = 0.0221, g = 11.3137 and ε = 1.0000 × 100. The predicted residual series was then summed with the forecast results from the SARIMA model to obtain the prediction results of the serial model, as shown in Figure 13.
Based on the serial model’s predictions, the monthly sales forecast of EVs in China for January to December 2023 is shown in Figure 13 and Table 13. There is some deviation between the predicted and actual values. The results in Table 14 indicate that the fitting results of the serial model are higher than those of the GRA-SVR model, suggesting that the model’s prediction accuracy is insufficient. This also demonstrates that the serial model is not feasible for forecasting the EV sales in China.

3.4.2. Forecast Results of the SARIMA-GRA-SVR Parallel Combined Model

Here, the SARIMA-GRA-SVR parallel combined model was employed for forecasting EV sales. First, the weight values of the four parallel methods for each model were determined, as shown in Table 15. These results were then applied to the forecast outcomes of the SARIMA and SVR models, allowing for the calculation of the prediction results for each parallel model.
Figure 14 illustrates the fitting results of the parallel model for the EV sales in China from February 2017 to December 2023. The black line represents the original sales data, while the other lines represent the sales forecasts from each parallel model. According to Figure 14 and Table 16, the parallel models using the advantage matrix method and the simple weighting method exhibit significant deviation from the actual values, indicating a poor fit for these methods. In contrast, the parallel combined models using the prediction error sum of squares inverse method and the mean square inverse method fit the original data better and demonstrate improved accuracy compared to the single models.

4. Discussion

This chapter will discuss the prediction results in detail. Through a comparison of model performance (Table 16), the optimal forecasting scheme for EV sales was established. RMSE, MAE and MAPE are used as metrics, leading to the conclusion that model ⑦ performs the worst, while model ⑤ (MAPE = 12%, MAE = 1.45, RMSE = 2.08) exhibits the best fitting results. Table 17 presents the monthly sales forecast data for EVs in China for 2023.
When comparing the goodness of fit of model ⑤ with other models, the following results were obtained: MAPE improved by 45.45% compared to model ① (MAPE = 22%, MAE = 3.54, RMSE = 5.92), while MAE and RMSE decreased by 59% and 64.86%, respectively; compared to model ② (MAPE = 13%, MAE = 1.79, RMSE = 2.86), MAPE improved by 7.7%, and MAE and RMSE decreased by 18.99% and 27.27%; MAPE improved by 42.86% compared to model ③ (MAPE = 21%, MAE = 3.44, RMSE = 5.89), with MAE and RMSE decreasing by 57.85% and 64.69%, respectively; MAPE improved by 29.41% compared to model ④ (MAPE = 17%, MAE = 2.67, RMSE = 4.48), while MAE and RMSE decreased by 45.69% and 53.57%; MAPE remains the same as model ⑥ (MAPE = 12%, MAE = 1.56, RMSE = 2.37), with MAE and RMSE decreasing by 7.05% and 12.24%; MAPE improved by 78.57% compared to model ⑦ (MAPE = 56%, MAE = 6.55, RMSE = 8.41), while MAE and RMSE decreased by 77.86% and 75.27%, respectively. Actual data validation showed that the prediction error squared reciprocal method SARIMA-GRA-SVR parallel combined model (model ⑤) demonstrates high accuracy in forecasting monthly sales of EVs, accurately depicting seasonal fluctuations and development trends in sales, providing industry participants with a more solid data support and decision-making basis.
In summary, the prediction error squared reciprocal method parallel combined model has the highest accuracy in short-term forecasting. However, due to the complexity of factors influencing EV sales, along with the high nonlinearity and unpredictability, the model struggles to accurately capture market dynamics. Factors that may affect forecasting accuracy include the following: (1) Policy changes: Government subsidy policies are crucial in driving EV sales. Changes in policies can cause fluctuations in market sales, leading to forecasting deviations. Therefore, it is necessary to adjust the model promptly, incorporate external variables, and closely monitor policy changes. (2) Macroeconomic factors: A gradually recovering economy will bolster consumer purchasing power and confidence, promoting demand for EVs. Although macroeconomic changes and sudden events may not yield certainty, companies are thus more proactive in risk management and market research on hoe to adapt to new environments. (3) Technological advancements: Breakthroughs in battery technology, cost reductions or improved charging speeds can greatly enhance consumer purchasing willingness, potentially leading to sales exceeding expectations. The rapid development of charging infrastructure can also alleviate consumer concerns about charging convenience, driving sales growth. (4) Global supply chain risks: EVs rely on global supply chains, and any disruptions in the supply of critical raw materials or chips can affect production and sales. Fluctuations in the global supply chain may lead to insufficient market supply, resulting in sales falling short of predictions, and models often struggle to fully account for the impact of such unforeseen events. (5) International market uncertainties: Changes in international oil prices, geopolitical factors and trade policies significantly affect EV sales. In summary, the forecasting of the EV market faces multiple complex external risks, and the model should be flexibly adjusted to respond to these nonlinear and unpredictable factors.

5. Conclusions

5.1. Main Conclusions

This study explored the development trajectory of monthly EV sales in China by comprehensively applying market data, enhancing time series analysis and utilizing support vector machines. It systematically assessed the advantages and limitations of each theoretical framework and constructed a hybrid forecasting model using both serial and parallel combinations. This model analyzed the dynamic changes in monthly EV sales in China from January 2016 to December 2022 to forecast monthly sales for January to December 2023.
Firstly, the SARIMA model was developed to analyze the temporal dependencies and seasonality within the sales data, yielding initial forecasts based on historical training data. Secondly, GRA was introduced to analyze multidimensional factors affecting EV sales, identifying key influencing factors for input into the SVR model to enhance prediction accuracy. Subsequently, a serial and parallel strategy effectively integrated the time series forecasting strengths of the SARIMA model with the nonlinear forecasting capabilities of the SVR model, creating a complementary forecasting system. Finally, the prediction model’s accuracy was verified through actual data, demonstrating its feasibility and providing industry participants with a robust data foundation for decision-making.

5.2. Recommendations

(1)
The forecast results indicate that the EV market is expected to continue robust growth in the near future, although the growth rate may stabilize as market maturity increases. This insight is crucial for guiding corporate strategic planning, optimizing production layouts, and formulating government policies.
(2)
Infrastructural Investment: The government should continue to increase investment in infrastructure, particularly charging facilities for EVs, to meet the growing market demand. As the growth rate of EV sales stabilizes, the government may consider adjusting incentive policies, such as purchase subsidies and tax benefits, to maintain market vitality.
Corporate Strategy: Companies should invest in research and development in the field of EV technology, enhancing product quality and performance to meet consumer demands for high-quality products. Based on the forecast results, companies should strategically position themselves in the market and develop targeted marketing strategies to adapt to market changes.

6. Future Outlook

6.1. Limitations

The SARIMA-GRA-SVR model performs well in short-term forecasting; however, its predictive accuracy may decline for long-term forecasts. This is because the model for predicting EV sales in China heavily relies on past time series and historical data. Additionally, external factors (such as policies, economic fluctuations and technological advancements) exhibit high nonlinearity and unpredictability, making it difficult for the prediction model to accurately capture real market changes, which leads to poor long-term forecasting performance. Therefore, to reduce forecasting gaps, the model must be continuously adjusted and more external variables introduced, while maintaining close monitoring of policies, technologies and market trends. Separating long-term and short-term forecasts is advisable; the SARIMA-GRA-SVR combined model can continue to be used for short-term predictions, while long-term forecasting could utilize scenario simulation or expert system models to address the complexity of the external environment.

6.2. Future Studies

In order to further enhance the predictive performance of the model and address its inertia, complexity regarding external factors and deficiencies in long-term forecasting capabilities, future research will focus on the following three aspects:
(1)
Model Optimization: By introducing dynamic external parameters, regularization techniques, model simplification and hybrid architectures, we can improve the predictive accuracy and robustness of the model.
(2)
Integration of Multisource Data: Combining data from internet big data and industry planning with the forecasting model can enhance the accuracy and timeliness of EV sales predictions.
(3)
Long-term Forecasting: Building on short-term and medium-term forecasts and exploring methods and technologies for long-term trend prediction will provide strong support for the sustainable development of the EV industry.
In summary, the SARIMA-GRA-SVR parallel combined model has demonstrated significant advantages and potential in predicting EV sales, providing valuable support for government decision-making and corporate strategic planning. With continued optimization of the model and advancements in multisource data integration technology, this model is expected to play an increasingly important role in the EV industry.

Author Contributions

Conceptualization, R.Y. and X.W.; methodology, X.X.; software, R.Y.; validation, X.W.; investigation, R.Y. and X.X.; resources, X.X. and X.W.; writing—original draft preparation, R.Y.; writing—review and editing, R.Y. and Z.Z.; supervision, X.X.; project administration, X.X.; funding acquisition, R.Y. and X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Hebei Provincial Higher Education Science and Technology Research Fund Project “Research on Industrial Carbon Emission Pathways in the Beijing Tianjin Hebei Region under the Background of Digital Economy” (Grant No. QN2023213).

Data Availability Statement

The original contributions presented in this study are included in the article.

Acknowledgments

The authors would like to thank the editor and anonymous reviewers for their patient and valuable comments on this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Sun, C.W.; Zhang, W.Y.; Luo, Y.; Xu, Y. The improvement and substitution effect of transportation infrastructure on air quality: An empirical evidence from China’s rail transit construction. Energy Policy 2019, 129, 949–957. [Google Scholar] [CrossRef]
  2. Wu, D.S.; Xie, Y.; Lyu, X.Y. The impacts of heterogeneous traffic regulation on air pollution: Evidence from China. Transp. Res. Part D Transp. Environ. 2022, 109, 103388. [Google Scholar] [CrossRef]
  3. Wu, Z.C.; Fan, X.J.; Zhu, B.Z.; Xia, J.; Zhang, L.; Wang, P. Do government subsidies improve innovation investment for new energy firms: A quasi-natural experiment of China’s listed companies. Technol. Forecast. Soc. Change 2022, 175, 121418. [Google Scholar] [CrossRef]
  4. Zhang, L.H.; Wang, L.W.; Chai, J.X. Influence of new energy vehicle subsidy policy on emission reduction of atmospheric pollutants: A case study of Beijing, China. J. Clean. Prod. 2020, 275, 124069. [Google Scholar] [CrossRef]
  5. Xie, Y.; Wu, D.S.; Zhu, S.J. Can new energy vehicle subsidies curb urban air pollution? Empirical evidence from pilot cities in China. Sci. Total Environ. 2021, 754, 142232. [Google Scholar] [CrossRef]
  6. Watabe, A.; Leaver, J. Comparative economic and environmental benefits of ownership of both new and used light-duty hydrogen fuel cell vehicles in Japan. Int. J. Hydrogen Energy 2021, 46, 26582–26593. [Google Scholar] [CrossRef]
  7. Li, J. Charging Chinese future: The roadmap of China’s policy for new energy automotive industry. Int. J. Hydrogen Energy 2020, 45, 11409–11423. [Google Scholar] [CrossRef]
  8. National Development and Reform Commission (NDRC). In Enhanced Actions on Climate Change: China’s Intended Nationally Determined Contributions; National Development and Reform Commission (NDRC): Beijing, China, 2015.
  9. Li, C.; Feng, D.; Zhuo, C.; Jiang, Y.; Li, Z.; Xu, H. Spatial distribution patterns and influencing factors of China’s new energy vehicle industry. J. Clean. Prod. 2022, 379, 134641. [Google Scholar] [CrossRef]
  10. Feng, D.; Liu, Y. Policy evolution and effect evaluation of the new-energy vehicle industry in China. Resour. Policy 2020, 67, 101655. [Google Scholar] [CrossRef]
  11. Guo, D.; Yan, W.; Gao, X.; Hao, Y.; Xu, Y.; E, W.; Tan, X.; Zhang, T. Forecast of passenger car market structure and environmental impact analysis in China. Sci. Total Environ. 2021, 772, 144950. [Google Scholar] [CrossRef]
  12. Gao, J.; Xie, Y.; Gu, F.; Yu, H. China automotive sales forecast based on structural relationship identification. J. Dalian Univ. Technol. 2018, 58, 92–98. [Google Scholar]
  13. Ding, R.; Li, W.; Wang, R. Forecasting model based on SARIMA and LSTM combination. Comput. Digit. Eng. 2020, 48, 304–307,337. [Google Scholar]
  14. Chen, K.; Liu, J. Sales forecast of Euler Black Cat new energy vehicles based on ARIMA model. Mod. Ind. Econ. Informatiz. 2022, 12, 169–171. [Google Scholar] [CrossRef]
  15. Zhang, Y.; Song, C.Y.; Geng, N.N. Forecasting electric vehicle sales with univariate and multivariate time series models: The case of China. PLoS ONE 2017, 12, e0176729. [Google Scholar] [CrossRef] [PubMed]
  16. Li, Z.; Liu, S. A comparison of forecasts based on the ARIMA model, grey model, and regression model. Stat. Decis. 2019, 35, 38–41. [Google Scholar] [CrossRef]
  17. Li, J.; Lei, M.; Dai, B.; Cai, X. Research on demand forecasting for pharmaceutical e-commerce platforms considering promotional factors. Chin. J. Manag. Sci. 2022, 30, 120–130. [Google Scholar] [CrossRef]
  18. Fantazzini, D.; Tokamysova, Z. Forecasting German car sales using Google data and multivariate models. Int. J. Prod. Econ. 2015, 170, 97–135. [Google Scholar] [CrossRef]
  19. Nie, S. Comparison of GDP forecasting performance based on four types of time series models. Stat. Decis. 2024, 40, 63–67. [Google Scholar] [CrossRef]
  20. Yao, J. Research on stock index forecasting based on ARIMA and information granulation SVR combination. Oper. Res. Manag. 2022, 31, 214–220. [Google Scholar]
  21. Xu, X.; Peng, L.; Ji, Z.; Zheng, S.; Guan, X.; Cui, H. Forecast analysis of the development potential of hydrogen fuel cell vehicles in Beijing. Technol. Econ. 2023, 42, 52–63. [Google Scholar]
  22. Fan, L.; Dong, H.; Jian, L. Research on carbon market price decomposition and integrated forecasting based on rolling time window. Chin. J. Manag. Sci. 2023, 31, 277–286. [Google Scholar] [CrossRef]
  23. Liang, C.; Ma, Y.; Chen, R.; Liang, Y. Daily tourism demand forecasting based on the SVR-ARMA combination model. J. Manag. Eng. 2015, 29, 122–127. [Google Scholar] [CrossRef]
  24. Hu, Y.; Ni, J.; Wen, L. A hybrid deep learning approach by integrating LSTM-ANN networks with GARCH model for copper price volatility prediction. Phys. A Stat. Mech. Its Appl. 2020, 557, 124907. [Google Scholar] [CrossRef]
  25. Xu, S.; Chan, H.K.; Zhang, T. Forecasting the demand of the aviation industry using hybrid time series SARIMA-SVR approach. Transp. Res. Part E Logist. Transp. Rev. 2019, 122, 169–180. [Google Scholar] [CrossRef]
  26. Tong, F.; Lan, F.; Chen, J. Analysis of influencing factors and prediction of the ownership of new energy vehicles. Sci. Technol. Manag. Res. 2016, 36, 112–116. [Google Scholar]
  27. Yan, R.; Geng, S.; Tang, J. Analysis of the correlation of influencing factors on the development of medical tourism in China. Jiangxi Soc. Sci. 2022, 42, 49–58. [Google Scholar]
  28. Xu, M. Research on Sales Forecasting and Development Paths of New Energy Vehicles in China. Master’s Thesis, Shandong University of Finance and Economics, Jinan, China, 2023. [Google Scholar] [CrossRef]
  29. Arslankaya, S. Time series analysis of sales quantity in an automotive company and estimation by artificial neural networks. Sak. Univ. J. Sci. 2018, 22, 1482–1492. [Google Scholar] [CrossRef]
  30. Aslam, J.; Saleem, A.; Khan, N.T.; Kim, Y.B. Factors influencing blockchain adoption in supply chain management practices: A study based on the oil industry. J. Innov. Knowl. 2021, 6, 124–134. [Google Scholar] [CrossRef]
  31. Gao, J.; Xie, Y.; Gu, F.; Xiao, W.; Hu, J.; Yu, W. A hybrid optimization approach to forecast automobile sales in China. Adv. Mech. Eng. 2017, 9, 1687814017719422. [Google Scholar] [CrossRef]
  32. Kaya, A.; Kaya, G.; Cebi, F. Forecasting automobile sales in Turkey with artificial neural networks. Int. J. Bus. Anal. 2019, 6, 50–60. [Google Scholar] [CrossRef]
  33. Bates, J.M.; Granger, C.W.J. The combination of forecasts. J. Oper. Res. Soc. 1969, 20, 451–468. [Google Scholar] [CrossRef]
  34. Zhang, L. Research on Civil Aviation Passenger Volume Forecasting Based on Combined Models. Master’s Thesis, Liaoning Normal University, Dalian, China, 2019. [Google Scholar]
  35. Deretić, N.; Stanimirović, D.; Awadh, M.A.; Vujanović, N.; Djukić, A. SARIMA modelling approach for forecasting traffic accidents. Sustainability 2022, 14, 4403. [Google Scholar] [CrossRef]
  36. Zhou, Z. Machine Learning; Tsinghua University Press: Beijing, China, 2016; pp. 133–137. [Google Scholar]
  37. Zhang, J.R.; Lin, C.Y.; Tang, H.M.; Wen, T.; Tannant, D.D.; Zhang, B. Input-parameter optimization using a SVR based ensemble model to predict landslide displacements in a reservoir area: A comparative study. Appl. Soft Comput. 2024, 150, 111107. [Google Scholar] [CrossRef]
  38. Lin, S.; Liu, Z. Parameter selection for support vector machines with RBF kernel function. J. Zhejiang Univ. Technol. 2007, 2, 163–167. [Google Scholar]
  39. Zhang, X.Y.; Liu, C.A. Model averaging prediction by K-fold cross-validation. J. Econom. 2023, 235, 280–301. [Google Scholar] [CrossRef]
  40. Rietmann, N.; Hügler, B.; Lieven, T. Forecasting the trajectory of electric vehicle sales and the consequences for worldwide CO2 emissions. J. Clean. Prod. 2020, 261, 121038. [Google Scholar] [CrossRef]
  41. Zhang, M.; Zhao, H.; Fan, L.; Yi, J. Dynamic modulus prediction model and analysis of factors influencing asphalt mixtures using gray relational analysis methods. J. Mater. Res. Technol. 2022, 19, 1312–1321. [Google Scholar] [CrossRef]
  42. Li, S.; Dai, X.; Liu, G. Research on the relationship between tax incentives, fiscal subsidies, and the development of new energy vehicle companies. Resour. Dev. Mark. 2018, 34, 1552–1557. [Google Scholar]
  43. Li, S.; Li, Y. The relationship between R&D investment and tax policies in new energy vehicle companies. Technol. Econ. 2016, 4, 32–36. [Google Scholar]
  44. Matyjaszek, M.; Riesgo Fernández, P.; Krzemień, A.; Wodarski, K.; Valverde, G.F. Forecasting coking coal prices by means of ARIMA models and neural networks, considering the transgenic time series theory. Resour. Policy 2019, 61, 283–292. [Google Scholar] [CrossRef]
Figure 1. Article structure chart.
Figure 1. Article structure chart.
Systems 12 00486 g001
Figure 2. SARIMA-SVR serial combined model.
Figure 2. SARIMA-SVR serial combined model.
Systems 12 00486 g002
Figure 3. SARIMA-SVR parallel combined model.
Figure 3. SARIMA-SVR parallel combined model.
Systems 12 00486 g003
Figure 4. Principle diagram of the SVR model.
Figure 4. Principle diagram of the SVR model.
Systems 12 00486 g004
Figure 5. Monthly sales of EVs in China.
Figure 5. Monthly sales of EVs in China.
Systems 12 00486 g005
Figure 6. Monthly sales of EVs in China after first-order differencing.
Figure 6. Monthly sales of EVs in China after first-order differencing.
Systems 12 00486 g006
Figure 7. ACF plot of monthly sales of EVs in China.
Figure 7. ACF plot of monthly sales of EVs in China.
Systems 12 00486 g007
Figure 8. PACF plot of monthly sales of EVs in China.
Figure 8. PACF plot of monthly sales of EVs in China.
Systems 12 00486 g008
Figure 9. Sales forecast of EVs in China using the SARIMA model.
Figure 9. Sales forecast of EVs in China using the SARIMA model.
Systems 12 00486 g009
Figure 10. Sales training plot of EVs in China using the GRA-SVR model.
Figure 10. Sales training plot of EVs in China using the GRA-SVR model.
Systems 12 00486 g010
Figure 11. Sales forecast of EVs in China using the GRA-SVR model.
Figure 11. Sales forecast of EVs in China using the GRA-SVR model.
Systems 12 00486 g011
Figure 12. Comparison of single model predictions and actual values.
Figure 12. Comparison of single model predictions and actual values.
Systems 12 00486 g012
Figure 13. Sales forecast of EVs in China using the serial model.
Figure 13. Sales forecast of EVs in China using the serial model.
Systems 12 00486 g013
Figure 14. Sales forecast of EVs in China using the parallel combined model.
Figure 14. Sales forecast of EVs in China using the parallel combined model.
Systems 12 00486 g014
Table 1. Monthly sales of EVs from 2016 to 2023 in China (10 t.h.s unit).
Table 1. Monthly sales of EVs from 2016 to 2023 in China (10 t.h.s unit).
Time20162017201820192020202120222023
January0.780.502.687.493.3015.0034.1028.70
February1.721.392.353.971.079.2025.8037.60
March1.82.545.229.604.0019.0039.5049.00
April2.42.866.487.105.1017.1023.1047.10
May2.63.858.208.326.4017.9034.7052.20
June3.44.806.3012.918.2021.2047.6057.30
July2.64.406.006.107.8022.0045.7054.10
August2.85.607.326.908.8026.5052.2059.70
September3.56.409.426.3011.2029.7053.9062.70
October3.97.7011.125.9013.3031.6054.1064.60
November5.810.5913.818.1016.7036.1061.5070.20
December9.314.4019.3014.0321.1044.8062.4082.50
Total40.965.0398.2096.72106.97290.10534.60665.70
Table 2. Data on A-share listed companies manufacturing EVs.
Table 2. Data on A-share listed companies manufacturing EVs.
Stock AbbreviationStock CodeListing DateStock AbbreviationStock CodeListing Date
Ankai Bus00086825 July 1997FAW Jiefang00080018 June 1997
BAIC BluePark60073316 August 1996Yutong Bus60006608 May 1997
BYD00259403 June 2011Hanma Technology60037510 April 2003
Dongfeng Motor60000627 July 1997JAC Motors60041824 August 2001
GAC Group60123809 March 2012JMC00055001 December 1993
Haima Automobile00057208 August 1994King Long Bus60068608 November 1993
Lifan Technology60177725 November 2010Changan Automobile00062510 June 1997
Seres60112715 June 2016Great Wall Motor60163328 September 2011
SAIC Motor60010425 November 1997Zhongtong Bus00095713 January 2000
Yaxing Bus60021331 August 1999Foton Motor60016602 June 1998
Table 3. Factors influencing the sales of EVs.
Table 3. Factors influencing the sales of EVs.
Primary IndicatorsSecondary IndicatorsTertiary Indicators
Factors Influencing the Sales of EVPolicy Factorsx1Tax Refunds/100 million RMB
x2Government Subsidies/100 million RMB
x3Number of Charging Piles/10,000 Units
Economic Factorsx4Per Capita GDP/10,000 RMB
x5Consumer Price Index (1978 = 100)
x6Per Capita Disposable Income of Urban Residents/10,000 RMB
x7Per Capita Disposable Income of Rural Residents/10,000 RMB
x8Carbon Trading Price/RMB
x9Crude Oil Production/10,000 Tons
Technological Factorsx10Automobile R&D Expenditure/100 million RMB
x11Installed Capacity of Power Batteries/GWh
x12Cumulative Registered Battery Recycling Companies/10,000 Companies
x13Traditional Automobile Sales/10,000 Units
x14Registered EV-Related Companies/10,000 Companies
Table 4. Fitting results of the SARIMA model parameters.
Table 4. Fitting results of the SARIMA model parameters.
Prediction AccuracyRMSEMAEMAPENormalized BIC
Value5.923.5422%2.9
Table 5. SARIMA model forecast results for January–December 2023 (10 t.h.s unit).
Table 5. SARIMA model forecast results for January–December 2023 (10 t.h.s unit).
MonthMonthJanuaryFebruaryMarchAprilMayJuneJulyAugustSeptemberOctoberNovember
Value54.9149.5158.0748.4558.1367.1770.7874.4877.2981.1185.5388.45
Table 6. Parameter settings.
Table 6. Parameter settings.
ParametersmM ρ
Value0.00441.73410.5
Table 7. Influencing factors of EV sales.
Table 7. Influencing factors of EV sales.
Time20162017201820192020202120222023
Factor
x152.6959.2697.92106.39140.22142.71383.23465.06
x275.6889.30137.22155.43118.09152.37149.70149.60
x320.4044.6077.70121.90168.10261.70521.00859.60
x45.385.966.557.017.188.148.578.94
x51783.201901.702041.902166.002111.902352.802374.102586.50
x63.363.643.934.244.384.744.935.18
x71.241.341.461.601.711.892.012.17
x816.3317.5219.4222.6129.1942.8055.3068.13
x919,969.0019,150.6118,932.4219,162.8319,476.8619,888.1020,472.2020,900.00
x10354.46398.50481.03517.61532.67666.87828.731107.48
x1128.2036.2456.9062.2064.00154.50294.60387.70
x12743.001096.001331.003737.005925.0026,839.0043,097.0045,949.00
x132437.692471.832370.982144.402017.802148.202356.303009.40
x142.312.994.364.708.1817.7524.6930.93
Table 8. Correlation coefficients of influencing factors for EV sales.
Table 8. Correlation coefficients of influencing factors for EV sales.
Time20162017201820192020202120222023
Factor
x10.87930.94160.87160.82860.73940.66730.86790.7967
x20.67600.67380.57060.52060.65860.95950.44540.3487
x30.90360.89560.88500.93340.82850.80260.78270.6256
x40.60300.61210.63760.60730.62280.90120.45050.3593
x50.57200.59030.62160.59520.63180.86470.43030.3529
x60.58820.60330.63540.60150.61230.87850.44110.3544
x70.60680.62340.65560.61460.61200.90100.45320.3674
x80.73770.78190.84490.77090.68920.95740.58430.5265
x90.50930.55580.61480.60760.62690.80140.41790.3345
x100.68080.69700.69940.66480.68320.86870.49360.4707
x110.95980.99300.99200.94460.99490.91260.91870.9279
x120.87420.80930.72530.83240.89750.66210.67070.9324
x130.50410.53100.59720.63670.69430.73350.40960.3640
x140.97640.97310.94660.98230.80240.77210.82010.8038
Table 9. Gray relational degree of influencing factors for EV sales.
Table 9. Gray relational degree of influencing factors for EV sales.
Influencing FactorRelational DegreeInfluencing FactorRelational Degree
Tax Refunds (x1)0.7245Carbon Trading Price (x8)0.6708
Government Subsidies (x2)0.5631Crude Oil Production (x9)0.5167
Number of Charging Piles (x3)0.7539Automobile R&D Expenditure (x10)0.5984
Per Capita GDP (x4)0.5543Installed Capacity of Power Batteries (x11)0.8395
Consumer Price Index (1978 = 100) (x5)0.5382Cumulative Registered Battery Recycling Companies (x12)0.6839
Per Capita Disposable Income of Urban Residents (x6)0.5450Traditional Automobile Sales (x13)0.5133
Per Capita Disposable Income of Rural Residents (x7)0.5583Registered EV-Related Companies (x14)0.7841
Table 10. GRA-SVR model parameters.
Table 10. GRA-SVR model parameters.
ParametersCgε
Value90.50970.01561.0000 × 10−4
Table 11. Fitting results of the GRA-SVR model parameters.
Table 11. Fitting results of the GRA-SVR model parameters.
Prediction AccuracyRMSEMAEMAPE
Value2.861.7913%
Table 12. GRA-SVR model forecast results for January–December 2023 (10 t.h.s unit).
Table 12. GRA-SVR model forecast results for January–December 2023 (10 t.h.s unit).
MonthJanuaryFebruaryMarchAprilMayJuneJulyAugustSeptemberOctoberNovemberDecember
Value34.8544.2552.9649.1653.7460.8660.1763.8165.6465.8277.1880.92
Table 13. SARIMA-GRA-SVR serial combined model forecast results for January–December 2023 (10 t.h.s unit).
Table 13. SARIMA-GRA-SVR serial combined model forecast results for January–December 2023 (10 t.h.s unit).
MonthJanuaryFebruaryMarchAprilMayJuneJulyAugustSeptemberOctoberNovemberDecember
Value54.9949.5658.0948.4558.1267.1570.7674.4577.2681.0985.5188.44
Table 14. Fitting results of the SARIMA-GRA-SVR serial combined parameters.
Table 14. Fitting results of the SARIMA-GRA-SVR serial combined parameters.
Prediction AccuracyRMSEMAEMAPE
Value5.893.4421%
Table 15. Weights of the SARIMA-GRA-SVR parallel combined model.
Table 15. Weights of the SARIMA-GRA-SVR parallel combined model.
Parallel MethodsSARIMA ModelSVR Model
Simple Weighting Method66.67%33.33%
Reciprocal of the Sum of Squared Prediction Error Method10.97%89.03%
Reciprocal of the Mean Squared Error Method25.98%74.00%
Matrix Advantage Method30.12%69.88%
Table 16. Model evaluation metrics.
Table 16. Model evaluation metrics.
Prediction AccuracyRMSEMAEMAPE
① SARIMA Model5.923.5422%
② GRA-SVR Model2.861.7913%
③ SARIMA-GRA-SVR Serial Combined Model5.893.4421%
④ Simple Weighting Method4.482.6717%
⑤ Reciprocal of the Sum of Squared Prediction Errors Method2.081.4512%
⑥ Reciprocal of the Mean Squared Error Method2.371.5612%
⑦ Matrix Advantage Method8.416.5556%
Table 17. SARIMA-GRA-SVR parallel model forecast results for January–December 2023 (10 t.h.s unit).
Table 17. SARIMA-GRA-SVR parallel model forecast results for January–December 2023 (10 t.h.s unit).
MonthJanuaryFebruaryMarchAprilMayJuneJulyAugustSeptemberOctoberNovemberDecember
47.3646.9255.4547.7555.7364.0666.1869.8472.4574.8981.3784.43
33.7342.2450.7346.4851.4158.4557.8761.4663.7063.6773.8677.18
36.9943.3651.8646.7852.4459.7959.8663.4665.7966.3575.6678.91
38.5343.8952.3946.9252.9360.4260.7964.4166.7767.6276.5079.73
Actual values28.7037.6049.0047.1052.2057.3054.1059.7062.7064.6070.2082.50
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yu, R.; Wang, X.; Xu, X.; Zhang, Z. Research on Forecasting Sales of Pure Electric Vehicles in China Based on the Seasonal Autoregressive Integrated Moving Average–Gray Relational Analysis–Support Vector Regression Model. Systems 2024, 12, 486. https://doi.org/10.3390/systems12110486

AMA Style

Yu R, Wang X, Xu X, Zhang Z. Research on Forecasting Sales of Pure Electric Vehicles in China Based on the Seasonal Autoregressive Integrated Moving Average–Gray Relational Analysis–Support Vector Regression Model. Systems. 2024; 12(11):486. https://doi.org/10.3390/systems12110486

Chicago/Turabian Style

Yu, Ru, Xiaoli Wang, Xiaojun Xu, and Zhiwen Zhang. 2024. "Research on Forecasting Sales of Pure Electric Vehicles in China Based on the Seasonal Autoregressive Integrated Moving Average–Gray Relational Analysis–Support Vector Regression Model" Systems 12, no. 11: 486. https://doi.org/10.3390/systems12110486

APA Style

Yu, R., Wang, X., Xu, X., & Zhang, Z. (2024). Research on Forecasting Sales of Pure Electric Vehicles in China Based on the Seasonal Autoregressive Integrated Moving Average–Gray Relational Analysis–Support Vector Regression Model. Systems, 12(11), 486. https://doi.org/10.3390/systems12110486

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop