Production Prediction and Influencing Factors Analysis of Horizontal Well Plunger Gas Lift Based on Interpretable Machine Learning
<p>Workflow diagram.</p> "> Figure 2
<p>Entropy diagram.</p> "> Figure 3
<p>Mutual information diagram.</p> "> Figure 4
<p>Sixfold cross-validation schematic diagram.</p> "> Figure 5
<p>Proportion of missing data values.</p> "> Figure 6
<p>Histogram of characteristic data and gas production.</p> "> Figure 6 Cont.
<p>Histogram of characteristic data and gas production.</p> "> Figure 7
<p>Pearson correlation heatmap.</p> "> Figure 8
<p>Mutual information values of 11 gas production values relative to plunger gas lift wells.</p> "> Figure 9
<p>Prediction graph and error statistics graph of XGBoost model.</p> "> Figure 10
<p>Prediction graph and error statistics graph of random forest model.</p> "> Figure 11
<p>Prediction graph and error statistics graph of SVR model.</p> "> Figure 12
<p>XGBoost model SHAP value global interpretation.</p> "> Figure 13
<p>XGBoost model SHAP value local interpretation.</p> "> Figure 14
<p>Global interpretation of SHAP values of Random Forest model.</p> "> Figure 15
<p>Local interpretation of SHAP values of random forest model.</p> ">
Abstract
:1. Introduction
- Mechanism-driven models have limitations. As the theoretical and practical understanding of plunger gas lift performance parameters has grown, significant improvements have been made in plunger gas lift technology. Approximate dynamic models have been developed to facilitate our understanding of plunger lift performance [6,7,8]. These studies offer a comprehensive analysis of various factors influencing plunger gas lift performance and provide essential guidelines for actual production, enabling quick and effective diagnosis to ensure optimal lift performance. However, owing to our insufficient understanding of the underlying mechanism, these models rely on numerous assumptions and simplifications, making it challenging for them to accurately predict production capacity. Furthermore, mechanism-driven models are complex and diverse, and verifying their accuracy and reliability is also quite difficult. Additionally, the data generated by these models may be artificially adjusted without an actual on-site basis, introducing a degree of subjectivity and potentially lacking universal applicability;
- The nature of machine learning models is often seen as a black box. Ajay Singh [9] utilized classification and regression tree (CART) technology to analyze actual field data from natural gas wells, proposing a method for root cause identification and production diagnosis based on operational data. The regression tree model swiftly identified well groups with either good or poor performance and offered improvement suggestions based on statistical analysis. It emphasized the importance of adding additional operating variables to enhance diagnostic effectiveness and suggested future work to improve the predictive ability of regression tree analysis. A. Ranjan et al. [10] optimized the gas lift rate to maximize daily hydrocarbon production by using an artificial neural network model, which exhibited better accuracy and performance than previous models. Naresh N. Nandola et al. [11] proposed an efficient optimization method for the plunger lift process in shale gas wells. By transforming time series data and manipulating variables, a reduced-order cyclic model was established to maximize daily production while meeting operational constraints. This was achieved by Junfeng Shi et al. [12] employing big data analysis and deep recurrent neural networks to study and select 11 parameters from more than 40,000 artificial lift wells of PetroChina. An effect evaluation function was established, the best artificial lift method was successfully selected, and a calculation result compliance rate of 90.56% was achieved in over 5000 gas wells. This provided a reliable, practical, and intelligent method for optimizing and selecting artificial lift. Thanawit Ounsakul et al. [13] utilized supervised machine learning methods to enhance the artificial lift well selection process, with the aim of minimizing life cycle costs and increasing production. The machine learning model can detect differences and continuously learn under dynamic conditions, offering a breakthrough artificial intelligence solution for the oil and gas industry. Tan Chaodong et al. [14] established a data-based optimal decision model by analyzing the production characteristics of plunger gas lift. The similarity weight was calculated through the K nearest neighbor algorithm, the production system was optimized, and drainage efficiency and production were significantly improved. Anvar Akhiiatdinov et al. [15] used machine learning methods to simulate gas production in plunger lift wells and proposed a neural network model with acceptable accuracy. The model can run quickly, monitor the gas flow rate of individual wells, and optimize the plunger lift cycle based on cumulative gas production. Nagham Amer Sami [16] successfully developed a model using machine learning algorithms such as decision tree regression, random forest regression, and K nearest neighbor regression. This model can predict the pipe pressure of artificial intermittent gas lift wells with an accuracy of over 99.9%. Although these machine learning models have enriched the prediction methods of gas lift technology and improved prediction accuracy, they lack interpretability and cannot easily provide practical guidance for engineering design.
- The lack of comprehensive methods is evident. Yukun Xie et al. [17] introduced an unsupervised clustering method based on the Transformer encoder to identify periodic points in plunger lift data. This approach achieves high-quality clustering through the autoencoder of a deep neural network and optimizes plunger lift parameters. Currently, few methods combine the advantages of mechanism-driven and data-driven approaches due to their fundamental differences. Mechanism-driven models emphasize theoretical foundations and physical explanations, while data-driven models focus on automatically extracting patterns and relationships from data. Effectively combining these two to leverage the physical interpretation ability of mechanism models and the prediction accuracy of data-driven models poses a complex challenge.
2. Methodology
- (1)
- Original data collection: The dataset comprises the primary engineering parameters that influence shale gas wells, along with indicators for evaluating their productivity. These engineering parameters primarily encompass dynamic factors, resistance factors, and volume factors [25]. Productivity refers to the gas production of shale gas wells;
- (2)
- Data preprocessing involves several steps. First, perform data cleaning, interpolate missing data, reduce data dimensions, and convert the data [26]. Subsequently, divide the preprocessed dataset into a training set and a test set. The division ratio typically ranges from 70% to 30% or 80% to 20%. In this paper, the data division ratio is 80% to 20%;
- (3)
- Feature selection aims to identify the optimal subset of features, eliminating those that are irrelevant or redundant. This process not only diminishes the feature count, but also enhances model accuracy and expedites running time. Furthermore, selecting genuinely pertinent features streamlines the model, facilitating our comprehension of the underlying data generation process. Given that machine learning frequently confronts the challenge of overfitting, where model parameters become overly reliant on training data, performing feature selection on the data is imperative to mitigate this issue;
- (4)
- Machine learning model. Use the divided training set data to establish the corresponding capacity model, mainly to find the optimal parameter values in the machine learning model. Commonly used methods include grid search, multi-fold cross-validation, and autonomous learning;
- (5)
- The evaluation of the capacity forecasting model involves assessing its accuracy using the test set data. Commonly employed accuracy metrics encompass the coefficient of determination (R2), mean absolute error (MAE), and mean square error (MSE). Based on these evaluation indicators, the machine learning algorithm demonstrating the highest predictive performance on the test set is chosen to establish the definitive capacity forecasting model;
- (6)
- Model interpretation: Based on the established optimal capacity forecasting model, the SHAP value method is used to provide global and local interpretations of the capacity forecast.
2.1. Mutual Information Method
2.2. XGBoost Model
2.2.1. CART Regression Tree
2.2.2. XGBoost Algorithm
2.2.3. XGBoost Principle
2.3. Shapely Value Method
2.4. Error Metric Method
3. Data Evaluation
3.1. Statistical Data Description
3.2. Feature Selection
4. Results and Discussion
4.1. Model Capacity Forecast
4.2. SHAP Value Interpretation Model Based on Mutual Information
4.2.1. XGBoost
4.2.2. Random Forest
5. Conclusions
- (1)
- This study is the first to attempt to combine the advantages of mechanism-driven and data-driven approaches in plunger lift technology. By integrating the XGBoost model with the SHAP value interpretation method, the accuracy and interpretability of plunger lift production predictions were improved. This method not only captures complex nonlinear relationships, but also provides a transparent physical explanation.
- (2)
- The mutual information method was used for feature selection, effectively extracting features highly related to yield. This optimized the model’s input and improved prediction accuracy and robustness. Compared with traditional methods, this approach significantly reduced feature dimensions and enhanced the model’s computational efficiency. Production capacity predictions were made using three machine learning models: XGBoost, random forest, and SVR, based on 11 contributing feature values. The R2 values on the test set were 0.85, 0.83, and 0.77, respectively, indicating high prediction performance.
- (3)
- The SHAP value method was introduced to interpret the model, making the model’s output interpretable and enabling the clear identification of factors with the greatest impact on production. Compared with traditional black box models, this method provides more powerful guidance for engineering practice. According to the SHAP value results, the five factors of the starting time of wells, GLR, time to close the well, catcher well inclination angle, and shut-in tubing pressure were identified as the dominant factors affecting the gas production of plunger gas lift wells.
- (4)
- In future research, the feature selection method should be further studied and optimized, and more actual production data should be incorporated to enhance the scientificity and effectiveness of feature selection, thereby further improving the predictive ability and stability of the model. Additionally, the SHAP value method can be applied to other machine learning models to improve their interpretability, enabling the model to not only enhance prediction accuracy in practical applications, but also provide clear guidance for operations and decision-making. Moreover, it is necessary to compare more different types of machine learning models, evaluate their performance on various oil and gas field data sets, find the most suitable application scenario model, and improve the universality and wide applicability of the prediction.
- (5)
- In this numerical study, only the engineering factors and production factors of the wellbore part were considered, without obtaining the corresponding reservoir and other related data. Therefore, when this machine learning method is subsequently used for production prediction and influencing factor analysis, it is recommended that consideration be given to adding reservoir and other related information to make the prediction and influencing factor analysis of the model more complete.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Nomenclature
SCP | Shut-in casing pressure | TP | Transfer pressure |
STP | Shut-in tubing pressure | CD | Catcher depth |
OCP | Open-well casing pressure | STW | Starting time of wells |
TCW | Time to close well | PRT | Plunger rising time |
GLR | Gas–liquid ratio | CWIA | Catcher well inclination angle |
GP | Gas production | OTP | Open-well tubing pressure |
References
- Wenzhi, Z.; Ailin, J.; Yunsheng, W.; Junlei, W.; Hanqing, Z. Progress in shale gas exploration in China and prospects for future development. China Pet. Explor. 2020, 25, 31–44. [Google Scholar]
- Caineng, Z.; Qun, Z.; Hongyan, W.; Qian, S.; Nan, S.; Zhiming, H.; Dexun, L. Theory and Technology of Unconventional Oil and Gas Exploration and Development Helps China Increase Oil and Gas Reserves and Production. Pet. Sci. Technol. Forum 2021, 40, 72–79. [Google Scholar]
- National Energy Administration. China Natural Gas Development Report; Petroleum Industry Press: Beijing, China, 2023.
- Bochun, L.; Jianhua, X.; Fan, X.; Linjuan, Z.; Kun, Z.; Mi, J.; Fan, Y. Large-Scale Application and Effect Analysis of Plunger Gas Lift Technology in Changning Shale Gas Reservoir. Drill. Prod. Technol. 2023, 46, 65–70. [Google Scholar]
- Miao, J.; Niu, L. A survey on feature selection. Procedia Comput. Sci. 2016, 91, 919–926. [Google Scholar] [CrossRef]
- Lea, J.F. Dynamic Analysis of Plunger Lift Operations. J. Pet. Technol. 1982, 34, 2617–2629. [Google Scholar] [CrossRef]
- Gasbarri, S.; Wiggins, M.L. A Dynamic Plunger Lift Model for Gas Wells. SPE Prod. Fac. 2001, 16, 89–96. [Google Scholar] [CrossRef]
- Ozkan, E.; Keefer, B.; Miller, M.G. Optimization of Plunger-Lift Performance in Liquid Loading Gas Wells. In Proceedings of the Canadian International Petroleum Conference, Denver, CO, USA, 5–8 October 2003. [Google Scholar] [CrossRef]
- Ajay Singh Singh, A. Application of data mining for quick root-cause identification and automated production diagnostic of gas wells with plunger lift. SPE Prod. Oper. 2017, 32, 279–293. [Google Scholar]
- Ranjan, A.; Verma, S.; Singh, Y. Gas lift optimization using artificial neural network. In SPE Middle East Oil & Gas Show and Conference; OnePetro: Richardson, TX, USA, 2015. [Google Scholar]
- Nandola, N.N.; Kaisare, N.S.; Gupta, A. Online optimization for a plunger lift process in shale gas wells. Comput. Chem. Eng. 2018, 108, 89–97. [Google Scholar] [CrossRef]
- Shi, J.; Chen, S.; Zhang, X.; Zhao, R.; Liu, Z.; Liu, M.; Zhang, N.; Sun, D. Artificial lift methods optimising and selecting based on big data analysis technology. In Proceedings of the International Petroleum Technology Conference, IPTC, Beijing, China, 26–28 March 2019; p. D011S010R003. [Google Scholar]
- Ounsakul, T.; Sirirattanachatchawan, T.; Pattarachupong, W.; Yokrat, Y.; Ekkawong, P. Artificial lift selection using machine learning. In Proceedings of the International Petroleum Technology Conference, IPTC, Beijing, China, 26–28 March 2019; p. D021S042R003. [Google Scholar]
- Chaodong, T.; Wenrong, S.; Loulou, L.; Peng, Q.; Zhaomin, G.; Wu, H. Research on Optimization Decision of Plunger Gas Lift Operation Based on Data Driven. In Proceedings of the 2019 IEEE Eurasia Conference on IOT, Communication and Engineering (ECICE), Yunlin, Taiwan, 3–6 October 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 263–266. [Google Scholar]
- Akhiiartdinov, A.; Pereyra, E.; Sarica, C.; Severino, J. Data Analytics Application for Conventional Plunger Lift Modeling and Optimization. In SPE Artificial Lift Conference and Exhibition—Americas; OnePetro: Richardson, TX, USA, 2020. [Google Scholar]
- Sami, N.A. Application of machine learning algorithms to predict tubing pressure in intermittent gas lift wells. Pet. Res. 2022, 7, 246–252. [Google Scholar] [CrossRef]
- Xie, Y.; Ma, S.; Wang, H.; Li, N.; Zhu, J.; Wang, J. Unsupervised clustering for the anomaly diagnosis of plunger lift operations. Geoenergy Sci. Eng. 2023, 231, 212305. [Google Scholar] [CrossRef]
- Petch, J.; Di, S.; Nelson, W. Opening the black box: The promise and limitations of explainable machine learning in cardiology. Can. J. Cardiol. 2022, 38, 204–213. [Google Scholar] [CrossRef] [PubMed]
- Narwaria, M. Does explainable machine learning uncover the black box in vision applications? Image Vis. Comput. 2022, 118, 104353. [Google Scholar] [CrossRef]
- Aas, K.; Jullum, M.; Løland, A. Explaining individual predictions when features are dependent: More accurate approximations to Shapley values. Artif. Intell. 2021, 298, 103502. [Google Scholar] [CrossRef]
- Lubo-Robles, D.; Devegowda, D.; Jayaram, V.; Bedle, H.; Marfurt, K.J.; Pranter, M.J. Machine learning model interpretability using SHAP values: Application to a seismic Facies classification task. In SEG Technical Program Expanded Abstracts 2020; Society of Exploration Geophysicists: Houston, TX, USA, 2020. [Google Scholar] [CrossRef]
- Tran, N.L.; Gupta, I.; Devegowda, D.; Jayaram, V.; Karami, H.; Rai, C.; Sondergeld, C.H. Application of Interpretable Machine-Learning Workflows to Identify Brittle, Fracturable, and Producible Rock in Horizontal Wells Using Surface Drilling Data. SPE Reserv. Eval. Eng. 2020, 23, 1328–1342. [Google Scholar] [CrossRef]
- Cross, T.; Sathaye, K.; Darnell, K.; Niederhut, D.; Crifasi, K. Predicting water production in the williston basin using a machine learning model. In Proceedings of the 8th Unconventional Resources Technology Conference, Virtual, 20–22 July 2020; American Association of Petroleum Geologists: Houston, TX, USA, 2020. [Google Scholar]
- Ma, X.; Zhou, D.; Cai, W.; Li, X.; He, M. An Interpretable Machine Learning Approach to Prediction Horizontal Well Productivity. J. Southwest Pet. Univ. Sci. Technol. Ed. 2022, 44, 81–90. [Google Scholar] [CrossRef]
- Lu, X.; Zhao, Y.; Peng, J. Analysis of Influencing Factors of Plunger Gas Lift Technology. Mech. Electr. Eng. Technol. 2022, 51, 141–144. [Google Scholar]
- Kong, Q.; Ye, C.; Sun, Y. Research on data preprocessing methods for big data. Comput. Technol. Dev. 2018, 28, 1–4. [Google Scholar] [CrossRef]
- Shannon, C.E. A mathematical theory of communication. SIGMOBILE Mob. Comput. Commun. Rev. 2001, 5, 3–55. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Shapley, L.S. Contributions to the Theory of Games (AM-28); Kuhn, H.W., Tucker, A.W., Eds.; Princeton University Press: Princeton, NJ, USA, 1953; pp. 307–318. [Google Scholar]
- Saifulizan, S.S.B.H.; Busahmin, B.; Prasad, D.M.R.; Elmabrouk, S. Evaluation of Different Well Control Methods Concentrating on the Application of Conventional Drilling Technique. ARPN J. Eng. Appl. Sci. 2023, 18, 1851–1857. [Google Scholar]
- Profillidis, V.A.; Botzoris, G.N. Econometric, Gravity, and the 4-Step Methods. In Modeling of Transport Demand; Elsevier: Amsterdam, The Netherlands, 2019; pp. 271–351. [Google Scholar] [CrossRef]
- Li, Q.; Han, Y.; Liu, X.; Ansari, U.; Cheng, Y.; Yan, C. Hydrate as a by-product in CO2 leakage during the long-term sub-seabed sequestration and its role in preventing further leakage. Environ. Sci. Pollut. Res. 2022, 29, 77737–77754. [Google Scholar] [CrossRef]
- Li, Q.; Wang, F.; Wang, Y.; Forson, K.; Cao, L.; Zhang, C.; Chen, J. Experimental investigation on the high-pressure sand suspension and adsorption capacity of guar gum fracturing fluid in low-permeability shale reservoirs: Factor analysis and mechanism disclosure. Environ. Sci. Pollut. Res. 2022, 29, 53050–53062. [Google Scholar] [CrossRef] [PubMed]
Name | Shut-in Casing Pressure (MPa) | Shut-in Tubing Pressure (MPa) | Open-Well Casing Pressure (MPa) | Open-Well Tubing Pressure (MPa) | Transfer Pressure (MPa) | Catcher Depth (m) |
---|---|---|---|---|---|---|
Mean | 4.98 | 4.55 | 4.57 | 2.48 | 2.41 | 3.34 |
Std | 1.11 | 1.00 | 1.07 | 0.80 | 0.70 | 0.17 |
Min | 3.22 | 1.95 | 2.92 | 1.49 | 0.00 | 3.11 |
25% | 4.25 | 3.89 | 3.86 | 2.02 | 2.01 | 3.20 |
50% | 4.64 | 4.25 | 4.24 | 2.22 | 2.16 | 3.27 |
75% | 5.76 | 5.07 | 5.25 | 2.64 | 2.54 | 3.50 |
Max | 7.93 | 7.77 | 7.81 | 7.02 | 5.30 | 3.62 |
Name | Starting Time of Wells (h) | Time to Close the Wells (h) | Plunger Rising Time (h) | GLR (104 m3/m3) | Catcher Well Inclination Angle (°) | Gas Production (104 m3/d) |
---|---|---|---|---|---|---|
Mean | 4.59 | 1.34 | 1.27 | 1.69 | 64.12 | 2.53 |
Std | 6.31 | 0.46 | 0.54 | 0.69 | 3.03 | 1.12 |
Min | 0.33 | 0.92 | 0.10 | 0.17 | 54.78 | 0.00 |
25% | 1.00 | 1.00 | 1.00 | 1.19 | 63.38 | 1.77 |
50% | 1.67 | 1.25 | 1.00 | 1.58 | 65.00 | 2.48 |
75% | 4.50 | 1.50 | 1.50 | 2.04 | 66.18 | 3.21 |
Max | 35.00 | 6.00 | 6.00 | 4.47 | 67.86 | 6.54 |
With Mutual Information Feature Selection | ||||||
---|---|---|---|---|---|---|
Cross-Validation Test Result | Test Set Result | |||||
XGBoost | RF | SVR | XGBoost | RF | SVR | |
RMSE | 0.38 | 0.35 | 0.44 | 0.37 | 0.41 | 0.51 |
MAE | 0.21 | 0.23 | 0.27 | 0.17 | 0.19 | 0.29 |
R2 | 0.87 | 0.87 | 0.82 | 0.85 | 0.83 | 0.77 |
MSE | 0.15 | 0.17 | 0.19 | 0.14 | 0.18 | 0.26 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Liu, J.; Shi, H.; Hong, J.; Wang, S.; Yang, Y.; Liu, H.; Guo, J.; Liu, Z.; Liao, R. Production Prediction and Influencing Factors Analysis of Horizontal Well Plunger Gas Lift Based on Interpretable Machine Learning. Processes 2024, 12, 1888. https://doi.org/10.3390/pr12091888
Liu J, Shi H, Hong J, Wang S, Yang Y, Liu H, Guo J, Liu Z, Liao R. Production Prediction and Influencing Factors Analysis of Horizontal Well Plunger Gas Lift Based on Interpretable Machine Learning. Processes. 2024; 12(9):1888. https://doi.org/10.3390/pr12091888
Chicago/Turabian StyleLiu, Jinbo, Haowen Shi, Jiangling Hong, Shengyuan Wang, Yingqiang Yang, Honglei Liu, Jiaojiao Guo, Zelin Liu, and Ruiquan Liao. 2024. "Production Prediction and Influencing Factors Analysis of Horizontal Well Plunger Gas Lift Based on Interpretable Machine Learning" Processes 12, no. 9: 1888. https://doi.org/10.3390/pr12091888
APA StyleLiu, J., Shi, H., Hong, J., Wang, S., Yang, Y., Liu, H., Guo, J., Liu, Z., & Liao, R. (2024). Production Prediction and Influencing Factors Analysis of Horizontal Well Plunger Gas Lift Based on Interpretable Machine Learning. Processes, 12(9), 1888. https://doi.org/10.3390/pr12091888