Comparative Study on Supervised Learning Models for Productivity Forecasting of Shale Reservoirs Based on a Data-Driven Approach
Autor: | Jihun Jung, Sunil Kwon, Dongkwon Han |
---|---|
Rok vydání: | 2020 |
Předmět: |
clustering analysis
variables importance method Computer science 020209 energy 02 engineering and technology Machine learning computer.software_genre lcsh:Technology Field (computer science) Data-driven lcsh:Chemistry Hydraulic fracturing 020401 chemical engineering 0202 electrical engineering electronic engineering information engineering General Materials Science 0204 chemical engineering Cluster analysis lcsh:QH301-705.5 Instrumentation Fluid Flow and Transfer Processes lcsh:T business.industry Process Chemistry and Technology Supervised learning shale gas General Engineering lcsh:QC1-999 Computer Science Applications Random forest Support vector machine machine learning lcsh:Biology (General) lcsh:QD1-999 lcsh:TA1-2040 data-driven Artificial intelligence Gradient boosting lcsh:Engineering (General). Civil engineering (General) business computer lcsh:Physics |
Zdroj: | Applied Sciences Volume 10 Issue 4 Applied Sciences, Vol 10, Iss 4, p 1267 (2020) |
ISSN: | 2076-3417 |
DOI: | 10.3390/app10041267 |
Popis: | Due to the rapid development of shale gas, a system has been established that can utilize a considerable amount of data using the database system. As a result, many studies using various machine learning techniques were carried out to predict the productivity of shale gas reservoirs. In this study, a comprehensive analysis is performed for a machine learning method based on data-driven approaches that evaluates productivity for shale gas wells by using various parameters such as hydraulic fracturing and well completion in Eagle Ford shale gas field. Two techniques are used to improve the performance of the productivity prediction machine learning model developed in this study. First, the optimal input variables were selected by using the variables importance method (VIM). Second, cluster analysis was used to analyze the similarities in the datasets and recreate the machine learning models for each cluster to compare the training and test results. To predict productivity, we used random forest (RF), gradient boosting tree (GBM), and support vector machine (SVM) supervised learning models. Compared to other supervised learning models, RF, which is applied with the VIM, has the best prediction performance. The retraining model through cluster analysis has excellent predictive performance. The developed model and prediction workflow are considered useful for reservoir engineers planning of field development plan. |
Databáze: | OpenAIRE |
Externí odkaz: |