<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">EE</journal-id>
<journal-id journal-id-type="nlm-ta">EE</journal-id>
<journal-id journal-id-type="publisher-id">EE</journal-id>
<journal-title-group>
<journal-title>Energy Engineering</journal-title>
</journal-title-group>
<issn pub-type="epub">1546-0118</issn>
<issn pub-type="ppub">0199-8595</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">62876</article-id>
<article-id pub-id-type="doi">10.32604/ee.2025.062876</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Models for Predicting the Minimum Miscibility Pressure (MMP) of CO<sub>2</sub>-Oil in Ultra-Deep Oil Reservoirs Based on Machine Learning</article-title>
<alt-title alt-title-type="left-running-head">Models for Predicting the Minimum Miscibility Pressure (MMP) of CO<sub>2</sub>-Oil in Ultra-Deep Oil Reservoirs Based on Machine Learning</alt-title>
<alt-title alt-title-type="right-running-head">Models for Predicting the Minimum Miscibility Pressure (MMP) of CO<sub>2</sub>-Oil in Ultra-Deep Oil Reservoirs Based on Machine Learning</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author">
<name name-style="western"><surname>Li</surname><given-names>Kun</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-2" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Li</surname><given-names>Tianfu</given-names></name><xref ref-type="aff" rid="aff-2">2</xref><email>2022210392@student.cup.edu.cn</email></contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>Wang</surname><given-names>Xiuwei</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-4" contrib-type="author">
<name name-style="western"><surname>Meng</surname><given-names>Qingchun</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-5" contrib-type="author">
<name name-style="western"><surname>Wang</surname><given-names>Zhenjie</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-6" contrib-type="author">
<name name-style="western"><surname>Luo</surname><given-names>Jinyang</given-names></name><xref ref-type="aff" rid="aff-1">1</xref><xref ref-type="aff" rid="aff-2">2</xref></contrib>
<contrib id="author-7" contrib-type="author">
<name name-style="western"><surname>Wang</surname><given-names>Zhaohui</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-8" contrib-type="author">
<name name-style="western"><surname>Yao</surname><given-names>Yuedong</given-names></name><xref ref-type="aff" rid="aff-2">2</xref></contrib>
<aff id="aff-1"><label>1</label><institution>Huabei Oilfield Company, PetroChina</institution>, <addr-line>Renqiu, 062552</addr-line>, <country>China</country></aff>
<aff id="aff-2"><label>2</label><institution>College of Petroleum Engineering, China University of Petroleum (Beijing)</institution>, <addr-line>Beijing, 102249</addr-line>, <country>China</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Tianfu Li. Email: <email>2022210392@student.cup.edu.cn</email></corresp>
</author-notes>
<pub-date date-type="collection" publication-format="electronic">
<year>2025</year></pub-date>
<pub-date date-type="pub" publication-format="electronic">
<day>29</day>
<month>05</month>
<year>2025</year></pub-date>
<volume>122</volume>
<issue>6</issue>
<fpage>2215</fpage>
<lpage>2238</lpage>
<history>
<date date-type="received">
<day>30</day>
<month>12</month>
<year>2024</year>
</date>
<date date-type="accepted">
<day>19</day>
<month>3</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2025 The Authors.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Published by Tech Science Press.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_EE_62876.pdf"></self-uri>
<abstract>
<p>CO<sub>2</sub> flooding for enhanced oil recovery (EOR) not only enables underground carbon storage but also plays a critical role in tertiary oil recovery. However, its displacement efficiency is constrained by whether CO<sub>2</sub> and crude oil achieve miscibility, necessitating precise prediction of the minimum miscibility pressure (MMP) for CO<sub>2</sub>-oil systems. Traditional methods, such as experimental measurements and empirical correlations, face challenges including time-consuming procedures and limited applicability. In contrast, artificial intelligence (AI) algorithms have emerged as superior alternatives due to their efficiency, broad applicability, and high prediction accuracy. This study employs four AI algorithms&#x2014;Random Forest Regression (RFR), Genetic Algorithm Based Back Propagation Artificial Neural Network (GA-BPNN), Support Vector Regression (SVR), and Gaussian Process Regression (GPR)&#x2014;to establish predictive models for CO<sub>2</sub>-oil MMP. A comprehensive database comprising 151 data entries was utilized for model development. The performance of these models was rigorously evaluated using five distinct statistical metrics and visualized comparisons. Validation results confirm their accuracy. Field applications demonstrate that all four models are effective for predicting MMP in ultra-deep reservoirs (burial depth &#x003E;5000 m) with complex crude oil compositions. Among them, the RFR and GA-BPNN models outperform SVR and GPR, achieving root mean square errors (RMSE) of 0.33% and 2.23%, and average absolute percentage relative errors (AAPRE) of 0.01% and 0.04%, respectively. Sensitivity analysis of MMP-influencing factors reveals that reservoir temperature (T<sub>R</sub>) exerts the most significant impact on MMP, while Xint (mole fraction of intermediate oil components, including C<sub>2</sub>-C<sub>4</sub>, CO<sub>2</sub>, and H<sub>2</sub>S) exhibits the least influence.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>MMP</kwd>
<kwd>random forest regression</kwd>
<kwd>genetic algorithm based back propagation artificial neural network</kwd>
<kwd>support vector regression</kwd>
<kwd>gaussian process regression</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>In recent years, the rapid growth of the global economy has driven a steady rise in demand for oil resources across nations [<xref ref-type="bibr" rid="ref-1">1</xref>]. However, conventional oil recovery methods often exhibit low extraction efficiency, prompting heightened interest in technologies capable of enhancing hydrocarbon recovery [<xref ref-type="bibr" rid="ref-2">2</xref>]. Among these, CO<sub>2</sub> flooding stands out as a highly effective and economically viable enhanced oil recovery (EOR) technique. Its significance stems not only from its ability to achieve carbon capture, utilization, and storage (CCUS)&#x2014;thereby contributing to the mitigation of the greenhouse effect&#x2014;but also from its cost-effectiveness and operational efficiency [<xref ref-type="bibr" rid="ref-3">3</xref>&#x2013;<xref ref-type="bibr" rid="ref-5">5</xref>]. The mechanisms by which CO<sub>2</sub> enhances oil recovery primarily include viscosity reduction through dissolution into crude oil, interfacial tension reduction, and most critically, miscible or immiscible displacement achieved via compositional interactions with the crude oil. When CO<sub>2</sub> is miscible with crude oil, the oil recovery rate will be greatly improved, and theoretically, the displacement efficiency can reach 100% [<xref ref-type="bibr" rid="ref-6">6</xref>].</p>
<p>Related experimental studies and field practices [<xref ref-type="bibr" rid="ref-7">7</xref>,<xref ref-type="bibr" rid="ref-8">8</xref>] have demonstrated that oil recovery rates under miscible and immiscible conditions exhibit marked disparities. Therefore, accurate determination of the minimum miscible pressure (MMP)&#x2014;the critical threshold delineating miscible and immiscible regimes&#x2014;is crucial for improving oil recovery efficiency. At present, the main methods for determining the minimum mixed phase pressure include experimental methods, empirical formulas, specific correlations, and intelligent algorithms.</p>
<p>The experimental determination of the MMP is widely adopted in CO<sub>2</sub> flooding projects due to its high accuracy and precision. Currently, commonly used experimental methods for determining the MMP include capillary tests, the bubble apparatus method, and the interfacial tension elimination method [<xref ref-type="bibr" rid="ref-9">9</xref>&#x2013;<xref ref-type="bibr" rid="ref-11">11</xref>]. Although these experimental techniques exhibit high reliability, they are associated with limitations such as time-consuming procedures, high operational costs, and complex operating procedures [<xref ref-type="bibr" rid="ref-12">12</xref>]. Given these constraints, developing rapid and inexpensive method methods for CO<sub>2</sub>-oil MMP estimation is imperative. <xref ref-type="table" rid="table-1">Table 1</xref> summarizes prior studies investigating MMP under diverse conditions using experimental and empirical approaches.</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>Previous experimental and empirical studies on MMP</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
</colgroup>
<thead>
<tr>
<th align="center">Author</th>
<th align="center">Method</th>
<th align="center">Independent parameters</th>
<th align="center">Temperature (<bold>&#x00B0;</bold>C)</th>
<th align="center">Remarks</th>
</tr>
</thead>
<tbody>
<tr>
<td>Holm et al. (1974) [<xref ref-type="bibr" rid="ref-13">13</xref>]</td>
<td>Experimental/</td>
<td>T<sub>R</sub>, MWC<sub>5&#x002B;</sub></td>
<td>32.2&#x2013;82.2</td>
<td>-Temperature ramg limit (32.2&#x00B0;C to 82.2&#x00B0;C)</td>
</tr>
<tr>
<td/>
<td>plot method</td>
<td/>
<td/>
<td>-Pressure range limit (9.65 MPa to 22 MPa)</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td>-180 &#x003C; MWC<sub>5&#x002B;</sub> &#x003C; 240</td>
</tr>
<tr>
<td>Yellig and Metcalfc (1980) [<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td>Empirical methods</td>
<td>T<sub>R</sub></td>
<td>35&#x2013;88.9</td>
<td>Temperature rang limit (35&#x00B0;C to 88.9&#x00B0;C)</td>
</tr>
<tr>
<td>Glas&#x00F8; (1985) [<xref ref-type="bibr" rid="ref-17">17</xref>]</td>
<td>Empirical methods</td>
<td>T<sub>R</sub>, MWC<sub>7&#x002B;</sub>, C<sub>2</sub>-C<sub>6</sub></td>
<td>54.4&#x2013;105.5</td>
<td>Influenced by C<sub>2</sub>-C<sub>6</sub> components</td>
</tr>
<tr>
<td>Alston et al. (1985) [<xref ref-type="bibr" rid="ref-18">18</xref>]</td>
<td>Empirical methods</td>
<td>T<sub>R</sub>, MWC<sub>5&#x002B;</sub>, X<sub>vol</sub>, X<sub>int</sub></td>
<td>32.2&#x2013;117.2</td>
<td>Limited by bubble point pressure</td>
</tr>
<tr>
<td>Abdurrahman et al. (2015) [<xref ref-type="bibr" rid="ref-19">19</xref>]</td>
<td>Experimental</td>
<td>T<sub>R</sub>, P<sub>r</sub>,</td>
<td>60&#x2013;66</td>
<td>Experimental operation is complex</td>
</tr>
<tr>
<td>Mansour et al. (2018) [<xref ref-type="bibr" rid="ref-14">14</xref>]</td>
<td>Empirical methods</td>
<td>Tc, API, MWC<sub>5&#x002B;</sub>, MWC<sub>7&#x002B;</sub></td>
<td>32.2&#x2013;118.9</td>
<td>Limited by oil samples</td>
</tr>
<tr>
<td>Chen et al. (2020) [<xref ref-type="bibr" rid="ref-15">15</xref>]</td>
<td>Improved empirical methods</td>
<td>T<sub>R</sub>, MWC<sub>7&#x002B;</sub>, dataset of pure and impure CO<sub>2</sub></td>
<td>30.5&#x2013;147.8</td>
<td>Empirical corelation</td>
</tr>
<tr>
<td>Ge et al. (2021) [<xref ref-type="bibr" rid="ref-20">20</xref>]</td>
<td>Empirical methods</td>
<td>T<sub>R</sub>, reservoir oil composition</td>
<td>71.6&#x2013;108.4</td>
<td>Limited by oil samples</td>
</tr>
<tr>
<td>Yu et al. (2024) [<xref ref-type="bibr" rid="ref-7">7</xref>]</td>
<td>Empirical methods</td>
<td>T<sub>R</sub>, composition of injected gas, reservoir oil composition</td>
<td>40.6&#x2013;122.5</td>
<td>Limited by crude oil composition and data sets</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Due to the limitations of experimental methods and the need for rapid prediction of the MMP, various empirical approaches have been proposed, demonstrating significant applicability and gradually emerging as alternatives to traditional experimental techniques. For instance, Holm et al. [<xref ref-type="bibr" rid="ref-13">13</xref>] introduced the plate method for MMP estimation, which relies on analyzing changes in crude oil density within reservoir formations. However, this method is restricted to reservoirs with temperatures between 32.2&#x00B0;C and 82.2&#x00B0;C and formation pressures ranging from 9.65 to 22 MPa. Mansour et al. [<xref ref-type="bibr" rid="ref-14">14</xref>] developed an empirical formula that integrates T<sub>C</sub>, API, MWC<sub>5&#x002B;</sub>, and MWC<sub>7&#x002B;</sub> to predict MMP for live oil systems. Similarly, Chen et al. [<xref ref-type="bibr" rid="ref-15">15</xref>] proposed a multi-factor empirical model accounting for ten variables influencing MMP, which exhibits superior accuracy compared to earlier formulations. Despite these advancements, empirical formulas often suffer from narrow applicability, being valid only for reservoirs with specific characteristics and failing to generalize across diverse geological settings. Consequently, there is a pressing need to develop more accurate and universally adaptable methods for CO<sub>2</sub>-oil MMP prediction.</p>
<p>Recent advancements in artificial intelligence (AI), particularly its superior machine learning (ML) capabilities and robust data analytics proficiency, have catalyzed the increasing adoption of ML techniques to address complex challenges in oilfield production. Key applications spanning well-test interpretation, log analysis, reservoir classification, production performance forecasting, and gas solubility prediction in crude oil have demonstrated the efficacy of ML-driven approaches, showcasing their adaptability and precision in handling datasets [<xref ref-type="bibr" rid="ref-21">21</xref>&#x2013;<xref ref-type="bibr" rid="ref-24">24</xref>]. Given these successes, researchers are now actively exploring ML-based frameworks to predict the CO<sub>2</sub>-oil MMP. Huang et al. [<xref ref-type="bibr" rid="ref-25">25</xref>] developed a three-layer back propagation artificial neural network(BPNN) model and comprehensively considered the effects of Molecular weight of C<sub>5&#x002B;</sub> fraction, reservoir temperature, and volatility to predict pure and implied CO<sub>2</sub>-oil MMP. Shokrollahi et al. [<xref ref-type="bibr" rid="ref-26">26</xref>] developed a Least Squares Support Vector Machine model (LSSVM) that comprehensively considers the effects of reservoir temperature, MWC<sub>5&#x002B;</sub>, and the contents of y<sub>CO2</sub>, y<sub>H2S</sub>, y<sub>C1</sub>, and N<sub>2</sub> in the injected gas to predict pure and impure MMP. Kamari et al. [<xref ref-type="bibr" rid="ref-27">27</xref>] developed a Genetic expression Programming (GEP) model to predict pure and impure CO<sub>2</sub>-oil MMP in live reservoir oil systems during CO<sub>2</sub> flooding. <xref ref-type="table" rid="table-2">Table 2</xref> provides a comparative summary of predictive models for the CO<sub>2</sub>-oil MMP developed in previous studies, detailing their input variables and methodological constraints.</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>A summary of Previous studies on models for prediction CO<sub>2</sub>-oil MMP</title>
</caption>
<table>
<colgroup>
<col/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
</colgroup>
<thead>
<tr>
<th>Author</th>
<th align="center">Method</th>
<th align="center">Independent parameters</th>
<th align="center">Temperature (<bold>&#x00B0;</bold>C)</th>
<th align="center">Remarks</th>
</tr>
</thead>
<tbody>
<tr>
<td>Huang et al. (2003) [<xref ref-type="bibr" rid="ref-25">25</xref>]</td>
<td>BPNN</td>
<td>TR, MW<sub>C5&#x002B;</sub>, X<sub>vol</sub>, X<sub>int</sub></td>
<td>16.7&#x2013;86.7</td>
<td>Limited by data rang, possibility of overfitting</td>
</tr>
<tr>
<td>Emera et al. (2005) [<xref ref-type="bibr" rid="ref-28">28</xref>]</td>
<td>GA</td>
<td>TR, MW<sub>C5&#x002B;</sub>, X<sub>vol</sub>, X<sub>int</sub></td>
<td>Not mentioned</td>
<td>Limited by bubble point pressure, narrow range of application</td>
</tr>
<tr>
<td>Birang et al. (2007) [<xref ref-type="bibr" rid="ref-29">29</xref>]</td>
<td>ANN</td>
<td>TR, MW<sub>C2-C5</sub> and MW<sub>C7&#x002B;</sub> component, X<sub>vol</sub>, X<sub>C1-C5</sub>, composition of injected gas</td>
<td>55.5&#x2013;106.5</td>
<td>Limited by hidden neurons, missing sample data points</td>
</tr>
<tr>
<td>Shokrollahi et al. (2013) [<xref ref-type="bibr" rid="ref-26">26</xref>]</td>
<td>LSSVM</td>
<td>T<sub>R</sub>, y<sub>CO2</sub>, y<sub>H2S</sub>, y<sub>C1</sub>, MW<sub>C5&#x002B;</sub>, N<sub>2</sub></td>
<td>31.7&#x2013;117.8</td>
<td>Difficulty in selecting hyper-parameters of LSSVM method</td>
</tr>
<tr>
<td>Kamari et al. (2015) [<xref ref-type="bibr" rid="ref-27">27</xref>]</td>
<td>GEP</td>
<td>T<sub>R</sub>, MW<sub>C5&#x002B;</sub>, X<sub>vol</sub>/X<sub>int</sub>, composition of injected gas</td>
<td>31.7&#x2013;117.8</td>
<td>The model has a low R<sup>2</sup> (0.63) on the validation set</td>
</tr>
<tr>
<td>Bian et al. (2016) [<xref ref-type="bibr" rid="ref-30">30</xref>]</td>
<td>SVR-GA</td>
<td>T<sub>R</sub>, X<sub>int</sub>, X<sub>vol</sub> MWC<sub>5&#x002B;</sub></td>
<td>31.7&#x2013;117.8</td>
<td>The sample data size is not mention</td>
</tr>
<tr>
<td>Saeedi Dehaghani et al. (2020) [<xref ref-type="bibr" rid="ref-31">31</xref>]</td>
<td>ANN, POS-ANN, ANN-TLBO</td>
<td>T<sub>R</sub>, MW<sub>C5&#x002B;</sub>, X<sub>vol</sub>, X<sub>int,</sub> composition of injected gas</td>
<td>66.27&#x2013;118.3</td>
<td>Lack of discussion on sensitivity analysis</td>
</tr>
<tr>
<td>Nait Amar et al. (2020) [<xref ref-type="bibr" rid="ref-32">32</xref>]</td>
<td>SVM</td>
<td>T<sub>R</sub>, C<sub>2</sub>-C<sub>6</sub>, MW<sub>C5&#x002B;</sub>, X<sub>vol/</sub>X<sub>int</sub></td>
<td>32.2&#x2013;136.7</td>
<td>The sample datapoints are not mention</td>
</tr>
<tr>
<td>Ghiasi et al. (2021) [<xref ref-type="bibr" rid="ref-33">33</xref>]</td>
<td>hybrid-ANFIS and AdaBoost-CART</td>
<td>T<sub>R</sub>, T<sub>c</sub>, composition of injected gas, MW<sub>C5&#x002B;</sub>, X<sub>vol</sub>/X<sub>int</sub></td>
<td>32.2&#x2013;136.7</td>
<td>The sample datapoints are not mention and limited by membership functions setting</td>
</tr>
<tr>
<td>Al-Khafaji et al. (2023) [<xref ref-type="bibr" rid="ref-34">34</xref>]</td>
<td>DT, SVR, KNN,<break/> RF, MLR</td>
<td>MW<sub>C5&#x002B;</sub>, T<sub>R</sub>, y<sub>H2S</sub></td>
<td>31.7&#x2013;136.7</td>
<td>Limited by model training time</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The results of these machine learning models for predicting MMP indicate that reservoir temperature and crude oil composition have significantly impact on MMP [<xref ref-type="bibr" rid="ref-21">21</xref>,<xref ref-type="bibr" rid="ref-23">23</xref>,<xref ref-type="bibr" rid="ref-24">24</xref>,<xref ref-type="bibr" rid="ref-26">26</xref>,<xref ref-type="bibr" rid="ref-27">27</xref>]. For ultra-deep oil reservoirs buried deeper than 5000 m, characteristics such as high reservoir temperature and complex crude oil composition are present. Existing prediction models exhibit low accuracy and local overfitting problems when predicting MMP in ultra-deep oil reservoirs. In order to quickly and accurately predict MMP in ultra-deep oil reservoirs, we developed multiple models with ten input parameters (i.e., T<sub>R</sub>, X<sub>vol</sub>, X<sub>int</sub>, X<sub>C5-C6</sub>, MWC<sub>7&#x002B;</sub>, mole fraction of CO<sub>2</sub> in solution (y<sub>CO2</sub>), mole fraction of C<sub>1</sub> in the CO<sub>2</sub> injection gas (y<sub>C1</sub>), mole fraction of N<sub>2</sub> in the injection gas (y<sub>N2</sub>), mole fraction of H<sub>2</sub>S in the injection gas (y<sub>H2S</sub>), and mole fraction of C<sub>2</sub>-C<sub>4</sub> in the injection gas (y<sub>C2-C4</sub>)) to predict MMP in both pure and impure CO<sub>2</sub> injection cases in ultra-deep oil reservoirs. Based on the above description, multiple models were established and optimized, and suitable model for CO<sub>2</sub> MMP in ultra-deep oil reservoirs was selected by comparing the prediction results of different models. Grid search was employed to tune and optimize the hyperparameters of these models, which predict the CO<sub>2</sub>-oil MMP in ultra-deep oil reservoirs, thereby ensuring their predictive accuracy. In addition, a sensitivity analysis was systematically conducted to quantify the relative contributions of individual factors affecting MMP and to elucidate the functional dependencies of MMP on these critical variables.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Minimum Miscibility Pressure (MMP)</title>
<p>During the process of injecting CO<sub>2</sub> into oil reservoirs for enhanced oil recovery, interactions between gas, oil, and water phases occur in the rock layers, resulting in interphase component transfer, phase transitions, and other complex phase behaviors. CO<sub>2</sub> undergoes multiple contact displacements with crude oil under reservoir temperature and pressure, achieving either miscible or immiscible flooding. The CO<sub>2</sub> mixed-phase drive refers to the formation of a stable mixed-phase zone at the leading edge between CO<sub>2</sub> and crude oil when the injection pressure is higher than a certain value and the disappearance of the oil-gas interface, resulting in zero interfacial tension in porous media. This pressure is called the minimum miscibility pressure.</p>
<p>It is widely recognized that temperature, crude oil components, and the composition of injected gas significantly influence the MMP. To date, researchers [<xref ref-type="bibr" rid="ref-27">27</xref>,<xref ref-type="bibr" rid="ref-30">30</xref>,<xref ref-type="bibr" rid="ref-31">31</xref>,<xref ref-type="bibr" rid="ref-33">33</xref>] have conducted comprehensive studies on the influencing factors affecting MMP. Reservoir temperature plays a crucial role in the MMP during CO<sub>2</sub> flooding. As the reservoir temperature increases, the amount of CO<sub>2</sub> dissolved in crude oil decreases, and the supercritical nature of CO<sub>2</sub> diminishes, particularly in terms of density. This reduction seriously affects the ability of CO<sub>2</sub> extraction to extract light hydrocarbons. Currently, most research indicates a linear positive correlation between CO<sub>2</sub>-oil MMP and reservoir temperature. Alston et al. [<xref ref-type="bibr" rid="ref-18">18</xref>] studied the crude oil components, dividing them into three parts&#x2014;X<sub>vol</sub>, X<sub>int</sub>, and MWC<sub>5&#x002B;</sub>&#x2014;have an impact on MMP, but MW<sub>C5&#x002B;</sub> has a more significant effect on CO<sub>2</sub>-oil MMP. Additionally, injecting gas components also has a significant impact on MMP. Sebastian et al. [<xref ref-type="bibr" rid="ref-35">35</xref>] found that injecting CO<sub>2</sub> gas containing C<sub>1</sub>, N<sub>2</sub>, H<sub>2</sub>S, and light component hydrocarbons (C<sub>2</sub>-C<sub>4</sub>) can have an impact on CO<sub>2</sub>-oil MMP.</p>
<p>Consequently, ten factors (T<sub>R</sub>, x<sub>int</sub>, x<sub>vol</sub>, x<sub>C5&#x2013;C6</sub>, MW<sub>C7&#x002B;</sub>, y<sub>C1</sub>, y<sub>N2</sub>, y<sub>CO2</sub>, y<sub>C2-C4</sub>, and y<sub>H2S</sub>) that significantly affect MMP were selected as the input variables and imported to design multiple models for MMP simulation. To evaluate the correlation between these characteristic variables and their influence on the MMP, this study employs the partial correlation analysis to detect any redundant information among the variables and ensure their independence. The results are shown in <xref ref-type="fig" rid="fig-1">Fig. 1</xref>. It can be seen from the <xref ref-type="fig" rid="fig-1">Fig. 1</xref> that the r values between any two feature variables and MMP differ, indicating that the ten features are independent of each other and there is no significant collinearity problem.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>Partial correlation analysis results heatmap</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="EE_62876-fig-1.tif"/>
</fig>
</sec>
<sec id="s3">
<label>3</label>
<title>Basic Description of Models and Experimental Data</title>
<sec id="s3_1">
<label>3.1</label>
<title>GA-BPNN Model</title>
<p>Artificial Neural Networks (ANNs) are computational models designed to simulate the structure and function of neurons in the human brain. Similar to the way the human brain works, artificial neural networks can automatically process multiple input variables in parallel, passing information from the input layer to the output layer through the learning process [<xref ref-type="bibr" rid="ref-36">36</xref>,<xref ref-type="bibr" rid="ref-37">37</xref>]. In addition, ANNs also have the ability to reveal potential linear or nonlinear relationships between input and output data. A complete neural network consists of an input layer, an output layer, and at least one hidden layer. The basic structural unit of a neural network is the neuron, which can receive, process, and output signals. It is a nonlinear element with multiple inputs and a single output. A large number of neurons are interconnected via connection weights, bias values, and transfer functions between neurons in each layer, forming a neural network with robust information processing capabilities. A back propagation neural network (BPNN) is a type of feedforward neural network that incorporates a backpropagation learning algorithm. Due to its adjustable architecture, capability for forward signal propagation, and mechanism of error backpropagation, the BPNN has emerged as the most widely implemented neural network paradigm. The schematic architecture of a standard three-layer BPNN is depicted in <xref ref-type="fig" rid="fig-2">Fig. 2</xref>. However, ANN models are prone to challenges, including overfitting, suboptimal parameter selection, and computational inefficiency during training. To mitigate these limitations, a hybrid framework integrating a Genetic Algorithm (GA) and BPNN is employed to optimize network weights and biases, thereby enhancing prediction accuracy [<xref ref-type="bibr" rid="ref-38">38</xref>].</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Typical three-layer backpropagation neural network model structure diagram</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="EE_62876-fig-2.tif"/>
</fig>
<p>The GA, first proposed by American scholar John Holland, it a global optimization algorithm that simulates the process of biological evolution.GA mimics principles of natural selection, crossover, and mutation to enhance individual fitness within a population. Its workflow comprises four stages: parameter encoding, population initialization, fitness evaluation, and iterative genetic operations [<xref ref-type="bibr" rid="ref-39">39</xref>,<xref ref-type="bibr" rid="ref-40">40</xref>]. Genetic operations include selection, crossover, and mutation. The selection operation can inherit excellent chromosomes to the next generation or generate new individuals through crossover operation and pass them on to the next generation. By adopting an optimal preservation method, the fitness values of individuals within the same population are sorted in descending order, with priority given to selecting individuals with higher fitness values for inheritance to the next generation. The crossover operation is the process of selecting chromosomes from a population and recombining them into new chromosomes through crossover operations. This operation can effectively inherit excellent genes from the population. The mutation operation involves using other genes in an individual to replace specific genes, which can enhance local search ability and effectively preserve population diversity. In general, selection operations ensure that the population evolves toward higher fitness, crossover operations accelerate the speed of genetic algorithm searching for optimal solutions, and mutation operations help prevent the algorithm from falling into local optima too early. Together, the three genetic operations of selection, crossover, and mutation enable GA to efficiently search the solution space and gradually approach the optimal solution.</p>
<p>Optimizing the BPNN model through GA and developing the GA-BPNN hybrid model. The specific steps are as follows:</p>
<p>(1) Initialization: Define the BPNN architecture and initialize GA parameters and population.</p>
<p>(2) Fitness Evaluation: Train the BPNN on the dataset and calculate individual fitness based on prediction accuracy.</p>
<p>(3) GA Execution: Perform selection, crossover, and mutation operations within the GA.</p>
<p>(4) Population Update: Replace old individuals with newly generated individuals to form a new population.</p>
<p>(5) Termination Check: Determine whether the termination condition is satisfied. If the termination condition is not satisfied, return to step 2 to continue the iteration.</p>
<p>(6) Model Finalization: Deploy the BPNN with optimized weights and biases from the highest-fitness individual.</p>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Random Forest Regression</title>
<p>RFR is an ensemble algorithm composed of multiple decision trees based on the bootstrap aggregating method; it was initially designed for classification tasks. As research progressed, the RFR model was further applied to predict CO<sub>2</sub>-oil MMP. For instance, Al Khafaji et al. [<xref ref-type="bibr" rid="ref-34">34</xref>] used the RFR model to predict MMP. The basic principles of RFR can be summarized as four steps: sample selection, feature selection, decision tree construction, and prediction. The number of trees in a random forest is represented as <italic>B</italic>, and the predicted result of the <italic>b</italic>-th tree is represented as F<sub>b</sub>(<italic>x</italic>). Therefore, the prediction formula for RFR can be expressed as:
<disp-formula id="eqn-1"><label>(1)</label><mml:math id="mml-eqn-1" display="block"><mml:mrow><mml:mover><mml:mrow><mml:mtext>y</mml:mtext></mml:mrow><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mi>B</mml:mi></mml:mfrac><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>b</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>B</mml:mi></mml:mrow></mml:munderover><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>The steps for establishing a random forest regression model are as follows:</p>
<p>(1) Data Standardization: Standardize all data to ensure that their values are between &#x2013;1 and 1, in order to obtain accurate prediction results and accelerate training speed.</p>
<p>(2) Import Data and Select Parameters: Import all data and output the dataset to the model for model training and testing.</p>
<p>(3) Bootstrap Sampling: The bootstrap resampling method is employed to randomly sample the original dataset with replacement, generating multiple subsets. Each subset trains a decision tree, while the unsampled instances (termed out-of-bag (OOB) data) are reserved for validation.</p>
<p>(4) Tree Growth: Each decision tree randomly selects several features from the input variable x for the current node&#x2019;s split subset. The optimal feature for splitting is determined by maximizing variance reduction, which enables maximal tree growth.</p>
<p>(5) OOB Validation: The OOB data, which are excluded from the training process, serve to evaluate model performance and accuracy. The optimal number of decision trees, T, is determined by iteratively refining the model based on OOB prediction errors until convergence.</p>
<p>(6) Prediction: For the input variable x<sub>i</sub> (i &#x003D; 1, 2, &#x2026; , k) that needs to be predicted. Each decision tree will output a corresponding predicted value y<sub>i</sub> (i &#x003D; 1, 2, &#x2026; , k).The predicted value of RFR is the average of the predicted values of all decision trees.</p>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Support Vector Regression</title>
<p>SVR is a regression analysis method derived from Support Vector Machines (SVM). While SVR shares conceptual similarities with traditional SVM classification algorithms, its objective diverges by focusing on identifying an optimal fitting function for continuous prediction problems, as opposed to delineating classification boundaries [<xref ref-type="bibr" rid="ref-41">41</xref>]. The basic idea of the SVR model is to find a function that can predict the output value as accurately as possible while keeping the complexity of the model low to avoid overfitting. To achieve this goal, SVR uses a loss function called &#x201C;&#x03B5;-insensitive loss function&#x201D;, which means that the loss is only calculated when the prediction error exceeds a certain threshold, &#x03B5; [<xref ref-type="bibr" rid="ref-42">42</xref>]. Its basic definition is as follows:
<disp-formula id="eqn-2"><label>(2)</label><mml:math id="mml-eqn-2" display="block"><mml:mi>L</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left left" rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mrow><mml:mo>|</mml:mo><mml:mi>y</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mo>&#x2264;</mml:mo><mml:mi>&#x03B5;</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>|</mml:mo><mml:mi>y</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03B5;</mml:mi></mml:mtd><mml:mtd><mml:mi>o</mml:mi><mml:mi>t</mml:mi><mml:mi>h</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>w</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>e</mml:mi></mml:mtd></mml:mtr></mml:mtable><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>In the nonlinear case, by introducing relaxation variables <inline-formula id="ieqn-1"><mml:math id="mml-ieqn-1"><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:msubsup><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:math></inline-formula>, as well as a penalty coefficient C, the penalty coefficient is used to control the trade-off between model complexity and data fitting. This transforms the optimization problem of SVR into solving the following optimization problem:
<disp-formula id="eqn-3"><label>(3)</label><mml:math id="mml-eqn-3" display="block"><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:msup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi>w</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi>C</mml:mi><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup></mml:math></disp-formula></p>
<p>The model is subject to the following constraints:
<disp-formula id="eqn-4"><label>(4)</label><mml:math id="mml-eqn-4" display="block"><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left" rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>w</mml:mi><mml:mo>&#x22C5;</mml:mo><mml:mi>x</mml:mi><mml:mo>+</mml:mo><mml:mi>b</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2264;</mml:mo><mml:mi>&#x03B5;</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mi>w</mml:mi><mml:mo>&#x22C5;</mml:mo><mml:mi>x</mml:mi><mml:mo>+</mml:mo><mml:mi>b</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2264;</mml:mo><mml:mi>&#x03B5;</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>&#x03B5;</mml:mi><mml:mo>&#x2265;</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>&#x2265;</mml:mo><mml:mn>0</mml:mn></mml:mtd></mml:mtr></mml:mtable><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>The solution to this problem can be achieved by introducing Lagrange functions <inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:mi>L</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>w</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>, transforming the original problem into a dual problem, and using KKT conditions to find the optimal solution. The specific process is as follows:
<disp-formula id="eqn-5"><label>(5)</label><mml:math id="mml-eqn-5" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd><mml:mi>L</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>w</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mi></mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:msup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:mi>w</mml:mi><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:mi>C</mml:mi><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>&#x2212;</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>&#x03B5;</mml:mi><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>w</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>b</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd /><mml:mtd><mml:mi></mml:mi><mml:mo>&#x2212;</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover><mml:msup><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>&#x03B5;</mml:mi><mml:mo>+</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>+</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mi>w</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mi>b</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>where, <inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msubsup><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msubsup><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:mrow></mml:msubsup></mml:math></inline-formula> are Lagrange operators, partial derivatives are obtained for <inline-formula id="ieqn-5"><mml:math id="mml-ieqn-5"><mml:mi>w</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03B5;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, respectively, to obtain saddle points. Then, the results are substituted into the original optimization problem, which can be transformed into the following dual problem:
<disp-formula id="eqn-6"><label>(6)</label><mml:math id="mml-eqn-6" display="block"><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>x</mml:mi><mml:mi>W</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>a</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mfrac><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mi>K</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03B5;</mml:mi><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula></p>
<p>The constraints imposed on the SVR model are as follows:
<disp-formula id="eqn-7"><label>(7)</label><mml:math id="mml-eqn-7" display="block"><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left" rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mo>&#x2264;</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>&#x2264;</mml:mo><mml:mi>C</mml:mi></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:munderover><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:math></disp-formula>where, <inline-formula id="ieqn-6"><mml:math id="mml-ieqn-6"><mml:mi>K</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is a kernel function used to calculate the inner product of data points, <inline-formula id="ieqn-7"><mml:math id="mml-ieqn-7"><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-8"><mml:math id="mml-ieqn-8"><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> in high-dimensional space. The commonly used kernel functions include linear kernel, polynomial kernel, Gaussian kernel, etc. Among them, Gaussian kernel has the best applicability and is also known as radial basis function kernel (RBF).</p>
<p>Based on the above analysis, obtain the optimal solution and derive the final regression function:
<disp-formula id="eqn-8"><label>(8)</label><mml:math id="mml-eqn-8" display="block"><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:mi>K</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mover><mml:mi>b</mml:mi><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover></mml:math></disp-formula>where, <inline-formula id="ieqn-9"><mml:math id="mml-ieqn-9"><mml:mover><mml:mi>b</mml:mi><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover></mml:math></inline-formula> is the bias term determined through training data.</p>
<p>The steps for establishing a support vector machine regression model are as follows:</p>
<p>(1) Read and preprocess the dataset.</p>
<p>(2) Build a model and define a hyperparameter search space.</p>
<p>(3) Use grid search cross-validation to find the optimal hyperparameters.</p>
<p>(4) Train the model using the optimal hyperparameters and conduct model evaluation.</p>
<p>(5) Use the trained model to predict new input data and output the prediction results.</p>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>Gaussian Process Regression</title>
<p>GPR is a data-driven method proposed by Williams and Rasmussen. Its essence lies in learning the kernel function with probabilistic significance [<xref ref-type="bibr" rid="ref-43">43</xref>]. The optimal hyperparameters are obtained by analyzing historical sample data, after which a prediction model is developed to forecast subsequent samples. The prediction generated by the GPR method carry probabilistic significance and can describe associated with these predictions. Compared with ANN, SVR and other methods, the hyperparameters required for prediction are easier to obtain and are suitable for high-dimensional nonlinear systems. <xref ref-type="fig" rid="fig-3">Fig. 3</xref> illustrates the specific process of GPR. The GPR algorithm is described below.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Process of GPR</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="EE_62876-fig-3.tif"/>
</fig>
<p>Given the training set D:<inline-formula id="ieqn-10"><mml:math id="mml-ieqn-10"><mml:msubsup><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula>, <italic>N</italic> is the number of training samples, <italic>X</italic><sub><italic>i</italic></sub> is the vector defining time <italic>t</italic><sub><italic>i</italic></sub>, and the joint probability distribution function of the random variables in the Gaussian process is:
<disp-formula id="eqn-9"><label>(9)</label><mml:math id="mml-eqn-9" display="block"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mtext>Z</mml:mtext></mml:mrow></mml:mfrac><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mfrac><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03BC;</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mtext>T</mml:mtext></mml:mrow></mml:mrow></mml:msup><mml:mo>&#x22C5;</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:msup><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03BC;</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>where, <italic>X</italic><sub><italic>m</italic></sub> and <italic>X</italic><sub><italic>n</italic></sub> represent the <italic>m</italic>-th and <italic>n</italic>-th vectors, <inline-formula id="ieqn-11"><mml:math id="mml-ieqn-11"><mml:mi>C</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is parameter covariance function, <inline-formula id="ieqn-12"><mml:math id="mml-ieqn-12"><mml:mi>&#x03BC;</mml:mi></mml:math></inline-formula> represents mean vector.</p>
<p>The most widely used covariance function is:
<disp-formula id="eqn-10"><label>(10)</label><mml:math id="mml-eqn-10" display="block"><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>&#x03B8;</mml:mi><mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mfrac><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mrow><mml:mtext>i</mml:mtext></mml:mrow><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>L</mml:mi></mml:mrow></mml:munderover><mml:mfrac><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>m</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow><mml:msup><mml:mrow><mml:mi mathvariant="normal">&#x03B4;</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow></mml:msup></mml:mfrac><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>where, <italic>L</italic> represents the number of elements in vector <italic>X</italic><sub><italic>i</italic></sub>, <inline-formula id="ieqn-13"><mml:math id="mml-ieqn-13"><mml:mi>&#x03B4;</mml:mi></mml:math></inline-formula> is variance, <inline-formula id="ieqn-14"><mml:math id="mml-ieqn-14"><mml:msub><mml:mi>&#x03B8;</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-15"><mml:math id="mml-ieqn-15"><mml:msub><mml:mi>&#x03B8;</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> are parameters.</p>
<p>For a new input vector X<sub>N&#x002B;1</sub>, use a predictive model to estimate its t<sub>N&#x002B;1</sub> value rate distribution, expectation, and variance are used for prediction, and the model function is represented as:
<disp-formula id="eqn-11"><label>(11)</label><mml:math id="mml-eqn-11" display="block"><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>y</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:msub><mml:mi>v</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:math></disp-formula>where, <italic>y</italic>(<italic>X</italic><sub><italic>n</italic></sub>) represents modeling functions, <inline-formula id="ieqn-16"><mml:math id="mml-ieqn-16"><mml:msub><mml:mi>&#x03BD;</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the prediction error.</p>
<p>So the model probability is:
<disp-formula id="eqn-12"><label>(12)</label><mml:math id="mml-eqn-12" display="block"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mi>A</mml:mi><mml:mo>,</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo>&#x222B;</mml:mo><mml:mrow><mml:mtext>P</mml:mtext></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mi>v</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>A</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x22C5;</mml:mo><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>v</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mi>d</mml:mi><mml:mi>y</mml:mi><mml:mi>d</mml:mi><mml:mi>v</mml:mi></mml:math></disp-formula>where, <italic>P(y|A)</italic> is the pre probability distribution of y(x), <italic>A</italic> is a set of hyperparameters of <italic>P(y|A)</italic>, <italic>P</italic>(<inline-formula id="ieqn-17"><mml:math id="mml-ieqn-17"><mml:mi>&#x03BD;</mml:mi></mml:math></inline-formula><italic>|B)</italic> is the pre probability distribution of prediction error, <italic>B</italic> is a parameter representing the error <inline-formula id="ieqn-18"><mml:math id="mml-ieqn-18"><mml:mi>&#x03BD;</mml:mi></mml:math></inline-formula>.</p>
<p>Let T<sub>N</sub> &#x003D; (t<sub>1</sub>, t<sub>2</sub>, &#x2026; , t<sub>n</sub>), T<sub>N&#x002B;1</sub> &#x003D; (t<sub>1</sub>, t<sub>2</sub>, &#x2026; , t<sub>n</sub>, t<sub>n&#x002B;1</sub>), therefore the conditional distribution of t<sub>N&#x002B;1</sub> can be represented by <xref ref-type="disp-formula" rid="eqn-13">Eq. (13)</xref>, and it can be used to predict t<sub>N&#x002B;1</sub>:
<disp-formula id="eqn-13"><label>(13)</label><mml:math id="mml-eqn-13" display="block"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>D</mml:mi><mml:mo>,</mml:mo><mml:mi>A</mml:mi><mml:mo>,</mml:mo><mml:mi>B</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mi>A</mml:mi><mml:mo>,</mml:mo><mml:mi>B</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mi>A</mml:mi><mml:mo>,</mml:mo><mml:mi>B</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:math></disp-formula></p>
<p>According to Bayesian method, it can be inferred that the distribution function of t<sub>N&#x002B;1</sub> is:
<disp-formula id="eqn-14"><label>(14)</label><mml:math id="mml-eqn-14" display="block"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>D</mml:mi><mml:mo>,</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mtext>n</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mtext>m</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mtext>m</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mtext>n</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mtext>m</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo>,</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:math></disp-formula></p>
<p>By substituting the Gaussian distribution formula, the preliminary calculation can be simplified as:
<disp-formula id="eqn-15"><label>(15)</label><mml:math id="mml-eqn-15" display="block"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>D</mml:mi><mml:mo>,</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mtext>n</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mtext>m</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mfrac><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mfrac><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msubsup><mml:msubsup><mml:mi>C</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msubsup><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msubsup><mml:mi>t</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msubsup><mml:msubsup><mml:mi>C</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msubsup><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math></disp-formula>where, <italic>Z</italic><sub><italic>N</italic></sub> and <italic>Z</italic><sub><italic>N</italic>&#x002B;1</sub> are two normalization constants.</p>
<p>After further simplification, the following equation can be obtained:
<disp-formula id="eqn-16"><label>(16)</label><mml:math id="mml-eqn-16" display="block"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>D</mml:mi><mml:mo>,</mml:mo><mml:mi>C</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mtext>n</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mrow><mml:mtext>m</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi mathvariant="normal">&#x0398;</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mtext>z</mml:mtext></mml:mrow></mml:mfrac><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mfrac><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow><mml:msubsup><mml:mi>&#x03C3;</mml:mi><mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>t</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow></mml:msubsup></mml:mrow></mml:mfrac><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>where, <inline-formula id="ieqn-19"><mml:math id="mml-ieqn-19"><mml:msub><mml:mrow><mml:mover><mml:mi>t</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> represented expected value. Therefore, predictions can be made based on their expectations and variances.</p>
</sec>
<sec id="s3_5">
<label>3.5</label>
<title>Experimental Database</title>
<p>The predictive reliability and accuracy of machine learning models are critically contingent upon the representativeness and robustness of the training dataset. In the frameworks of ANN, RFR, GPR, and SVR, input data are conventionally partitioned into training and testing subsets. The former serves to calibrate model parameters and optimize predictive performance, whereas the latter is employed to assess the accuracy and stability of the models.</p>
<p>The experimental dataset employed in this investigation comprises 151 data entries, partitioned into two categories: 89 records of pure CO<sub>2</sub> injection and 62 records of impure CO<sub>2</sub> injection. The dataset was divided into training (90%, <italic>n</italic> &#x003D; 136) and testing (10%, <italic>n</italic> &#x003D; 15) subsets to ensure unbiased model validation. These data were collected from various mature experimental works in the literature [<xref ref-type="bibr" rid="ref-13">13</xref>,<xref ref-type="bibr" rid="ref-15">15</xref>,<xref ref-type="bibr" rid="ref-44">44</xref>,<xref ref-type="bibr" rid="ref-45">45</xref>]. Data sets include experimental values for reservoir temperature, composition of drive gas (y<sub>CO2</sub>, y<sub>H2S</sub>, y<sub>N2</sub>, and y<sub>C2&#x2013;C4</sub>), molecular weight of the C<sub>7&#x002B;</sub> fraction in crude oil (MW<sub>C7&#x002B;</sub>), the ratio of volatile (X<sub>vol</sub>) to intermediate (X<sub>int</sub>) components in crude oil, pure and impure CO<sub>2</sub>-oil MMP. <xref ref-type="table" rid="table-3">Table 3</xref> presents a statistical description of the dataset used. A detailed analysis of the experimental data highlights that the reservoir temperature ranges from 31.00&#x00B0;C to 148.90&#x00B0;C, the experimental values for MMP range from 6.55 to 40.22 MPa.</p>
<table-wrap id="table-3">
<label>Table 3</label>
<caption>
<title>Statistical description of the dataset used in this studies</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Parameters</th>
<th>Unit</th>
<th>Min</th>
<th>Max</th>
<th>Mean</th>
<th>STD</th>
<th>25%</th>
<th>50%</th>
<th>75%</th>
</tr>
</thead>
<tbody>
<tr>
<td>T<sub>R</sub></td>
<td>&#x00B0;C</td>
<td>31.00</td>
<td>148.90</td>
<td>80.49</td>
<td>81.11</td>
<td>57.20</td>
<td>81.66</td>
<td>112.10</td>
</tr>
<tr>
<td>y<sub>CO2</sub></td>
<td>%</td>
<td>30.00</td>
<td>100.00</td>
<td>88.78</td>
<td>100.00</td>
<td>83.00</td>
<td>100.00</td>
<td>100.00</td>
</tr>
<tr>
<td>y<sub>H2S</sub></td>
<td>%</td>
<td>0</td>
<td>50.00</td>
<td>2.81</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>y<sub>C1</sub></td>
<td>%</td>
<td>0</td>
<td>54.00</td>
<td>6.27</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>8.05</td>
</tr>
<tr>
<td>y<sub>N2</sub></td>
<td>%</td>
<td>0</td>
<td>19.00</td>
<td>1.83</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>X<sub>int</sub></td>
<td>%</td>
<td>0</td>
<td>42.44</td>
<td>16.16</td>
<td>14.28</td>
<td>11.37</td>
<td>14.28</td>
<td>20.59</td>
</tr>
<tr>
<td>X<sub>vol</sub></td>
<td>%</td>
<td>0</td>
<td>65.61</td>
<td>24.77</td>
<td>25.01</td>
<td>10.50</td>
<td>25.56</td>
<td>38.80</td>
</tr>
<tr>
<td>y<sub>C2-C4</sub></td>
<td>%</td>
<td>0</td>
<td>25.00</td>
<td>1.83</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>X<sub>C5-C6</sub></td>
<td>%</td>
<td>0.01</td>
<td>11.79</td>
<td>6.19</td>
<td>6.52</td>
<td>4.22</td>
<td>6.47</td>
<td>8.21</td>
</tr>
<tr>
<td>MW<sub>C7&#x002B;</sub></td>
<td>g/mol</td>
<td>153.90</td>
<td>402.70</td>
<td>227.26</td>
<td>220.00</td>
<td>200.00</td>
<td>220.00</td>
<td>240.00</td>
</tr>
<tr>
<td>MMP</td>
<td>MPa</td>
<td>6.55</td>
<td>40.22</td>
<td>20.31</td>
<td>20.24</td>
<td>12.96</td>
<td>20.43</td>
<td>25.31</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Results and Discussions</title>
<p>In this study, we have researched and implemented four intelligent algorithm technologies, including GA-BPNN, RFR, GPR and SVR, and established an accurate and stable MMP prediction model. The optimal configuration of the GA-BPNN model is obtained through trial and error. <xref ref-type="table" rid="table-4">Table 4</xref> shows the optimal network structure of the GA-BPNN model for CO<sub>2</sub>-oil MMP prediction. All models are written and implemented in Python.</p>
<table-wrap id="table-4">
<label>Table 4</label>
<caption>
<title>The optimal network structure of GA-BPNN model for CO<sub>2</sub>-oil MMP prediction</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Parameters</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>Number of hidden layer</td>
<td>2</td>
</tr>
<tr>
<td>Number of neurons in hidden layer 1</td>
<td>128</td>
</tr>
<tr>
<td>Number of neurons in hidden layer 2</td>
<td>128</td>
</tr>
<tr>
<td>Transition function of input-hidden layer 1</td>
<td>Relu</td>
</tr>
<tr>
<td>Transition function of hidden layer 1-hidden</td>
<td>Relu</td>
</tr>
<tr>
<td>Transition function of hidden-output layer</td>
<td>Relu</td>
</tr>
<tr>
<td>Input data</td>
<td>[&#x2212;1, 1]</td>
</tr>
<tr>
<td>Learning rate</td>
<td>0.001</td>
</tr>
<tr>
<td>Training algorithm</td>
<td>IM</td>
</tr>
<tr>
<td>Cycle index</td>
<td>9000</td>
</tr>
<tr>
<td>Population size of GA</td>
<td>60</td>
</tr>
<tr>
<td>Selectivity factor</td>
<td>1.0</td>
</tr>
<tr>
<td>Crossover factor</td>
<td>0.75</td>
</tr>
<tr>
<td>Mutation factor</td>
<td>0.2</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>To significantly demonstrate the predictive performance of each model, four statistical error functions were selected to evaluate the performance of the models: the coefficient of variation (CV), absolute percent relative error (APRE), average absolute percent relative error (AAPRE), and root mean square error (RMSE). The closer the value is to 0, the higher the accuracy of the model. Additionally, the coefficient of determination (R<sup>2</sup>) is also used as the regression evaluation index, and the closer R<sup>2</sup> is to 1, the better the fitting effect. The specific calculation methods for the four statistical error functions are as follows:
<disp-formula id="eqn-17"><label>(17)</label><mml:math id="mml-eqn-17" display="block"><mml:mi>C</mml:mi><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:msqrt><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow></mml:mfrac><mml:msubsup><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mfrac><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow></mml:msup></mml:msqrt><mml:mover><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover></mml:mfrac></mml:math></disp-formula>
<disp-formula id="eqn-18"><label>(18)</label><mml:math id="mml-eqn-18" display="block"><mml:mi>A</mml:mi><mml:mi>P</mml:mi><mml:mi>R</mml:mi><mml:mi>E</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:mfrac><mml:msubsup><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup><mml:mrow><mml:mo>(</mml:mo><mml:mfrac><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mn>100</mml:mn></mml:mrow><mml:mi mathvariant="normal">&#x0025;</mml:mi></mml:math></disp-formula>
<disp-formula id="eqn-19"><label>(19)</label><mml:math id="mml-eqn-19" display="block"><mml:mi>A</mml:mi><mml:mi>A</mml:mi><mml:mi>P</mml:mi><mml:mi>R</mml:mi><mml:mi>E</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:mfrac><mml:msubsup><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup><mml:mrow><mml:mo>|</mml:mo><mml:mfrac><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfrac><mml:mo>|</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mrow><mml:mn>100</mml:mn></mml:mrow><mml:mi mathvariant="normal">&#x0025;</mml:mi></mml:math></disp-formula>
<disp-formula id="eqn-20"><label>(20)</label><mml:math id="mml-eqn-20" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd /><mml:mtd><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mi>E</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mfrac><mml:mrow><mml:msubsup><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow></mml:msup></mml:mrow><mml:mi>N</mml:mi></mml:mfrac></mml:msqrt></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
<p><disp-formula id="eqn-21"><label>(21)</label><mml:math id="mml-eqn-21" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd /><mml:mtd><mml:msup><mml:mi>R</mml:mi><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow></mml:msup><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:msubsup><mml:mo movablelimits="false">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msubsup><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup><mml:mo>&#x2212;</mml:mo><mml:mover><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>where, <inline-formula id="ieqn-20"><mml:math id="mml-ieqn-20"><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> is MMP value of experimental, <inline-formula id="ieqn-21"><mml:math id="mml-ieqn-21"><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> is the MMP predicted value, <inline-formula id="ieqn-22"><mml:math id="mml-ieqn-22"><mml:mover><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msubsup><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:msubsup></mml:mrow><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover></mml:math></inline-formula> is the MMP predicted average value.</p>
<sec id="s4_1">
<label>4.1</label>
<title>Comparison between Different Models</title>
<p>In this section, we will statistically analyze the values of four statistical error functions, with <xref ref-type="table" rid="table-5">Table 5</xref> displaying the implementation performance of each model. In this table, the performance of the training and the testing data is listed separately. From <xref ref-type="table" rid="table-5">Table 5</xref>, it can be observed that the RFR model has R<sup>2</sup> values close to 1 compared to the other three models on both the training and testing sets, specifically are 0.98 and 0.95, respectively. The MAPE and RMSE values on the testing set are also quite low, at 4.31% and 1.00%, respectively. These indicators suggest that the RFR model exhibits the highest prediction accuracy and strong stability. Besides, the GA-BPNN model demonstrates good prediction performance, ranking second to RFR. In comparison, it is evident that GPR has the lowest performance among the four models.</p>
<table-wrap id="table-5">
<label>Table 5</label>
<caption>
<title>Evaluation index values of the four MMP prediction models</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Data base</th>
<th>Evaluating indicator</th>
<th>GA-BPNN</th>
<th>RFR</th>
<th>SVR</th>
<th>GPR</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="5">Training data</td>
<td>R<sup>2</sup></td>
<td>0.95</td>
<td>0.98</td>
<td>0.93</td>
<td>0.91</td>
</tr>
<tr>
<td>APRE</td>
<td>5.10</td>
<td>4.31</td>
<td>5.45</td>
<td>8.05</td>
</tr>
<tr>
<td>AAPRE</td>
<td>2.91</td>
<td>2.51</td>
<td>3.28</td>
<td>4.76</td>
</tr>
<tr>
<td>RMSE</td>
<td>1.25</td>
<td>1.00</td>
<td>1.47</td>
<td>1.54</td>
</tr>
<tr>
<td>CV</td>
<td>11.35</td>
<td>5.98</td>
<td>7.17</td>
<td>10.06</td>
</tr>
<tr>
<td rowspan="5">Test data</td>
<td>R<sup>2</sup></td>
<td>0.93</td>
<td>0.95</td>
<td>0.90</td>
<td>0.88</td>
</tr>
<tr>
<td>APRE</td>
<td>6.55</td>
<td>5.98</td>
<td>7.26</td>
<td>10.64</td>
</tr>
<tr>
<td>AAPRE</td>
<td>5.54</td>
<td>5.05</td>
<td>6.56</td>
<td>8.51</td>
</tr>
<tr>
<td>RMSE</td>
<td>2.45</td>
<td>2.13</td>
<td>3.01</td>
<td>4.02</td>
</tr>
<tr>
<td>CV</td>
<td>8.65</td>
<td>9.60</td>
<td>7.19</td>
<td>8.38</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Among the four models evaluated, the RFR operates under the ensemble learning paradigm, which aggregates predictions from multiple decision trees to mitigate overfitting and enhance generalization. This method has obvious advantages over neural networks in processing small and medium-sized data sets (&#x003C;10,000 samples).The GPR model is distinguished by kernel flexibility. By selecting the appropriate covariance function, GPR can adapt to different data structures (periodic/non-periodic), and provide probability output to quantify the prediction uncertainty, which is a very valuable feature for risk perception decision-making in reservoir engineering. However, GPR &#x00027;s dependence on computationally intensive hyperparameter optimization often leads to local optimum, especially in high-dimensional parameter space, so advanced global optimization techniques such as Bayesian optimization are needed. Due to its &#x03B5;-insensitive loss function, SVR is theoretically robust to outliers and noise, and can ignore the error within the tolerance threshold. Its performance depends heavily on the choice of kernel functions (e.g., linear, polynomial) and hyperparameter tuning, but it has interpretability defects and sensitivity to parameter initialization. The predictive performance of the neural network model in this study was not outstanding, and the BPNN algorithm optimized with GA did not demonstrate its outstanding predictive ability. It is speculated that this may be due to the lack of massive data in related research fields, and the big data fitting advantage of the GA-BPNN model has not been fully utilized.</p>
<p>To better illustrate the model performance, <xref ref-type="fig" rid="fig-4">Fig. 4</xref> displays the MMP data predicted by RFR, GA-BPNN, SVR, and GPR models. From <xref ref-type="fig" rid="fig-4">Fig. 4</xref>, it can be seen that the RFR, GA-BPNN, and SVR models have achieved good alignment near the unit slope compared to the GPR model. Among the four models, the error distribution of the RFR and the GA-BPNN models is relatively balanced, with fewer outliers, thus ensuring good stability. The error visualization of the GPR model around the unit slope is quite evident, but it remains within an acceptable range.</p>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Prediction performance of four models</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="EE_62876-fig-4a.tif"/>
<graphic mimetype="image" mime-subtype="tif" xlink:href="EE_62876-fig-4b.tif"/>
</fig>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Comparison Models with Correlations from Literature</title>
<p>To further validate the predictive performance of the model, a statistical error analysis will be conducted based on previous models. The prediction results of the RFR model and GA-BPNN model developed in this paper will be compared with six forecasting MMP methods proposed by Lee [<xref ref-type="bibr" rid="ref-46">46</xref>], Cronquist [<xref ref-type="bibr" rid="ref-47">47</xref>], Yellig and Metcalfc [<xref ref-type="bibr" rid="ref-16">16</xref>], Alston et al. [<xref ref-type="bibr" rid="ref-18">18</xref>], Emera et al. [<xref ref-type="bibr" rid="ref-28">28</xref>], and Yuan et al. [<xref ref-type="bibr" rid="ref-48">48</xref>]. The six comparative models selected in this article simultaneously consider the influence of CO<sub>2</sub> injection gas impurities on the prediction results of CO<sub>2</sub>-oil MMP, as well as the influencing factors of MMP prediction under pure CO<sub>2</sub> injection. Therefore, they were used for comparison with RFR and GA-BPNN models.</p>
<p><xref ref-type="fig" rid="fig-5">Fig. 5</xref> visually displays the comparison results between the simulated MMP values and literature methods. The unit slope line in <xref ref-type="fig" rid="fig-5">Fig. 5</xref> serves as the evaluation criterion; the closer the data points are to the unit slope line, the higher the consistency between the simulated and experimental values. <xref ref-type="table" rid="table-6">Table 6</xref> shows the comparison of various indicators of the model. These results indicate that the RFR model and GA-BPNN model have strong predictive abilities.</p>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>Comparison of the simulated MMP values with that calculated from literature models [<xref ref-type="bibr" rid="ref-16">16</xref>,<xref ref-type="bibr" rid="ref-18">18</xref>,<xref ref-type="bibr" rid="ref-28">28</xref>,<xref ref-type="bibr" rid="ref-46">46</xref>&#x2013;<xref ref-type="bibr" rid="ref-48">48</xref>]</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="EE_62876-fig-5.tif"/>
</fig><table-wrap id="table-6">
<label>Table 6</label>
<caption>
<title>Statistical error analysis of different literature models and the model presented in this studies</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Model name</th>
<th>AAPRE</th>
<th>APRE</th>
<th>RMSE</th>
</tr>
</thead>
<tbody>
<tr>
<td>RFR</td>
<td>4.55</td>
<td>1.01</td>
<td>0.89</td>
</tr>
<tr>
<td>GA-BPNN</td>
<td>6.45</td>
<td>2.86</td>
<td>1.66</td>
</tr>
<tr>
<td>Emera et al. [<xref ref-type="bibr" rid="ref-28">28</xref>]</td>
<td>15.46</td>
<td>5.12</td>
<td>3.84</td>
</tr>
<tr>
<td>Yuan et al. [<xref ref-type="bibr" rid="ref-48">48</xref>]</td>
<td>17.12</td>
<td>6.46</td>
<td>4.76</td>
</tr>
<tr>
<td>Alston et al. [<xref ref-type="bibr" rid="ref-18">18</xref>]</td>
<td>18.32</td>
<td>7.04</td>
<td>5.12</td>
</tr>
<tr>
<td>Yellig and Metcalfc [<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td>19.65</td>
<td>7.85</td>
<td>5.85</td>
</tr>
<tr>
<td>Cronquist [<xref ref-type="bibr" rid="ref-47">47</xref>]</td>
<td>20.12</td>
<td>8.64</td>
<td>7.12</td>
</tr>
<tr>
<td>Lee [<xref ref-type="bibr" rid="ref-46">46</xref>]</td>
<td>22.54</td>
<td>9.54</td>
<td>8.86</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>Sensitivity Analysis</title>
<p>Analyze the impact of the characteristic parameters of the four models used in this article on predicting MMP in order to deepen our understanding of the CO<sub>2</sub>-oil interaction mechanism and its intrinsic correlation. In this study, the relevance factor (R) was utilized to assess the extent of influence that each parameter has on MMP. For input variables, the R is a value ranging from &#x2212;1 to 1. When R is negative, it represents that the input variable has a negative effect on the output variable, and when R is positive, it indicates that the input variable has a positive effect on the output variable. The absolute value of R represents the impact of the input variable on the output variable, and the larger the absolute value of R, the greater the impact of the input variable on the output variable. The calculation formula for R is as follows:
<disp-formula id="eqn-22"><label>(22)</label><mml:math id="mml-eqn-22" display="block"><mml:mi>R</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>P</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mover><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>J</mml:mi></mml:mrow></mml:msub><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover><mml:mo>)</mml:mo></mml:mrow><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mo>(</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mover><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:msqrt><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mover><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>J</mml:mi></mml:mrow></mml:msub><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow></mml:msup><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:munderover><mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mover><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:mrow></mml:msup></mml:msqrt></mml:mfrac></mml:math></disp-formula>where, <inline-formula id="ieqn-23"><mml:math id="mml-ieqn-23"><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-24"><mml:math id="mml-ieqn-24"><mml:mover><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover></mml:math></inline-formula> is the <italic>i</italic>-th and average number of input <italic>j</italic>, <italic>MMP</italic><sub><italic>i</italic></sub> indicates the <italic>i</italic>-th, <inline-formula id="ieqn-25"><mml:math id="mml-ieqn-25"><mml:mover><mml:mrow><mml:mi>M</mml:mi><mml:mi>M</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mo accent="false">&#x00AF;</mml:mo></mml:mover></mml:math></inline-formula> represents the average value of the MMP output.</p>
<p><xref ref-type="fig" rid="fig-6">Fig. 6</xref> visually displays the R value of each input variable. From <xref ref-type="fig" rid="fig-6">Fig. 6</xref>, it can be seen that X<sub>int</sub>, X<sub>C5-C6</sub>, and y<sub>H2S</sub> can reduce the value of MMP. The observed negative correlations between X<sub>int</sub>, X<sub>C5-C6</sub>, y<sub>H2S</sub>, and MMP can be attributed to their thermodynamic roles in reducing CO<sub>2</sub>-oil interfacial tension. The ability of y<sub>H2S</sub> to reduce MMP is due to the polarity of H<sub>2</sub>S molecules, which can effectively reduce the interfacial tension between CO<sub>2</sub> and oil. The C<sub>2</sub>-C<sub>6</sub> components in crude oil have similar molecular weights and properties to CO<sub>2</sub>, which helps to increase the solubility of CO<sub>2</sub> in crude oil and reduce CO<sub>2</sub>-oil MMP.</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>Relevancy factor of input variables on CO<sub>2</sub>-oil MMP</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="EE_62876-fig-6.tif"/>
</fig>
<p>The increase of the other seven factors (T<sub>R</sub>, MW<sub>C7&#x002B;</sub>, X<sub>vol</sub>, y<sub>C2-C4</sub>, y<sub>CO2</sub>, y<sub>C1</sub>, y<sub>N2</sub>) will lead to an increase in CO<sub>2</sub>-oil MMP. The increase of T<sub>R</sub> will result in a decrease in the solubility of CO<sub>2</sub> in crude oil, which seriously affects the ability of CO<sub>2</sub> extraction to extract light hydrocarbons, ultimately leading to an increase in CO<sub>2</sub>-oil MMP. An increase of MW<sub>C7&#x002B;</sub> may create a greater difference between crude oil and CO<sub>2</sub> molecules, resulting in an increase in interfacial tension and thus affecting MMP. For the light components (X<sub>vol</sub>) in crude oil, due to their easy volatilization, they are prone to entering the CO<sub>2</sub> gas phase from the crude oil during the CO<sub>2</sub> oil multi-stage mixed phase contact process, which results in a decrease in CO<sub>2</sub> purity and an increase in MMP. The increase in CO<sub>2</sub> concentration enhances the frequency of contact between CO<sub>2</sub> and crude oil. As CO<sub>2</sub> continuously extracts lighter components from the crude oil, the proportion of heavier components increases. Since these heavier components exhibit poor miscibility with CO&#x2082;, a higher pressure is required to achieve miscibility. Within the parameter curve range, the influence of the ten influencing factors on MMP is ranked in descending order: T<sub>R</sub> &#x003E; X<sub>C5-C6</sub> &#x003E; MW<sub>C7&#x002B;</sub> &#x003E; X<sub>vol</sub> &#x003E; y<sub>H2S</sub> &#x003E; y<sub>C2-C4</sub> &#x003E; y<sub>CO2</sub> &#x003E; y<sub>C1</sub>, y<sub>N2</sub> &#x003E; X<sub>int</sub>.</p>
</sec>
<sec id="s4_4">
<label>4.4</label>
<title>Models Application to Ultra-Deep Oil Reservoir</title>
<p>The Xinghua Block of Bayan oil field is a typical ultra-deep oil reservoir characterized by a large burial depth (&#x003E;5000 m), high formation temperature and pressure, complex crude oil composition, and intricate fluid phases [<xref ref-type="bibr" rid="ref-49">49</xref>,<xref ref-type="bibr" rid="ref-50">50</xref>]. To verify the applicability of the established CO<sub>2</sub>-oil MMP prediction models for ultra-deep oil reservoirs, experimental measurement data of CO<sub>2</sub> flooding MMP from two wells in Xinghua block of Bayan oil field were selected. The thin tube experiment refers to a simulated displacement test conducted in a thin tube model, and the experimental flowchart is shown in <xref ref-type="fig" rid="fig-7">Fig. 7</xref>. The criterion for determining whether the thin tube experiment qualifies as a mixed-phase displacement is that the crude oil recovery rate, when injected with 1.2 pore volume(PV) exceeds 90%. Additionally, as the displacement pressure increases, there is no significant rise in displacement efficiency, and mixed-phase fluids can be observed in the observation window. The method for determining MMP is to plot the relationship curve between the recovery degree and displacement pressure when injecting 1.2 PV in each thin tube experiment, while ensuring that the thin tube experiment achieves three cycles of miscible flooding and three cycles of immiscible flooding [<xref ref-type="bibr" rid="ref-51">51</xref>].The pressure corresponding to the intersection of the non-mixed phase and mixed phase curves is the minimum mixed phase pressure (MMP), as shown in <xref ref-type="fig" rid="fig-8">Fig. 8</xref>. The results of the slim tube test showed that injecting pure CO<sub>2</sub> at formation temperatures of 139&#x00B0;C and 133.9&#x00B0;C resulted in MMP values of 49.79 and 48.48 MPa for the reservoir, respectively.</p>
<fig id="fig-7">
<label>Figure 7</label>
<caption>
<title>Flow chart of thin tube test</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="EE_62876-fig-7.tif"/>
</fig><fig id="fig-8">
<label>Figure 8</label>
<caption>
<title>Relationship curve between CO<sub>2</sub> displacement recovery degree and displacement pressure in the formation oil slim tube experiment of Bayan oil field</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="EE_62876-fig-8.tif"/>
</fig>
<p>Calculate the MMP using the development prediction models and compare the calculated value with the experimental value measured by the thin tube experiment, <xref ref-type="table" rid="table-7">Table 7</xref> shows the comparison results between the MMP values predicted by four models and the experimental values measured by capillary experiments. As summarized in <xref ref-type="table" rid="table-7">Table 7</xref>, both the RFR and GA-BPNN models demonstrate superior predictive accuracy, with RMSE values below 3% and AAPRE less than 0.1%. Furthermore, all four models exhibit minimal deviations between predicted and experimentally measured MMP, with errors consistently confined within engineering-acceptable thresholds (relative error &#x003C; 5%). These results validate the robustness and applicability of the proposed models for MMP prediction in ultra-deep reservoirs characterized by high-temperature, high-pressure conditions.</p>
<table-wrap id="table-7">
<label>Table 7</label>
<caption>
<title>Comparison of experimental and predicted values of CO<sub>2</sub>-oil MMP</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
</colgroup>
<thead>
<tr>
<th align="center">Model</th>
<th align="center">Temp (&#x00B0;C)</th>
<th align="center">MW<sub>C7&#x002B;</sub><break/> (g/mol)</th>
<th align="center">X<sub>vol</sub> (%)</th>
<th align="center">Xint (%)</th>
<th align="center">X<sub>C5-C6</sub> (%)</th>
<th align="center">Experimental value (MPa)</th>
<th align="center">Prediction value (MPa)</th>
<th align="center">RMSE (%)</th>
<th align="center">AAPRE (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="2">RFR</td>
<td>139.00</td>
<td>296.92</td>
<td>8.95</td>
<td>3.95</td>
<td>4.40</td>
<td>49.79</td>
<td>50.12</td>
<td>0.33</td>
<td>0.01</td>
</tr>
<tr>
<td>133.90</td>
<td>279.88</td>
<td>9.95</td>
<td>4.99</td>
<td>4.14</td>
<td>48.18</td>
<td>49.00</td>
<td>0.82</td>
<td>0.02</td>
</tr>
<tr>
<td rowspan="2">GA-BPNN</td>
<td>139.00</td>
<td>296.92</td>
<td>8.95</td>
<td>3.95</td>
<td>4.40</td>
<td>49.79</td>
<td>47.56</td>
<td>2.23</td>
<td>0.04</td>
</tr>
<tr>
<td>133.90</td>
<td>279.88</td>
<td>9.95</td>
<td>4.99</td>
<td>4.14</td>
<td>48.18</td>
<td>45.65</td>
<td>2.53</td>
<td>0.05</td>
</tr>
<tr>
<td rowspan="2">SVR</td>
<td>139.00</td>
<td>296.92</td>
<td>8.95</td>
<td>3.95</td>
<td>4.40</td>
<td>49.79</td>
<td>53.85</td>
<td>4.06</td>
<td>0.08</td>
</tr>
<tr>
<td>133.90</td>
<td>279.88</td>
<td>9.95</td>
<td>4.99</td>
<td>4.14</td>
<td>48.18</td>
<td>52.45</td>
<td>4.27</td>
<td>0.09</td>
</tr>
<tr>
<td rowspan="2">GPR</td>
<td>139.00</td>
<td>296.92</td>
<td>8.95</td>
<td>3.95</td>
<td>4.40</td>
<td>49.79</td>
<td>43.56</td>
<td>6.23</td>
<td>0.13</td>
</tr>
<tr>
<td>133.90</td>
<td>279.88</td>
<td>9.95</td>
<td>4.99</td>
<td>4.14</td>
<td>48.18</td>
<td>43.12</td>
<td>5.06</td>
<td>0.11</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Conclusion</title>
<p>In this study, we developed four MMP prediction models based on machine learning, namely RFR,GA-BPNN,SVR and GPR. Additionally, we separately investigated pure CO<sub>2</sub> gas injection and impure CO<sub>2</sub> injection as described in the literature. Across these four models, total of ten impact factors (e.g., T<sub>R</sub>, x<sub>int</sub>, x<sub>vol</sub>, x<sub>C5&#x2013;C6</sub>, MW<sub>C7&#x002B;</sub>, y<sub>C1</sub>, y<sub>N2</sub>, y<sub>CO2</sub>, y<sub>C2-C4</sub>, y<sub>H2S</sub>) were selected, with MMP serving as the output variable. Based on a comprehensive understanding of four models and discussions of the simulation results, the study&#x2019;s conclusions can be briefly described as follows:</p>
<p>(1) The developed fours models for predicting MMP can effectively predict MMP for both pure CO<sub>2</sub> and impure CO<sub>2</sub> gas injections, and they can all be utilized for MMP prediction in ultra-deep oil reservoirs.</p>
<p>(2) Among the four models studied in this article, the RFR model exhibits the best predictive performance, while the GPR model shows the worst predictive performance. However, the BPNN algorithm optimized with GA did not demonstrate its outstanding predictive ability, which may be due to its incomplete utilization of the advantages of big data fitting.</p>
<p>(3) In the comparison with experimental results with those from six forecasting MMP methods, the RFR model and GA-BPNN model showed excellent performance.</p>
<p>(4) Among the ten influencing factors studied, X<sub>int</sub>, X<sub>C5-C6</sub>, and y<sub>H2S</sub> are negatively correlated with MMP, while T<sub>R</sub>, MW<sub>C7&#x002B;</sub>, X<sub>vol</sub>, y<sub>C2-C4</sub>, y<sub>CO2</sub>, y<sub>C1</sub> and y<sub>N2</sub> are positively correlated with MMP. Among those, T<sub>R</sub> has the greatest impact on MMP, followed by X<sub>C5-C6</sub> and MW<sub>C7&#x002B;</sub>. The degree of influence of the ten influencing factors on MMP, in descending order, is as follows: T<sub>R</sub> &#x003E; X<sub>C5-C6</sub> &#x003E; MW<sub>C7&#x002B;</sub> &#x003E; X<sub>vol</sub> &#x003E; y<sub>H2S</sub> &#x003E; y<sub>C2-C4</sub> &#x003E; y<sub>CO2</sub> &#x003E; y<sub>C1</sub>, y<sub>N2</sub> &#x003E; X<sub>int</sub>.</p>
<p>(5) The difference between the properties of crude oil components and those of injected gas components impacts on MMP. The closer the properties of the component are to those of gas, the smaller the interfacial tension between oil and gas, and the lower the MMP.</p>
<p>(6) Although the four developed models have demonstrated promising performance in predicting the MMP of ultra-deep reservoirs, the limited sample size prevents a comprehensive analysis of interaction mechanisms among different influencing factors. Future research will focus on expanding the dataset and integrating experimental studies to systematically investigate the interaction mechanisms between various parameters affecting MMP, as well as to further improve and establish a high-performance MMP prediction model for ultra-deep reservoirs.</p>
</sec>
</body>
<back>
<ack>
<p>I express my gratitude to the anonymous reviewers for their comments and suggestions which have greatly helped me to improve the content, quality, organization, and presentation of this work.</p>
</ack>
<sec>
<title>Funding Statement</title>
<p>There is no funding support for this work.</p>
</sec>
<sec>
<title>Author Contributions</title>
<p>The authors confirm contribution to the paper as follows: study conception and design: Tianfu Li and Yuedong Yao; data collection: Kun Li and Xiuwei Wang; analysis and interpretation of results: Qingchun Meng and Zhenjie Wang; draft manuscript preparation: Jingyang Luo and Zhaohui Wang. All authors reviewed the results and approved the final version of the manuscript.</p>
</sec>
<sec sec-type="data-availability">
<title>Availability of Data and Materials</title>
<p>The data that support the findings of this study are available from author, Tianfu Li, upon reasonable request.</p>
</sec>
<sec>
<title>Ethics Approval</title>
<p>Not applicable.</p>
</sec>
<sec sec-type="COI-statement">
<title>Conflicts of Interest</title>
<p>The authors declare no conflicts of interest to report in the present study.</p>
</sec>
<glossary content-type="abbreviations" id="glossary-1">
<title>Nomenclature</title>
<def-list>
<def-item>
<term>CO<sub>2</sub></term>
<def>
<p>Carbon dioxide</p>
</def>
</def-item>
<def-item>
<term>EOR</term>
<def>
<p>Enhanced Oil Recovery</p>
</def>
</def-item>
<def-item>
<term>MMP</term>
<def>
<p>Minimum Miscible Pressure</p>
</def>
</def-item>
<def-item>
<term>AI</term>
<def>
<p>Artificial Intelligence</p>
</def>
</def-item>
<def-item>
<term>API</term>
<def>
<p>American Petroleum Institute</p>
</def>
</def-item>
<def-item>
<term>CCUS</term>
<def>
<p>Carbon Capture, Utilization and Storage</p>
</def>
</def-item>
<def-item>
<term>GA</term>
<def>
<p>Genetic Algorithm</p>
</def>
</def-item>
<def-item>
<term>BPNN</term>
<def>
<p>Back Propagation Neural Network</p>
</def>
</def-item>
<def-item>
<term>RFR</term>
<def>
<p>Random Forest Regression</p>
</def>
</def-item>
<def-item>
<term>P<sub>r</sub></term>
<def>
<p>Reservoir Pressure</p>
</def>
</def-item>
<def-item>
<term>SVR</term>
<def>
<p>Support Vector Regression</p>
</def>
</def-item>
<def-item>
<term>GPR</term>
<def>
<p>Gaussian Process Regression</p>
</def>
</def-item>
<def-item>
<term>LSSVM</term>
<def>
<p>Least Squares Support Vector Machine</p>
</def>
</def-item>
<def-item>
<term>GEP</term>
<def>
<p>Genetic Expression Programming</p>
</def>
</def-item>
<def-item>
<term>GA-BPNN</term>
<def>
<p>Genetic Algorithm Based Back Propagation Artificial Neural Network</p>
</def>
</def-item>
<def-item>
<term>ANN</term>
<def>
<p>Artificial Neural Network</p>
</def>
</def-item>
<def-item>
<term>PV</term>
<def>
<p>Pore Volume</p>
</def>
</def-item>
<def-item>
<term>ML</term>
<def>
<p>Machine Learning</p>
</def>
</def-item>
<def-item>
<term>DT</term>
<def>
<p>Decision Trees</p>
</def>
</def-item>
<def-item>
<term>y<sub>CO2</sub></term>
<def>
<p>Mole fraction of CO<sub>2</sub> in solution</p>
</def>
</def-item>
<def-item>
<term>STD</term>
<def>
<p>Standard Deviation</p>
</def>
</def-item>
<def-item>
<term>CV</term>
<def>
<p>Coefficient of Variation</p>
</def>
</def-item>
<def-item>
<term>APRE</term>
<def>
<p>Average Relative Error Percent</p>
</def>
</def-item>
<def-item>
<term>SVM</term>
<def>
<p>Support Vector Machines</p>
</def>
</def-item>
<def-item>
<term>AAPRE</term>
<def>
<p>Average Absolute Percent Relative Error</p>
</def>
</def-item>
<def-item>
<term>RMSE</term>
<def>
<p>Root Mean Square Error</p>
</def>
</def-item>
<def-item>
<term>R<sup>2</sup></term>
<def>
<p>Coefficient of Determination</p>
</def>
</def-item>
<def-item>
<term>R</term>
<def>
<p>Relevance Factor</p>
</def>
</def-item>
<def-item>
<term>T<sub>R</sub></term>
<def>
<p>Reservoir Temperature</p>
</def>
</def-item>
<def-item>
<term>T<sub>c</sub></term>
<def>
<p>Critical temperature</p>
</def>
</def-item>
<def-item>
<term>MW<sub>C5&#x002B;</sub></term>
<def>
<p>Molecular weight of C<sub>5&#x002B;</sub> oil fraction, g/mol</p>
</def>
</def-item>
<def-item>
<term>MW<sub>C7&#x002B;</sub></term>
<def>
<p>Molecular weight of C<sub>7&#x002B;</sub> oil fraction, g/mol</p>
</def>
</def-item>
<def-item>
<term>X<sub>vol</sub></term>
<def>
<p>Mole fraction of volatile oil components, including (C<sub>1</sub> and N<sub>2</sub>)</p>
</def>
</def-item>
<def-item>
<term>X<sub>int</sub></term>
<def>
<p>Mole fraction of intermediate oil components, including (C<sub>2</sub>-C<sub>4</sub>, CO<sub>2</sub>, and H<sub>2</sub>S)</p>
</def>
</def-item>
<def-item>
<term>X<sub>C5-C6</sub></term>
<def>
<p>Mole fraction of oil components, including(C<sub>5</sub> and C<sub>6</sub>)</p>
</def>
</def-item>
<def-item>
<term>y<sub>C1</sub></term>
<def>
<p>Mole fraction of C<sub>1</sub> in the CO<sub>2</sub> injection gas</p>
</def>
</def-item>
<def-item>
<term>y<sub>N2</sub></term>
<def>
<p>Mole fraction of N<sub>2</sub> in the injection gas</p>
</def>
</def-item>
<def-item>
<term>y<sub>H2S</sub></term>
<def>
<p>Mole fraction of H<sub>2</sub>S in the injection gas</p>
</def>
</def-item>
<def-item>
<term>y<sub>C2-C4</sub></term>
<def>
<p>Mole fraction of C<sub>2</sub>-C<sub>4</sub> in the injection gas</p>
</def>
</def-item>
<def-item>
<term>MLR</term>
<def>
<p>Multiple Linear Regression</p>
</def>
</def-item>
</def-list>
</glossary>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kumar</surname> <given-names>N</given-names></string-name>, <string-name><surname>Sampaio</surname> <given-names>MA</given-names></string-name>, <string-name><surname>Ojha</surname> <given-names>K</given-names></string-name>, <string-name><surname>Hoteit</surname> <given-names>H</given-names></string-name>, <string-name><surname>Mandal</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Fundamental aspects, mechanisms and emerging possibilities of CO<sub>2</sub> miscible flooding in enhanced oil recovery: a review</article-title>. <source>Fuel</source>. <year>2022</year>;<volume>330</volume>(<issue>3</issue>):<fpage>125633</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2022.125633</pub-id>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Al-Obaidi</surname> <given-names>DA</given-names></string-name>, <string-name><surname>Al-Mudhafar</surname> <given-names>WJ</given-names></string-name>, <string-name><surname>Hussein</surname> <given-names>HH</given-names></string-name>, <string-name><surname>Rao</surname> <given-names>DN</given-names></string-name></person-group>. <article-title>Experimental influence assessments of water drive and gas breakthrough through the CO<sub>2</sub>-assisted gravity drainage process in reservoirs with strong aquifers</article-title>. <source>Fuel</source>. <year>2024</year>;<volume>370</volume>:<fpage>131873</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2024.131873</pub-id>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ali</surname> <given-names>M</given-names></string-name>, <string-name><surname>Jha</surname> <given-names>NK</given-names></string-name>, <string-name><surname>Pal</surname> <given-names>N</given-names></string-name>, <string-name><surname>Keshavarz</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hoteit</surname> <given-names>H</given-names></string-name>, <string-name><surname>Sarmadivaleh</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Recent advances in carbon dioxide geological storage, experimental procedures, influencing parameters, and future outlook</article-title>. <source>Earth-Sci Rev</source>. <year>2022</year>;<volume>225</volume>(<issue>14</issue>):<fpage>103895</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.earscirev.2021.103895</pub-id>.</mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Liang</surname> <given-names>B</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>C</given-names></string-name>, <string-name><surname>Jia</surname> <given-names>C</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>C</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zha</surname> <given-names>Y</given-names></string-name></person-group>. <article-title>Carbon capture, utilization and storage (CCUS) in oil and gas reservoirs in China: status, opportunities and challenges</article-title>. <source>Fuel</source>. <year>2024</year>;<volume>375</volume>(<issue>2</issue>):<fpage>132353</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2024.132353</pub-id>.</mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Huang</surname> <given-names>T</given-names></string-name>, <string-name><surname>Li</surname> <given-names>X</given-names></string-name>, <string-name><surname>Cheng</surname> <given-names>L</given-names></string-name></person-group>. <article-title>Modeling of the flow and heat transfer of supercritical CO<sub>2</sub> flowing in serpentine tubes</article-title>. <source>Front Heat Mass Trans</source>. <year>2020</year>;<volume>15</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>8</lpage>. doi:<pub-id pub-id-type="doi">10.5098/hmt.15.15</pub-id>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Kang</surname> <given-names>W</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Xie</surname> <given-names>A</given-names></string-name>, <string-name><surname>Li</surname> <given-names>M</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Stability influence factors and mechanism of produced emulsion from CO<sub>2</sub> flooding</article-title>. <source>J Molecular Liq</source>. <year>2021</year>;<volume>333</volume>:<fpage>115974</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.molliq.2021.115974</pub-id>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yu</surname> <given-names>H</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>J</given-names></string-name>, <string-name><surname>Zeng</surname> <given-names>H</given-names></string-name>, <string-name><surname>Xie</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Song</surname> <given-names>J</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>A new empirical correlation of MMP prediction for oil&#x2014;impure CO<sub>2</sub> systems</article-title>. <source>Fuel</source>. <year>2024</year>;<volume>371</volume>:<fpage>132043</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2024.132043</pub-id>.</mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Sun</surname> <given-names>J</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>L</given-names></string-name>, <string-name><surname>Bao</surname> <given-names>B</given-names></string-name></person-group>. <article-title>Recovery mechanism of supercritical CO<sub>2</sub> miscible flooding in 30 nm nanomatrices with multi-scale and fracture heterogeneity</article-title>. <source>Fuel</source>. <year>2025</year>;<volume>392</volume>:<fpage>134835</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2025.134835</pub-id>.</mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Flock</surname> <given-names>DL</given-names></string-name>, <string-name><surname>Nouar</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Parametric analysis on the determination of the minimum miscibility pressure in slim tube displacements</article-title>. <source>J Can Pet Technol</source>. <year>1984</year>;<volume>23</volume>(<issue>5</issue>). doi:<pub-id pub-id-type="doi">10.2118/84-05-12</pub-id>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Song</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Song</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Mo</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Jing</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>F</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Determination of minimum miscibility and near-miscibility pressures for CO<sub>2</sub>-oil mixtures in shale reservoirs</article-title>. <source>Fuel</source>. <year>2025</year>;<volume>388</volume>:<fpage>134531</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2025.134531</pub-id>.</mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>J</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>X</given-names></string-name></person-group>. <article-title>A comprehensive review of minimum miscibility pressure determination and reduction strategies between CO<sub>2</sub> and crude oil in CCUS processes</article-title>. <source>Fuel</source>. <year>2025</year>;<volume>384</volume>:<fpage>134053</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2024.134053</pub-id>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Xian</surname> <given-names>B</given-names></string-name>, <string-name><surname>Hao</surname> <given-names>H</given-names></string-name>, <string-name><surname>Deng</surname> <given-names>S</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>H</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>T</given-names></string-name>, <string-name><surname>Cheng</surname> <given-names>L</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Laboratory experiments of CO<sub>2</sub> flooding and its influencing factors on oil recovery in a low permeability reservoir with medium viscous oil</article-title>. <source>Fuel</source>. <year>2024</year>;<volume>371</volume>(<issue>4</issue>):<fpage>131871</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2024.131871</pub-id>.</mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Holm</surname> <given-names>LW</given-names></string-name>, <string-name><surname>Josendal</surname> <given-names>VA</given-names></string-name></person-group>. <article-title>Mechanisms of oil displacement by carbon dioxide</article-title>. <source>SPE J Pet Technol</source>. <year>1974</year>;<volume>26</volume>:<fpage>1427</fpage>&#x2013;<lpage>38</lpage>. doi:<pub-id pub-id-type="doi">10.2118/4736-PA</pub-id>.</mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Mansour</surname> <given-names>EM</given-names></string-name>, <string-name><surname>Al-Sabagh</surname> <given-names>AM</given-names></string-name>, <string-name><surname>Desouky</surname> <given-names>SM</given-names></string-name>, <string-name><surname>Zawawy</surname> <given-names>FM</given-names></string-name>, <string-name><surname>Ramzi</surname> <given-names>M</given-names></string-name></person-group>. <article-title>A new estimating method of minimum miscibility pressure as a key parameter in designing CO<sub>2</sub> gas injection process</article-title>. <source>Egyptian J Petrol</source>. <year>2018</year>;<volume>27</volume>(<issue>4</issue>):<fpage>801</fpage>&#x2013;<lpage>10</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.ejpe.2017.12.002</pub-id>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>G</given-names></string-name>, <string-name><surname>Gao</surname> <given-names>H</given-names></string-name>, <string-name><surname>Fu</surname> <given-names>K</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Liang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Tontiwachwuthikul</surname> <given-names>P</given-names></string-name></person-group>. <article-title>An improved correlation to determine minimum miscibility pressure of CO<sub>2</sub>&#x2013;oil system</article-title>. <source>Green Ene Environ</source>. <year>2020</year>;<volume>5</volume>(<issue>1</issue>):<fpage>97</fpage>&#x2013;<lpage>104</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.gee.2018.12.003</pub-id>.</mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yellig</surname> <given-names>WF</given-names></string-name>, <string-name><surname>Metcalfe</surname> <given-names>RS</given-names></string-name></person-group>. <article-title>Determination and prediction of CO<sub>2</sub> minimum miscibility pressures (includes associated paper 8876)</article-title>. <source>J Petrol Techo1</source>. <year>1980</year>;<volume>32</volume>(<issue>1</issue>):<fpage>160</fpage>&#x2013;<lpage>8</lpage>. doi:<pub-id pub-id-type="doi">10.2118/7477-PA</pub-id>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Glas&#x00F8;</surname> <given-names>&#x00D8;</given-names></string-name></person-group>. <article-title>Generalized minimum miscibility pressure correlation</article-title>. <source>Soc Petrol Eng J</source>. <year>1985</year>;<volume>25</volume>(<issue>6</issue>):<fpage>927</fpage>&#x2013;<lpage>34</lpage>. doi:<pub-id pub-id-type="doi">10.2118/12893-PA</pub-id>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Alston</surname> <given-names>RB</given-names></string-name>, <string-name><surname>Kokolis</surname> <given-names>GP</given-names></string-name>, <string-name><surname>James</surname> <given-names>CF</given-names></string-name></person-group>. <article-title>CO<sub>2</sub> minimum miscibility pressure: a correlation for impure CO<sub>2</sub> streams and live oil systems</article-title>. <source>Soc Petrol Eng J</source>. <year>1985</year>;<volume>25</volume>(<issue>2</issue>):<fpage>268</fpage>&#x2013;<lpage>74</lpage>. doi:<pub-id pub-id-type="doi">10.2118/11959-PA</pub-id>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Abdurrahman</surname> <given-names>M</given-names></string-name>, <string-name><surname>Permadi</surname> <given-names>AK</given-names></string-name>, <string-name><surname>Bae</surname> <given-names>WS</given-names></string-name></person-group>. <article-title>An improved method for estimating minimum miscibility pressure through condensation-extraction process under swelling tests</article-title>. <source>J Petrol Sci Eng</source>. <year>2015</year>;<volume>131</volume>:<fpage>165</fpage>&#x2013;<lpage>71</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.petrol.2015.04.033</pub-id>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ge</surname> <given-names>D</given-names></string-name>, <string-name><surname>Cheng</surname> <given-names>H</given-names></string-name>, <string-name><surname>Cai</surname> <given-names>M</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Dong</surname> <given-names>P</given-names></string-name></person-group>. <article-title>A new predictive method for CO<sub>2</sub>-oil minimum miscibility pressure</article-title>. <source>Geofluids</source>. <year>2021</year>;<volume>1</volume>:<fpage>1</fpage>&#x2013;<lpage>8</lpage>. doi:<pub-id pub-id-type="doi">10.1155/2021/8868592</pub-id>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Li</surname> <given-names>D</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>S</given-names></string-name>, <string-name><surname>Liao</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>B</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>D</given-names></string-name></person-group>. <article-title>A well-test model for gas hydrate dissociation considering a dynamic interface</article-title>. <source>Fuel</source>. <year>2022</year>;<volume>314</volume>(<issue>1&#x2013;2</issue>):<fpage>123053</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2021.123053</pub-id>.</mixed-citation></ref>
<ref id="ref-22"><label>[22]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bagheri</surname> <given-names>H</given-names></string-name>, <string-name><surname>Mohebian</surname> <given-names>R</given-names></string-name>, <string-name><surname>Moradzadeh</surname> <given-names>A</given-names></string-name>, <string-name><surname>Olya</surname> <given-names>BAM</given-names></string-name></person-group>. <article-title>Pore size classification and prediction based on distribution of reservoir fluid volumes utilizing well logs and deep learning algorithm in a complex lithology</article-title>. <source>Artif Intell Geosci</source>. <year>2024</year>;<volume>5</volume>:<fpage>100094</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.aiig.2024.100094</pub-id>.</mixed-citation></ref>
<ref id="ref-23"><label>[23]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bhattacherjee</surname> <given-names>R</given-names></string-name>, <string-name><surname>Botchway</surname> <given-names>K</given-names></string-name>, <string-name><surname>Pashin</surname> <given-names>JC</given-names></string-name>, <string-name><surname>Chakraborty</surname> <given-names>G</given-names></string-name>, <string-name><surname>Bikkina</surname> <given-names>P</given-names></string-name></person-group>. <article-title>Developing statistical and machine learning models for predicting CO<sub>2</sub> solubility in live crude oils</article-title>. <source>Fuel</source>. <year>2024</year>;<volume>368</volume>:<fpage>131577</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2024.131577</pub-id>.</mixed-citation></ref>
<ref id="ref-24"><label>[24]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wu</surname> <given-names>B</given-names></string-name>, <string-name><surname>Xie</surname> <given-names>R</given-names></string-name>, <string-name><surname>Xiao</surname> <given-names>L</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>J</given-names></string-name>, <string-name><surname>Jin</surname> <given-names>G</given-names></string-name>, <string-name><surname>Fu</surname> <given-names>J</given-names></string-name></person-group>. <article-title>Integrated classification method of tight sandstone reservoir based on principal component analysis&#x2014;simulated annealing genetic algorithm-fuzzy cluster means</article-title>. <source>Petrol Sci</source>. <year>2023</year>;<volume>20</volume>(<issue>5</issue>):<fpage>2747</fpage>&#x2013;<lpage>58</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.petsci.2023.04.014</pub-id>.</mixed-citation></ref>
<ref id="ref-25"><label>[25]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Huang</surname> <given-names>YF</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>GH</given-names></string-name>, <string-name><surname>Dong</surname> <given-names>MZ</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>GM</given-names></string-name></person-group>. <article-title>Development of an artificial neural network model for predicting minimum miscibility pressure in CO<sub>2</sub> flooding</article-title>. <source>J Petrol Sci Engg</source>. <year>2003</year>;<volume>37</volume>(<issue>1</issue>):<fpage>83</fpage>&#x2013;<lpage>95</lpage>. doi:<pub-id pub-id-type="doi">10.1016/S0920-4105(02)00312-1</pub-id>.</mixed-citation></ref>
<ref id="ref-26"><label>[26]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Shokrollahi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Arabloo</surname> <given-names>M</given-names></string-name>, <string-name><surname>Gharagheizi</surname> <given-names>F</given-names></string-name>, <string-name><surname>Mohammadi</surname> <given-names>AH</given-names></string-name></person-group>. <article-title>Intelligent model for prediction of CO<sub>2</sub>&#x2014;reservoir oil minimum miscibility pressure</article-title>. <source>Fuel</source>. <year>2013</year>;<volume>112</volume>:<fpage>375</fpage>&#x2013;<lpage>84</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2013.04.036</pub-id>.</mixed-citation></ref>
<ref id="ref-27"><label>[27]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kamari</surname> <given-names>A</given-names></string-name>, <string-name><surname>Arabloo</surname> <given-names>M</given-names></string-name>, <string-name><surname>Shokrollahi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Gharagheizi</surname> <given-names>F</given-names></string-name>, <string-name><surname>Mohammadi</surname> <given-names>AH</given-names></string-name></person-group>. <article-title>Rapid method to estimate the minimum miscibility pressure (MMP) in live reservoir oil systems during CO<sub>2</sub> flooding</article-title>. <source>Fuel</source>. <year>2015</year>;<volume>153</volume>:<fpage>310</fpage>&#x2013;<lpage>9</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2015.02.087</pub-id>.</mixed-citation></ref>
<ref id="ref-28"><label>[28]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Emera</surname> <given-names>MK</given-names></string-name>, <string-name><surname>Sarma</surname> <given-names>HK</given-names></string-name></person-group>. <article-title>Use of genetic algorithm to estimate CO<sub>2</sub>&#x2013;oil minimum miscibility pressure&#x2014;a key parameter in design of CO<sub>2</sub> miscible flood</article-title>. <source>J Petrol Sci Eng</source>. <year>2005</year>;<volume>46</volume>(<issue>1</issue>):<fpage>37</fpage>&#x2013;<lpage>52</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.petrol.2004.10.001</pub-id>.</mixed-citation></ref>
<ref id="ref-29"><label>[29]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Birang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Dinarvand</surname> <given-names>N</given-names></string-name>, <string-name><surname>Shariatpanahi</surname> <given-names>SF</given-names></string-name>, <string-name><surname>Edalat</surname> <given-names>M</given-names></string-name></person-group>. <chapter-title>Development of a new artificial-neural-network model for predicting minimum miscibility pressure in hydrocarbon gas injection</chapter-title>. Paper presented at: The <article-title>SPE Middle East Oil and Gas Show and Conference</article-title>; <year>2007 Mar</year>; <publisher-loc>Manama, Bahrain</publisher-loc>. doi:<pub-id pub-id-type="doi">10.2118/105407-MS</pub-id>.</mixed-citation></ref>
<ref id="ref-30"><label>[30]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bian</surname> <given-names>X</given-names></string-name>, <string-name><surname>Han</surname> <given-names>B</given-names></string-name>, <string-name><surname>Du</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Jaubert</surname> <given-names>JN</given-names></string-name>, <string-name><surname>Li</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Integrating support vector regression with genetic algorithm for CO<sub>2</sub>-oil minimum miscibility pressure (MMP) in pure and impure CO<sub>2</sub> streams</article-title>. <source>Fuel</source>. <year>2016</year>;<volume>182</volume>:<fpage>550</fpage>&#x2013;<lpage>7</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2016.05.124</pub-id>.</mixed-citation></ref>
<ref id="ref-31"><label>[31]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dehaghani</surname> <given-names>AHS</given-names></string-name>, <string-name><surname>Soleimani</surname> <given-names>R</given-names></string-name></person-group>. <article-title>Prediction of CO<sub>2</sub>-oil minimum miscibility pressure using soft computing methods</article-title>. <source>Chem Eng Technol</source>. <year>2020</year>;<volume>43</volume>(<issue>7</issue>):<fpage>1361</fpage>&#x2013;<lpage>71</lpage>. doi:<pub-id pub-id-type="doi">10.1002/ceat.201900411</pub-id>.</mixed-citation></ref>
<ref id="ref-32"><label>[32]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Amar</surname> <given-names>MN</given-names></string-name>, <string-name><surname>Zeraibi</surname> <given-names>N</given-names></string-name></person-group>. <article-title>Application of hybrid support vector regression artificial bee colony for prediction of MMP in CO<sub>2</sub>-EOR process</article-title>. <source>Petroleum</source>. <year>2020</year>;<volume>6</volume>(<issue>4</issue>):<fpage>415</fpage>&#x2013;<lpage>22</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.petlm.2018.08.001</pub-id>.</mixed-citation></ref>
<ref id="ref-33"><label>[33]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ghiasi</surname> <given-names>MM</given-names></string-name>, <string-name><surname>Mohammadi</surname> <given-names>AH</given-names></string-name>, <string-name><surname>Zendehboudi</surname> <given-names>S</given-names></string-name></person-group>. <article-title>Use of hybrid-ANFIS and ensemble methods to calculate minimum miscibility pressure of CO<sub>2</sub>&#x2014;reservoir oil system in miscible flooding process</article-title>. <source>J Molecul Liquids</source>. <year>2021</year>;<volume>331</volume>:<fpage>115369</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.molliq.2021.115369</pub-id>.</mixed-citation></ref>
<ref id="ref-34"><label>[34]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Al-Khafaji</surname> <given-names>HF</given-names></string-name>, <string-name><surname>Meng</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Hussain</surname> <given-names>W</given-names></string-name>, <string-name><surname>Mohammed</surname> <given-names>RK</given-names></string-name>, <string-name><surname>Harash</surname> <given-names>F</given-names></string-name>, <string-name><surname>AlFakey</surname> <given-names>SA</given-names></string-name></person-group>. <article-title>Predicting minimum miscible pressure in pure CO<sub>2</sub> flooding using machine learning: method comparison and sensitivity analysis</article-title>. <source>Fuel</source>. <year>2023</year>;<volume>354</volume>:<fpage>129263</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2023.129263</pub-id>.</mixed-citation></ref>
<ref id="ref-35"><label>[35]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Sebastian</surname> <given-names>HM</given-names></string-name>, <string-name><surname>Wenger</surname> <given-names>RS</given-names></string-name>, <string-name><surname>Renner</surname> <given-names>TA</given-names></string-name></person-group>. <article-title>Correlation of minimum miscibility pressure for impure CO<sub>2</sub> streams</article-title>. <source>J Pet Technol</source>. <year>1985</year>;<volume>37</volume>:<fpage>2076</fpage>&#x2013;<lpage>82</lpage>. doi:<pub-id pub-id-type="doi">10.2118/12648-PA</pub-id>.</mixed-citation></ref>
<ref id="ref-36"><label>[36]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Fathaddin</surname> <given-names>MT</given-names></string-name>, <string-name><surname>Irawan</surname> <given-names>S</given-names></string-name>, <string-name><surname>Setiati</surname> <given-names>R</given-names></string-name>, <string-name><surname>Rakhmanto</surname> <given-names>PA</given-names></string-name>, <string-name><surname>Prakoso</surname> <given-names>S</given-names></string-name>, <string-name><surname>Mardiana</surname> <given-names>DA</given-names></string-name></person-group>. <article-title>Optimized artificial neural network application for estimating oil recovery factor of solution gas drive sandstone reservoirs</article-title>. <source>Heliyon</source>. <year>2024</year>;<volume>10</volume>(<issue>13</issue>):<fpage>e33824</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.heliyon.2024.e33824</pub-id>; <pub-id pub-id-type="pmid">39027583</pub-id></mixed-citation></ref>
<ref id="ref-37"><label>[37]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhang</surname> <given-names>R</given-names></string-name>, <string-name><surname>Tong</surname> <given-names>W</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>S</given-names></string-name>, <string-name><surname>Qiu</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>X</given-names></string-name></person-group>. <article-title>ANN model with feature selection to predict turbulent heat transfer characteristics of supercritical fluids: take CO<sub>2</sub> and H<sub>2</sub>O as examples</article-title>. <source>Int J Therm Sci</source>. <year>2023</year>;<volume>188</volume>(<issue>1</issue>):<fpage>108247</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.ijthermalsci.2023.108247</pub-id>.</mixed-citation></ref>
<ref id="ref-38"><label>[38]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bayazitova</surname> <given-names>G</given-names></string-name>, <string-name><surname>Anastasiadou</surname> <given-names>M</given-names></string-name>, <string-name><surname>Santos</surname> <given-names>VDD</given-names></string-name></person-group>. <article-title>Oil and gas flow anomaly detection on offshore naturally flowing wells using deep neural networks</article-title>. <source>Geoene Sci Eng</source>. <year>2024</year>;<volume>242</volume>(<issue>4</issue>):<fpage>213240</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.geoen.2024.213240</pub-id>.</mixed-citation></ref>
<ref id="ref-39"><label>[39]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yang</surname> <given-names>L</given-names></string-name>, <string-name><surname>Li</surname> <given-names>B</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>K</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>C</given-names></string-name>, <string-name><surname>Jing</surname> <given-names>F</given-names></string-name></person-group>. <article-title>Multi-objective shape optimization of offshore swirl-vane oil-water separators using genetic algorithm network coupled computational fluid dynamics</article-title>. <source>Ocean Eng</source>. <year>2025</year>;<volume>325</volume>:<fpage>120827</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.oceaneng.2025.120827</pub-id>.</mixed-citation></ref>
<ref id="ref-40"><label>[40]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Guo</surname> <given-names>D</given-names></string-name>, <string-name><surname>Kang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Li</surname> <given-names>S</given-names></string-name></person-group>. <article-title>Optimization of fracturing parameters for tight oil production based on genetic algorithm</article-title>. <source>Petroleum</source>. <year>2022</year>;<volume>8</volume>(<issue>2</issue>):<fpage>252</fpage>&#x2013;<lpage>63</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.petlm.2021.11.006</pub-id>.</mixed-citation></ref>
<ref id="ref-41"><label>[41]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wen</surname> <given-names>S</given-names></string-name>, <string-name><surname>Wei</surname> <given-names>B</given-names></string-name>, <string-name><surname>You</surname> <given-names>J</given-names></string-name>, <string-name><surname>He</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Xin</surname> <given-names>J</given-names></string-name>, <string-name><surname>Varfolomeev</surname> <given-names>MA</given-names></string-name></person-group>. <article-title>Forecasting oil production in unconventional reservoirs using long short term memory network coupled support vector regression method: a case study</article-title>. <source>Petroleum</source>. <year>2023</year>;<volume>9</volume>(<issue>4</issue>):<fpage>647</fpage>&#x2013;<lpage>57</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.petlm.2023.05.004</pub-id>.</mixed-citation></ref>
<ref id="ref-42"><label>[42]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dargahi-Zarandi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Hemmati-Sarapardeh</surname> <given-names>A</given-names></string-name>, <string-name><surname>Shateri</surname> <given-names>M</given-names></string-name>, <string-name><surname>Menad</surname> <given-names>NA</given-names></string-name>, <string-name><surname>Ahmadi</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Modeling minimum miscibility pressure of pure/impure CO<sub>2</sub>-crude oil systems using adaptive boosting support vector regression: Application to gas injection processes</article-title>. <source> J Petrol Sci Eng</source>. <year>2020</year>;<volume>184</volume>:<fpage>106499</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.petrol.2019.106499</pub-id>.</mixed-citation></ref>
<ref id="ref-43"><label>[43]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lv</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Rashidi-Khaniabadi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>R</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>T</given-names></string-name>, <string-name><surname>Mohammadi</surname> <given-names>MR</given-names></string-name>, <string-name><surname>Hemmati-Sarapardeh</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Modelling CO<sub>2</sub> diffusion coefficient in heavy crude oils and bitumen using extreme gradient boosting and Gaussian process regression</article-title>. <source>Energy</source>. <year>2023</year>;<volume>275</volume>:<fpage>127396</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.energy.2023.127396</pub-id>.</mixed-citation></ref>
<ref id="ref-44"><label>[44]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cui</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>L</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Cui</surname> <given-names>P</given-names></string-name>, <string-name><surname>Du</surname> <given-names>D</given-names></string-name></person-group>. <article-title>Determination of the minimum miscibility pressure of the CO<sub>2</sub>/oil system based on quantification of the oil droplet volume reduction behavior</article-title>. <source>Coll Surf A: Physicochem Eng Asp</source>. <year>2022</year>;<volume>653</volume>:<fpage>130058</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.colsurfa.2022.130058</pub-id>.</mixed-citation></ref>
<ref id="ref-45"><label>[45]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Yan</surname> <given-names>S</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Du</surname> <given-names>D</given-names></string-name></person-group>. <article-title>Determination of the minimum miscibility pressures of CO<sub>2</sub> and multicomponent oil system with a novel diffusion coefficient fit line intersection methodology</article-title>. <source>Fuel</source>. <year>2025</year>;<volume>385</volume>(<issue>43</issue>):<fpage>134106</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2024.134106</pub-id>.</mixed-citation></ref>
<ref id="ref-46"><label>[46]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Lee</surname> <given-names>IJ</given-names></string-name></person-group>. <article-title>Effectiveness of carbon dioxide displacement under miscible and immiscible conditions</article-title>. <publisher-loc>Calgary, AB, Canada: Petroleum Recovery Inst.</publisher-loc>; <year>1979</year>. PRI-7910; CE-02239.</mixed-citation></ref>
<ref id="ref-47"><label>[47]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Cronquist</surname> <given-names>C</given-names></string-name></person-group>. <article-title>Carbon dioxide dynamic displacement with light reservoir oils</article-title>. In: <conf-name>Fourth Annual U.S. DOE Symposium</conf-name>; <year>1978</year>; <publisher-loc>Tulsa, OK, USA</publisher-loc>.</mixed-citation></ref>
<ref id="ref-48"><label>[48]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Yuan</surname> <given-names>H</given-names></string-name>, <string-name><surname>Johns</surname> <given-names>RT</given-names></string-name>, <string-name><surname>Egwuenu</surname> <given-names>AM</given-names></string-name>, <string-name><surname>Dindoruk</surname> <given-names>B</given-names></string-name></person-group>. <chapter-title>Improved MMP correlations for CO<sub>2</sub> floods using analytical gas flooding theory</chapter-title>. Paper presented at: The <article-title>SPE/DOE Symposium on Improved Oil Recovery</article-title>; <year>2004 Apr</year>; <publisher-loc>Tulsa, Oklahoma</publisher-loc>. doi:<pub-id pub-id-type="doi">10.2118/89359-MS</pub-id>.</mixed-citation></ref>
<ref id="ref-49"><label>[49]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ma</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Cai</surname> <given-names>X</given-names></string-name>, <string-name><surname>Li</surname> <given-names>M</given-names></string-name>, <string-name><surname>Li</surname> <given-names>H</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>D</given-names></string-name>, <string-name><surname>Qiu</surname> <given-names>N</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Research advances on the mechanisms of reservoir formation and hydrocarbon accumulation and the oil and gas development methods of deep and ultra-deep marine carbonates</article-title>. <source>Petrol Explorat Develop</source>. <year>2024</year>;<volume>51</volume>(<issue>4</issue>):<fpage>795</fpage>&#x2013;<lpage>812</lpage>. doi:<pub-id pub-id-type="doi">10.1016/S1876-3804(24)60507-0</pub-id>.</mixed-citation></ref>
<ref id="ref-50"><label>[50]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lei</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Cai</surname> <given-names>B</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>L</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Progress and development directions of stimulation techniques for ultra-deep oil and gas reservoirs</article-title>. <source>Petrol Explorat Develop</source>. <year>2021</year>;<volume>48</volume>:<fpage>221</fpage>&#x2013;<lpage>31</lpage>. doi:<pub-id pub-id-type="doi">10.1016/S1876-3804(21)60018-6</pub-id>.</mixed-citation></ref>
<ref id="ref-51"><label>[51]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yu</surname> <given-names>H</given-names></string-name>, <string-name><surname>Lu</surname> <given-names>X</given-names></string-name>, <string-name><surname>Fu</surname> <given-names>W</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>H</given-names></string-name>, <string-name><surname>Xie</surname> <given-names>Q</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Determination of minimum near miscible pressure region during CO<sub>2</sub> and associated gas injection for tight oil reservoir in Ordos Basin, China</article-title>. <source>Fuel</source>. <year>2020</year>;<volume>263</volume>:<fpage>116737</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.fuel.2019.116737</pub-id>.</mixed-citation></ref>
</ref-list>
</back></article>












