<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">SDHM</journal-id>
<journal-id journal-id-type="nlm-ta">SDHM</journal-id>
<journal-id journal-id-type="publisher-id">SDHM</journal-id>
<journal-title-group>
<journal-title>Structural Durability &#x0026; Health Monitoring</journal-title>
</journal-title-group>
<issn pub-type="epub">1930-2991</issn>
<issn pub-type="ppub">1930-2983</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">52683</article-id>
<article-id pub-id-type="doi">10.32604/sdhm.2024.052683</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Intelligent Diagnosis of Highway Bridge Technical Condition Based on Defect Information</article-title><alt-title alt-title-type="left-running-head">Intelligent Diagnosis of Highway Bridge Technical Condition Based on Defect Information</alt-title><alt-title alt-title-type="right-running-head">Intelligent Diagnosis of Highway Bridge Technical Condition Based on Defect Information</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author">
<name name-style="western"><surname>Ma</surname><given-names>Yanxue</given-names></name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-2" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Liu</surname><given-names>Xiaoling</given-names></name>
<xref ref-type="aff" rid="aff-1">1</xref><email>liuxiaoling@nbu.edu.cn</email>
</contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>Wang</surname><given-names>Bing</given-names></name>
<xref ref-type="aff" rid="aff-2">2</xref>
</contrib>
<contrib id="author-4" contrib-type="author">
<name name-style="western"><surname>Liu</surname><given-names>Ying</given-names></name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<aff id="aff-1"><label>1</label><institution>Faculty of Maritime and Transportation, Ningbo University</institution>, <addr-line>Ningbo, 315211</addr-line>, <country>China</country></aff>
<aff id="aff-2"><label>2</label><institution>School of Civil &#x0026; Environmental Engineering and Geography Science, Ningbo University</institution>, <addr-line>Ningbo, 315211</addr-line>, <country>China</country></aff>
</contrib-group><author-notes><corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Xiaoling Liu. Email: <email>liuxiaoling@nbu.edu.cn</email></corresp></author-notes>
<pub-date date-type="collection" publication-format="electronic">
<year>2024</year></pub-date>
<pub-date date-type="pub" publication-format="electronic"><day>20</day><month>9</month><year>2024</year></pub-date>
<volume>18</volume>
<issue>6</issue>
<fpage>871</fpage>
<lpage>889</lpage>
<history>
<date date-type="received"><day>11</day><month>4</month><year>2024</year></date>
<date date-type="accepted"><day>04</day><month>7</month><year>2024</year></date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2024 The Authors.</copyright-statement>
<copyright-year>2024</copyright-year>
<copyright-holder>Published by Tech Science Press.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="_SDHM_52683.pdf"></self-uri>
<abstract>
<p>In the bridge technical condition assessment standards, the evaluation of bridge conditions primarily relies on the defects identified through manual inspections, which are determined using the comprehensive hierarchical analysis method. However, the relationship between the defects and the technical condition of the bridges warrants further exploration. To address this situation, this paper proposes a machine learning-based intelligent diagnosis model for the technical condition of highway bridges. Firstly, collect the inspection records of highway bridges in a certain region of China, then standardize the severity of diverse defects in accordance with relevant specifications. Secondly, in order to enhance the independence between the defects, the key defect indicators were screened using Principal Component Analysis (PCA) in combination with the weights of the building blocks. Based on this, an enhanced Naive Bayesian Classification (NBC) algorithm is established for the intelligent diagnosis of technical conditions of highway bridges, juxtaposed with four other algorithms for comparison. Finally, key defect variables that affect changes in bridge grades are discussed. The results showed that the technical condition level of the superstructure had the highest correlation with cracks; the PCA-NBC algorithm achieved an accuracy of 93.50% of the predicted values, which was the highest improvement of 19.43% over other methods. The purpose of this paper is to provide inspectors with a convenient and predictive information-rich method to intelligently diagnose the technical condition of bridges based on bridge defects. The results of this research can help bridge inspectors and even non-specialists to better understand the condition of bridge defects.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Highway bridges</kwd>
<kwd>defects</kwd>
<kwd>Naive Bayesian classification</kwd>
<kwd>principal component analysis</kwd>
<kwd>machine learning</kwd>
</kwd-group>
<funding-group>
<award-group id="awg1">
<funding-source>National Natural Science Foundation of China</funding-source>
<award-id>51808301</award-id>
</award-group>
<award-group id="awg2">
<funding-source>Zhejiang Provincial Education Department</funding-source>
<award-id>Y202248860</award-id>
</award-group>
<award-group id="awg3">
<funding-source>National &#x201C;111&#x201D; Centre on Safety and Intelligent Operation of Sea Bridge</funding-source>
<award-id>D21013</award-id>
</award-group>
</funding-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>Bridges are subject to environmental corrosion and vehicle loads during operation, which can lead to defects such as concrete cracks and steel corrosion, thereby affecting the safety and durability of the bridge. Therefore, bridges inspection is very important. At present, bridge inspection mainly relies on on-site personnel investigation results, and then evaluates bridge levels according to regulations. However, the bridge defects investigated are often disconnected from the technical condition. Therefore, it is necessary to accurately grasp the relationship between bridge defects and technical conditions. This provides useful guidance for bridge inspectors to understand the technical conditions of bridges from a macro perspective. At the same time, it also provides reference for maintenance management decision-making.</p>
<p>Determining the technical condition level of a bridge based on its defect data is essentially an effective categorization of its data. Data classification is an important form of data analysis in the field of data mining [<xref ref-type="bibr" rid="ref-1">1</xref>], machine learning, and pattern recognition. The current mainstream classifiers include support vector machine (SVM), decision tree (DT), Naive Bayesian Classification (NBC), artificial neural network, and classification based on association rules [<xref ref-type="bibr" rid="ref-2">2</xref>&#x2013;<xref ref-type="bibr" rid="ref-4">4</xref>]. In the application of bridge engineering, Yang et al. [<xref ref-type="bibr" rid="ref-5">5</xref>] proposed an effective hybrid classification model to evaluate the health status of bridges, which solves the learning problem of the classification model on large-scale uncertain labels data. Bektas et al. [<xref ref-type="bibr" rid="ref-6">6</xref>] proposed a method to judge the bridge condition by using classification and regression trees. Chung et al. [<xref ref-type="bibr" rid="ref-7">7</xref>] proposed an estimation model for the safety level of highway bridges by comprehensively considering the basic specifications, year of completion, traffic, and safety rating of bridges. Martinez et al. [<xref ref-type="bibr" rid="ref-8">8</xref>] used a variety of classification models to predict the bridge condition index to determine the rehabilitation priority for the bridge. Feng et al. [<xref ref-type="bibr" rid="ref-9">9</xref>] used a combination of finite element simulation and support vector machine to classify the data, so as to achieve the purpose of providing a safety level for bridge scour warning. Mokalled et al. [<xref ref-type="bibr" rid="ref-10">10</xref>] proposed a strategy for multilevel damage classification of bridges using the Bayesian estimation technique based on drive-by health monitoring. Mangalathu et al. [<xref ref-type="bibr" rid="ref-11">11</xref>] used various machine learning methods of quadratic discriminant analysis to predict the failure modes of bridge components with an accuracy of 91%. Jootoo et al. [<xref ref-type="bibr" rid="ref-12">12</xref>] used Decision Trees (DT), Bayesian Networks, and Support Vector Machines (SVM) to predict bridge design types. Hu et al. [<xref ref-type="bibr" rid="ref-13">13</xref>] proposed a corrosion-damaged bridge abutment damage mode discrimination method considering probability based on Fisher&#x2019;s discriminant grouping, combined with Bayes&#x2019; formula and related theories. Zhang et al. [<xref ref-type="bibr" rid="ref-14">14</xref>] established a multi-indicator and multi-level bridge deck system reliability evaluation model in order to reasonably evaluate the reliability of the concrete bridge deck system, and gave the categorization criteria of the basic evaluation indexes. This shows that classification discrimination is an important tool widely used in the field of bridge engineering. Accurate categorization of bridge structures, materials, and defects can help engineers better understand the nature of the problem and develop appropriate solutions, both during the design phase and during construction and maintenance.</p>
<p>Such methods are also more widely used in other fields. For example, in medicine, many scholars use relevant classification algorithms, such as SVM, artificial neural networks, logistic regression, and other algorithms to predict and diagnose diseases in order to help clinicians [<xref ref-type="bibr" rid="ref-15">15</xref>&#x2013;<xref ref-type="bibr" rid="ref-18">18</xref>]. Zeng et al. [<xref ref-type="bibr" rid="ref-19">19</xref>] trained and fitted health data based on random forest and other related machine learning algorithms, improving the progress of health data classification. In addition, there are corresponding studies in the classification of fruits and vegetables, combining neural networks, SVM, and other related algorithms to classify the attributes of fruits and vegetables is a common and convenient method [<xref ref-type="bibr" rid="ref-20">20</xref>&#x2013;<xref ref-type="bibr" rid="ref-22">22</xref>]. And Gupta et al. [<xref ref-type="bibr" rid="ref-23">23</xref>] reviewed recent advances in the identification and classification of fruit and vegetable disease by combining cutting-edge techniques such as machine learning and deep learning.</p>
<p>This study mainly uses the Naive Bayesian classification algorithm. This algorithm is based on the Bayesian theorem and is a simple and effective statistical classification method. It can effectively handle small datasets, has low sensitivity to missing data, and can maintain stable classification efficiency in multiple tasks. However, in real life, the assumption of independence of sample attributes generally does not hold [<xref ref-type="bibr" rid="ref-24">24</xref>,<xref ref-type="bibr" rid="ref-25">25</xref>]. To effectively address the limitations of this algorithm, Wang et al. [<xref ref-type="bibr" rid="ref-26">26</xref>] established a semi-supervised adaptive discriminative discretization framework for Bayes through pseudo-labeling techniques and adaptive discriminative discretization scheme. This significantly reduces the information loss in the discretization process and greatly improves the discriminative ability of the classifier. Wu et al. [<xref ref-type="bibr" rid="ref-27">27</xref>] proposed to introduce the cultural algorithm into the weighted naive bayesian classification algorithm, which can effectively solve the problem that the algorithm lacks conditional independence assumptions and label class selection strategies. Zhang et al. [<xref ref-type="bibr" rid="ref-28">28</xref>], in order to further mitigate the conditional independence assumption, proposed a new improved model called attribute and instance weighted Naive Bayes, which combines attribute weighting with instance weighting into one uniform framework. Similarly, Wang et al. [<xref ref-type="bibr" rid="ref-29">29</xref>] used a combination of PCA and Naive Bayes to classify motor faults.</p>
<p>The number of variables of highway bridge defects is large and the correlation between variables is high. Therefore, this paper proposes to combine the principal component analysis method with the Naive Bayesian classification algorithm. It not only reduces the dimensionality of the variables but also ensures the precondition that the feature variables are independent of each other in the Naive Bayesian classification algorithm. In summary, this paper utilizes examples of highway bridge defects to predict and classify the technical condition level of highway bridges. The extraction of important characteristic variables is simplified by performing Pearson correlation analysis between defects and bridge levels and defects. The intelligent diagnosis model of highway bridge technical conditions based on defect information using the PCA-NBC algorithm is established and compared with other algorithms. In-depth study of key defect variables affect the change of bridge grade, linking the correlation findings in order to assist bridge inspectors in predicting other disease conditions as well as the overall bridge technical condition.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Data Description</title>
<p>At present, the standard for evaluating highway bridges in China is the &#x2018;Standards for Technical Condition Evaluation of Highway Bridges&#x2019; [<xref ref-type="bibr" rid="ref-30">30</xref>] (hereinafter referred to as TCEB). <xref ref-type="table" rid="table-1">Table 1</xref> demonstrates the categorization of the technical condition levels of highway bridges, which are categorized into five levels according to the highway bridge assessment criteria, with Level 1 being the best and Level 5 being the worst. In this paper, the characteristic condition of 137 highway bridges with all the data of defects in Ningbo is collected and organized, and the characteristic condition of the bridges is obtained as <xref ref-type="fig" rid="fig-1">Fig. 1</xref>. There are 23 bridges of Level 1 (16.79%), 92 bridges of Level 2 (67.15%), and 22 bridges of Level 3 (16.06%). The selected highway bridges are all beam bridges, including 133 plate girder bridges (97.08%) and 4 box girder bridges (2.92%). According to the classification of bridge scale by span length, there are 71 middle bridges (single span between 20 and 40 meters) (51.82%) and 66 small bridges (single span less than 20 meters) (48.18%).</p>
<table-wrap id="table-1"><label>Table 1</label>
<caption>
<title>Classification of bridge technical conditions</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th valign="top" rowspan="2">Technical condition evaluation</th>
<th align="center" colspan="5">Category of bridge technical condition</th>
</tr>
<tr>
<th>Level 1</th>
<th>Level 2</th>
<th>Level 3</th>
<th>Level 4</th>
<th>Level 5</th>
</tr>
</thead>
<tbody>
<tr>
<td>Description of bridge state</td>
<td>New state</td>
<td>Minor damage</td>
<td>Medium damage</td>
<td>Larger damage</td>
<td>severe damage</td>
</tr>
<tr>
<td>D</td>
<td>[95,100]</td>
<td>[80,95]</td>
<td>[60,80]</td>
<td>[40,60]</td>
<td>[0,40]</td>
</tr>
</tbody>
</table>
</table-wrap><fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>Characteristics of 137 bridges</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="SDHM_52683-fig-1.tif"/>
</fig>
<sec id="s2_1">
<label>2.1</label>
<title>Defect Coding</title>
<p>Prior to the correlation analysis of bridge defects, it was necessary to code for the bridge level as well as the percentage of all the defect variables present in the bridges in this study, as shown in <xref ref-type="table" rid="table-2">Tables 2</xref> and <xref ref-type="table" rid="table-3">3</xref>. A total of 41 defects were present in all the bridges in this study.</p>
<table-wrap id="table-2"><label>Table 2</label>
<caption>
<title>Naming of bridges and component</title></caption>
<table><colgroup>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Name</th>
<th>Numbering</th>
</tr>
</thead>
<tbody>
<tr>
<td>Assessment level of the bridge</td>
<td>G</td>
</tr>
<tr>
<td>Level of superstructure</td>
<td>G<sub>1</sub></td>
</tr>
<tr>
<td>Level of substructure</td>
<td>G<sub>2</sub></td>
</tr>
<tr>
<td>Level of bridge deck system</td>
<td>G<sub>3</sub></td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-3"><label>Table 3</label>
<caption>
<title>Naming of bridge defects</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Bridge member</th>
<th>Name of bridge defects</th>
<th>Numbering</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="11">Superstructure</td>
<td>Spalling in upper load-bearing structure</td>
<td>V<sub>1-1</sub></td>
</tr>
<tr>
<td>Corrosion of reinforcement in upper load-bearing structure</td>
<td>V<sub>1-2</sub></td>
</tr>
<tr>
<td>Cracks in upper load-bearing structure</td>
<td>V<sub>1-3</sub></td>
</tr>
<tr>
<td>Water seepage in upper load-bearing structure</td>
<td>V<sub>1-4</sub></td>
</tr>
<tr>
<td>Spalling in upper general structure</td>
<td>V<sub>1-5</sub></td>
</tr>
<tr>
<td>Corrosion of reinforcement in upper general structure</td>
<td>V<sub>1-6</sub></td>
</tr>
<tr>
<td>Water seepage in upper general structure</td>
<td>V<sub>1-7</sub></td>
</tr>
<tr>
<td>Aging deterioration and cracking in plate bearing</td>
<td>V<sub>1-8</sub></td>
</tr>
<tr>
<td>Position stringing, voiding or shear overrun in plate bearing</td>
<td>V<sub>1-9</sub></td>
</tr>
<tr>
<td>Bearing buried</td>
<td>V<sub>1-10</sub></td>
</tr>
<tr>
<td>Honeycomb and scale in upper load-bearing structure</td>
<td>V<sub>1-11</sub></td>
</tr>
<tr>
<td rowspan="2">Substructure</td>
<td>Honeycomb and scale in bridge pier</td>
<td>V<sub>2-1</sub></td>
</tr>
<tr>
<td>Spalling and exposed reinforcement in bridge pier</td>
<td>V<sub>2-2</sub></td>
</tr>
<tr>
<td rowspan="17"></td>
<td>Corrosion of reinforcement in bridge pier</td>
<td>V<sub>2-3</sub></td>
</tr>
<tr>
<td>Abrasion in bridge pier</td>
<td>V<sub>2-4</sub></td>
</tr>
<tr>
<td>Cracks in bridge pier</td>
<td>V<sub>2-5</sub></td>
</tr>
<tr>
<td>Water seepage in bridge pier</td>
<td>V<sub>2-6</sub></td>
</tr>
<tr>
<td>Spalling and exposed reinforcement in bent cap and tie beam</td>
<td>V<sub>2-7</sub></td>
</tr>
<tr>
<td>Corrosion of reinforcement in bent cap and tie beam</td>
<td>V<sub>2-8</sub></td>
</tr>
<tr>
<td>Cracks in bent cap and tie beam</td>
<td>V<sub>2-9</sub></td>
</tr>
<tr>
<td>Water seepage in bent cap and tie beam</td>
<td>V<sub>2-10</sub></td>
</tr>
<tr>
<td>Spalling in abutment</td>
<td>V<sub>2-11</sub></td>
</tr>
<tr>
<td>Cracks in abutment</td>
<td>V<sub>2-12</sub></td>
</tr>
<tr>
<td>Water seepage in abutment</td>
<td>V<sub>2-13</sub></td>
</tr>
<tr>
<td>Water seepage in abutment cap</td>
<td>V<sub>2-14</sub></td>
</tr>
<tr>
<td>Damage in abutment cap</td>
<td>V<sub>2-15</sub></td>
</tr>
<tr>
<td>Cracks in abutment cap</td>
<td>V<sub>2-16</sub></td>
</tr>
<tr>
<td>Damage in wing wall and ear wall</td>
<td>V<sub>2-17</sub></td>
</tr>
<tr>
<td>Water seepage in wing wall and ear wall</td>
<td>V<sub>2-18</sub></td>
</tr>
<tr>
<td>Defect in cone slope and slope protection</td>
<td>V<sub>2-19</sub></td>
</tr>
<tr>
<td rowspan="11">Bridge deck system</td>
<td>Damage in deck pavement</td>
<td>V<sub>3-1</sub></td>
</tr>
<tr>
<td>Cracks in deck pavement</td>
<td>V<sub>3-2</sub></td>
</tr>
<tr>
<td>Damage in expansion and contraction joint</td>
<td>V<sub>3-3</sub></td>
</tr>
<tr>
<td>Lose efficacy in expansion and contraction joint</td>
<td>V<sub>3-4</sub></td>
</tr>
<tr>
<td>Damage in railing and guardrail</td>
<td>V<sub>3-5</sub></td>
</tr>
<tr>
<td>Defect of waterproof and drainage discharge pipe and diversion tank</td>
<td>V<sub>3-6</sub></td>
</tr>
<tr>
<td>Deformation in deck pavement</td>
<td>V<sub>3-7</sub></td>
</tr>
<tr>
<td>Poor drainage of waterproof and drainage system</td>
<td>V<sub>3-8</sub></td>
</tr>
<tr>
<td>Defilement or damage in lighting and sign</td>
<td>V<sub>3-9</sub></td>
</tr>
<tr>
<td>Fall off and missing in lighting and sign</td>
<td>V<sub>3-10</sub></td>
</tr>
<tr>
<td>Sidewalk damage</td>
<td>V<sub>3-11</sub></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Next, the defect needs to be unified. From the TCEB, depending on the highest level that can be achieved by the test indicator, points need to be deducted accordingly for the different indicator categories of the component, as shown in <xref ref-type="table" rid="table-4">Table 4</xref>. A defect index level is 1, indicating that the component has no such defects. The index level is 2, indicating that the defect has little effect on the bridge and has no effect on the use function of the bridge. The index level is 3, indicating that the component has a moderate defect, and the bridge can still maintain normal function.</p>
<table-wrap id="table-4"><label>Table 4</label>
<caption>
<title>The deduction score of each detection index of a component</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>The highest level</th>
<th align="center" colspan="5">Category of indicator</th>
</tr>
<tr>
<th>That a detection index can reach</th>
<th>Level 1</th>
<th>Level 2</th>
<th>Level 3</th>
<th>Level 4</th>
<th>Level 5</th>
</tr>
</thead>
<tbody>
<tr>
<td>Level 3</td>
<td>0</td>
<td>20</td>
<td>35</td>
<td>&#x2014;</td>
<td>&#x2014;</td>
</tr>
<tr>
<td>Level 4</td>
<td>0</td>
<td>25</td>
<td>40</td>
<td>50</td>
<td>&#x2014;</td>
</tr>
<tr>
<td>Level 5</td>
<td>0</td>
<td>35</td>
<td>45</td>
<td>60</td>
<td>100</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In the highway bridges in service, the defect levels existing in the components are mainly Level 2. Therefore, to facilitate the determination of the effect of bridge defects on bridge level. Based on the deduction score of each detection index in the TCEB, for all defects, grades other than Level 2 were converted to Level 2 with a deduction as a percentage of the grade. For example, assuming that the highest level achievable for a defect is Level 5. let the number of Level 1s for that indicator be <italic>n</italic><sub>1</sub>, the number of Level 2s be <italic>n</italic><sub>2</sub>, and so on. Then the number of Level 2 converted by the defect is then:<disp-formula id="eqn-1"><label>(1)</label>
<mml:math id="mml-eqn-1" display="block"><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mn>45</mml:mn></mml:mrow><mml:mrow><mml:mn>35</mml:mn></mml:mrow></mml:mfrac></mml:mrow><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mn>3</mml:mn></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mn>60</mml:mn></mml:mrow><mml:mrow><mml:mn>35</mml:mn></mml:mrow></mml:mfrac></mml:mrow><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mn>4</mml:mn></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mn>100</mml:mn></mml:mrow><mml:mrow><mml:mn>35</mml:mn></mml:mrow></mml:mfrac></mml:mrow><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mn>5</mml:mn></mml:msub></mml:mrow></mml:mstyle></mml:mstyle></mml:mstyle></mml:math>
</disp-formula></p>
<p>In addition, the obtained data need to be normalized. If the number of a particular defect on a bridge component is <italic>n</italic>, the number of total members of that bridge component is <italic>N</italic>, and the percentage of bridge defects is <italic>P</italic>, then:<disp-formula id="eqn-2"><label>(2)</label>
<mml:math id="mml-eqn-2" display="block"><mml:mi>P</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mi>n</mml:mi><mml:mi>N</mml:mi></mml:mfrac></mml:mrow></mml:mstyle></mml:math>
</disp-formula></p>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Correlation Analysis</title>
<p>This section examines the correlation between the presence of defects and defects in each structure, and the correlation between the defects and the technical condition of the bridge components, in order to provide a basis for the prediction of the technical condition of highway bridges. The results of the Pearson correlation calculation based on the above data are shown below:</p>
<p>The following conclusions can be drawn from the above correlation results:
<list list-type="order"><list-item>
<p>From <xref ref-type="fig" rid="fig-2">Fig. 2a</xref>, the level of superstructure (G<sub>1</sub>) has the highest correlation with cracks (V<sub>1-3</sub>) with a Pearson value of 0.7.</p>
</list-item><list-item>
<p>Due to the large number of components and defects in the substructure, the defects are more dispersed. Therefore, in <xref ref-type="fig" rid="fig-2">Fig. 2b</xref>, the level of substructure (G<sub>2</sub>) is less relevant to its defects.</p>
</list-item><list-item>
<p>In <xref ref-type="fig" rid="fig-2">Fig. 2c</xref>, the level of bridge deck system (G<sub>3</sub>) is positively correlated with cracks in deck pavement (V<sub>3-2</sub>) with a Pearson value of 0.5. This is due to the fact that the bridge deck system carries the largest weight, so its defects have a high correlation with the level of the bridge deck system.</p>
</list-item><list-item>
<p>The Pearson value between corrosion of reinforcement and cracks in superstructures is 0.5, which is marked with a blue circle in <xref ref-type="fig" rid="fig-2">Fig. 2a</xref>. When the bridge is in an open or wet environment, the corrosion of reinforcement at the cracks will be more serious. The corrosion of reinforcement will further lead to the enlargement of cracks, which ultimately affects the bearing capacity and durability of the bridge.</p>
</list-item><list-item>
<p>The Pearson value between spalling and corrosion of reinforcement in the superstructure is 0.6, which is marked with a green circle in <xref ref-type="fig" rid="fig-2">Fig. 2a</xref>. This is due to the compactness of the concrete is not enough and the thickness of the protective layer is insufficient or damaged, such as honeycomb, damage, spalling, cracks, etc. These defects make the steel bar directly exposed to external conditions, resulting in corrosion. Especially under the conditions of open air, humid environment, and dry-wet alternation, the corrosion rate of unprotected steel bars will be faster.</p>
</list-item></list></p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Correlation analysis. (a) The correlation between the level of the superstructure and its existing defect variables. (b) The correlation between the level of substructure and its existing defect variables. (c) The correlation between the level of the bridge deck system and its existing defect variables</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="SDHM_52683-fig-2.tif"/>
</fig>
<p>In summary, bridge defects often do not occur independently, but rather interact to affect the overall condition of the bridge. The bridge defect variables were screened based on the above correlation studies and the Weight value of each component of the beam bridge (<xref ref-type="table" rid="table-5">Table 5</xref>) in the TCEB. Remove components with low weight values: wing wall and ear wall, cone slope and slope protection, sidewalk, lighting, and signs. In the selected samples, there are no defects in the pier foundation, river bed, or modulation structure. Finally, a total of 30 variables are obtained, as shown in <xref ref-type="table" rid="table-6">Table 6</xref>.</p>
<table-wrap id="table-5"><label>Table 5</label>
<caption>
<title>Weight value of each component of the beam bridge</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Bridge member</th>
<th>Evaluated components</th>
<th>Weight</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="3">Superstructure</td>
<td>Upper load-bearing structure (main beam, hanging beam)</td>
<td>0.280</td>
</tr>
<tr>
<td>Upper general structure (wet joint, diaphragm, etc.)</td>
<td>0.072</td>
</tr>
<tr>
<td>Bearing</td>
<td>0.048</td>
</tr>
<tr>
<td rowspan="5">Substructure</td>
<td>Wing wall and ear wall</td>
<td>0.008</td>
</tr>
<tr>
<td>Cone slope and slope protection</td>
<td>0.004</td>
</tr>
<tr>
<td>Bridge pier</td>
<td>0.120</td>
</tr>
<tr>
<td>Abutment</td>
<td>0.120</td>
</tr>
<tr>
<td>Pier foundation</td>
<td>0.112</td>
</tr>
<tr>
<td rowspan="2"></td>
<td>River bed</td>
<td>0.028</td>
</tr>
<tr>
<td>Modulation structure</td>
<td>0.008</td>
</tr>
<tr>
<td rowspan="6">Bridge deck system</td>
<td>Deck pavement</td>
<td>0.080</td>
</tr>
<tr>
<td>Expansion and contraction joint</td>
<td>0.050</td>
</tr>
<tr>
<td>Sidewalk</td>
<td>0.020</td>
</tr>
<tr>
<td>Railing and guardrail</td>
<td>0.020</td>
</tr>
<tr>
<td>Drainage system</td>
<td>0.020</td>
</tr>
<tr>
<td>Lighting and signs</td>
<td>0.010</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-6"><label>Table 6</label>
<caption>
<title>Indicators of bridge defects after screening</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Bridge member</th>
<th>Name of bridge defects</th>
<th>Numbering</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="10">Superstructure</td>
<td>Spalling in upper load-bearing structure</td>
<td>V<sub>1-1</sub></td>
</tr>
<tr>
<td>Corrosion of reinforcement in upper load-bearing structure</td>
<td>V<sub>1-2</sub></td>
</tr>
<tr>
<td>Cracks in upper load-bearing structure</td>
<td>V<sub>1-3</sub></td>
</tr>
<tr>
<td>Water seepage in upper load-bearing structure</td>
<td>V<sub>1-4</sub></td>
</tr>
<tr>
<td>Spalling in upper general structure</td>
<td>V<sub>1-5</sub></td>
</tr>
<tr>
<td>Corrosion of reinforcement in upper general structure</td>
<td>V<sub>1-6</sub></td>
</tr>
<tr>
<td>Water seepage in upper general structure</td>
<td>V<sub>1-7</sub></td>
</tr>
<tr>
<td>Aging deterioration and cracking in plate bearing</td>
<td>V<sub>1-8</sub></td>
</tr>
<tr>
<td>Position stringing, voiding or shear overrun in plate bearing</td>
<td>V<sub>1-9</sub></td>
</tr>
<tr>
<td>Bearing buried</td>
<td>V<sub>1-10</sub></td>
</tr>
<tr>
<td rowspan="14">Substructure</td>
<td>Honeycomb and scale in bridge pier</td>
<td>V<sub>2-1</sub></td>
</tr>
<tr>
<td>Spalling and exposed reinforcement in bridge pier</td>
<td>V<sub>2-2</sub></td>
</tr>
<tr>
<td>Corrosion of reinforcement in bridge pier</td>
<td>V<sub>2-3</sub></td>
</tr>
<tr>
<td>Abrasion in bridge pier</td>
<td>V<sub>2-4</sub></td>
</tr>
<tr>
<td>Cracks in bridge pier</td>
<td>V<sub>2-5</sub></td>
</tr>
<tr>
<td>Water seepage in bridge pier</td>
<td>V<sub>2-6</sub></td>
</tr>
<tr>
<td>Spalling and exposed reinforcement in bent cap and tie beam</td>
<td>V<sub>2-7</sub></td>
</tr>
<tr>
<td>Corrosion of reinforcement in bent cap and tie beam</td>
<td>V<sub>2-8</sub></td>
</tr>
<tr>
<td>Cracks in bent cap and tie beam</td>
<td>V<sub>2-9</sub></td>
</tr>
<tr>
<td>Water seepage in bent cap and tie beam</td>
<td>V<sub>2-10</sub></td>
</tr>
<tr>
<td>Spalling in abutment</td>
<td>V<sub>2-11</sub></td>
</tr>
<tr>
<td>Cracks in abutment</td>
<td>V<sub>2-12</sub></td>
</tr>
<tr>
<td>Water seepage in abutment</td>
<td>V<sub>2-13</sub></td>
</tr>
<tr>
<td>Water seepage in abutment cap</td>
<td>V<sub>2-14</sub></td>
</tr>
<tr>
<td rowspan="6">Bridge deck system</td>
<td>Damage in deck pavement</td>
<td>V<sub>3-1</sub></td>
</tr>
<tr>
<td>Cracks in deck pavement</td>
<td>V<sub>3-2</sub></td>
</tr>
<tr>
<td>Damage in expansion and contraction joint</td>
<td>V<sub>3-3</sub></td>
</tr>
<tr>
<td>Lose efficacy in expansion and contraction joint</td>
<td>V<sub>3-4</sub></td>
</tr>
<tr>
<td>Damage in railing and guardrail</td>
<td>V<sub>3-5</sub></td>
</tr>
<tr>
<td>Defect of waterproof and drainage discharge pipe and diversion tank</td>
<td>V<sub>3-6</sub></td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="s3">
<label>3</label>
<title>Intelligent Diagnosis of Technical Condition of Highway Bridges</title>
<p>In order to realize intelligent diagnosis of the technical condition of highway bridges based on bridge defects, it is necessary to have a clear understanding of the mapping relationship between the screened defect data and the level of bridge technical condition. On this basis, this paper proposes to combine Principal Component Analysis with Naive Bayesian Classification. This method not only improves the efficiency and accuracy of discrimination but also eliminates the influence of correlation between bridge defect indicators by using principal component analysis, which makes the data conform to the law of normal distribution. At the same time, it is able to compress and reduce the dimensionality of the raw data information to extract the main factors of bridge defect indicators. Thus, an intelligent diagnostic model for the technical condition of highway bridges based on defect information using the PCA-NBC algorithm is established.</p>
<sec id="s3_1">
<label>3.1</label>
<title>Naive Bayesian Classification</title>
<p>Naive Bayesian Classification (NBC) is a classification method based on the assumption of independence between Bayes theorem and feature conditions. Compared with the original method, the calculation difficulty is simpler, and the classification process is simple and efficient. For a given training dataset, first, the joint probability distribution from input to output is learned based on the assumption of conditional independence of features. Then, based on the learned model, the input x is input, and the output y with the largest posterior probability is found using Bayes&#x2019; theorem.</p>
<p>Set the sample data set <inline-formula id="ieqn-1">
<mml:math id="mml-ieqn-1"><mml:mi>D</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mi>d</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>d</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>d</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:math>
</inline-formula>, the feature attribute set of the corresponding sample data is <inline-formula id="ieqn-2">
<mml:math id="mml-ieqn-2"><mml:mi>X</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>d</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:math>
</inline-formula>, the class variable is <inline-formula id="ieqn-3">
<mml:math id="mml-ieqn-3"><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:math>
</inline-formula>, Then <italic>D</italic> can be divided into <italic>y</italic><sub>m</sub> categories. <inline-formula id="ieqn-4">
<mml:math id="mml-ieqn-4"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>d</mml:mi></mml:msub></mml:mrow></mml:math>
</inline-formula> are independent and random, the prior probability of <italic>Y</italic> is <inline-formula id="ieqn-5">
<mml:math id="mml-ieqn-5"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>r</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>Y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</inline-formula>, and the posterior probability of <italic>Y</italic> is <inline-formula id="ieqn-6">
<mml:math id="mml-ieqn-6"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>X</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</inline-formula>. The posterior probability is calculated by the naive Bayesian algorithm:<disp-formula id="eqn-3"><label>(3)</label>
<mml:math id="mml-eqn-3" display="block"><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>X</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>Y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>X</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>Y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle></mml:math>
</disp-formula></p>
<p>Then the class with the largest posterior probability is recorded as the prediction class, that is, <inline-formula id="ieqn-7">
<mml:math id="mml-ieqn-7"><mml:mi>arg</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mo movablelimits="true" form="prefix">max</mml:mo><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>X</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</inline-formula>.</p>
<p>Naive Bayes is based on the independence between features. In the case of a given category <italic>y</italic>, the above equation can be further expressed as the following:<disp-formula id="eqn-4"><label>(4)</label>
<mml:math id="mml-eqn-4" display="block"><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>X</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mi>y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">&#x220F;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>d</mml:mi></mml:munderover><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>Y</mml:mi><mml:mo>=</mml:mo><mml:mi>y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>From the above two formulas, the posterior probability can be calculated as:<disp-formula id="eqn-5"><label>(5)</label>
<mml:math id="mml-eqn-5" display="block"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>p</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>Y</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>X</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>Y</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:munderover><mml:mo movablelimits="false">&#x220F;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>d</mml:mi></mml:munderover><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>Y</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle></mml:math>
</disp-formula></p>
<p>Since the size of <italic>P(X)</italic> is fixed, it is only necessary to compare the molecular part of the above formula when comparing the posterior probability. Therefore, we can get a Naive Bayesian calculation formula that the sample data belongs to the category <italic>y</italic><sub><italic>i</italic></sub>:<disp-formula id="eqn-6"><label>(6)</label>
<mml:math id="mml-eqn-6" display="block"><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>d</mml:mi></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo><mml:munderover><mml:mo movablelimits="false">&#x220F;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>d</mml:mi></mml:munderover><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:munderover><mml:mo movablelimits="false">&#x220F;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>d</mml:mi></mml:munderover><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle></mml:math>
</disp-formula></p>
<p>Thus, the naive Bayesian classifier can be expressed as:<disp-formula id="eqn-7"><label>(7)</label>
<mml:math id="mml-eqn-7" display="block"><mml:mi>f</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:munder><mml:mrow><mml:mi>arg</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mo movablelimits="true" form="prefix">max</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mrow></mml:munder><mml:mo>&#x2061;</mml:mo><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>X</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo><mml:munderover><mml:mo movablelimits="false">&#x220F;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>d</mml:mi></mml:munderover><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:munderover><mml:mo movablelimits="false">&#x220F;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>d</mml:mi></mml:munderover><mml:mrow><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle></mml:math>
</disp-formula></p>
<p>In practice, the appropriate model can be selected according to whether the data characteristics are continuous or not. When the features are continuous variables, the Gaussian model is used. When the features are discrete variables, a polynomial model is used; if the features are discrete, they are of Boolean type, i.e., true and false, and a Bernoulli model can be used. Since the data in this paper are continuous variables, a Gaussian model is used.</p>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>PCA-NBC Algorithm Steps</title>
<p>Because the hypothesis of the independence of the feature attributes of the NBC algorithm is often difficult to establish, it often results in suboptimal classification outcomes. Therefore, this paper combines the PCA with the NBC algorithm. PCA is one of the most widely used data dimensionality reduction algorithms. This method reduces the dimension of the original features, retains some of the most important features, and reduces the correlation between the feature variables while ensuring the minimum loss of information. By incorporating PCA, it becomes possible to satisfy the independence assumption of NBC and thereby enhance classification accuracy. On this basis, an intelligent diagnostic model for the technical condition of highway bridges is established. The steps are as follows, and the flowchart of the PCA-NBC algorithm is shown in <xref ref-type="fig" rid="fig-3">Fig. 3</xref>.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>PCA-NBC algorithm flow</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="SDHM_52683-fig-3.tif"/>
</fig>
<p>Input: sample data set D, including feature attribute set X and class variable Y.</p><p>Step 1: According to the basic structure of the bridge, the data of defect variables were analyzed.</p>
<p>Step 2: According to the basic principle of PCA, appropriate feature variables are selected to form a new comprehensive feature attribute set.</p>
<p>Step 3: Validation of the dataset using 5-fold-cross validation.</p>
<p>Step 4: Establish the intelligent diagnosis model of highway bridge technical condition based on defect information using the PCA-NBC algorithm.</p><p>Step 5: The level of the sample to be tested is taken as the one with the largest <italic>a posteriori</italic> probability.</p>
<p>Step 6: Calculate the accuracy of model discrimination (ACC) according to the confusion matrix obtained.</p><p>Output: The ACC of the training set and the samples to be tested.</p>
<p>In addition to that, in order to verify the performance of the proposed model, ACC and F1 Score are selected as the evaluation metrics in this paper. Since this paper is solving a three-category problem, Macro-F1 is used in this paper, i.e., TP, FP, FN, and TN of each category are counted, and their respective Precision and Recall are calculated to get their respective F1 Scores, and then the average is taken to get Macro-F1. According to the confusion matrix (<xref ref-type="fig" rid="fig-4">Fig. 4</xref>), the calculation formula can be obtained as shown below:</p>
<p><disp-formula id="eqn-8"><label>(8)</label>
<mml:math id="mml-eqn-8" display="block"><mml:mi>A</mml:mi><mml:mi>C</mml:mi><mml:mi>C</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle></mml:math>
</disp-formula><disp-formula id="eqn-9"><label>(9)</label>
<mml:math id="mml-eqn-9" display="block"><mml:mi>P</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle></mml:math>
</disp-formula><disp-formula id="eqn-10"><label>(10)</label>
<mml:math id="mml-eqn-10" display="block"><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>l</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle></mml:math>
</disp-formula><disp-formula id="eqn-11"><label>(11)</label>
<mml:math id="mml-eqn-11" display="block"><mml:mi>F</mml:mi><mml:mn>1</mml:mn><mml:mrow></mml:mrow><mml:mtext>&#x00A0;</mml:mtext><mml:mi>S</mml:mi><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mn>2</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mi>P</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>l</mml:mi></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle></mml:math>
</disp-formula></p>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Confusion matrix</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="SDHM_52683-fig-4.tif"/>
</fig>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Algorithm Validation</title>
<sec id="s4_1">
<label>4.1</label>
<title>Verification and Comparison</title>
<p>According to the established intelligent diagnostic model for the technical condition of highway bridges, the filtered variable data are input. According to steps 2 of <xref ref-type="sec" rid="s3_2">Section 3.2</xref>, a set of comprehensive feature attribute sets can be obtained. The number of characteristic variables is reduced to 21, which is named as <italic>x</italic><sub>1</sub>, <italic>x</italic><sub>2</sub>, <italic>x</italic><sub>3</sub>,&#x2026;, <italic>x</italic><sub>21</sub>. In turn, the degree of correlation of the comprehensive feature attribute set and whether the data are normally distributed are determined. From <xref ref-type="fig" rid="fig-5">Figs. 5</xref> and <xref ref-type="fig" rid="fig-6">6</xref>, the Pearson values between the characteristic attributes and after calculating the absolute values for them, the average value was found to be 0.15. And, the data conforms to a normal distribution, then the model can be solved using a Gaussian model.</p>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>Correlation graph of the variables</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="SDHM_52683-fig-5.tif"/>
</fig><fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>Normal distribution diagram of 21 variables</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="SDHM_52683-fig-6.tif"/>
</fig>
<p>Establish the intelligent diagnostic model of the technical condition of highway bridges according to <xref ref-type="sec" rid="s3_2">Section 3.2</xref>, and determine the level of the technical condition of the samples to be tested. Since the model uses cross-validation for data categorization and randomly draws training samples, the ACC obtained from each test is slightly different. Therefore, 100 tests were conducted on the same set of data and averaged to get the final classification accuracy as shown in <xref ref-type="table" rid="table-7">Table 7</xref>. The highest value of ACC for bridge level discrimination was 95.62% and the lowest value was 90.51%, resulting in a final classification accuracy average of 93.50%.</p>
<table-wrap id="table-7"><label>Table 7</label>
<caption>
<title>Model ACC results</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Test times</th>
<th>1</th>
<th>2</th>
<th>3</th>
<th>&#x2026;</th>
<th>98</th>
<th>99</th>
<th>100</th>
<th>Average value</th>
</tr>
</thead>
<tbody>
<tr>
<td>Accuracy/%</td>
<td>94.16</td>
<td>95.62</td>
<td>91.97</td>
<td>&#x2026;</td>
<td>91.97</td>
<td>90.51</td>
<td>94.89</td>
<td>93.50</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>To explore whether the NBC algorithm is optimal, the decision tree algorithm, SVM algorithm, Fisher algorithm, and BP algorithm were also tested for their ability to intelligently diagnose the technical condition of bridges. All were tested 100 times and the results are shown in <xref ref-type="table" rid="table-8">Table 8</xref> and <xref ref-type="fig" rid="fig-7">Fig. 7</xref>. The average classification ACC of DT algorithm, SVM algorithm, Fisher algorithm, and BP algorithm are 92.39%, 88.52%, 81.80%, and 74.07%. In addition, <xref ref-type="table" rid="table-8">Table 8</xref> shows the F1 Score of these algorithms. The F1 Score of the model in this paper presents a better performance. In comparison, the PCA-NBC algorithm classifies better.</p>
<table-wrap id="table-8"><label>Table 8</label>
<caption>
<title>Comparison of evaluation criteria for algorithms</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Algorithms</th>
<th>PCA-NBC</th>
<th>DT</th>
<th>SVM</th>
<th>Fisher</th>
<th>BP</th>
</tr>
</thead>
<tbody>
<tr>
<td>ACC (%)</td>
<td>93.50</td>
<td>92.39</td>
<td>88.52</td>
<td>81.80</td>
<td>74.07</td>
</tr>
<tr>
<td>F1 Score</td>
<td>0.94</td>
<td>0.90</td>
<td>0.84</td>
<td>0.92</td>
<td>0.81</td>
</tr>
</tbody>
</table>
</table-wrap><fig id="fig-7">
<label>Figure 7</label>
<caption>
<title>ACC comparison chart</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="SDHM_52683-fig-7.tif"/>
</fig>
<p>The test results show that this method possesses high accuracy and is more applicable to the practical situation of this paper. Due to the limitations of the bridge data selected in this paper in terms of geography, sample size, and bridge type. The generalization ability of this method can be improved by appropriately adapting this method to other bridge data.</p>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Discussion</title>
<p>In order to further explore the relationship between bridge defects and the level of bridge technical condition, this paper plotted the variation of the percentage P of the defect variables for the three levels of bridges for the 30 defect variables after screening (<xref ref-type="fig" rid="fig-8">Fig. 8</xref>). The similarities and differences among different classes of bridges in multiple defect variables can be clearly found through the images. In comparing these three levels of bridges, the changes in defects present in the superstructure and substructure were most pronounced, while the changes in defects in the bridge deck system showed similarity. This finding is consistent with the performance of the weights of the components of the TCEB, i.e., both the superstructure and substructure accounted for weight values greater than the weight values of the bridge deck system. Also, since this is an analysis of overall bridges and defects, the degree of influence of changes in individual variables differs somewhat from the correlation analysis above. For example, the V<sub>1-5</sub> variable in the correlation analysis Person is 0.3. and in this figure, it can be seen that this variable has a greater variation in the category 2 bridge data than in the category 1 and 3 bridge data. There is some inconsistency between the two.</p>
<fig id="fig-8">
<label>Figure 8</label>
<caption>
<title>Plot of changes in defect variables for three levels of bridges. (a) Variation diagram of defect variables of Level 1 bridges. (b) Variation diagram of defect variables of Level 2 bridges. (c) Variation diagram of defect variables of Level 3 bridges</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="SDHM_52683-fig-8.tif"/>
</fig>
<p>In summary, the defect variable V<sub>1-1</sub>, V<sub>1-2</sub>, V<sub>1-3</sub>, V<sub>1-4</sub>, V<sub>1-7</sub>, V<sub>1-8</sub>, V<sub>2-2</sub>, V<sub>2-3</sub>, V<sub>2-4</sub>, V<sub>2-7</sub>, and V<sub>2-9</sub> have undergone significant changes, as shown in <xref ref-type="fig" rid="fig-8">Fig. 8</xref>. Calculate the difference between the three levels based on the <italic>p</italic>-value of the variable in <xref ref-type="fig" rid="fig-9">Fig. 9</xref>, and then take the average value. The priority bridge defect types with mean values greater than 0.19 were selected as V<sub>1-2</sub>, V<sub>1-3</sub>, V<sub>1-7</sub>, V<sub>2-2</sub>, V<sub>2-3</sub>, V<sub>2-7</sub>, and V<sub>2-9</sub>. <xref ref-type="table" rid="table-9">Table 9</xref> documents the variation in peak P values for these key defect variables across three levels. Clearly, high-risk defects are concentrated in the upper load-bearing structure, upper general structure, and bridge pier. Attention should be focused on issues such as cracks, water seepage, spalling, exposed reinforcement, and corrosion of reinforcement in these components. Furthermore, by utilizing the correlation between defects, it is possible to predict other potential issues and evaluate the overall technical condition of the bridge. In addition, in TCEB, water seepage defects are not included in the standard. However, water seepage defects have a greater impact on bridges in actual inspections, and further adjustments to the specifications are needed.</p>
<fig id="fig-9">
<label>Figure 9</label>
<caption>
<title>Comparison of extracted bridge defect variables</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="SDHM_52683-fig-9.tif"/>
</fig><table-wrap id="table-9"><label>Table 9</label>
<caption>
<title>Indicators of key defects in the technical condition of highway bridges</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th rowspan="2">Name of bridge defects</th>
<th align="center" colspan="3">Range of data change</th>
</tr>
<tr>
<th>Level 1 (P peak)</th>
<th>Level 2 (P peak)</th>
<th>Level 3 (P peak)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Corrosion of reinforcement in upper load-bearing structure (V<sub>1-2</sub>)</td>
<td>0.04</td>
<td>0.06</td>
<td>0.33</td>
</tr>
<tr>
<td>Cracks in upper load-bearing structure (V<sub>1-3</sub>)</td>
<td>0.01</td>
<td>0.38</td>
<td>1.00</td>
</tr>
<tr>
<td>Water seepage in upper general structure (V<sub>1-7</sub>)</td>
<td>0.01</td>
<td>0.54</td>
<td>1.00</td>
</tr>
<tr>
<td>Spalling and exposed reinforcement in bridge pier (V<sub>2-2</sub>)</td>
<td>0.00</td>
<td>0.65</td>
<td>1.00</td>
</tr>
<tr>
<td>Corrosion of reinforcement in bridge pier (V<sub>2-3</sub>)</td>
<td>0.00</td>
<td>0.13</td>
<td>0.29</td>
</tr>
<tr>
<td>Spalling and exposed reinforcement in bent cap and tie beam (V<sub>2-7</sub>)</td>
<td>0.03</td>
<td>0.25</td>
<td>0.50</td>
</tr>
<tr>
<td>Cracks in bent cap and tie beam (V<sub>2-9</sub>)</td>
<td>0.00</td>
<td>0.11</td>
<td>0.50</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Conclusions</title>
<p>This study conducted a disease correlation analysis based on disease detection information and the technical condition level of more than 100 bridges. Subsequently, an intelligent diagnostic model for the technical condition of highway bridges, employing defect information, was established. The following conclusions were predominantly reached:</p>
<p>(1) Analysis of the correlation between defects and component levels reveals that the superstructure level exhibits the highest correlation with cracks. The number of components and defects in the substructure is high, and the degree of correlation between the existing defects is low. The level of the bridge deck system is highly correlated with cracks in the deck pavement. At the same time, it was found that the three types of defects in bridges, spalling, corrosion of reinforcement, and cracks, interact with each other, which is consistent with the actual causes and developmental patterns of bridge defects.</p>
<p>(2) Considering the large number and complex composition of bridge defect data, an intelligent diagnostic model for the technical condition of highway bridges using the PCA-NBC algorithm is proposed. Through the research and testing of the model, it is obtained that the accuracy of bridge category discrimination is 93.50% on average. And compared with DT, SVM, Fisher, and BP multiple algorithms, the highest improvement is 19.43%. And the F1 Score reaches 0.94. The model is more robust and works best for the actual situation of this paper. The method can assist bridge inspectors and even non-professionals to have a better understanding of bridge defect conditions.</p>
<p>(3) By plotting the variation of the defect variables for the three levels of bridges, the key defect variables affecting the change in the level of the bridges were obtained. Namely, cracks and corrosion of reinforcement in the upper load-bearing structure; water seepage in the upper general structure; spalling, exposed reinforcement and corrosion of reinforcement in the bridge pier; spalling, exposed reinforcement, and cracks in bent cap and tie beam. This needs to be focused on by bridge inspectors and also helps in the initial determination of the technical condition level of the bridge.</p>
<p>(4) The bridge technical condition level discrimination model proposed in this paper has some limitations in terms of sample size, geographical area, bridge type, and bridge category. When other bridge data are substituted, the results may have a slight bias. Therefore, adjustments need to be made to the model to address the research errors stemming from data variability.</p>
</sec>
</body>
<back>
<ack>
<p>None.</p>
</ack>
<sec>
<title>Funding Statement</title>
<p>This research was financially supported by the National Natural Science Foundation of China (No. 51808301), the Scientific Research Fund of Zhejiang Provincial Education Department (No. Y202248860), and the National &#x201C;111&#x201D; Centre on Safety and Intelligent Operation of Sea Bridge (D21013).</p>
</sec>
<sec>
<title>Author Contributions</title>
<p>Study conception and design: Yanxue Ma, Xiaoling Liu; data collection: Bing Wang; analysis and interpretation of results: Ying Liu, Yanxue Ma, Xiaoling Liu; draft manuscript preparation: Xiaoling Liu, Yanxue Ma. All authors reviewed the results and approved the final version of the manuscript.</p>
</sec>
<sec sec-type="data-availability">
<title>Availability of Data and Materials</title>
<p>Most data and models generated and used during the study appear in the published article. However, some information is proprietary or confidential in nature and may only be provided with restrictions (e.g., bridge defect data).</p>
</sec>
<sec>
<title>Ethics Approval</title>
<p>Not applicable.</p>
</sec>
<sec sec-type="COI-statement">
<title>Conflicts of Interest</title>
<p>The authors declare that they have no conflicts of interest to report regarding the present study.</p>
</sec>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>1.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ge</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Song</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Ding</surname> <given-names>SX</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>B</given-names></string-name></person-group>. <article-title>Data mining and analytics in the process industry: the role of machine learning</article-title>. <source>IEEE Access</source>. <year>2017</year>;<volume>5</volume>:<fpage>20590</fpage>&#x2013;<lpage>616</lpage>.</mixed-citation></ref>
<ref id="ref-2"><label>2.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Alqahtani</surname> <given-names>A</given-names></string-name>, <string-name><surname>Ullah Khan</surname> <given-names>H</given-names></string-name>, <string-name><surname>Alsubai</surname> <given-names>S</given-names></string-name>, <string-name><surname>Sha</surname> <given-names>M</given-names></string-name>, <string-name><surname>Almadhor</surname> <given-names>A</given-names></string-name>, <string-name><surname>Iqbal</surname> <given-names>T</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>An efficient approach for textual data classification using deep learning</article-title>. <source>Front Comput Neurosci</source>. <year>2022</year>;<volume>16</volume>:<fpage>992296</fpage>; <pub-id pub-id-type="pmid">36185709</pub-id></mixed-citation></ref>
<ref id="ref-3"><label>3.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Koseler</surname> <given-names>K</given-names></string-name>, <string-name><surname>Stephan</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Machine learning applications in baseball: a systematic literature review</article-title>. <source>Appl Artif Intell</source>. <year>2017</year>;<volume>31</volume>(<issue>9&#x2013;10</issue>):<fpage>745</fpage>&#x2013;<lpage>63</lpage>.</mixed-citation></ref>
<ref id="ref-4"><label>4.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Nguyen</surname> <given-names>TTS</given-names></string-name>, <string-name><surname>Do</surname> <given-names>PMT</given-names></string-name></person-group>. <article-title>Classification optimization for training a large dataset with Na&#x00EF;ve Bayes</article-title>. <source>J Comb Optim</source>. <year>2020</year>;<volume>40</volume>(<issue>1</issue>):<fpage>141</fpage>&#x2013;<lpage>69</lpage>.</mixed-citation></ref>
<ref id="ref-5"><label>5.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Nan</surname> <given-names>F</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>P</given-names></string-name></person-group>. <article-title>Effective multilayer hybrid classification approach for automatic bridge health assessment on large-scale uncertain data</article-title>. <source>J Ind Inf Integr</source>. <year>2021</year>;<volume>24</volume>:<fpage>100234</fpage>.</mixed-citation></ref>
<ref id="ref-6"><label>6.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bektas</surname> <given-names>BA</given-names></string-name>, <string-name><surname>Carriquiry</surname> <given-names>A</given-names></string-name>, <string-name><surname>Smadi</surname> <given-names>O</given-names></string-name></person-group>. <article-title>Using classification trees for predicting national bridge inventory condition ratings</article-title>. <source>J Infrastruct Syst</source>. <year>2013</year>;<volume>19</volume>(<issue>4</issue>):<fpage>425</fpage>&#x2013;<lpage>33</lpage>. doi:<pub-id pub-id-type="doi">10.1061/(ASCE)IS.1943-555X.0000143</pub-id>.</mixed-citation></ref>
<ref id="ref-7"><label>7.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chung</surname> <given-names>S</given-names></string-name>, <string-name><surname>Lim</surname> <given-names>S</given-names></string-name>, <string-name><surname>Chi</surname> <given-names>S</given-names></string-name></person-group>. <article-title>Developing an estimation model for safety rating of road bridges using rule-based classification method</article-title>. <source>J KIBIM</source>. <year>2016</year>;<volume>6</volume>(<issue>2</issue>):<fpage>29</fpage>&#x2013;<lpage>38</lpage>. doi:<pub-id pub-id-type="doi">10.13161/kibim.2016.6.2.029</pub-id>.</mixed-citation></ref>
<ref id="ref-8"><label>8.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Martinez</surname> <given-names>P</given-names></string-name>, <string-name><surname>Mohamed</surname> <given-names>E</given-names></string-name>, <string-name><surname>Mohsen</surname> <given-names>O</given-names></string-name>, <string-name><surname>Mohamed</surname> <given-names>Y</given-names></string-name></person-group>. <article-title>Comparative study of data mining models for prediction of bridge future conditions</article-title>. <source>J Perform Constr Facil</source>. <year>2020</year>;<volume>34</volume>(<issue>1</issue>):<fpage>04019108</fpage>. doi:<pub-id pub-id-type="doi">10.1061/(ASCE)CF.1943-5509.0001395</pub-id>.</mixed-citation></ref>
<ref id="ref-9"><label>9.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Feng</surname> <given-names>CW</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>HY</given-names></string-name></person-group>. <article-title>Hybridization of finite-element method and support vector machine to determine scour bridge safety level</article-title>. <source>J Perform Constr Facil</source>. <year>2015</year>;<volume>29</volume>(<issue>3</issue>):<fpage>04014079</fpage>. doi:<pub-id pub-id-type="doi">10.1061/(ASCE)CF.1943-5509.0000505</pub-id>.</mixed-citation></ref>
<ref id="ref-10"><label>10.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Mokalled</surname> <given-names>S</given-names></string-name>, <string-name><surname>Locke</surname> <given-names>W</given-names></string-name>, <string-name><surname>Abuodeh</surname> <given-names>O</given-names></string-name>, <string-name><surname>Redmond</surname> <given-names>L</given-names></string-name>, <string-name><surname>McMahan</surname> <given-names>C</given-names></string-name></person-group>. <article-title>Drive-by health monitoring of highway bridges using Bayesian estimation technique for damage classification</article-title>. <source>Struct Control Health Monit</source>. <year>2022</year>;<volume>29</volume>(<issue>6</issue>):<fpage>e2944</fpage>. doi:<pub-id pub-id-type="doi">10.1002/stc.2944</pub-id>.</mixed-citation></ref>
<ref id="ref-11"><label>11.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Mangalathu</surname> <given-names>S</given-names></string-name>, <string-name><surname>Jeon</surname> <given-names>JS</given-names></string-name></person-group>. <article-title>Machine learning-based failure mode recognition of circular reinforced concrete bridge columns: comparative study</article-title>. <source>J Struct Eng</source>. <year>2019</year>;<volume>145</volume>(<issue>10</issue>):<fpage>04019104</fpage>.</mixed-citation></ref>
<ref id="ref-12"><label>12.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Jootoo</surname> <given-names>A</given-names></string-name>, <string-name><surname>Lattanzi</surname> <given-names>D</given-names></string-name></person-group>. <article-title>Bridge type classification: supervised learning on a modified NBI data set</article-title>. <source>J Comput Civ Eng</source>. <year>2017</year>;<volume>31</volume>(<issue>6</issue>):<fpage>04017063</fpage>.</mixed-citation></ref>
<ref id="ref-13"><label>13.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Hu</surname> <given-names>S</given-names></string-name>, <string-name><surname>Shao</surname> <given-names>K</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>X</given-names></string-name>, <string-name><surname>Ma</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>B</given-names></string-name></person-group>. <article-title>Predictions and evolution characteristics of failure modes of degenerate RC piers</article-title>. <source>Buildings</source>. <year>2023</year>;<volume>13</volume>(<issue>1</issue>):<fpage>113</fpage>.</mixed-citation></ref>
<ref id="ref-14"><label>14.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>X</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>C</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>Y</given-names></string-name></person-group>. <article-title>Reliability assessment of the existing concrete bridge deck system based on the multi-index multilevel extension assessment</article-title>. <source>J Highway Transport Res Dev</source>. <year>2016</year>;<volume>10</volume>(<issue>3</issue>):<fpage>26</fpage>&#x2013;<lpage>33</lpage>.</mixed-citation></ref>
<ref id="ref-15"><label>15.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>GZ</given-names></string-name>, <string-name><surname>He</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Shao</surname> <given-names>FF</given-names></string-name>, <string-name><surname>Ou</surname> <given-names>AH</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>XZ</given-names></string-name></person-group>. <article-title>Patient classification of hypertension in traditional Chinese medicine using multi-label learning techniques</article-title>. <source>BMC Med Genomics</source>. <year>2015</year>;<volume>8</volume>(<issue>3</issue>):<fpage>1</fpage>&#x2013;<lpage>6</lpage>.</mixed-citation></ref>
<ref id="ref-16"><label>16.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>F</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>NL</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>BX</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>ZR</given-names></string-name>, <string-name><surname>Jin</surname> <given-names>XL</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>RJ</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Identification and classification of traditional Chinese medicine syndrome types among senior patients with vascular mild cognitive impairment using latent tree analysis</article-title>. <source>J Integr Med</source>. <year>2017</year>;<volume>15</volume>(<issue>3</issue>):<fpage>186</fpage>&#x2013;<lpage>200</lpage>. doi:<pub-id pub-id-type="doi">10.1016/S2095-4964(17)60335-2</pub-id>; <pub-id pub-id-type="pmid">28494849</pub-id></mixed-citation></ref>
<ref id="ref-17"><label>17.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dong</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name></person-group>. <article-title>Breast cancer classification application based on QGA-SVM</article-title>. <source>J Intell Fuzzy Syst</source>. <year>2023</year>;<volume>44</volume>(<issue>4</issue>):<fpage>5559</fpage>&#x2013;<lpage>71</lpage>.</mixed-citation></ref>
<ref id="ref-18"><label>18.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Donghia</surname> <given-names>R</given-names></string-name>, <string-name><surname>Guerra</surname> <given-names>V</given-names></string-name>, <string-name><surname>Misciagna</surname> <given-names>G</given-names></string-name>, <string-name><surname>Loiacono</surname> <given-names>C</given-names></string-name>, <string-name><surname>Brunetti</surname> <given-names>A</given-names></string-name>, <string-name><surname>Bevilacqua</surname> <given-names>V</given-names></string-name></person-group>. <article-title>Prediction of gastrointestinal cancers in the ONCONUT cohort study: comparison between logistic regression and artificial neural network</article-title>. <source>Front Oncol</source>. <year>2023</year>;<volume>13</volume>:<fpage>1110999</fpage>. doi:<pub-id pub-id-type="doi">10.3389/fonc.2023.1110999</pub-id>; <pub-id pub-id-type="pmid">37168368</pub-id></mixed-citation></ref>
<ref id="ref-19"><label>19.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zeng</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Cheng</surname> <given-names>F</given-names></string-name></person-group>. <article-title>Medical and health data classification method based on machine learning</article-title>. <source>J Healthc Eng</source>. <volume>2021</volume>;(<issue>12</issue>):<fpage>2722854</fpage>. doi:<pub-id pub-id-type="doi">10.1155/2021/2722854</pub-id>; <pub-id pub-id-type="pmid">34824763</pub-id></mixed-citation></ref>
<ref id="ref-20"><label>20.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Hameed</surname> <given-names>K</given-names></string-name>, <string-name><surname>Chai</surname> <given-names>D</given-names></string-name>, <string-name><surname>Rassau</surname> <given-names>A</given-names></string-name></person-group>. <article-title>A comprehensive review of fruit and vegetable classification techniques</article-title>. <source>Image Vis Comput</source>. <year>2018</year>;<volume>80</volume>(<issue>1</issue>):<fpage>24</fpage>&#x2013;<lpage>44</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.imavis.2018.09.016</pub-id>.</mixed-citation></ref>
<ref id="ref-21"><label>21.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Gupta</surname> <given-names>O</given-names></string-name>, <string-name><surname>Das</surname> <given-names>AJ</given-names></string-name>, <string-name><surname>Hellerstein</surname> <given-names>J</given-names></string-name>, <string-name><surname>Raskar</surname> <given-names>R</given-names></string-name></person-group>. <article-title>Machine learning approaches for large scale classification of produce</article-title>. <source>Sci Rep</source>. <year>2018</year>;<volume>8</volume>(<issue>1</issue>):<fpage>5226</fpage>; <pub-id pub-id-type="pmid">29588477</pub-id></mixed-citation></ref>
<ref id="ref-22"><label>22.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yasmeen</surname> <given-names>U</given-names></string-name>, <string-name><surname>Khan</surname> <given-names>MA</given-names></string-name>, <string-name><surname>Tariq</surname> <given-names>U</given-names></string-name>, <string-name><surname>Khan</surname> <given-names>JA</given-names></string-name>, <string-name><surname>Yar</surname> <given-names>MAE</given-names></string-name>, <string-name><surname>Hanif</surname> <given-names>CA</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Citrus diseases recognition using deep improved genetic algorithm</article-title>. <source>Comput Mater Contin</source>. <year>2021</year>;<volume>71</volume>:<fpage>3667</fpage>&#x2013;<lpage>84</lpage>. doi:<pub-id pub-id-type="doi">10.32604/cmc.2022.022264</pub-id>.</mixed-citation></ref>
<ref id="ref-23"><label>23.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Gupta</surname> <given-names>S</given-names></string-name>, <string-name><surname>Tripathi</surname> <given-names>AK</given-names></string-name></person-group>. <article-title>Fruit and vegetable disease detection and classification: recent trends, challenges, and future opportunities</article-title>. <source>Eng Appl Artif Intell</source>. <year>2024</year>;<volume>133</volume>:<fpage>108260</fpage>.</mixed-citation></ref>
<ref id="ref-24"><label>24.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>S</given-names></string-name>, <string-name><surname>Webb</surname> <given-names>GI</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>L</given-names></string-name>, <string-name><surname>Ma</surname> <given-names>X</given-names></string-name></person-group>. <article-title>A novel selective na&#x00EF;ve Bayes algorithm</article-title>. <source>Knowl-Based Syst</source>. <year>2020</year>;<volume>192</volume>:<fpage>105361</fpage>.</mixed-citation></ref>
<ref id="ref-25"><label>25.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>S</given-names></string-name>, <string-name><surname>Ren</surname> <given-names>J</given-names></string-name>, <string-name><surname>Bai</surname> <given-names>R</given-names></string-name></person-group>. <article-title>A regularized attribute weighting framework for naive Bayes</article-title>. <source>IEEE Access</source>. <year>2020</year>;<volume>8</volume>:<fpage>225639</fpage>&#x2013;<lpage>49</lpage>.</mixed-citation></ref>
<ref id="ref-26"><label>26.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>S</given-names></string-name>, <string-name><surname>Ren</surname> <given-names>J</given-names></string-name>, <string-name><surname>Bai</surname> <given-names>R</given-names></string-name></person-group>. <article-title>A semi-supervised adaptive discriminative discretization method improving discrimination power of regularized naive Bayes</article-title>. <source>Expert Syst Appl</source>. <year>2023</year>;<volume>225</volume>(<issue>1</issue>):<fpage>120094</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.eswa.2023.120094</pub-id>.</mixed-citation></ref>
<ref id="ref-27"><label>27.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wu</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>B</given-names></string-name>, <string-name><surname>Hu</surname> <given-names>C</given-names></string-name>, <string-name><surname>Yan</surname> <given-names>X</given-names></string-name></person-group>. <article-title>Evolutionary multilabel classification algorithm based on cultural algorithm</article-title>. <source>Symmetry</source>. <year>2021</year>;<volume>13</volume>(<issue>2</issue>):<fpage>322</fpage>. doi:<pub-id pub-id-type="doi">10.3390/sym13020322</pub-id>.</mixed-citation></ref>
<ref id="ref-28"><label>28.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Jiang</surname> <given-names>L</given-names></string-name>, <string-name><surname>Yu</surname> <given-names>L</given-names></string-name></person-group>. <article-title>Attribute and instance weighted naive Bayes</article-title>. <source>Pattern Recognit</source>. <year>2021</year>;<volume>111</volume>(<issue>2&#x2013;3</issue>):<fpage>107674</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.patcog.2020.107674</pub-id>.</mixed-citation></ref>
<ref id="ref-29"><label>29.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>K</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Luo</surname> <given-names>C</given-names></string-name></person-group>. <article-title>Generator fault classification method based on multi-source information fusion Naive Bayes classification algorithm</article-title>. <source>Energies</source>. <year>2022</year>;<volume>15</volume>(<issue>24</issue>):<fpage>9635</fpage>. doi:<pub-id pub-id-type="doi">10.3390/en15249635</pub-id>.</mixed-citation></ref>
<ref id="ref-30"><label>30.</label><mixed-citation publication-type="book"><person-group person-group-type="author"><collab>JTG/T H21-2011</collab></person-group>. <source>Standards for technical condition evaluation of highway bridges</source>. <publisher-loc>Beijing, China</publisher-loc>: <publisher-name>China Communication Press</publisher-name>; <year>2011</year>.</mixed-citation></ref>
</ref-list>
</back>
</article>

















