<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">JIMH</journal-id>
<journal-id journal-id-type="nlm-ta">JIMH</journal-id>
<journal-id journal-id-type="publisher-id">JIMH</journal-id>
<journal-title-group>
<journal-title>Journal of Intelligent Medicine and Healthcare</journal-title>
</journal-title-group>
<issn pub-type="epub">2837-634X</issn>
<issn pub-type="ppub">2837-6331</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">75201</article-id>
<article-id pub-id-type="doi">10.32604/jimh.2026.075201</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>CardioForest: An Explainable Ensemble Learning Model for Automatic Wide QRS Complex Tachycardia Diagnosis from ECG</article-title>
<alt-title alt-title-type="left-running-head">CardioForest: An Explainable Ensemble Learning Model for Automatic Wide QRS Complex Tachycardia Diagnosis from ECG</alt-title>
<alt-title alt-title-type="right-running-head">CardioForest: An Explainable Ensemble Learning Model for Automatic Wide QRS Complex Tachycardia Diagnosis from ECG</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author"><contrib-id contrib-id-type="orcid">https://orcid.org/0009-0003-3039-3175</contrib-id>
<name name-style="western"><surname>Chakma</surname><given-names>Vaskar</given-names></name><xref ref-type="aff" rid="aff-1">1</xref><xref ref-type="author-notes" rid="afn1">#</xref></contrib>
<contrib id="author-2" contrib-type="author"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-2579-5359</contrib-id>
<name name-style="western"><surname>Ju</surname><given-names>Xiaolin</given-names></name><xref ref-type="aff" rid="aff-1">1</xref><xref ref-type="author-notes" rid="afn1">#</xref></contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>Cao</surname><given-names>Heling</given-names></name><xref ref-type="aff" rid="aff-2">2</xref></contrib>
<contrib id="author-4" contrib-type="author">
<name name-style="western"><surname>Feng</surname><given-names>Xue</given-names></name><xref ref-type="aff" rid="aff-3">3</xref></contrib>
<contrib id="author-5" contrib-type="author">
<name name-style="western"><surname>Ji</surname><given-names>Xiaodong</given-names></name><xref ref-type="aff" rid="aff-3">3</xref></contrib>
<contrib id="author-6" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Pan</surname><given-names>Haiyan</given-names></name><xref ref-type="aff" rid="aff-3">3</xref><email>dr.phy@ntu.edu.cn</email></contrib>
<contrib id="author-7" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Zhan</surname><given-names>Gao</given-names></name><xref ref-type="aff" rid="aff-1">1</xref><email>gaozhan@ntu.edu.cn</email></contrib>
<aff id="aff-1"><label>1</label><institution>School of Artificial Intelligence and Computer Science, Nantong University</institution>, <addr-line>Nantong, 226001</addr-line>, <country>China</country></aff>
<aff id="aff-2"><label>2</label><institution>College of Information Science and Engineering, Henan University of Technology</institution>, <addr-line>Zhengzhou, 450001</addr-line>, <country>China</country></aff>
<aff id="aff-3"><label>3</label><institution>Department of Information Center, Affiliated Hospital of Nantong University</institution>, <addr-line>Nantong, 226001</addr-line>, <country>China</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Authors: Haiyan Pan. Email: <email>dr.phy@ntu.edu.cn</email>; Gao Zhan. Email: <email>gaozhan@ntu.edu.cn</email></corresp>
<fn id="afn1">
<p><sup>#</sup>These authors contributed equally to this work</p>
</fn>
</author-notes>
<pub-date date-type="collection" publication-format="electronic">
<year>2026</year></pub-date>
<pub-date date-type="pub" publication-format="electronic">
<day>22</day>
<month>1</month>
<year>2026</year>
</pub-date>
<volume>4</volume>
<issue>1</issue>
<fpage>37</fpage>
<lpage>86</lpage>
<history>
<date date-type="received">
<day>27</day>
<month>10</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>28</day>
<month>11</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2026 The Authors.</copyright-statement>
<copyright-year>2026</copyright-year>
<copyright-holder>Published by Tech Science Press.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_JIMH_75201.pdf"></self-uri>
<abstract>
<p>Wide QRS Complex Tachycardia (WCT) is a life-threatening cardiac arrhythmia requiring rapid and accurate diagnosis. Traditional manual ECG interpretation is time-consuming and subject to inter-observer variability, while existing AI models often lack the clinical interpretability necessary for trusted deployment in emergency settings. We developed CardioForest, an optimized Random Forest ensemble model, for automated WCT detection from 12-lead ECG signals. The model was trained, tested, and validated using 10-fold cross-validation on 800,000 ten-second-long 12-lead Electrocardiogram (ECG) recordings from the MIMIC-IV dataset (15.46% WCT prevalence), with comparative evaluation against XGBoost, LightGBM, and Gradient Boosting models. Performance was assessed using accuracy, balanced accuracy, precision, recall, F1-score, ROC-AUC, RMSE, and MAE. SHAP (SHapley Additive exPlanations) analysis provided feature-level interpretability to ensure clinical validity. CardioForest achieved superior and consistent performance across all metrics: test accuracy 95.19% (<inline-formula id="ieqn-1"><mml:math id="mml-ieqn-1"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula>0.33%), balanced accuracy 88.76% (<inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula>0.79%), precision 95.26%, recall 78.42%, F1-score 86.02%, and ROC-AUC 0.8886, with the lowest error rates (RMSE: 0.2532, MAE: 0.1944). Statistical significance testing confirmed CardioForest&#x2019;s advantages over XGBoost (accuracy <inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:mo>+</mml:mo><mml:mn>6.75</mml:mn><mml:mi mathvariant="normal">&#x0025;</mml:mi></mml:math></inline-formula>, <italic>p</italic> &#x003C; 0.001), LightGBM (accuracy <inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:mo>+</mml:mo><mml:mn>10.85</mml:mn><mml:mi mathvariant="normal">&#x0025;</mml:mi></mml:math></inline-formula>, <italic>p</italic> &#x003C; 0.001), and Gradient Boosting (accuracy <inline-formula id="ieqn-5"><mml:math id="mml-ieqn-5"><mml:mo>+</mml:mo><mml:mn>2.69</mml:mn><mml:mi mathvariant="normal">&#x0025;</mml:mi></mml:math></inline-formula>, <italic>p</italic> &#x003D; 0.012). Notably, CardioForest demonstrated exceptional stability (coefficient of variation: 0.35%) compared to competing models. SHAP analysis revealed that QRS duration-the primary clinical diagnostic criterion&#x2014;dominated model predictions (mean SHAP value: 0.45), with additional contributions from QRS morphology and axis measurements, perfectly aligning with established cardiological knowledge. CardioForest represents a clinically validated, interpretable AI solution for WCT detection that balances diagnostic accuracy with transparent decision-making. With inference times under 10 milliseconds and comprehensive explainability through SHAP visualizations, the model is deployment-ready for real-time emergency department screening. By providing cardiologists with both accurate predictions and clinically interpretable feature attributions, CardioForest addresses the critical gap between AI performance and clinical trust, offering a practical tool for timely, evidence-based cardiac diagnosis in high-stakes scenarios.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Wide QRS Complex Tachycardia (WCT)</kwd>
<kwd>ECG analysis</kwd>
<kwd>ensemble machine learning</kwd>
<kwd>explainable AI</kwd>
<kwd>artificial intelligence in healthcare</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>Wide QRS Complex Tachycardia (WCT) is a severe and potentially lethal cardiac condition characterized by an exceedingly rapid heartbeat in combination with a widened QRS complex on the electrocardiogram (ECG) [<xref ref-type="bibr" rid="ref-1">1</xref>&#x2013;<xref ref-type="bibr" rid="ref-3">3</xref>]. Normally, the QRS complex&#x2014;a short, spiky waveform&#x2014;registers the process of ventricular depolarization, whereby the ventricles of the heart contract and effectively pump blood [<xref ref-type="bibr" rid="ref-4">4</xref>,<xref ref-type="bibr" rid="ref-5">5</xref>]. A regular narrow QRS complex indicates typical electrical conduction through the heart&#x2019;s normal pathways [<xref ref-type="bibr" rid="ref-6">6</xref>,<xref ref-type="bibr" rid="ref-7">7</xref>]. However, if the QRS complex is wide, then this is an indication of a disruption in electrical propagation [<xref ref-type="bibr" rid="ref-8">8</xref>], typically as a result of underlying structural disease, electrolyte imbalance, or an inherited disorder.</p>
<p>Untreated, WCT can significantly weaken the heart&#x2019;s function to circulate blood effectively, causing symptoms that range from palpitations, dizziness, and chest pain to, in extreme cases, sudden cardiac arrest [<xref ref-type="bibr" rid="ref-9">9</xref>,<xref ref-type="bibr" rid="ref-10">10</xref>]. Understanding these structural abnormalities underscores the clinical importance of accurate WCT diagnosis, as delayed recognition can lead to severe complications, including heart failure and arrhythmias. As such, the early and correct diagnosis of WCT is not only critical&#x2014;it is a matter of life and death. Diagnosis of WCT has traditionally depended to a large extent on manual ECG interpretation by experienced cardiologists. While still the gold standard, this process is time-consuming, labor-intensive, and subject to considerable variability [<xref ref-type="bibr" rid="ref-11">11</xref>,<xref ref-type="bibr" rid="ref-12">12</xref>]. Individual cardiologists may disagree in borderline or uncertain cases, postponing diagnosis and treatment [<xref ref-type="bibr" rid="ref-13">13</xref>,<xref ref-type="bibr" rid="ref-14">14</xref>]. In high-pressure clinical environments where minutes matter, delays can be detrimental. As healthcare systems globally face rising demands, the demand for faster, more accurate diagnostic support that augments, rather than replaces, clinical judgment is pressing. In the past few years, Artificial Intelligence (AI)-driven models have been demonstrated to achieve stellar performance in ECG interpretation with accuracy and speed [<xref ref-type="bibr" rid="ref-15">15</xref>]. Among them, deep learning methods&#x2014;particularly Convolutional Neural Networks (CNNs)&#x2014;have worked incredibly well in identifying complex patterns within ECG signals that are not easily visible to the naked eye [<xref ref-type="bibr" rid="ref-16">16</xref>&#x2013;<xref ref-type="bibr" rid="ref-18">18</xref>]. However, for all the high-accuracy deep learning models claim, they tend to behave like &#x201C;black boxes&#x201D; with little description of decision-making. This absence of transparency has been a significant barrier to clinical adoption because cardiologists and clinicians need not only accuracy but also transparency to trust AI recommendations. For AI to be successfully integrated into clinical practice, especially in life-critical conditions like WCT, interpretability is equally as important as accuracy [<xref ref-type="bibr" rid="ref-19">19</xref>].</p>
<p>Cardiologists must understand the rationale for AI predictions&#x2014;seeing not just the output, but the supporting evidence, e.g., what ECG features led to a specific classification. Without this transparency, clinicians will remain unconvinced and reluctant to trust AI for decision-making, especially when a patient&#x2019;s life hangs in the balance. Given these challenges, this research introduces a novel solution: <bold>CardioForest</bold>, an interpretable ensemble-based AI model for WCT detection. Based primarily on Random Forest architecture&#x2014;augmented with techniques such as XGBoost, LightGBM, and Gradient Boosting&#x2014;CardioForest leverages the strengths of ensemble machine learning to achieve both high diagnostic precision and clear interpretability [<xref ref-type="bibr" rid="ref-20">20</xref>]. CardioForest distinguishes itself from traditional rule-based diagnostic algorithms (Brugada criteria, Vereckei algorithm) through its data-driven learning approach combined with transparent decision-making. While rule-based methods rely on fixed thresholds and sequential decision trees with reported accuracies of 80%&#x2013;90% and significant inter-observer variability, CardioForest learns optimal decision boundaries directly from large-scale clinical data while maintaining interpretability through feature importance rankings and SHAP value analysis. Unlike the binary decision rules of traditional algorithms (e.g., &#x201C;if QRS duration &#x003E; 120 ms AND AV dissociation is present, then VT&#x201D;), CardioForest quantifies the continuous contribution of each ECG feature to the final prediction, providing clinicians with nuanced probability estimates and confidence scores rather than deterministic classifications. This hybrid approach&#x2014;combining machine learning&#x2019;s pattern recognition capabilities with the transparency clinicians expect from rule-based systems&#x2014;represents a significant advancement in bridging the gap between AI performance and clinical adoption. Unlike deep neural networks, Random Forest models are natively explainable through feature importance rankings and decision-tree visualization. This dual advantage ensures clinicians can rely on and interpret the model outputs. Our experiment design is focused not only on the evaluation of diagnostic accuracy but also on increasing clinical trust by employing explainable AI (XAI) methods, for example, SHAP (SHapley Additive exPlanations) [<xref ref-type="bibr" rid="ref-21">21</xref>] values and feature attribution analysis [<xref ref-type="bibr" rid="ref-22">22</xref>]. CardioForest bridges the crucial gap between AI&#x2019;s computational power and the clinician&#x2019;s need for interpretability by highlighting which features most significantly influenced the model&#x2019;s decisions. Preliminary results suggest that CardioForest outperforms traditional manual approaches and competes favorably with state-of-the-art deep learning models while offering superior transparency, an essential quality for clinical adoption. For all these developments, we acknowledge that challenges remain. Future research should also explore continuous learning frameworks, where AI models learn incrementally from new data, thereby improving their diagnostic acumen over time without compromising explainability. <xref ref-type="fig" rid="fig-1">Fig. 1</xref> presents <italic>CardioForest</italic> as a pioneering WCT detection solution by providing cardiologists with speedy, interpretable, and reliable AI support. We hope to enhance cardiac diagnosis, reduce diagnostic latency, and ultimately save more lives. After that, <xref ref-type="fig" rid="fig-2">Fig. 2</xref> illustrates the anatomical differences between normal cardiac structure and pathological conditions such as left ventricular noncompaction, which can manifest as wide QRS complexes on ECG. The presence of prominent trabeculations in the left ventricle (right panel) disrupts normal electrical conduction pathways, contributing to the wide QRS morphology that CardioForest is designed to detect. We include this clinical reference textbook [<xref ref-type="bibr" rid="ref-6">6</xref>] as it represents the current gold standard for ECG education used in training cardiologists who serve as the benchmark for AI system comparison. Looking ahead, we envision the expansion of explainable ensemble model applications beyond WCT toward general arrhythmia detection with the inclusion of real-time ECG monitoring for preemptive cardiac health management.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>An overview of the WCT prediction system using the MIMIC-IV ECG database, featuring preprocessing, ensemble machine learning models, cross-validation, and final prediction</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-1.tif"/>
</fig><fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Anatomical comparison between normal cardiac structure (center) and left ventricular noncompaction (right). The normal heart shows regular ventricular muscle wall thickness and organized electrical pathways. In contrast, left ventricular noncompaction exhibits prominent trabeculations and disorganized myocardial structure, which disrupts electrical conduction and manifests as wide QRS complexes on ECG. Such structural abnormalities represent one category of conditions that CardioForest must accurately identify to guide appropriate clinical management</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-2.tif"/>
</fig>
</sec>
<sec id="s2">
<label>2</label>
<title>Related Works</title>
<p>Accurate and timely prediction of Wide Complex Tachycardia (WCT) remains a major focus in cardiovascular research, driven by the need to distinguish between ventricular tachycardia (VT) and supraventricular tachycardia (SVT) with aberrant conduction. Machine learning (ML) and deep learning (DL) techniques have gained prominence in this domain, offering new avenues for improved diagnostic performance compared to traditional criteria-based methods.</p>
<p>Li et al. [<xref ref-type="bibr" rid="ref-23">23</xref>] proposed a Gradient Boosting Machine (GBM) model for differentiating VT from SVT using surface ECG features. Their approach leveraged a rich set of ECG-derived parameters, leading to an impressive classification performance with an overall accuracy of 91.2%, sensitivity of 89.5%, specificity of 92.8%, and an area under the ROC curve (AUC) of 0.94. This study highlighted the importance of carefully selected ECG features in enhancing machine learning model performance. Building on the trend of AI-driven diagnosis, Chow et al. [<xref ref-type="bibr" rid="ref-24">24</xref>] developed a specialized AI model to interpret WCT directly from ECGs. Their system, designed with clinical applicability in mind, demonstrated an overall accuracy of 93%, with sensitivity and specificity exceeding 91%. This work showcased the potential of deep learning models in outperforming traditional rule-based algorithms for arrhythmia classification, particularly for ambiguous WCT cases.</p>
<p>Focusing on high-risk populations, Bhattacharya et al. [<xref ref-type="bibr" rid="ref-25">25</xref>] introduced the HCM-VAr-Risk model, which applies machine learning techniques to predict ventricular arrhythmias in patients with hypertrophic cardiomyopathy (HCM). Their model achieved a C-index of 0.83, reflecting strong discriminative ability. The study underscored the utility of ML for risk stratification in structurally abnormal hearts, offering a more individualized approach to arrhythmia prediction. Hong et al. [<xref ref-type="bibr" rid="ref-26">26</xref>] provided a broader perspective by conducting a systematic review of deep learning applications for ECG analysis, including arrhythmia detection and classification tasks. The review covered a range of architectures, such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), and hybrid models, illustrating the high accuracy and generalizability of DL models when trained on large, diverse ECG datasets. Their findings support the growing consensus that deep learning can significantly enhance the detection of complex arrhythmias, including WCT.</p>
<p>Addressing diagnostic challenges from a different angle, May et al. [<xref ref-type="bibr" rid="ref-27">27</xref>] introduced the QRS Polarity Shift (QRS-PS) method, which focuses on changes in QRS polarity between baseline ECGs and WCT episodes. By simplifying the interpretation of polarity shifts, their algorithm achieved AUC values ranging from 0.90 to 0.93. This technique provides a pragmatic and explainable tool that can be readily integrated into clinical workflows, assisting clinicians in making rapid and accurate diagnoses. Machine learning classification models have also shown remarkable potential in SVT detection. Howladar and Sahoo [<xref ref-type="bibr" rid="ref-28">28</xref>] developed a decision-tree-based model specifically tailored for SVT identification. Their model attained a striking 97% accuracy, demonstrating the effectiveness of even relatively simple ML algorithms when paired with relevant feature selection. Deep learning models have further pushed the boundaries of arrhythmia prediction. Rajpurkar et al. [<xref ref-type="bibr" rid="ref-29">29</xref>] designed a CNN-based model, trained on a large annotated ECG dataset, that achieved recall and precision rates exceeding those of board-certified cardiologists. Their work set a new benchmark for DL-based arrhythmia detection and provided strong evidence for adopting AI-assisted ECG interpretation tools in clinical practice.</p>
<p>In addition, Frausto-Avila et al. [<xref ref-type="bibr" rid="ref-30">30</xref>] presented a compact neural network architecture enhanced with advanced feature engineering techniques. Their model achieved an accuracy of 97.36% in arrhythmia classification tasks, suggesting that lightweight models can maintain high predictive performance while offering advantages in computational efficiency, making them suitable for deployment in real-time or resource-constrained environments. The reviewed studies demonstrate that both machine learning and deep learning approaches have significantly advanced WCT prediction and arrhythmia classification. The diversity of methods&#x2014;from feature-driven models like GBM and decision trees to sophisticated deep learning architectures like CNNs&#x2014;reflects the rich potential of AI technologies to improve clinical outcomes. All abbreviations referenced in this paper, along with their definitions, are compiled in <xref ref-type="table" rid="table-1">Table 1</xref> and a detailed comparison of these related works, including their methodologies, key innovations, and achieved performance metrics, is presented in <xref ref-type="table" rid="table-2">Table 2</xref>.</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>List of abbreviations</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
</colgroup>
<thead>
<tr>
<th>Abbreviation</th>
<th>Definition</th>
</tr>
</thead>
<tbody>
<tr>
<td>AI</td>
<td>Artificial Intelligence</td>
</tr>
<tr>
<td>ANN</td>
<td>Artificial Neural Network</td>
</tr>
<tr>
<td>AUC</td>
<td>Area Under the Curve</td>
</tr>
<tr>
<td>AV</td>
<td>Atrioventricular</td>
</tr>
<tr>
<td>bpm</td>
<td>Beats Per Minute</td>
</tr>
<tr>
<td>CNN</td>
<td>Convolutional Neural Network</td>
</tr>
<tr>
<td>CPU</td>
<td>Central Processing Unit</td>
</tr>
<tr>
<td>CSV</td>
<td>Comma-Separated Values</td>
</tr>
<tr>
<td>CV</td>
<td>Cross-Validation/Coefficient of Variation</td>
</tr>
<tr>
<td>DL</td>
<td>Deep Learning</td>
</tr>
<tr>
<td>ECG</td>
<td>Electrocardiogram</td>
</tr>
<tr>
<td>ECE</td>
<td>Expected Calibration Error</td>
</tr>
<tr>
<td>GBM</td>
<td>Gradient Boosting Machine</td>
</tr>
<tr>
<td>HCM</td>
<td>Hypertrophic Cardiomyopathy</td>
</tr>
<tr>
<td>HR</td>
<td>Heart Rate</td>
</tr>
<tr>
<td>IQR</td>
<td>Interquartile Range</td>
</tr>
<tr>
<td>LGBM</td>
<td>Light Gradient Boosting Machine</td>
</tr>
<tr>
<td>MAE</td>
<td>Mean Absolute Error</td>
</tr>
<tr>
<td>MIMIC</td>
<td>Medical Information Mart for Intensive Care</td>
</tr>
<tr>
<td>ML</td>
<td>Machine Learning</td>
</tr>
<tr>
<td>ms</td>
<td>Milliseconds</td>
</tr>
<tr>
<td>NPV</td>
<td>Negative Predictive Value</td>
</tr>
<tr>
<td>OOB</td>
<td>Out-of-Bag</td>
</tr>
<tr>
<td>PCA</td>
<td>Principal Component Analysis</td>
</tr>
<tr>
<td>PPV</td>
<td>Positive Predictive Value</td>
</tr>
<tr>
<td>PR</td>
<td>Precision-Recall</td>
</tr>
<tr>
<td>QRS</td>
<td>Q-wave, R-wave, S-wave Complex</td>
</tr>
<tr>
<td>RF</td>
<td>Random Forest</td>
</tr>
<tr>
<td>RMSE</td>
<td>Root Mean Square Error</td>
</tr>
<tr>
<td>RNN</td>
<td>Recurrent Neural Network</td>
</tr>
<tr>
<td>ROC</td>
<td>Receiver Operating Characteristic</td>
</tr>
<tr>
<td>SD</td>
<td>Standard Deviation</td>
</tr>
<tr>
<td>SHAP</td>
<td>SHapley Additive exPlanations</td>
</tr>
<tr>
<td>SMOTE</td>
<td>Synthetic Minority Over-sampling Technique</td>
</tr>
<tr>
<td>SVT</td>
<td>Supraventricular Tachycardia</td>
</tr>
<tr>
<td>SVT-AC</td>
<td>Supraventricular Tachycardia with Aberrant Conduction</td>
</tr>
<tr>
<td>VT</td>
<td>Ventricular Tachycardia</td>
</tr>
<tr>
<td>WCT</td>
<td>Wide QRS Complex Tachycardia</td>
</tr>
<tr>
<td>WFDB</td>
<td>WaveForm DataBase</td>
</tr>
<tr>
<td>XAI</td>
<td>Explainable Artificial Intelligence</td>
</tr>
<tr>
<td>XGBoost</td>
<td>Extreme Gradient Boosting</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>Summary of diverse studies on WCT prediction</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
</colgroup>
<thead>
<tr>
<th>Study</th>
<th>Model/Method</th>
<th>Key features</th>
<th>Performance metrics</th>
<th>References</th>
</tr>
</thead>
<tbody>
<tr>
<td>Li et al. (2024)</td>
<td>Gradient Boosting Machine (GBM)</td>
<td>Differentiates VT from SVT using surface ECG features</td>
<td>Accuracy: 91.2%, Sensitivity: 89.5%, Specificity: 92.8%, AUC: 0.94</td>
<td>[<xref ref-type="bibr" rid="ref-23">23</xref>]</td>
</tr>
<tr>
<td>Chow et al. (2024)</td>
<td>Artificial Intelligence (AI) Model</td>
<td>AI algorithm interpreting WCT ECGs</td>
<td>Accuracy: 93%, Sensitivity: 91.9%, Specificity: 93.4%</td>
<td>[<xref ref-type="bibr" rid="ref-24">24</xref>]</td>
</tr>
<tr>
<td>Bhattacharya et al. (2021)</td>
<td>HCM-VAr-Risk Model</td>
<td>ML model for ventricular arrhythmias in HCM patients</td>
<td>Sensitivity: 73%, Specificity: 76%, C-index: 0.83</td>
<td>[<xref ref-type="bibr" rid="ref-25">25</xref>]</td>
</tr>
<tr>
<td>Hong et al. (2020)</td>
<td>Deep Learning Review</td>
<td>Systematic review of DL methods for ECG data</td>
<td>Various models achieving high accuracy and AUC</td>
<td>[<xref ref-type="bibr" rid="ref-26">26</xref>]</td>
</tr>
<tr>
<td>May et al. (2024)</td>
<td>QRS Polarity Shift (QRS-PS)</td>
<td>Algorithm based on QRS polarity shifts between WCT and baseline ECGs</td>
<td>AUC: 0.90&#x2013;0.93</td>
<td>[<xref ref-type="bibr" rid="ref-27">27</xref>]</td>
</tr>
<tr>
<td>Howladar &#x0026; Sahoo (2021)</td>
<td>Classification Model</td>
<td>ML-based SVT detection &#x0026; decision-tree-based model for SVT classification</td>
<td>Accuracy: 97%</td>
<td>[<xref ref-type="bibr" rid="ref-28">28</xref>]</td>
</tr>
<tr>
<td>Rajpurkar et al. (2017)</td>
<td>CNN-based Model</td>
<td>Deep learning model for arrhythmia detection</td>
<td>Exceeds average cardiologist performance in recall and precision</td>
<td>[<xref ref-type="bibr" rid="ref-29">29</xref>]</td>
</tr>
<tr>
<td>Frausto-Avila et al. (2024)</td>
<td>Compact neural network</td>
<td>ANN with feature enhancement for arrhythmia classification</td>
<td>Accuracy: 97.36%</td>
<td>[<xref ref-type="bibr" rid="ref-30">30</xref>]</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3">
<label>3</label>
<title>Diagnostic Data Resources</title>
<p>This study utilizes the MIMIC-IV-ECG [<xref ref-type="bibr" rid="ref-31">31</xref>] Module (a statistical summary of the dataset has been shown in <xref ref-type="table" rid="table-3">Table 3</xref>), a comprehensive database of diagnostic electrocardiogram (ECG) waveforms [<xref ref-type="bibr" rid="ref-32">32</xref>] integrated with the broader MIMIC-IV Clinical Database. The dataset contains approximately 800,000 ten-second-long 12-lead ECG recordings sampled at 500 Hz, collected from nearly 160,000 unique patients. For computational efficiency while maintaining statistical validity, we utilized a stratified random sample of recordings (see <xref ref-type="sec" rid="s4_2">Section 4.2</xref>), preserving the original class distribution (15.46% WCT prevalence). Each electrocardiogram (ECG) record is stored in the standard WaveForm DataBase (WFDB)<xref ref-type="fn" rid="fn-1"><sup>1</sup></xref><fn id="fn-1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="https://physionet.org/lightwave/">https://physionet.org/lightwave/</ext-link></p>
</fn> format (Accessed: 29 December 2024), which includes a header file (.hea) and a binary data file (.dat). The records are organized in a structured directory hierarchy based on the subject identifier, allowing for efficient data retrieval. For example, a subject with ID 10001725 would be stored under the path files/p1000/p10001725/, with each diagnostic study within a subdirectory labeled by a randomly generated study ID. Approximately 55% of the ECGs in the dataset overlap with a hospital admission and 25% with an emergency department visit, while the remaining records were collected outside traditional inpatient or emergency settings. This diversity in acquisition context allows for a wide range of use cases, from acute event analysis to routine monitoring assessments. However, it is important to note that the ECG timestamps are derived from the internal clock of the acquisition device and are not synchronized with the hospital&#x2019;s clinical information systems.</p>
<table-wrap id="table-3">
<label>Table 3</label>
<caption>
<title>Statistics of the MIMIC-IV real-world datasets used in this paper</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
</colgroup>
<thead>
<tr>
<th>Metric</th>
<th>Value</th>
</tr>
</thead>
<tbody>
<tr>
<td>Total records</td>
<td>800,035</td>
</tr>
<tr>
<td>Unique subjects</td>
<td>161,352</td>
</tr>
<tr>
<td>Unique studies</td>
<td>800,035</td>
</tr>
<tr>
<td>Unique carts</td>
<td>156</td>
</tr>
<tr>
<td>Average RR Interval (ms)</td>
<td>865.60</td>
</tr>
<tr>
<td>Average QRS Duration (ms)</td>
<td>108.51</td>
</tr>
<tr>
<td>Average P Onset (ms)</td>
<td>4702.88</td>
</tr>
<tr>
<td>Average P End (ms)</td>
<td>8745.08</td>
</tr>
<tr>
<td>Average QRS Onset (ms)</td>
<td>283.42</td>
</tr>
<tr>
<td>Average QRS End (ms)</td>
<td>391.66</td>
</tr>
<tr>
<td>Average T End (ms)</td>
<td>688.65</td>
</tr>
<tr>
<td>Average P Axis (degrees)</td>
<td>4973.35</td>
</tr>
<tr>
<td>Average QRS Axis (degrees)</td>
<td>107.37</td>
</tr>
<tr>
<td>Average T Axis (degrees)</td>
<td>192.55</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Consequently, temporal alignment between the ECGs and clinical events in the MIMIC-IV database may require additional validation. Each ECG waveform is accompanied by machine-generated summary measurements [<xref ref-type="bibr" rid="ref-31">31</xref>] stored in the machine_measurements.csv file. These include standard parameters such as RR interval, QRS onset and end, and filter settings, along with textual machine-generated interpretation notes across columns report_0 to report_17. The accompanying data dictionary in machine_measurements-_data_dictionary.csv describes the technical and clinical meaning of each column. Each record includes a subject_id, study_id, and ecg_time, enabling direct linkage to clinical data in the MIMIC-IV hospital and emergency department modules. Cardiologist interpretations are also available for over 600,000 ECG studies. These free-text reports are stored in the MIMIC-IV-Note module and are linked to the ECG waveforms via the waveform_note_links.csv file. Each entry in this linkage file includes the subject ID, study ID, waveform path, and a note_id that can be used to retrieve the corresponding cardiologist report. A sequential integer (note_seq) is also provided to determine the order of ECG collection for individual patients. This linkage enables researchers to perform comparative analyses between machine-generated and clinician-interpreted findings. To support large-scale analysis, key metadata from record_list.csv, machine_measurements.csv, and waveform_note_links.csv have been made available through Google BigQuery. This facilitates efficient querying and integration with other clinical tables in the MIMIC-IV ecosystem. As a practical illustration, using BigQuery, a researcher can identify a patient&#x2019;s hospital admissions and correlate them with the timing of their ECGs, determine whether a given ECG occurred during a hospital stay, and check for the presence of associated cardiologist notes [<xref ref-type="bibr" rid="ref-33">33</xref>,<xref ref-type="bibr" rid="ref-34">34</xref>]. For waveform visualization and signal processing, the dataset supports standard PhysioNet WFDB toolkits in Python, MATLAB, and C. Researchers can read and visualize ECG waveforms using the wfdb Python package [<xref ref-type="bibr" rid="ref-35">35</xref>]. For instance, using wfdb.rdrecord() and wfdb.plot_wfdb(), one can extract and display the raw signal for any given ECG study. This compatibility makes the dataset highly accessible for both signal processing and clinical informatics researchers. Despite its richness, the dataset has some limitations. Notably, the ECG device timestamps may be inaccurate due to a lack of clock synchronization. Additionally, some ECGs were recorded outside the hospital or emergency department, limiting direct temporal correlation with clinical events [<xref ref-type="bibr" rid="ref-36">36</xref>]. Nonetheless, MIMIC-IV-ECG is invaluable for studying cardiac health, machine learning applications in ECG interpretation, and cross-modal linkage with comprehensive clinical records [<xref ref-type="bibr" rid="ref-37">37</xref>].</p>
</sec>
<sec id="s4">
<label>4</label>
<title>Data Preparation and Processing Pipeline</title>
<sec id="s4_1">
<label>4.1</label>
<title>Data Cleaning and Preprocessing Techniques</title>
<p>Duplicate entries were identified using <monospace>subject_id</monospace> and <monospace>study_id</monospace>, ensuring each ECG was uniquely represented. Categorical variables (e.g., wct_label) were encoded numerically using label encoding, and floating-point precision errors were truncated. This ensured compatibility with machine learning algorithms and improved computational efficiency. The complete preprocessing pipeline is detailed in Algorithm 1. Pandas&#x2019; duplicated() function [<xref ref-type="bibr" rid="ref-38">38</xref>] detected redundant records, which were subsequently removed. Post-cleaning verification confirmed the dataset retained its integrity, with zero duplicate records. From the cleaned dataset of 800,035 records, we extracted a stratified sample of records for analysis (see <xref ref-type="sec" rid="s4_2">Section 4.2</xref>), ensuring representative distribution across all clinical variables while maintaining computational tractability for comprehensive cross-validation and explainability analyses. Biologically implausible values (e.g., negative RR intervals) were corrected using interpolation [<xref ref-type="bibr" rid="ref-39">39</xref>], while extreme outliers were adjusted or removed. Visualizations like boxplots and histograms validated the corrections, showing normalized distributions for key features such as RR intervals and QRS durations. Timestamps (ecg_time_x and ecg_time_y) were converted to a uniform format using Python&#x2019;s datetime module, ensuring consistency for time-series analysis. This step addressed discrepancies arising from unsynchronized ECG machine clocks.</p>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Sampling Strategy and Validation</title>
<p>Given the computational demands of comprehensive cross-validation and explainability analysis on the full MIMIC-IV-ECG dataset (800,035 records), we employed stratified random sampling to create a representative subset of records. This sample size exceeds the minimum required for a 95% confidence level with a <inline-formula id="ieqn-6"><mml:math id="mml-ieqn-6"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula>1% margin of error by a factor of 18, ensuring robust statistical validity [<xref ref-type="bibr" rid="ref-40">40</xref>]. The stratification was performed on the target variable (<monospace>wct_label_encoded</monospace>) to maintain the original class distribution (84.54% normal rhythm, 15.46% WCT). This approach preserves the clinical prevalence observed in the full dataset while enabling:
<list list-type="bullet">
<list-item>
<p>Comprehensive 10-fold cross-validation without computational bottlenecks</p></list-item>
<list-item>
<p>Detailed SHAP analysis for model explainability</p></list-item>
<list-item>
<p>Extensive hyperparameter optimization across multiple models</p></list-item>
<list-item>
<p>Real-time inference speed suitable for clinical deployment</p></list-item>
</list></p>
<p>The sampling process used a fixed random seed (42) to ensure reproducibility, and validation confirmed that all feature distributions in the sample matched those of the full dataset
(Kolmogorov-Smirnov test, <italic>p</italic> &#x003E; 0.05 for all features). The data preparation workflow is formalized
in Algorithm 1.</p>
<fig id="fig-26">
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-26.tif"/>
</fig>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>Data Merging, Feature Selection, and Extraction</title>
<p>Data merging, feature selection, and extraction represent a critical phase in transforming the cleaned MIMIC-IV-ECG dataset [<xref ref-type="bibr" rid="ref-31">31</xref>] into a format optimized for machine learning analysis. This stage begins with integrating multiple data sources, including the raw ECG waveforms, machine-generated measurements, and cardiologist reports, into a unified dataframe. The merging process leverages key identifiers such as <monospace>subject_id</monospace> and <monospace>study_id</monospace> to ensure accurate alignment of records across different tables. Special attention is paid to temporal consistency, as the timestamp discrepancies between ECG recordings and hospital events require careful reconciliation to maintain the integrity of time-series analyses. Feature selection constitutes the next crucial step, where we systematically evaluate the clinical relevance and statistical properties of each potential predictor [<xref ref-type="bibr" rid="ref-41">41</xref>]. The dataset&#x2019;s extensive collection of ECG parameters&#x2014;including temporal intervals (RR, PR, QT), wave amplitudes (P, QRS, T), and axis measurements&#x2014;presents both opportunities and challenges. Distribution plots in <xref ref-type="fig" rid="fig-3">Fig. 3</xref> for key features like RR interval and QRS duration provide insights into their statistical properties, highlighting skewness that may require transformation. In <xref ref-type="fig" rid="fig-4">Fig. 4</xref>, we employ correlation analysis to identify redundant features, using heatmap visualizations to detect strong linear relationships between variables. For instance, the analysis revealed a high correlation between specific lead-specific measurements, prompting the removal of redundant leads to reduce dimensionality while preserving diagnostic information. Features demonstrating minimal variability or near-constant values across the population are flagged for potential exclusion, as they offer limited discriminatory power for classification tasks.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Temporal dynamics of ECG features showing rolling statistics (mean <inline-formula id="ieqn-18"><mml:math id="mml-ieqn-18"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> standard deviation and skewness) for RR interval (milliseconds) and QRS duration (milliseconds) across the time sequence. The <italic>y</italic>-axis represents feature values in their respective units</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-3.tif"/>
</fig><fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Initialization parameters and preprocessing metadata for ECG signal analysis, showing default values (0.00&#x2013;0.01) for subject identifiers, report fields, filtering parameters, and waveform annotation markers (P-onset, QRS complex). The WCT (Wide Complex Tachycardia) label indicators suggest the beginning of arrhythmia classification preprocessing</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-4.tif"/>
</fig>
<p>The feature extraction phase employs advanced techniques to derive more informative representations of the raw data. Principal Component Analysis (PCA) [<xref ref-type="bibr" rid="ref-42">42</xref>] proves particularly valuable for condensing the multidimensional ECG features into a smaller set of orthogonal components that capture the majority of variance in the data [<xref ref-type="bibr" rid="ref-43">43</xref>]. Prior to PCA application (<xref ref-type="fig" rid="fig-5">Fig. 5</xref>), we standardize all features to zero mean and unit variance to prevent variables with larger scales from dominating the component calculation. The resulting principal components not only reduce computational complexity but also help visualize the underlying structure of the data in two or three dimensions. Boxplot analyses complement this approach by comparing feature distributions across different clinical conditions, such as normal sinus rhythm vs. wide complex tachycardia [<xref ref-type="bibr" rid="ref-44">44</xref>].</p>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>Relationship between Principal Component 1 (<italic>x</italic>-axis) and Principal Component 2 (<italic>y</italic>-axis). The axis scaling (0&#x2013;70) indicates the relative variance explained by each component in this dimensionality reduction visualization</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-5.tif"/>
</fig>
<p>These visualizations help identify features that show significant separation between classes, making them prime candidates for inclusion in predictive models. The final stage involves creating derived features that may enhance model performance. For example, we calculate heart rate variability metrics from RR intervals and compute ratios between various wave durations that clinicians frequently use in practice. The feature engineering process shown in <xref ref-type="fig" rid="fig-6">Fig. 6</xref> remains grounded in clinical knowledge to ensure the biological plausibility of all derived measures. Throughout this entire process, we maintain rigorous documentation of all feature selection decisions and transformations applied, enabling full reproducibility of the analysis pipeline. The output of this comprehensive feature selection and extraction workflow is a refined dataset where each feature carries maximum informational value while minimizing redundancy, providing an optimal foundation for subsequent machine learning model development.</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>This boxplot illustrates the statistical distribution of QRS complex durations across all ECG recordings, showing median values, interquartile ranges, and outliers. The visualization helped validate measurement quality and identify extreme values requiring clinical review before feature selection</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-6.tif"/>
</fig>
</sec>
<sec id="s4_4">
<label>4.4</label>
<title>Clinical Outcome Definition</title>
<p>Wide QRS Complex Tachycardia (WCT) was defined according to standard electrophysiological criteria as the presence of both: (1) ventricular rate exceeding 100 beats per minute (tachycardia), and (2) QRS complex duration <inline-formula id="ieqn-19"><mml:math id="mml-ieqn-19"><mml:mo>&#x2265;</mml:mo></mml:math></inline-formula>120 milliseconds (wide QRS). This definition encompasses both ventricular tachycardia (VT) originating from ventricular tissue and supraventricular tachycardia with aberrant conduction (SVT-AC) exhibiting widened QRS complexes. ECG records in the MIMIC-IV-ECG dataset were labeled as WCT based on machine-generated measurements (<monospace>qrs_duration</monospace> field) combined with cardiologist interpretation reports linked through the <monospace>waveform_note_links.csv</monospace> file. Specifically, records meeting the QRS duration criterion (<inline-formula id="ieqn-20"><mml:math id="mml-ieqn-20"><mml:mo>&#x2265;</mml:mo></mml:math></inline-formula>120 ms) and associated with documented tachyarrhythmia in clinical notes were classified as WCT-positive (label &#x003D; 1), while records with normal QRS duration (&#x003C;120 ms) and no documented arrhythmia were classified as WCT-negative (label &#x003D; 0). Borderline cases (QRS duration 115&#x2013;125 ms) were reviewed against cardiologist reports to ensure label accuracy. This approach yielded a dataset with 15.46% WCT prevalence, reflecting realistic clinical incidence rates for this life-threatening arrhythmia.</p>
</sec>
<sec id="s4_5">
<label>4.5</label>
<title>Handling Missing Values and Categorical Variables</title>
<p>In this section, two crucial data preprocessing tasks have been focused on and shown in <xref ref-type="fig" rid="fig-7">Fig. 7</xref>: handling missing values and encoding categorical variables, both of which are vital steps to ensure that the dataset is suitable for machine learning models. Missing values are a common issue in many real-world datasets. If not appropriately addressed, they can negatively impact the performance of machine learning models by introducing bias or reducing the dataset&#x2019;s size. We chose median imputation [<xref ref-type="bibr" rid="ref-45">45</xref>] as the strategy to handle missing values in the dataset. Median imputation involves replacing missing values with the median value of a column [<xref ref-type="bibr" rid="ref-46">46</xref>]. The median is particularly useful because it is less sensitive to extreme values or outliers than the mean, making it a more robust choice when working with data that might have such anomalies. For example, in the ECG dataset, some numerical columns, such as the &#x2018;rr_interval&#x2019; or &#x2018;qrs_duration&#x2019;, may contain missing values for various reasons, such as data collection issues or measurement errors. Instead of discarding rows with missing values, which could result in a loss of important information, median imputation replaces these missing values with the central value of the column, preserving the overall distribution of the data. This approach helps maintain the integrity of the dataset, ensuring that the analysis and modeling processes are not disrupted by missing entries. We used Scikit-learn&#x2019;s &#x2018;SimpleImputer&#x2019; with the &#x2018;median&#x2019; strategy to perform this imputation across all relevant numerical columns in the dataset [<xref ref-type="bibr" rid="ref-47">47</xref>].</p>
<fig id="fig-7">
<label>Figure 7</label>
<caption>
<title>Organized dataframe structure showing the format of ECG records with 28 clinically relevant features after preprocessing. The dataframe architecture represents the structure used for highlighting preserved temporal measurements, electrical axis values, and metadata for machine learning applications</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-7.tif"/>
</fig>
<p>Once the missing values were handled, the next step was to address the categorical variables in the dataset. Many machine learning algorithms require numerical inputs, so categorical data, often represented as text labels, needs to be converted into a numerical format. In this case, the dataset contains several categorical columns, such as &#x2018;report_0&#x2019;, &#x2018;report_1&#x2019;, &#x2018;report_2&#x2019;, &#x2018;report_3&#x2019;, &#x2018;report_4&#x2019;, &#x2018;report_5&#x2019;, &#x2018;report_6&#x2019;, &#x2018;filtering&#x2019;, and &#x2018;wct_label&#x2019;, all of which contain textual labels that represent different categories or classifications of the data. Label encoding [<xref ref-type="bibr" rid="ref-48">48</xref>] was chosen to convert these categorical text values into numerical labels. Label encoding assigns a unique integer to each category within a given column. For instance, if the &#x2018;report_0&#x2019; column contains the values &#x2018;Normal&#x2019;, &#x2018;Abnormal&#x2019;, and &#x2018;Pending&#x2019;, the Label Encoder would transform these labels into numerical values like 0, 1, and 2. This transformation makes the data usable by machine learning algorithms, which can only process numerical inputs. Scikit-learn&#x2019;s &#x2018;LabelEncoder&#x2019; has been used for this task, applying it to each categorical column in the dataset [<xref ref-type="bibr" rid="ref-49">49</xref>]. Both preprocessing steps&#x2014;median imputation for handling missing values and label encoding for categorical variables&#x2014;ensure the dataset is ready for analysis and modeling. The complete preprocessing procedure is formalized in Algorithm 1.</p>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Methods, Experiments, and Results</title>
<sec id="s5_1">
<label>5.1</label>
<title>Model Selection and Justification</title>
<p>The analysis of electrocardiogram (ECG) [<xref ref-type="bibr" rid="ref-50">50</xref>] signals demands robust methodologies capable of navigating noise, patient-specific variability, and subtle morphological changes. These challenges are particularly acute when diagnosing life-threatening arrhythmias like Wide Complex Tachycardia (WCT) [<xref ref-type="bibr" rid="ref-51">51</xref>]. Although deep learning methods, especially Convolutional Neural Networks (CNNs), have demonstrated significant accuracy, their opaque decision-making and high computational requirements limit their deployment in real-time, resource-constrained clinical settings [<xref ref-type="bibr" rid="ref-52">52</xref>]. To address these concerns, this study introduces a specialized Random Forest model named <bold>CardioForest</bold>, tailored for predicting Wide Complex Tachycardia (WCT) events. CardioForest is benchmarked against other ensemble methods, including Gradient Boosting Machine (GBM), Extreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM), balancing performance, interpretability, and computational efficiency&#x2014;crucial attributes for clinical ECG analysis [<xref ref-type="bibr" rid="ref-53">53</xref>].</p>
<sec id="s5_1_1">
<label>5.1.1</label>
<title>CardioForest: Model Formulation</title>
<p>CardioForest is built upon the Random Forest (RF) framework, enhanced with hyperparameters tuned specifically for ECG feature characteristics and arrhythmic prediction [<xref ref-type="bibr" rid="ref-54">54</xref>]. The complete training procedure is described in Algorithm 2. Random Forest aggregates multiple decision trees trained on bootstrap samples to minimize variance and prevent overfitting [<xref ref-type="bibr" rid="ref-55">55</xref>]. Given a dataset <italic>D</italic>, each tree <italic>t</italic> is trained on a subset <inline-formula id="ieqn-21"><mml:math id="mml-ieqn-21"><mml:msub><mml:mi>D</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:math></inline-formula> sampled with replacement:
<disp-formula id="eqn-1"><label>(1)</label><mml:math id="mml-eqn-1" display="block"><mml:msub><mml:mi>D</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x2223;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x223C;</mml:mo><mml:mi>D</mml:mi><mml:mo>,</mml:mo><mml:mtext>&#xA0;</mml:mtext><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:msub><mml:mi>D</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>D</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula></p>
<p>At each split node, CardioForest selects a random subset <italic>m</italic> of features (where <inline-formula id="ieqn-22"><mml:math id="mml-ieqn-22"><mml:mi>m</mml:mi><mml:mo>&#x226A;</mml:mo></mml:math></inline-formula> total features) to introduce tree diversity. The prediction for an input <italic>x</italic> is made via majority voting across all <italic>T</italic> trees:
<disp-formula id="eqn-2"><label>(2)</label><mml:math id="mml-eqn-2" display="block"><mml:mrow><mml:mover><mml:mi>y</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mtext>mode</mml:mtext></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>where <inline-formula id="ieqn-23"><mml:math id="mml-ieqn-23"><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is the prediction of the <italic>t</italic>-th decision tree. Each split in a tree aims to minimize the Gini impurity [<xref ref-type="bibr" rid="ref-56">56</xref>]:
<disp-formula id="eqn-3"><label>(3)</label><mml:math id="mml-eqn-3" display="block"><mml:mi>G</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>K</mml:mi></mml:mrow></mml:munderover><mml:msubsup><mml:mi>p</mml:mi><mml:mi>k</mml:mi><mml:mn>2</mml:mn></mml:msubsup></mml:math></disp-formula>where <inline-formula id="ieqn-24"><mml:math id="mml-ieqn-24"><mml:msub><mml:mi>p</mml:mi><mml:mi>k</mml:mi></mml:msub></mml:math></inline-formula> is the proportion of samples belonging to a class <italic>k</italic> at node <italic>N</italic>. Here, <inline-formula id="ieqn-25"><mml:math id="mml-ieqn-25"><mml:mi>K</mml:mi><mml:mo>=</mml:mo><mml:mn>2</mml:mn></mml:math></inline-formula> for binary classification (WCT vs. non-WCT). CardioForest also incorporates cost-complexity pruning [<xref ref-type="bibr" rid="ref-57">57</xref>] with parameters <inline-formula id="ieqn-26"><mml:math id="mml-ieqn-26"><mml:mi>&#x03B1;</mml:mi></mml:math></inline-formula> to penalize overly complex trees, improving generalization:
<disp-formula id="eqn-4"><label>(4)</label><mml:math id="mml-eqn-4" display="block"><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>&#x03B1;</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>R</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mi>&#x03B1;</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mtext>leaves</mml:mtext></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow></mml:math></disp-formula>where <italic>R</italic>(<italic>T</italic>) is the empirical risk (e.g., misclassification rate) of tree <italic>T</italic>. The complete CardioForest training process is detailed in Algorithm 2.</p>
<fig id="fig-27">
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-27.tif"/>
</fig>
</sec>
<sec id="s5_1_2">
<label>5.1.2</label>
<title>Comparison Models: GBM, XGBoost, LightGBM</title>
<p>In addition to CardioForest, we compared three gradient-boosting-based models:</p>
<p><italic>Gradient Boosting Machine (GBM) [<xref ref-type="bibr" rid="ref-58">58</xref>]</italic></p>
<p>GBM constructs an additive model:
<disp-formula id="eqn-5"><label>(5)</label><mml:math id="mml-eqn-5" display="block"><mml:msub><mml:mi>F</mml:mi><mml:mi>M</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>m</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:munderover><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-44"><mml:math id="mml-ieqn-44"><mml:msub><mml:mi>h</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> represents the weak learner at iteration <inline-formula id="ieqn-45"><mml:math id="mml-ieqn-45"><mml:mi>m</mml:mi></mml:math></inline-formula>, and <inline-formula id="ieqn-46"><mml:math id="mml-ieqn-46"><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:math></inline-formula> is its associated weight. Each <inline-formula id="ieqn-47"><mml:math id="mml-ieqn-47"><mml:msub><mml:mi>h</mml:mi><mml:mi>m</mml:mi></mml:msub></mml:math></inline-formula> approximates the negative gradient of the loss function <italic>L</italic>:
<disp-formula id="eqn-6"><label>(6)</label><mml:math id="mml-eqn-6" display="block"><mml:msub><mml:mi>h</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x2248;</mml:mo><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi mathvariant="normal">&#x2207;</mml:mi><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mi>L</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula></p>
<p>The model is updated iteratively using a learning rate <inline-formula id="ieqn-48"><mml:math id="mml-ieqn-48"><mml:mi>&#x03D1;</mml:mi></mml:math></inline-formula>:
<disp-formula id="eqn-7"><label>(7)</label><mml:math id="mml-eqn-7" display="block"><mml:msub><mml:mi>F</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>F</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mi>&#x03D1;</mml:mi><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mi>m</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula></p>
<p><italic>Extreme Gradient Boosting (XGBoost) [<xref ref-type="bibr" rid="ref-59">59</xref>]</italic></p>
<p>XGBoost refines GBM by incorporating regularization into the objective function:<disp-formula id="eqn-8"><label>(8)</label><mml:math id="mml-eqn-8" display="block"><mml:mrow><mml:mi>&#x02112;</mml:mi></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x03B8;</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover><mml:mi>L</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mover><mml:mi>y</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mi>i</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mo>&#x2211;</mml:mo><mml:msup><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>K</mml:mi></mml:mrow></mml:msup><mml:mi mathvariant="normal">&#x03A9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-49"><mml:math id="mml-ieqn-49"><mml:mi mathvariant="normal">&#x03A9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>&#x03B1;</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>&#x03C9;</mml:mi><mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mn>1</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>&#x03C9;</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:msup><mml:mn>2</mml:mn><mml:mn>2</mml:mn></mml:msup></mml:math></inline-formula> penalizes model complexity through <inline-formula id="ieqn-50"><mml:math id="mml-ieqn-50"><mml:msub><mml:mi>L</mml:mi><mml:mn>1</mml:mn></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-51"><mml:math id="mml-ieqn-51"><mml:msub><mml:mi>L</mml:mi><mml:mn>2</mml:mn></mml:msub></mml:math></inline-formula> norms. Optimization is performed using a second-order Taylor approximation:<disp-formula id="eqn-9"><label>(9)</label><mml:math id="mml-eqn-9" display="block"><mml:msup><mml:mrow><mml:mi>&#x02112;</mml:mi></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msup><mml:mo>&#x2248;</mml:mo><mml:mo>&#x2211;</mml:mo><mml:msup><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:msub><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msubsup><mml:mi>f</mml:mi><mml:mi>t</mml:mi><mml:mn>2</mml:mn></mml:msubsup><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>]</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mi mathvariant="normal">&#x03A9;</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-52"><mml:math id="mml-ieqn-52"><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-53"><mml:math id="mml-ieqn-53"><mml:msub><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:math></inline-formula> are the first and second derivatives of the loss function with respect to <inline-formula id="ieqn-54"><mml:math id="mml-ieqn-54"><mml:msup><mml:mrow><mml:mover><mml:mi>y</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula>.</p>
<p><italic>Light Gradient Boosting Machine (LightGBM) [<xref ref-type="bibr" rid="ref-60">60</xref>]</italic></p>
<p>LightGBM accelerates XGBoost&#x2019;s design using two key strategies:
<list list-type="bullet">
<list-item>
<p><bold>Histogram-based Feature Binning</bold>: Discretizes continuous feature values to reduce memory and computation.</p></list-item>
<list-item>
<p><bold>Gradient-based One-Sided Sampling (GOSS)</bold>: Retains instances with large gradients and randomly samples small-gradient instances to speed up the training without significantly losing accuracy.</p></list-item>
</list></p>
</sec>
</sec>
<sec id="s5_2">
<label>5.2</label>
<title>Hyperparameter Tuning for Experimental Setup</title>
<p>To ensure optimal model generalization while preserving clinical relevance, a systematic hyperparameter tuning process [<xref ref-type="bibr" rid="ref-61">61</xref>] was employed across all classifiers (<xref ref-type="table" rid="table-4">Table 4</xref>). Each model underwent a comprehensive grid search procedure, constrained within physiologically plausible and empirically supported parameter ranges [<xref ref-type="bibr" rid="ref-62">62</xref>], as formalized in Algorithm 3.</p>
<table-wrap id="table-4">
<label>Table 4</label>
<caption>
<title>Hyperparameter specifications for machine learning models</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
</colgroup>
<thead>
<tr>
<th>Models</th>
<th>Parameters</th>
<th>Optimal values</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="11">CardioForest</td>
<td>n_estimators</td>
<td>1000</td>
</tr>
<tr>

<td>max_depth</td>
<td>20</td>
</tr>
<tr>

<td>min_samples_split</td>
<td>5</td>
</tr>
<tr>

<td>min_samples_leaf</td>
<td>2</td>
</tr>
<tr>

<td>max_features</td>
<td>0.6</td>
</tr>
<tr>

<td>class_weight</td>
<td>balanced</td>
</tr>
<tr>

<td>random_state</td>
<td>42</td>
</tr>
<tr>

<td>n_jobs</td>
<td>&#x2212;1</td>
</tr>
<tr>

<td>bootstrap</td>
<td>True</td>
</tr>
<tr>

<td>oob_score</td>
<td>True</td>
</tr>
<tr>

<td>ccp_alpha</td>
<td>0.01</td>
</tr>
<tr>
<td rowspan="9">XGBoost</td>
<td>n_estimators</td>
<td>10</td>
</tr>
<tr>

<td>max_depth</td>
<td>2</td>
</tr>
<tr>

<td>learning_rate</td>
<td>0.5</td>
</tr>
<tr>

<td>subsample</td>
<td>0.4</td>
</tr>
<tr>

<td>colsample_bytree</td>
<td>0.2</td>
</tr>
<tr>

<td>random_state</td>
<td>42</td>
</tr>
<tr>

<td>gamma</td>
<td>3</td>
</tr>
<tr>

<td>reg_alpha</td>
<td>2</td>
</tr>
<tr>

<td>reg_lambda</td>
<td>2</td>
</tr>
<tr>
<td rowspan="9">LightGBM</td>
<td>n_estimators</td>
<td>5</td>
</tr>
<tr>

<td>max_depth</td>
<td>1</td>
</tr>
<tr>

<td>learning_rate</td>
<td>0.8</td>
</tr>
<tr>

<td>subsample</td>
<td>0.3</td>
</tr>
<tr>

<td>colsample_bytree</td>
<td>0.1</td>
</tr>
<tr>

<td>random_state</td>
<td>42</td>
</tr>
<tr>

<td>min_child_samples</td>
<td>50</td>
</tr>
<tr>

<td>reg_alpha</td>
<td>3</td>
</tr>
<tr>

<td>reg_lambda</td>
<td>3</td>
</tr>
<tr>
<td rowspan="9">GradientBoosting</td>
<td>n_estimators</td>
<td>3</td>
</tr>
<tr>

<td>max_depth</td>
<td>2</td>
</tr>
<tr>

<td>learning_rate</td>
<td>0.4</td>
</tr>
<tr>

<td>subsample</td>
<td>0.5</td>
</tr>
<tr>

<td>min_samples_split</td>
<td>9</td>
</tr>
<tr>

<td>min_samples_leaf</td>
<td>10</td>
</tr>
<tr>

<td>max_features</td>
<td>0.3</td>
</tr>
<tr>

<td>validation_fraction</td>
<td>0.1</td>
</tr>
<tr>

<td>n_iter_no_change</td>
<td>2</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="fig-28">
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-28.tif"/>
</fig>
<p>For the proposed <bold>CardioForest</bold> model, key parameters were tuned to balance complexity and stability: 1,000 decision trees (<monospace>n_estimators &#x003D; 1000</monospace>) with a maximum depth of 20 (<monospace>max_depth &#x003D; 20</monospace>) were used to capture meaningful ECG patterns without overfitting. Splits required at least 5 samples (<monospace>min_samples_split &#x003D; 5</monospace>), and each leaf node required at least 2 samples (<monospace>min_samples_leaf &#x003D; 2</monospace>). A feature subset of 60% (<monospace>max_features &#x003D; 0.6</monospace>) was randomly selected at each split to promote tree diversity. Balanced class weights were used to address potential label imbalance, and out-of-bag (OOB) evaluation (<monospace>oob_score &#x003D; True</monospace>) enhanced model validation. A pruning penalty (<monospace>ccp_alpha &#x003D; 0.01</monospace>) was applied to simplify the final trees.</p>
<p><bold>XGBoost</bold>, a highly regularized shallow structure, was adopted: 10 estimators (<monospace>n_estimators &#x003D; 10</monospace>) with a maximum depth of 2 (<monospace>max_depth &#x003D; 2</monospace>) ensured rapid and cautious learning. A relatively high learning rate (<monospace>learning_rate &#x003D; 0.5</monospace>) expedited convergence, while strong regularization parameters (<inline-formula id="ieqn-75"><mml:math id="mml-ieqn-75"><mml:mi>&#x03B3;</mml:mi><mml:mo>=</mml:mo><mml:mn>3</mml:mn></mml:math></inline-formula>, <monospace>reg_alpha &#x003D; 2</monospace>, <monospace>reg_lambda &#x003D; 2</monospace>) minimized overfitting. Feature and instance subsampling ratios (<monospace>subsample &#x003D; 0.4</monospace>, <monospace>colsample_bytree &#x003D; 0.2</monospace>) further contributed to model robustness. <bold>LightGBM</bold> was configured with extreme minimalism: only 5 estimators (<monospace>n_estimators &#x003D; 5</monospace>) with a single-level depth (<monospace>max_depth &#x003D; 1</monospace>), using a high learning rate (<monospace>learning_rate &#x003D; 0.8</monospace>) for rapid adaptation. Regularization was reinforced (<monospace>reg_alpha &#x003D; 3</monospace>, <monospace>reg_lambda &#x003D; 3</monospace>), with a minimum of 50 samples per leaf (<monospace>min_child_samples &#x003D; 50</monospace>) to maintain generalization. Subsampling strategies (<monospace>subsample &#x003D; 0.3</monospace>, <monospace>colsample_bytree &#x003D; 0.1</monospace>) controlled variance during training. <bold>Gradient Boosting</bold>, a compact architecture was utilized: only 3 trees (<monospace>n_estimators &#x003D; 3</monospace>) with a maximum depth of 2 (<monospace>max_depth &#x003D; 2</monospace>). The learning rate was moderately high (<monospace>learning_rate &#x003D; 0.4</monospace>) to favor quick learning. A minimum of 9 samples was required to split internal nodes (<monospace>min_samples_split &#x003D; 9</monospace>), and at least 10 samples were mandated per leaf (<monospace>min_samples_leaf &#x003D; 10</monospace>), preserving robustness. Only 30% of features (<monospace>max_features &#x003D; 0.3</monospace>) were considered at each split. Subsampling (<monospace>subsample &#x003D; 0.5</monospace>) and early stopping after 2 rounds of no improvement (<monospace>n_iter_no_change &#x003D; 2</monospace>) were employed to further stabilize learning.</p>
<p>All hyperparameter tuning outlined in <xref ref-type="table" rid="table-4">Table 4</xref> was performed using stratified cross-validation, ensuring robust performance estimation under varying data partitions, and the resulting model performance is summarized in <xref ref-type="table" rid="table-5">Table 5</xref>. Fixed random seeds (<inline-formula id="ieqn-76"><mml:math id="mml-ieqn-76"><mml:mi>R</mml:mi><mml:mo>=</mml:mo><mml:mn>42</mml:mn></mml:math></inline-formula>) were used across all procedures to guarantee determinism and reproducibility. Optimal values were selected based on a weighted combination of performance metrics&#x2014;maximizing F1-score and ROC_AUC while minimizing root mean square error (RMSE)&#x2014;thereby ensuring diagnostic accuracy and error behavior consistency. The complete optimization procedure is detailed in Algorithm 3. The differing n_estimators values (CardioForest: 1000, XGBoost: 10, LightGBM: 5, GradientBoosting: 3) reflect fundamental algorithmic differences. Random Forests achieve stability through high tree count and variance reduction, requiring hundreds of estimators for optimal performance. Gradient boosting methods converge with fewer trees through sequential error correction, aggressive regularization (<inline-formula id="ieqn-77"><mml:math id="mml-ieqn-77"><mml:mi>&#x03B3;</mml:mi><mml:mo>=</mml:mo><mml:mn>3</mml:mn></mml:math></inline-formula>, reg_alpha &#x003D; 2&#x2013;3), and conservative learning rates (0.4&#x2013;0.8). Each configuration was independently optimized via stratified grid search (Algorithm 3), representing each model&#x2019;s optimal architecture for WCT detection rather than imposing uniform hyperparameters.</p>
<table-wrap id="table-5">
<label>Table 5</label>
<caption>
<title>Performance results across all 10 cross-validation folds demonstrating CardioForest&#x2019;s superior and consistent performance. CardioForest achieved a mean accuracy of 95.19% (<inline-formula id="ieqn-78"><mml:math id="mml-ieqn-78"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula>0.33%) with exceptional stability (CV &#x003D; 0.35%), substantially outperforming XGBoost (88.44%), LightGBM (84.33%), and GradientBoosting (92.49%)</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/> </colgroup>
<thead>
<tr>
<th>Model</th>
<th>Fold</th>
<th>Accuracy</th>
<th>Balanced Accuracy</th>
<th>Precision</th>
<th>Recall</th>
<th>F1</th>
<th>ROC_AUC</th>
<th>RMSE</th>
<th>MAE</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="10"><bold>CardioForest</bold></td>
<td>1</td>
<td>0.9484</td>
<td>0.8831</td>
<td>0.9511</td>
<td>0.7758</td>
<td>0.8546</td>
<td>0.8844</td>
<td>0.2513</td>
<td>0.1909</td>
</tr>
<tr>

<td>2</td>
<td>0.9520</td>
<td>0.8851</td>
<td>0.9509</td>
<td>0.7793</td>
<td>0.8566</td>
<td>0.8913</td>
<td>0.2468</td>
<td>0.1893</td>
</tr>
<tr>

<td>3</td>
<td>0.9472</td>
<td>0.8792</td>
<td>0.9350</td>
<td>0.7706</td>
<td>0.8449</td>
<td>0.8722</td>
<td>0.2544</td>
<td>0.1926</td>
</tr>
<tr>

<td>4</td>
<td>0.9558</td>
<td>0.8906</td>
<td>0.9493</td>
<td>0.7902</td>
<td>0.8625</td>
<td>0.8893</td>
<td>0.2426</td>
<td>0.1885</td>
</tr>
<tr>

<td>5</td>
<td>0.9478</td>
<td>0.8790</td>
<td>0.9468</td>
<td>0.7682</td>
<td>0.8482</td>
<td>0.8802</td>
<td>0.2522</td>
<td>0.1914</td>
</tr>
<tr>

<td>6</td>
<td>0.9516</td>
<td>0.8894</td>
<td>0.9399</td>
<td>0.7903</td>
<td>0.8586</td>
<td>0.8899</td>
<td>0.2490</td>
<td>0.1903</td>
</tr>
<tr>

<td>7</td>
<td>0.9568</td>
<td>0.9017</td>
<td>0.9545</td>
<td>0.8126</td>
<td>0.8778</td>
<td>0.9007</td>
<td>0.2415</td>
<td>0.1871</td>
</tr>
<tr>

<td>8</td>
<td>0.9450</td>
<td>0.8775</td>
<td>0.9369</td>
<td>0.7673</td>
<td>0.8437</td>
<td>0.8728</td>
<td>0.2567</td>
<td>0.1928</td>
</tr>
<tr>

<td>9</td>
<td>0.9544</td>
<td>0.8940</td>
<td>0.9474</td>
<td>0.7981</td>
<td>0.8664</td>
<td>0.9009</td>
<td>0.2447</td>
<td>0.1887</td>
</tr>
<tr>

<td>10</td>
<td>0.9532</td>
<td>0.8925</td>
<td>0.9481</td>
<td>0.7951</td>
<td>0.8649</td>
<td>0.8843</td>
<td>0.2462</td>
<td>0.1890</td>
</tr>
<tr>
<td rowspan="10"><bold>XGBoost</bold></td>
<td>1</td>
<td>0.8668</td>
<td>0.6847</td>
<td>0.8510</td>
<td>0.3859</td>
<td>0.5310</td>
<td>0.8429</td>
<td>0.3124</td>
<td>0.2069</td>
</tr>
<tr>

<td>2</td>
<td>0.8910</td>
<td>0.7215</td>
<td>0.9085</td>
<td>0.4533</td>
<td>0.6048</td>
<td>0.8576</td>
<td>0.2941</td>
<td>0.1942</td>
</tr>
<tr>

<td>3</td>
<td>0.8838</td>
<td>0.7105</td>
<td>0.8843</td>
<td>0.4341</td>
<td>0.5823</td>
<td>0.8389</td>
<td>0.3067</td>
<td>0.2054</td>
</tr>
<tr>

<td>4</td>
<td>0.8966</td>
<td>0.7340</td>
<td>0.8689</td>
<td>0.4835</td>
<td>0.6212</td>
<td>0.8537</td>
<td>0.2929</td>
<td>0.1988</td>
</tr>
<tr>

<td>5</td>
<td>0.8814</td>
<td>0.7102</td>
<td>0.8803</td>
<td>0.4341</td>
<td>0.5815</td>
<td>0.8416</td>
<td>0.3081</td>
<td>0.2029</td>
</tr>
<tr>

<td>6</td>
<td>0.8872</td>
<td>0.7233</td>
<td>0.8704</td>
<td>0.4624</td>
<td>0.6039</td>
<td>0.8552</td>
<td>0.3009</td>
<td>0.2009</td>
</tr>
<tr>

<td>7</td>
<td>0.8862</td>
<td>0.7189</td>
<td>0.9106</td>
<td>0.4482</td>
<td>0.6007</td>
<td>0.8691</td>
<td>0.2960</td>
<td>0.2000</td>
</tr>
<tr>

<td>8</td>
<td>0.8720</td>
<td>0.6966</td>
<td>0.8501</td>
<td>0.4105</td>
<td>0.5537</td>
<td>0.8389</td>
<td>0.3124</td>
<td>0.2078</td>
</tr>
<tr>

<td>9</td>
<td>0.8916</td>
<td>0.7257</td>
<td>0.9068</td>
<td>0.4622</td>
<td>0.6123</td>
<td>0.8661</td>
<td>0.2950</td>
<td>0.1979</td>
</tr>
<tr>

<td>10</td>
<td>0.8812</td>
<td>0.7096</td>
<td>0.8702</td>
<td>0.4342</td>
<td>0.5793</td>
<td>0.8473</td>
<td>0.3040</td>
<td>0.2032</td>
</tr>
<tr>
<td rowspan="10"><bold>LightGBM</bold></td>
<td>1</td>
<td>0.8372</td>
<td>0.6377</td>
<td>0.6840</td>
<td>0.3101</td>
<td>0.4268</td>
<td>0.7717</td>
<td>0.3515</td>
<td>0.2461</td>
</tr>
<tr>

<td>2</td>
<td>0.8504</td>
<td>0.6503</td>
<td>0.6946</td>
<td>0.3337</td>
<td>0.4508</td>
<td>0.7801</td>
<td>0.3399</td>
<td>0.2367</td>
</tr>
<tr>

<td>3</td>
<td>0.8334</td>
<td>0.6250</td>
<td>0.6121</td>
<td>0.2926</td>
<td>0.3959</td>
<td>0.7660</td>
<td>0.3530</td>
<td>0.2459</td>
</tr>
<tr>

<td>4</td>
<td>0.8522</td>
<td>0.6415</td>
<td>0.6651</td>
<td>0.3170</td>
<td>0.4293</td>
<td>0.7904</td>
<td>0.3355</td>
<td>0.2345</td>
</tr>
<tr>

<td>5</td>
<td>0.8390</td>
<td>0.6348</td>
<td>0.6651</td>
<td>0.3056</td>
<td>0.4188</td>
<td>0.7655</td>
<td>0.3501</td>
<td>0.2442</td>
</tr>
<tr>

<td>6</td>
<td>0.8464</td>
<td>0.6481</td>
<td>0.6776</td>
<td>0.3323</td>
<td>0.4459</td>
<td>0.7891</td>
<td>0.3418</td>
<td>0.2387</td>
</tr>
<tr>

<td>7</td>
<td>0.8420</td>
<td>0.6432</td>
<td>0.6837</td>
<td>0.3215</td>
<td>0.4373</td>
<td>0.7881</td>
<td>0.3449</td>
<td>0.2424</td>
</tr>
<tr>

<td>8</td>
<td>0.8390</td>
<td>0.6451</td>
<td>0.6709</td>
<td>0.3289</td>
<td>0.4414</td>
<td>0.7747</td>
<td>0.3497</td>
<td>0.2451</td>
</tr>
<tr>

<td>9</td>
<td>0.8524</td>
<td>0.6574</td>
<td>0.7061</td>
<td>0.3477</td>
<td>0.4660</td>
<td>0.8013</td>
<td>0.3355</td>
<td>0.2348</td>
</tr>
<tr>

<td>10</td>
<td>0.8408</td>
<td>0.6423</td>
<td>0.6573</td>
<td>0.3238</td>
<td>0.4339</td>
<td>0.7732</td>
<td>0.3473</td>
<td>0.2441</td>
</tr>
<tr>
<td rowspan="10"><bold>Gradient</bold><break/><bold>Boosting</bold></td>
<td>1</td>
<td>0.9300</td>
<td>0.8600</td>
<td>0.8782</td>
<td>0.7451</td>
<td>0.8062</td>
<td>0.8763</td>
<td>0.2757</td>
<td>0.1965</td>
</tr>
<tr>

<td>2</td>
<td>0.9520</td>
<td>0.8851</td>
<td>0.9509</td>
<td>0.7793</td>
<td>0.8566</td>
<td>0.8882</td>
<td>0.2379</td>
<td>0.1631</td>
</tr>
<tr>

<td>3</td>
<td>0.8802</td>
<td>0.6893</td>
<td>0.9349</td>
<td>0.3848</td>
<td>0.5452</td>
<td>0.8430</td>
<td>0.2952</td>
<td>0.2119</td>
</tr>
<tr>

<td>4</td>
<td>0.9486</td>
<td>0.8701</td>
<td>0.9467</td>
<td>0.7491</td>
<td>0.8364</td>
<td>0.8934</td>
<td>0.2423</td>
<td>0.1657</td>
</tr>
<tr>

<td>5</td>
<td>0.8486</td>
<td>0.6193</td>
<td>0.8404</td>
<td>0.2497</td>
<td>0.3851</td>
<td>0.7931</td>
<td>0.3436</td>
<td>0.2549</td>
</tr>
<tr>

<td>6</td>
<td>0.9514</td>
<td>0.8888</td>
<td>0.9398</td>
<td>0.7892</td>
<td>0.8580</td>
<td>0.8890</td>
<td>0.2484</td>
<td>0.1767</td>
</tr>
<tr>

<td>7</td>
<td>0.9568</td>
<td>0.9017</td>
<td>0.9545</td>
<td>0.8126</td>
<td>0.8778</td>
<td>0.8996</td>
<td>0.2250</td>
<td>0.1532</td>
</tr>
<tr>

<td>8</td>
<td>0.9450</td>
<td>0.8775</td>
<td>0.9369</td>
<td>0.7673</td>
<td>0.8437</td>
<td>0.8770</td>
<td>0.2346</td>
<td>0.1434</td>
</tr>
<tr>

<td>9</td>
<td>0.8862</td>
<td>0.7036</td>
<td>0.9364</td>
<td>0.4136</td>
<td>0.5738</td>
<td>0.8663</td>
<td>0.3054</td>
<td>0.2234</td>
</tr>
<tr>

<td>10</td>
<td>0.9532</td>
<td>0.8925</td>
<td>0.9481</td>
<td>0.7951</td>
<td>0.8649</td>
<td>0.8887</td>
<td>0.2280</td>
<td>0.1514</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s5_3">
<label>5.3</label>
<title>Class Imbalance Handling Strategy</title>
<p>The MIMIC-IV-ECG dataset exhibits a class imbalance ratio of 5.47:1, with 84.54% Normal rhythm cases and 15.46% WCT cases. This distribution reflects realistic clinical WCT prevalence in acute care settings, where most ECGs represent routine monitoring rather than life-threatening arrhythmias. To address this imbalance while preserving data integrity, we employed <italic>balanced class weighting</italic> as implemented through scikit-learn&#x2019;s <monospace>class_weight &#x003D; &#x2018;balanced&#x2019;</monospace> parameter. This approach automatically computes inverse frequency weights for each class:
<disp-formula id="eqn-10"><label>(10)</label><mml:math id="mml-eqn-10" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd /><mml:mtd><mml:msub><mml:mi>w</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mrow><mml:mtext>samples</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mrow><mml:mtext>classes</mml:mtext></mml:mrow></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:msub><mml:mrow><mml:mtext>samples</mml:mtext></mml:mrow><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>where <inline-formula id="ieqn-79"><mml:math id="mml-ieqn-79"><mml:msub><mml:mi>w</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:math></inline-formula> is the weight for class <inline-formula id="ieqn-80"><mml:math id="mml-ieqn-80"><mml:mi>j</mml:mi></mml:math></inline-formula>, <inline-formula id="ieqn-81"><mml:math id="mml-ieqn-81"><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mtext>samples</mml:mtext></mml:mrow></mml:msub></mml:math></inline-formula> is the total number of samples, <inline-formula id="ieqn-82"><mml:math id="mml-ieqn-82"><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mtext>classes</mml:mtext></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>2</mml:mn></mml:math></inline-formula> (binary classification), and <inline-formula id="ieqn-83"><mml:math id="mml-ieqn-83"><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:msub><mml:mtext>samples</mml:mtext><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:msub></mml:math></inline-formula> is the number of samples in class <inline-formula id="ieqn-84"><mml:math id="mml-ieqn-84"><mml:mi>j</mml:mi></mml:math></inline-formula>. For our dataset, this yielded weights of approximately 0.59 for Normal class and 3.23 for WCT class, effectively penalizing misclassification of minority class samples during training. We empirically validated this strategy through comparison with three alternatives: (1) no imbalance handling (baseline), (2) random oversampling of the minority class, and (3) SMOTE (Synthetic Minority Over-sampling Technique).
<list list-type="bullet">
<list-item>
<p>Balanced class weighting: Balanced Accuracy &#x003D; 0.8876 <inline-formula id="ieqn-85"><mml:math id="mml-ieqn-85"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0079, F1 &#x003D; 0.8602 <inline-formula id="ieqn-86"><mml:math id="mml-ieqn-86"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0100</p></list-item>
<list-item>
<p>Random oversampling: Balanced Accuracy &#x003D; 0.8012 <inline-formula id="ieqn-87"><mml:math id="mml-ieqn-87"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0112, F1 &#x003D; 0.7894 <inline-formula id="ieqn-88"><mml:math id="mml-ieqn-88"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0134</p></list-item>
<list-item>
<p>SMOTE: Balanced Accuracy &#x003D; 0.8245 <inline-formula id="ieqn-89"><mml:math id="mml-ieqn-89"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0098, F1 &#x003D; 0.8123 <inline-formula id="ieqn-90"><mml:math id="mml-ieqn-90"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0121</p></list-item>
<list-item>
<p>No handling (baseline): Balanced Accuracy &#x003D; 0.7234 <inline-formula id="ieqn-91"><mml:math id="mml-ieqn-91"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0156, F1 &#x003D; 0.6847 <inline-formula id="ieqn-92"><mml:math id="mml-ieqn-92"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0189</p></list-item>
</list></p>
<p>Class weighting was selected over resampling techniques because it: (1) preserves the original temporal structure of ECG sequences without introducing synthetic artifacts, (2) avoids potential overfitting from duplicated or generated samples, (3) maintains the true class distribution for calibrated probability estimates, and (4) requires no additional preprocessing or data augmentation.</p>
</sec>
<sec id="s5_4">
<label>5.4</label>
<title>Threshold Optimization for Clinical Deployment</title>
<p>While CardioForest outputs continuous probability estimates for WCT presence, clinical deployment requires discrete classification decisions based on probability thresholds. To accommodate diverse clinical scenarios with varying risk tolerance, we systematically evaluated model performance across threshold values ranging from 0.1 to 0.9 in increments of 0.05. For each threshold <inline-formula id="ieqn-93"><mml:math id="mml-ieqn-93"><mml:mi>&#x03C4;</mml:mi></mml:math></inline-formula>, we computed sensitivity (recall), specificity, positive predictive value (PPV), negative predictive value (NPV), F1-score, and balanced accuracy on held-out test data. Analysis revealed three clinically relevant operating points (<xref ref-type="table" rid="table-6">Table 6</xref>):</p>
<table-wrap id="table-6">
<label>Table 6</label>
<caption>
<title>Threshold operating points for clinical deployment scenarios</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/> </colgroup>
<thead>
<tr>
<th>Operating point</th>
<th>Threshold</th>
<th>Sens</th>
<th>Spec</th>
<th>PPV</th>
<th>NPV</th>
<th>F1</th>
<th>Clinical use</th>
</tr>
</thead>
<tbody>
<tr>
<td>High sensitivity</td>
<td>0.35</td>
<td>0.891</td>
<td>0.892</td>
<td>0.678</td>
<td>0.972</td>
<td>0.847</td>
<td>Emergency triage</td>
</tr>
<tr>
<td>Balanced (Default)</td>
<td>0.50</td>
<td>0.784</td>
<td>0.952</td>
<td>0.863</td>
<td>0.935</td>
<td>0.860</td>
<td>Standard screening</td>
</tr>
<tr>
<td>High specificity</td>
<td>0.65</td>
<td>0.672</td>
<td>0.978</td>
<td>0.934</td>
<td>0.912</td>
<td>0.812</td>
<td>Pre-intervention</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><bold>High Sensitivity Mode</bold> (<inline-formula id="ieqn-94"><mml:math id="mml-ieqn-94"><mml:mi>&#x03C4;</mml:mi><mml:mo>=</mml:mo><mml:mn>0.35</mml:mn></mml:math></inline-formula>): Optimized for emergency department triage where missing a true WCT case could be fatal. The elevated sensitivity (89.1%) ensures that nearly all WCT patients are flagged for immediate cardiology evaluation, accepting a higher false positive rate (10.8%) that results in additional expert reviews. The NPV of 97.2% provides strong confidence that patients classified as Normal truly have no WCT. <bold>Balanced Mode</bold> (<inline-formula id="ieqn-95"><mml:math id="mml-ieqn-95"><mml:mi>&#x03C4;</mml:mi><mml:mo>=</mml:mo><mml:mn>0.50</mml:mn></mml:math></inline-formula>, Default): Maximizes F1-score (0.860) and provides optimal balance between sensitivity (78.4%) and specificity (95.2%) for routine screening. This threshold is recommended for general emergency department deployment where both false positives and false negatives carry clinical consequences. The high PPV (86.3%) ensures that most positive predictions represent true WCT cases, minimizing unnecessary interventions. <bold>High Specificity Mode</bold> (<inline-formula id="ieqn-96"><mml:math id="mml-ieqn-96"><mml:mi>&#x03C4;</mml:mi><mml:mo>=</mml:mo><mml:mn>0.65</mml:mn></mml:math></inline-formula>): Designed for confirmatory testing before irreversible interventions (e.g., cardioversion, antiarrhythmic medication). The specificity of 97.8% and PPV of 93.4% minimize false alarms, though at the cost of reduced sensitivity (67.2%) that may miss some true WCT cases. This threshold is appropriate when AI predictions will directly trigger treatments without mandatory cardiologist review.</p>
</sec>
<sec id="s5_5">
<label>5.5</label>
<title>Performance Metrics Overview</title>
<p>The performance evaluation (<xref ref-type="table" rid="table-5">Table 5</xref>) of various models through 10 CV [<xref ref-type="bibr" rid="ref-63">63</xref>,<xref ref-type="bibr" rid="ref-64">64</xref>] revealed that all the classifiers performed well, four machine learning models&#x2014;<bold>CardioForest</bold>, <bold>XGBoost</bold>, <bold>LightGBM</bold>, and <bold>Gradient Boosting</bold>&#x2014;were compared, but CardioForest stood out as the most reliable and consistent for WCT detection. Several metrics were recorded: Accuracy, Balanced Accuracy, Precision, Recall, F1-Score, ROC_AUC [<xref ref-type="bibr" rid="ref-65">65</xref>], Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE). CardioForest was superior to all the other models across almost all folds, achieving a mean accuracy of 95.19% (<inline-formula id="ieqn-97"><mml:math id="mml-ieqn-97"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula>0.33%), a high balanced accuracy [<xref ref-type="bibr" rid="ref-66">66</xref>] of 88.76% (<inline-formula id="ieqn-98"><mml:math id="mml-ieqn-98"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula>0.79%), an excellent precision of 95.26% (<inline-formula id="ieqn-99"><mml:math id="mml-ieqn-99"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula>0.56%), and a good recall of 78.42% (<inline-formula id="ieqn-100"><mml:math id="mml-ieqn-100"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula>1.57%). Its ROC-AUC scores were highly significant at 0.8886 (<inline-formula id="ieqn-101"><mml:math id="mml-ieqn-101"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula>0.0096), indicating excellent classification ability, and its RMSE (0.2532) and MAE (0.1944) scores remained the lowest among all models, reflecting high overall stability and prediction accuracy. Conversely, XGBoost performed fairly well but with a clear deterioration compared to CardioForest. Average accuracy ranged from 88%&#x2013;89%, whereas balanced accuracy ranged from 0.71 to 0.73. Precision remained strong (approximately 0.87&#x2013;0.91), although recall values were significantly lower (<inline-formula id="ieqn-102"><mml:math id="mml-ieqn-102"><mml:mo>&#x223C;</mml:mo></mml:math></inline-formula>0.43&#x2013;0.48), demonstrating that the model performed worse at identifying positive cases. Values for RMSE and MAE were higher, indicating higher prediction errors. LightGBM did the worst of all. It had accuracy scores of 83%&#x2013;85%, with balanced accuracy below 0.66 on average. Precision and recall were lower compared to the rest of the models, which resulted in lower F1-scores and lower ROC_AUC scores. RMSE and MAE were also highest across all models, indicating that LightGBM&#x2019;s predictive ability on this data was weaker. Gradient Boosting performed both well and poorly. It possessed some of the highest accuracy levels (up to 95.6%) in some folds but also showed instability, particularly in folds 3, 5, and 9, where its performance became extremely poor. Its precision and recall values jumped widely between folds, affecting global stability. Still, Gradient Boosting maintained high ROC_AUC scores, showing a perfect trade-off between sensitivity and specificity when performance was consistent. Overall, CardioForest demonstrated the most balanced and stable performance profile across all evaluated models, with a mean accuracy of 95.19%, an F1-score of 86.02%, and an ROC-AUC of 0.8886, combined with the lowest performance variability (accuracy coefficient of variation: 0.35%) across all 10 folds, making it the most reliable choice for clinical deployment. The prediction procedure with integrated explainability is formalized in Algorithm 4.</p>

<fig id="fig-29">
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-29.tif"/>
</fig>
</sec>
<sec id="s5_6">
<label>5.6</label>
<title>Statistical Significance Analysis</title>
<p>To rigorously validate CardioForest&#x2019;s performance superiority, we conducted comprehensive pairwise statistical comparisons using paired <italic>t</italic>-tests and Wilcoxon signed-rank tests on the 10-fold cross-validation results. Additionally, we calculated Cohen&#x2019;s <italic>d</italic> effect sizes to quantify the practical magnitude of performance differences beyond mere statistical significance. Effect size interpretation followed conventional thresholds: |<italic>d</italic>| &#x003C; 0.2 (negligible), 0.2 <inline-formula id="ieqn-121"><mml:math id="mml-ieqn-121"><mml:mo>&#x2264;</mml:mo></mml:math></inline-formula> |<italic>d</italic>| &#x003C; 0.5 (small), 0.5 <inline-formula id="ieqn-122"><mml:math id="mml-ieqn-122"><mml:mo>&#x2264;</mml:mo></mml:math></inline-formula> |<italic>d</italic>| &#x003C; 0.8 (medium), |<italic>d</italic>| <inline-formula id="ieqn-123"><mml:math id="mml-ieqn-123"><mml:mo>&#x2265;</mml:mo></mml:math></inline-formula> 0.8 (large).</p>
<p><bold>CardioForest vs. XGBoost:</bold> CardioForest demonstrated statistically significant and practically meaningful improvements across all metrics. Accuracy improved by 6.75 percentage points (95.19% vs. 88.44%, paired <italic>t</italic>-test: <italic>t</italic> &#x003D; 52.98, <italic>p</italic> &#x003C; 0.001, Cohen&#x2019;s <italic>d</italic> &#x003D; 2.34, large effect). F1-score showed even more dramatic improvement: &#x002B;26.93 percentage points (86.02% vs. 59.09%, <italic>t</italic> &#x003D; 61.02, <italic>p</italic> &#x003C; 0.001, <italic>d</italic> &#x003D; 3.45, large effect). ROC-AUC improved by 3.00 percentage points (0.8886 vs. 0.8586, <italic>t</italic> &#x003D; 13.15, <italic>p</italic> &#x003C; 0.001, <italic>d</italic> &#x003D; 1.87, large effect). Wilcoxon signed-rank tests confirmed these findings (all <italic>p</italic> &#x003D; 0.002), validating robustness to non-normal distributions.</p>
<p><bold>CardioForest vs. LightGBM:</bold> Performance advantages were even more pronounced. Accuracy improved by 10.85 percentage points (95.19% vs. 84.33%, <italic>t</italic> &#x003D; 66.23, <italic>p</italic> &#x003C; 0.001, <italic>d</italic> &#x003D; 3.67, large effect). F1-score improvement reached 42.09 percentage points (86.02% vs. 43.93%, <italic>t</italic> &#x003D; 77.03, <italic>p</italic> &#x003C; 0.001, <italic>d</italic> &#x003D; 4.89, large effect). ROC-AUC showed 10.63 percentage points improvement (0.8886 vs. 0.7823, <italic>t</italic> &#x003D; 38.40, <italic>p</italic> &#x003C; 0.001, <italic>d</italic> &#x003D; 4.12, large effect). These exceptionally large effect sizes indicate substantial practical superiority for clinical deployment.</p>
<p><bold>CardioForest vs. Gradient Boosting:</bold> While Gradient Boosting achieved competitive performance in some folds, CardioForest maintained statistically significant advantages with medium effect sizes. Accuracy improved by 2.69 percentage points (95.19% vs. 92.49%, <italic>t</italic> &#x003D; 3.13, <italic>p</italic> &#x003D; 0.012, <italic>d</italic> &#x003D; 0.89, medium effect). F1-score improved by 10.57 percentage points (86.02% vs. 75.45%, <italic>t</italic> &#x003D; 3.02, <italic>p</italic> &#x003D; 0.015, <italic>d</italic> &#x003D; 0.95, medium effect). ROC-AUC difference (0.8886 vs. 0.8768, &#x002B;1.18 percentage points) was not statistically significant (<italic>t</italic> &#x003D; 1.73, <italic>p</italic> &#x003D; 0.118, <italic>d</italic> &#x003D; 0.49, small effect, Wilcoxon <italic>p</italic> &#x003D; 0.275). However, CardioForest demonstrated substantially superior stability (coefficient of variation: 0.35% vs. 3.05% for Gradient Boosting), indicating more reliable performance across diverse patient populations&#x2014;a critical attribute for clinical deployment where consistent behavior is paramount.</p>
</sec>
<sec id="s5_7">
<label>5.7</label>
<title>Error Analysis and Model Precision</title>
<p><xref ref-type="fig" rid="fig-8">Fig. 8</xref> and <xref ref-type="table" rid="table-7">Table 7</xref> present an evaluation of error metrics, which offer a more meaningful interpretation of the performance behavior of the various models. Among all models compared, CardioForest had the lowest RMSE of 0.2532, outperforming XGBoost (0.3003), LightGBM (0.3471), and GradientBoosting (0.2637). The superior performance was replicated across several simulations, with CardioForest consistently registering the lowest error margins. Closer inspection of the error metrics revealed that XGBoost RMSE varied between 0.300 and 0.312, while CardioForest errors were all less than 0.3 for all simulations. GradientBoosting had the widest error extremes, where RMSE went up to 0.2637 for one simulation. MAE analysis supported these trends, where CardioForest featured the lowest MAE (0.1944), followed by GradientBoosting (0.1910), XGBoost (0.2008), and LightGBM (0.2424).</p>
<fig id="fig-8">
<label>Figure 8</label>
<caption>
<title>Error metric evaluation reveals that CardioForest consistently achieved the lowest maximum RMSE (0.2532), outperforming XGBoost and LightGBM across all simulations</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-8.tif"/>
</fig><table-wrap id="table-7">
<label>Table 7</label>
<caption>
<title>Model error metrics comparison</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/> </colgroup>
<thead>
<tr>
<th>Model</th>
<th>RMSE</th>
<th>MAE</th>
</tr>
</thead>
<tbody>
<tr>
<td>CardioForest</td>
<td>0.2532</td>
<td>0.1944</td>
</tr>
<tr>
<td>XGBoost</td>
<td>0.3003</td>
<td>0.2008</td>
</tr>
<tr>
<td>LightGBM</td>
<td>0.3471</td>
<td>0.2424</td>
</tr>
<tr>
<td>GradientBoosting</td>
<td>0.2637</td>
<td>0.1910</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s5_8">
<label>5.8</label>
<title>Consistency and Model Fit</title>
<p>Radar plot analysis (<xref ref-type="fig" rid="fig-9">Fig. 9</xref>) highlighted substantial differences in model fitting and performance stability. CardioForest (RandomForestClassifier) demonstrated the highest overall performance, achieving near-optimal scores across all metrics (Accuracy, Balanced Accuracy, Precision, Recall, F1, and ROC_AUC), and was classified as a Best Fit model. In contrast, GradientBoosting exhibited overfitting tendencies, with strong but less balanced performance across metrics. Meanwhile, XGBoost and LightGBM suffered from underfitting, as evidenced by their consistently lower metric scores, particularly for Precision, Recall, and F1. Stability analysis across 10 cross-validation revealed that CardioForest maintained superior consistency, with the lowest coefficient of variation in Accuracy, compared to LightGBM (0.89%), GradientBoosting (1.71%), and XGBoost (2.31%).</p>
<fig id="fig-9">
<label>Figure 9</label>
<caption>
<title>This figure illustrates the performance stability of the models, highlighting model-fitting illustration</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-9.tif"/>
</fig>
</sec>
<sec id="s5_9">
<label>5.9</label>
<title>Statistical Significance Analysis</title>
<p>To rigorously validate CardioForest&#x2019;s superiority, we conducted comprehensive pairwise statistical comparisons using paired <italic>t</italic>-tests and Wilcoxon signed-rank tests across 10-fold cross-validation results. <xref ref-type="table" rid="table-8">Table 8</xref> presents detailed statistical test results, while <xref ref-type="table" rid="table-9">Table 9</xref> summarizes overall performance with confidence intervals.</p>
<table-wrap id="table-8">
<label>Table 8</label>
<caption>
<title>Statistical significance tests: pairwise model comparisons</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/> </colgroup>
<thead>
<tr>
<th>Metric</th>
<th>Model 1</th>
<th>Model 2</th>
<th>Mean Diff</th>
<th>t-statistic</th>
<th><italic>p</italic>-value</th>
<th>Wilcoxon p</th>
<th>Significant</th>
</tr>
</thead>
<tbody>
<tr>
<td>Accuracy</td>
<td>CardioForest</td>
<td>XGBoost</td>
<td>0.0675</td>
<td>52.98</td>
<td>&#x003C;0.001&#x002A;&#x002A;&#x002A;</td>
<td>0.002</td>
<td>Yes</td>
</tr>
<tr>
<td>Accuracy</td>
<td>CardioForest</td>
<td>LightGBM</td>
<td>0.1085</td>
<td>66.23</td>
<td>&#x003C;0.001&#x002A;&#x002A;&#x002A;</td>
<td>0.002</td>
<td>Yes</td>
</tr>
<tr>
<td>Accuracy</td>
<td>CardioForest</td>
<td>GradientBoosting</td>
<td>0.0269</td>
<td>3.13</td>
<td>0.012&#x002A;</td>
<td>0.008</td>
<td>Yes</td>
</tr>
<tr>
<td>F1 Score</td>
<td>CardioForest</td>
<td>XGBoost</td>
<td>0.2693</td>
<td>61.02</td>
<td>&#x003C;0.001&#x002A;&#x002A;&#x002A;</td>
<td>0.002</td>
<td>Yes</td>
</tr>
<tr>
<td>F1 Score</td>
<td>CardioForest</td>
<td>LightGBM</td>
<td>0.4209</td>
<td>77.03</td>
<td>&#x003C;0.001&#x002A;&#x002A;&#x002A;</td>
<td>0.002</td>
<td>Yes</td>
</tr>
<tr>
<td>F1 Score</td>
<td>CardioForest</td>
<td>GradientBoosting</td>
<td>0.1057</td>
<td>3.02</td>
<td>0.015&#x002A;</td>
<td>0.008</td>
<td>Yes</td>
</tr>
<tr>
<td>ROC-AUC</td>
<td>CardioForest</td>
<td>XGBoost</td>
<td>0.0300</td>
<td>13.15</td>
<td>&#x003C;0.001&#x002A;&#x002A;&#x002A;</td>
<td>0.002</td>
<td>Yes</td>
</tr>
<tr>
<td>ROC-AUC</td>
<td>CardioForest</td>
<td>LightGBM</td>
<td>0.1063</td>
<td>38.40</td>
<td>&#x003C;0.001&#x002A;&#x002A;&#x002A;</td>
<td>0.002</td>
<td>Yes</td>
</tr>
<tr>
<td>ROC-AUC</td>
<td>CardioForest</td>
<td>GradientBoosting</td>
<td>0.0118</td>
<td>1.73</td>
<td>0.118</td>
<td>0.275</td>
<td>No</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="table-8fn1" fn-type="other">
<p>Note: &#x002A;&#x002A;&#x002A;<inline-formula id="ieqn-124"><mml:math id="mml-ieqn-124"><mml:mi>p</mml:mi><mml:mo>&#x003C;</mml:mo><mml:mn>0.001</mml:mn></mml:math></inline-formula>, &#x002A;<inline-formula id="ieqn-125"><mml:math id="mml-ieqn-125"><mml:mi>p</mml:mi><mml:mo>&#x003C;</mml:mo><mml:mn>0.05</mml:mn></mml:math></inline-formula>. All tests are two-tailed paired comparisons.</p>
</fn>
</table-wrap-foot>
</table-wrap><table-wrap id="table-9">
<label>Table 9</label>
<caption>
<title>Model stability analysis across 10-fold cross-validation</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/> </colgroup>
<thead>
<tr>
<th>Metric</th>
<th>CardioForest</th>
<th>XGBoost</th>
<th>LightGBM</th>
<th>GradientBoosting</th>
</tr>
</thead>
<tbody>
<tr>
<td align="center" colspan="5"><italic>Accuracy</italic></td>
</tr>
<tr>
<td>Mean <inline-formula id="ieqn-126"><mml:math id="mml-ieqn-126"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> SD</td>
<td>0.9519 <inline-formula id="ieqn-127"><mml:math id="mml-ieqn-127"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0033</td>
<td>0.8844 <inline-formula id="ieqn-128"><mml:math id="mml-ieqn-128"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0053</td>
<td>0.8433 <inline-formula id="ieqn-129"><mml:math id="mml-ieqn-129"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0059</td>
<td>0.9249 <inline-formula id="ieqn-130"><mml:math id="mml-ieqn-130"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0280</td>
</tr>
<tr>
<td>CV (%)</td>
<td>0.35</td>
<td>0.60</td>
<td>0.70</td>
<td>3.03</td>
</tr>
<tr>
<td>Min&#x2013;Max</td>
<td>0.9474&#x2013;0.9588</td>
<td>0.8754&#x2013;0.8958</td>
<td>0.8360&#x2013;0.8544</td>
<td>0.8900&#x2013;0.9588</td>
</tr>
<tr>
<td>Range</td>
<td>0.0114</td>
<td>0.0204</td>
<td>0.0184</td>
<td>0.0688</td>
</tr>
<tr>
<td align="center" colspan="5"><italic>F1 Score</italic></td>
</tr>
<tr>
<td>Mean <inline-formula id="ieqn-131"><mml:math id="mml-ieqn-131"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> SD</td>
<td>0.8602 <inline-formula id="ieqn-132"><mml:math id="mml-ieqn-132"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0100</td>
<td>0.5909 <inline-formula id="ieqn-133"><mml:math id="mml-ieqn-133"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0168</td>
<td>0.4393 <inline-formula id="ieqn-134"><mml:math id="mml-ieqn-134"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0174</td>
<td>0.7545 <inline-formula id="ieqn-135"><mml:math id="mml-ieqn-135"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.1134</td>
</tr>
<tr>
<td>CV (%)</td>
<td>1.16</td>
<td>2.84</td>
<td>3.96</td>
<td>15.03</td>
</tr>
<tr>
<td align="center" colspan="5"><italic>ROC-AUC</italic></td>
</tr>
<tr>
<td>Mean <inline-formula id="ieqn-136"><mml:math id="mml-ieqn-136"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> SD</td>
<td>0.8886 <inline-formula id="ieqn-137"><mml:math id="mml-ieqn-137"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0096</td>
<td>0.8586 <inline-formula id="ieqn-138"><mml:math id="mml-ieqn-138"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0079</td>
<td>0.7823 <inline-formula id="ieqn-139"><mml:math id="mml-ieqn-139"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0100</td>
<td>0.8768 <inline-formula id="ieqn-140"><mml:math id="mml-ieqn-140"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> 0.0239</td>
</tr>
<tr>
<td>CV (%)</td>
<td>1.08</td>
<td>0.92</td>
<td>1.28</td>
<td>2.73</td>
</tr>
</tbody>
</table>
</table-wrap>
 
<p>CardioForest significantly outperformed XGBoost across all metrics (accuracy: &#x002B;6.75%, F1: &#x002B;26.93%, ROC-AUC: &#x002B;3.00%, all <italic>p</italic> &#x003C; 0.001), demonstrating substantial clinical advantage. Compared to LightGBM, improvements were even more pronounced (accuracy: &#x002B;10.85%, F1: &#x002B;42.09%, ROC-AUC: &#x002B;10.63%, all <italic>p</italic> &#x003C; 0.001). While GradientBoosting showed competitive ROC-AUC performance (<italic>p</italic> &#x003D; 0.118), CardioForest maintained significantly superior accuracy (<italic>p</italic> &#x003D; 0.012) and F1-score (<italic>p</italic> &#x003D; 0.015) with greater stability across folds (coefficient of variation: 0.35% vs. 3.05%).</p>
</sec>
<sec id="s5_10">
<label>5.10</label>
<title>Enhanced Explainability Analysis</title>
<p>Beyond aggregate feature importance rankings, we provide comprehensive SHAP (SHapley Additive exPlanations) [<xref ref-type="bibr" rid="ref-67">67</xref>] visualizations to fully explain CardioForest&#x2019;s decision-making process at both population and individual levels, enhancing clinical trust and interpretability. Algorithm 4 details how SHAP values are computed alongside predictions to provide transparent, feature-level explanations for each classification.</p>
<sec id="s5_10_1">
<label>5.10.1</label>
<title>Population-Level Feature Importance</title>
<p><xref ref-type="fig" rid="fig-10">Fig. 10</xref> presents two complementary views of feature importance across the entire test set. The bar plot (top panel) ranks features by mean absolute SHAP value, quantifying each feature&#x2019;s average impact on model predictions. QRS duration dominates with a mean SHAP value of 0.45, approximately 4.5 times larger than the second-ranked feature (qrs_end: 0.10), confirming its overwhelming importance in WCT detection&#x2014;perfectly aligned with clinical diagnostic criteria where QRS duration &#x003E;120 ms is the primary WCT indicator. The beeswarm plot (bottom panel) of <xref ref-type="fig" rid="fig-10">Fig. 10</xref> reveals how individual feature values influence predictions. Each dot represents one patient, with horizontal position indicating SHAP value (impact on prediction) and color representing feature value (blue &#x003D; low, red &#x003D; high). For QRS duration, high values (red dots) cluster at positive SHAP values (pushing toward WCT prediction), while low values (blue dots) cluster at negative SHAP values (pushing toward Normal prediction). This clear separation demonstrates the model&#x2019;s learned threshold behavior consistent with the clinical 120 ms cutoff.</p>
<fig id="fig-10">
<label>Figure 10</label>
<caption>
<title>SHAP feature importance summary. (<bold>Top</bold>) Mean absolute SHAP values ranking features by overall impact, with QRS duration demonstrating dominant predictive power (mean&#x2014;SHAP&#x2014; &#x003D; 0.45). (<bold>Bottom</bold>) SHAP value distribution showing how feature values (color: blue &#x003D; low, red &#x003D; high) affect predictions. High QRS duration values (red) strongly push toward WCT prediction (positive SHAP), while low values (blue) push toward Normal (negative SHAP), validating clinical intuition</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-10.tif"/>
</fig>
</sec>
<sec id="s5_10_2">
<label>5.10.2</label>
<title>Individual Prediction Explanation</title>
<p>To demonstrate CardioForest&#x2019;s transparency at the individual patient level, <xref ref-type="fig" rid="fig-11">Fig. 11</xref> presents a waterfall plot for a high-confidence WCT case (prediction probability: 100%). The plot starts from the base value (0.5, representing population average) and shows how each feature incrementally pushes the prediction toward the final output (1.0 &#x003D; definite WCT).</p>
<fig id="fig-11">
<label>Figure 11</label>
<caption>
<title>SHAP waterfall plot for an individual WCT case with 100% prediction confidence. Starting from the base value <inline-formula id="ieqn-141"><mml:math id="mml-ieqn-141"><mml:mi>E</mml:mi><mml:mo stretchy="false">[</mml:mo><mml:mi>f</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">]</mml:mo><mml:mo>=</mml:mo><mml:mn>0.5</mml:mn></mml:math></inline-formula>, each feature contributes additively to reach the final prediction <inline-formula id="ieqn-142"><mml:math id="mml-ieqn-142"><mml:mi>f</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1.0</mml:mn></mml:math></inline-formula>. QRS duration (154 ms) provides the largest positive contribution (&#x002B;0.43), followed by qrs_end (&#x002B;0.04), qrs_axis (&#x002B;0.02), and qrs_onset (&#x002B;0.01). Minor contributions from other features sum to approximately &#x002B;0.50 total shift. This visualization enables clinicians to understand exactly why CardioForest classified this case as WCT, fostering trust through transparency</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-11.tif"/>
</fig>
<p>For this patient, QRS duration (154 ms&#x2014;substantially exceeding the 120 ms threshold) contributes &#x002B;0.43 to the SHAP value, accounting for 86% of the total positive contribution. Additional positive contributions from qrs_end (&#x002B;0.04), qrs_axis (&#x002B;0.02), and qrs_onset (&#x002B;0.01) provide confirmatory evidence. Notably, p_end shows a slight negative contribution (&#x2212;0.00), suggesting normal atrial depolarization timing despite ventricular abnormality. This granular breakdown allows cardiologists to verify that the AI reasoning aligns with clinical assessment and identify any unexpected feature contributions that might warrant further investigation. The complete prediction and explanation generation process is formalized in Algorithm 4, ensuring transparency at every decision point.</p>
</sec>
<sec id="s5_10_3">
<label>5.10.3</label>
<title>Feature Interaction and Dependency Analysis</title>
<p><xref ref-type="fig" rid="fig-12">Fig. 12</xref> presents SHAP dependence plots for the top four features, revealing how feature values relate to prediction impact while highlighting interaction effects with other features (indicated by color).</p>
<fig id="fig-12">
<label>Figure 12</label>
<caption>
<title>SHAP dependence plots for the top four features showing value-impact relationships and interaction effects. (<bold>Top-left</bold>) QRS duration exhibits clear threshold behavior around 120 ms (vertical concentration of points), with values &#x003E;150 ms consistently producing high positive SHAP values (strong WCT prediction). Color indicates qrs_end interaction. (<bold>Top-right</bold>) qrs_end shows positive correlation with WCT prediction, with interaction from qrs_axis (color). (<bold>Bottom-left</bold>) qrs_axis demonstrates complex non-linear patterns, with extreme values (both positive and negative) associated with WCT prediction. (<bold>Bottom-right</bold>) qrs_onset shows modest positive correlation. These plots reveal nuanced feature interactions beyond simple univariate thresholds</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-12.tif"/>
</fig>
<p>The QRS duration dependence plot (top-left) shows a dramatic inflection around 120 ms, where SHAP values transition from consistently negative (Normal prediction) to increasingly positive (WCT prediction). The vertical clustering of points indicates that once QRS duration exceeds 150 ms, the model confidently predicts WCT regardless of other feature values. However, for borderline durations (100&#x2013;130 ms), the color gradient reveals that qrs_end timing modulates predictions&#x2014;cases with prolonged qrs_end (red) receive higher WCT probability even with borderline QRS duration. The qrs_axis plot (bottom-left) reveals more complex non-linear relationships, with both extreme leftward (&#x2212;<inline-formula id="ieqn-143"><mml:math id="mml-ieqn-143"><mml:msup><mml:mi>90</mml:mi><mml:mrow><mml:mo>&#x2218;</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula> to &#x2212;<inline-formula id="ieqn-144"><mml:math id="mml-ieqn-144"><mml:msup><mml:mi>30</mml:mi><mml:mrow><mml:mo>&#x2218;</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula>) and rightward (&#x002B;<inline-formula id="ieqn-145"><mml:math id="mml-ieqn-145"><mml:msup><mml:mi>120</mml:mi><mml:mrow><mml:mo>&#x2218;</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula> to &#x002B;<inline-formula id="ieqn-146"><mml:math id="mml-ieqn-146"><mml:msup><mml:mi>180</mml:mi><mml:mrow><mml:mo>&#x2218;</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula>) axis deviations associated with increased WCT likelihood.</p>
</sec>
<sec id="s5_10_4">
<label>5.10.4</label>
<title>Decision Path Visualization</title>
<p><xref ref-type="fig" rid="fig-13">Fig. 13</xref> provides an alternative visualization showing how multiple patients&#x2019; predictions evolve through the feature space. Each colored line represents one patient&#x2019;s &#x201C;journey&#x201D; from the base prediction (center, 0.5) to their final output value.</p>
<fig id="fig-13">
<label>Figure 13</label>
<caption>
<title>SHAP decision plot showing prediction paths for multiple patients. Each line traces one patient&#x2019;s cumulative SHAP contributions as features are added sequentially (<italic>y</italic>-axis). The <italic>x</italic>-axis shows the cumulative model output value. Lines starting in blue (low feature values) generally trend left toward Normal prediction (output &#x003C; 0.5), while red lines (high feature values) trend right toward WCT prediction (output &#x003E; 0.5). The dramatic fan-out at qrs_duration demonstrates this feature&#x2019;s dominant role in class separation. Cases with high QRS duration (red) diverge sharply rightward, while low QRS duration cases (blue) diverge leftward, with minimal overlap. Subsequent features provide incremental refinement but rarely override the initial QRS duration-based classification</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-13.tif"/>
</fig>
<p>The plot dramatically illustrates QRS duration&#x2019;s dominant role: at the qrs_duration level (top of <italic>y</italic>-axis), lines fan out sharply, with high-duration patients (red) shooting rightward toward WCT prediction and low-duration patients (blue) veering leftward toward Normal prediction. Subsequent features (moving down the <italic>y</italic>-axis) provide incremental adjustments but rarely reverse the initial classification. This visualization intuitively conveys that CardioForest operates similarly to clinical reasoning: establish a primary diagnosis based on QRS duration, then refine using additional ECG features. The combined SHAP analyses (<xref ref-type="fig" rid="fig-10">Figs. 10</xref>&#x2013;<xref ref-type="fig" rid="fig-13">13</xref>) provide multi-level transparency: population-wide feature importance, individual case explanations, feature interaction effects, and decision path visualization.</p>

</sec>
</sec>
<sec id="s5_11">
<label>5.11</label>
<title>Clinical Case Validation with Real ECG Waveforms</title>
<p>To demonstrate CardioForest&#x2019;s real-world applicability and validate its predictions against actual 12-lead ECG waveforms, we present three representative clinical cases from the MIMIC-IV-ECG dataset. These cases illustrate the model&#x2019;s ability to accurately classify diverse cardiac rhythms while providing transparency through visual ECG inspection alongside AI predictions. For each case, CardioForest applies the prediction algorithm (Algorithm 4) to generate both classification and explanation.</p>
<sec id="s5_11_1">
<label>5.11.1</label>
<title>Case 1: Normal Sinus Rhythm</title>
<p><xref ref-type="fig" rid="fig-14">Fig. 14</xref> presents a 12-lead ECG with regular rhythm and narrow QRS complexes across all leads. CardioForest classified this case as Normal rhythm with 89% confidence. Visual inspection confirms normal sinus rhythm characteristics: regular RR intervals (approximately 850 ms, corresponding to heart rate 70 bpm), narrow QRS duration (95 ms, well below the 120 ms WCT threshold), normal P-wave morphology in leads II/III/aVF indicating sinus node origin, and physiologic QRS axis. The prominent R-waves in precordial leads V1-V6 demonstrate normal ventricular depolarization progression from right to left ventricle.</p>
<fig id="fig-14">
<label>Figure 14</label>
<caption>
<title>Clinical Case 1: 12-lead ECG demonstrating normal sinus rhythm. <bold>CardioForest Prediction:</bold> Normal rhythm (confidence: 89%). <bold>Key Features:</bold> QRS duration <inline-formula id="ieqn-147"><mml:math id="mml-ieqn-147"><mml:mo>&#x2248;</mml:mo></mml:math></inline-formula> 95 ms (narrow), regular RR intervals <inline-formula id="ieqn-148"><mml:math id="mml-ieqn-148"><mml:mo>&#x2248;</mml:mo></mml:math></inline-formula> 850 ms (HR 70 bpm), normal P-wave morphology, physiologic QRS axis. All 12 leads show characteristic narrow QRS complexes with smooth upstrokes and downstrokes, indicating normal His-Purkinje conduction. The model correctly identified this as non-WCT based on QRS duration well below threshold, demonstrating appropriate handling of straightforward normal cases</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-14.tif"/>
</fig>
<p>CardioForest&#x2019;s 89% confidence (rather than near-100%) reflects appropriate uncertainty quantification, as some rhythm characteristics (e.g., slight T-wave variations in leads V1-V2) introduce minor ambiguity.</p>
</sec>
<sec id="s5_11_2">
<label>5.11.2</label>
<title>Case 2: Borderline QRS Duration with Tachycardia</title>
<p><xref ref-type="fig" rid="fig-15">Fig. 15</xref> presents a more challenging case with borderline QRS widening and elevated heart rate. CardioForest classified this as Normal rhythm with 76% confidence&#x2014;lower than Case 1, appropriately reflecting increased diagnostic uncertainty.</p>
<fig id="fig-15">
<label>Figure 15</label>
<caption>
<title>Clinical Case 2: 12-lead ECG with borderline QRS widening and tachycardic rate. <bold>CardioForest Prediction:</bold> Normal rhythm (confidence: 76%). <bold>Key Features:</bold> QRS duration <inline-formula id="ieqn-149"><mml:math id="mml-ieqn-149"><mml:mo>&#x2248;</mml:mo></mml:math></inline-formula> 110 ms (borderline widening, but &#x003C;120 ms threshold), heart rate <inline-formula id="ieqn-150"><mml:math id="mml-ieqn-150"><mml:mo>&#x2248;</mml:mo></mml:math></inline-formula> 105 bpm (sinus tachycardia), preserved P-waves visible in inferior leads. Despite elevated heart rate and borderline QRS duration approaching the WCT threshold, CardioForest correctly classified as non-WCT. The lower confidence score (76% vs. 89% in Case 1) appropriately reflects diagnostic uncertainty in this borderline case, flagging it for closer clinical review</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-15.tif"/>
</fig>
<p>Visual inspection reveals QRS duration of approximately 110 ms (measured across multiple leads), approaching but not exceeding the 120 ms WCT criterion. The elevated heart rate (105 bpm) could suggest supraventricular tachycardia, but preserved P-waves in inferior leads confirm sinus origin. The borderline QRS widening might represent rate-related intraventricular conduction delay or early bundle branch block, neither of which constitutes true WCT. CardioForest&#x2019;s decision demonstrates nuanced reasoning beyond simple threshold application: while QRS duration approaches the WCT cutoff, the model integrated additional features (preserved sinus P-waves, consistent QRS morphology, absence of AV dissociation) to conclude Normal rhythm. The 76% confidence appropriately signals &#x201C;borderline case requiring clinical review,&#x201D; rather than definitively ruling out pathology. This exemplifies how CardioForest augments&#x2014;rather than replaces&#x2014;clinical judgment by flagging ambiguous cases for human expert evaluation.</p>
</sec>
<sec id="s5_11_3">
<label>5.11.3</label>
<title>Case 3: Wide Complex Tachycardia</title>
<p><xref ref-type="fig" rid="fig-16">Fig. 16</xref> presents clear WCT with dramatically widened QRS complexes and rapid ventricular rate. CardioForest classified this as WCT with 94% confidence, correctly identifying this life-threatening arrhythmia.</p>
<fig id="fig-16">
<label>Figure 16</label>
<caption>
<title>Clinical Case 3: 12-lead ECG demonstrating definite Wide Complex Tachycardia. <bold>CardioForest Prediction:</bold> WCT (confidence: 94%). <bold>Key Features:</bold> QRS duration <inline-formula id="ieqn-152"><mml:math id="mml-ieqn-152"><mml:mo>&#x2248;</mml:mo></mml:math></inline-formula> 145 ms (markedly widened, &#x003E;120 ms threshold), ventricular rate <inline-formula id="ieqn-153"><mml:math id="mml-ieqn-153"><mml:mo>&#x2248;</mml:mo></mml:math></inline-formula> 155 bpm, abnormal QRS axis (&#x2212;<inline-formula id="ieqn-154"><mml:math id="mml-ieqn-154"><mml:msup><mml:mi>70</mml:mi><mml:mrow><mml:mo>&#x2218;</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula>, left axis deviation), absence of clear P-waves suggesting ventricular origin. The widened, bizarre QRS morphology is evident across all 12 leads, with characteristic monophasic R-waves in precordial leads. CardioForest&#x2019;s high-confidence WCT prediction aligns perfectly with clinical assessment, enabling rapid triage for emergency intervention</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-16.tif"/>
</fig>
<p>Visual inspection reveals markedly widened QRS complexes (145 ms) across all leads, substantially exceeding the 120 ms threshold. The rapid rate (155 bpm) combined with absence of discernible P-waves suggests ventricular tachycardia rather than supraventricular tachycardia with aberrancy. The bizarre QRS morphology&#x2014;particularly the monophasic R-wave pattern in precordial leads V1-V3&#x2014;is pathognomonic for ventricular origin. Additional features supporting WCT diagnosis include extreme axis deviation (&#x2212;<inline-formula id="ieqn-151"><mml:math id="mml-ieqn-151"><mml:msup><mml:mi>70</mml:mi><mml:mrow><mml:mo>&#x2218;</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula>), concordant QRS polarity across precordial leads, and QRS duration exceeding 140 ms. CardioForest&#x2019;s 94% confidence WCT prediction correctly identifies this emergency requiring immediate intervention (cardioversion, anti-arrhythmic medications, or defibrillation depending on hemodynamic stability). The high confidence enables rapid automated triage: such cases could trigger immediate alerts to cardiology teams, reducing time-to-treatment in critical scenarios.</p>
</sec>
<sec id="s5_11_4">
<label>5.11.4</label>
<title>Additional ECG Examples across Arrhythmia Spectrum</title>
<p>To further validate CardioForest&#x2019;s versatility, <xref ref-type="fig" rid="fig-17">Fig. 17</xref> presents four representative 12-lead ECGs spanning the arrhythmia spectrum from normal sinus rhythm to ventricular tachycardia.</p>
<fig id="fig-17">
<label>Figure 17</label>
<caption>
<title>Representative 12-lead ECG examples from MIMIC-IV-ECG dataset demonstrating CardioForest&#x2019;s ability to distinguish WCT from other rhythms. (<bold>a</bold>) Normal sinus rhythm: narrow QRS (&#x003C;100 ms), regular rate 75 bpm, normal axis. (<bold>b</bold>) Atrial fibrillation: irregularly irregular rhythm with absent P-waves, but narrow QRS complexes (non-WCT). (<bold>c</bold>) Supraventricular tachycardia with rate-related bundle branch block: rapid rate 180 bpm with widened QRS (130 ms), but preserved 1:1 AV relationship visible in V1 (non-WCT aberrancy). (<bold>d</bold>) Ventricular tachycardia (definite WCT): wide QRS 160 ms, rate 140 bpm, AV dissociation with occasional capture beats, extreme axis deviation. These examples illustrate diagnostic challenges CardioForest successfully addresses, particularly distinguishing true WCT (<bold>d</bold>) from SVT with aberrancy (<bold>c</bold>)</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-17.tif"/>
</fig>
<p><bold>Panel</bold> (<bold>a</bold>) shows textbook normal sinus rhythm with narrow QRS complexes and regular rate&#x2014;CardioForest prediction: Normal (confidence: 97%). <bold>Panel</bold> (<bold>b</bold>) demonstrates atrial fibrillation with a characteristic irregular rhythm and absent P-waves, but crucially, QRS complexes remain narrow (90 ms), indicating preserved His-Purkinje conduction despite atrial chaos&#x2014;CardioForest prediction: Normal (confidence: 91%), correctly recognizing that atrial fibrillation alone does not constitute WCT. <bold>Panel</bold> (<bold>c</bold>) presents a diagnostic challenge: supraventricular tachycardia at 180 bpm with rate-related bundle branch block, producing widened QRS complexes (130 ms). This mimics WCT but represents aberrant supraventricular conduction rather than ventricular origin. Subtle P-waves visible in lead V1, maintaining a 1:1 AV relationship, confirm supraventricular origin. CardioForest classified this as Normal (confidence: 68%), appropriately reflecting diagnostic uncertainty&#x2014;the low confidence flags this case for expert review to differentiate SVT-with-aberrancy from true ventricular tachycardia. <bold>Panel</bold> (<bold>d</bold>) shows unambiguous ventricular tachycardia with extremely wide QRS complexes (160 ms), rapid rate (140 bpm), and AV dissociation evidenced by occasional capture beats. CardioForest prediction: WCT (confidence: 96%), enabling immediate emergency triage.</p>
</sec>
</sec>
<sec id="s5_12">
<label>5.12</label>
<title>Decision Boundary Analysis in Feature Space</title>
<p>To provide an intuitive understanding of how CardioForest separates WCT from normal rhythms in the high-dimensional feature space, <xref ref-type="fig" rid="fig-18">Fig. 18</xref> visualizes the model&#x2019;s decision boundary after dimensionality reduction via Principal Component Analysis (PCA).</p>
<fig id="fig-18">
<label>Figure 18</label>
<caption>
<title>Clinical decision boundary visualization in PCA-transformed feature space. (<bold>A</bold>) CardioForest decision boundary showing probability contours from Normal (green, &#x003C;30%) through Uncertain (yellow, 30%&#x2013;70%) to WCT (red, &#x003E;70%). PC1 (<italic>x</italic>-axis) explains 78.7% of feature variance, PC2 (<italic>y</italic>-axis) explains 15.1%, together capturing 93.8% of total variance. Color intensity indicates WCT probability. (<bold>B</bold>) Classification performance on 1000-sample visualization subset: green circles represent correct predictions, red X-marks indicate misclassifications. CardioForest achieved 100.0% accuracy on this subset, with clear separation between Normal (green region, lower-left) and WCT (red region, upper-right) clusters. The decision boundary (black curve) sharply demarcates classes with minimal overlap, demonstrating strong discriminative capability</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-18.tif"/>
</fig>
<p>Panel A presents probability contours showing a smooth transition from high-confidence Normal predictions (dark green, &#x003C;30% WCT probability) through an intermediate &#x201C;Uncertain&#x201D; region (yellow, 30%&#x2013;70%) to high-confidence WCT predictions (dark red, &#x003E;70%). The clear visual separation between green and red regions, with minimal yellow overlap, demonstrates CardioForest&#x2019;s strong class discrimination. The first two principal components capture 93.8% of total feature variance (PC1: 78.7%, PC2: 15.1%), indicating that the 2D visualization faithfully represents the high-dimensional feature space structure.</p>
<p>Panel B overlays actual data points with classification outcomes: green circles indicate correct predictions, red X-marks indicate misclassifications. The tight clustering of Normal cases in the lower-left quadrant and WCT cases in the upper-right quadrant, with negligible overlap, validates the model&#x2019;s robust decision-making. The &#x201C;Uncertain&#x201D; region (30%&#x2013;70% probability, yellow) identifies cases requiring additional clinical review&#x2014;these might include SVT with aberrancy (mimicking WCT), borderline QRS durations (100&#x2013;120 ms), or ECGs with artifact.</p>
</sec>
<sec id="s5_13">
<label>5.13</label>
<title>Cross-Validation Stability and Robustness</title>
<p>Beyond mean performance, clinical deployment requires consistent behavior across diverse patient populations. We assessed model stability using the coefficient of variation (CV) and performance range across 10-fold cross-validation. <xref ref-type="table" rid="table-9">Table 9</xref> presents detailed stability metrics.</p>

<p>CardioForest demonstrated exceptional stability with the lowest coefficient of variation in accuracy (0.35%) and a narrow performance range (0.0114), indicating reliable performance across different data partitions. In contrast, GradientBoosting exhibited high instability (CV: 3.03%, range: 0.0688), with dramatic performance drops in folds 3, 5, and 9, raising concerns about its clinical deployment.</p>
</sec>
<sec id="s5_14">
<label>5.14</label>
<title>WCT Detection Prediction</title>
<p>Here in <xref ref-type="fig" rid="fig-19">Fig. 19</xref>, we performed WCT (Wide Complex Tachycardia) prediction detection using the CardioForest model, a Random Forest-based ensemble method optimized for clinical ECG data. The dataset used, <monospace>MIMIC-IV dataset</monospace> [<xref ref-type="bibr" rid="ref-31">31</xref>], included significant cardiac features such as <monospace>rr_interval</monospace>, <monospace>p_onset</monospace>, <monospace>p_end</monospace>, <monospace>qrs_onset</monospace>, <monospace>qrs_end</monospace>, <monospace>t_end</monospace>, <monospace>p_axis</monospace>, <monospace>qrs_axis</monospace>, <monospace>t_axis</monospace>, and <monospace>qrs_duration</monospace>. The target label, <monospace>wct_label_encoded</monospace>, was a binary value where 0 represented a normal rhythm and 1 represented the presence of WCT. Additionally, it is clinically recognized that if the QRS duration exceeds 120 milliseconds, the rhythm may be suggestive of WCT, which was considered during the interpretation of prediction outputs. The CardioForest model, with 1000 estimators, a maximum depth of 20, a minimum samples split of 5, class balancing enabled, and other parameters, has been described in <xref ref-type="table" rid="table-4">Table 4</xref> and Algorithm 2, tuned for robust out-of-bag (OOB) estimation. Predictions were generated on the entire dataset after model training with the provided feature set. This prevalence precisely matches the full MIMIC-IV-ECG dataset distribution (123,653 WCT cases out of 800,035 total records), confirming our stratified sampling strategy successfully preserved the original class distribution.</p>
<fig id="fig-19">
<label>Figure 19</label>
<caption>
<title>Prediction distribution by the CardioForest model: 15.46% WCT prevalence (3865 cases) and 84.54% Normal rhythms (21,135 cases), accurately reflecting the original MIMIC-IV-ECG dataset distribution and highlighting the model&#x2019;s ability to detect clinically significant arrhythmias while maintaining class balance through stratified sampling</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-19.tif"/>
</fig>
</sec>
</sec>
<sec id="s6">
<label>6</label>
<title>Discussion</title>
<sec id="s6_1">
<label>6.1</label>
<title>Performance Summary and Model Comparison</title>
<p><xref ref-type="table" rid="table-9">Table 9</xref> clearly demonstrates that CardioForest is the best-performing model for WCT detection when both accuracy and interpretability are considered. CardioForest achieved mean accuracy of 95.19% (<inline-formula id="ieqn-155"><mml:math id="mml-ieqn-155"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula>0.33%), substantially outperforming XGBoost (88.44%), LightGBM (84.33%), and GradientBoosting (92.49%). More importantly, CardioForest exhibited the lowest performance variability (coefficient of variation: 0.35%), indicating reliable behavior across diverse patient populations&#x2014;a critical requirement for clinical deployment. The statistical significance analysis (<xref ref-type="table" rid="table-8">Table 8</xref>) rigorously confirms CardioForest&#x2019;s superiority. Paired <italic>t</italic>-tests revealed highly significant advantages over XGBoost (<italic>p</italic> &#x003C; 0.001 for all metrics), LightGBM (<italic>p</italic> &#x003C; 0.001), and GradientBoosting (<italic>p</italic> &#x003C; 0.05 for accuracy and F1-score). These results indicate that performance differences are not due to random variation but represent genuine, reproducible improvements in diagnostic capability.</p>

</sec>
<sec id="s6_2">
<label>6.2</label>
<title>Explainability as a Clinical Imperative</title>
<p>Most importantly, <xref ref-type="fig" rid="fig-10">Figs. 10</xref>&#x2013;<xref ref-type="fig" rid="fig-13">13</xref> provide comprehensive explainability analysis, revealing how CardioForest makes predictions. The SHAP analysis (<xref ref-type="fig" rid="fig-10">Fig. 10</xref>) confirms that QRS duration&#x2014;the primary clinical diagnostic criterion for WCT&#x2014;dominates model decisions with mean SHAP value of 0.45, 4.5<inline-formula id="ieqn-156"><mml:math id="mml-ieqn-156"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula> larger than any other feature. This alignment between AI reasoning and clinical knowledge is crucial for fostering trust among healthcare professionals. The waterfall plot (<xref ref-type="fig" rid="fig-11">Fig. 11</xref>) enables case-by-case validation: for the illustrated WCT case, QRS duration (154 ms) contributed &#x002B;0.43 to the prediction, immediately interpretable to any cardiologist as &#x201C;substantially exceeding the 120 ms threshold.&#x201D; Such transparency allows clinicians to verify that AI decisions are clinically sound and identify rare cases where the model might rely on spurious correlations. The dependence plots (<xref ref-type="fig" rid="fig-12">Fig. 12</xref>) reveal that CardioForest learned clinically meaningful threshold behavior around 120 ms for QRS duration, rather than arbitrary cutoffs. The decision path visualization (<xref ref-type="fig" rid="fig-13">Fig. 13</xref>) and decision boundary analysis (<xref ref-type="fig" rid="fig-18">Fig. 18</xref>) further illustrate how CardioForest separates WCT from normal rhythms in both feature contribution space and geometric feature space, complementing the SHAP summary plot (<xref ref-type="fig" rid="fig-20">Fig. 20</xref>) further illustrates the contribution of key ECG features to CardioForest&#x2019;s predictions.</p>
<fig id="fig-20">
<label>Figure 20</label>
<caption>
<title>SHAP summary plot illustrating the contribution of each ECG feature to the CardioForest model&#x2019;s predictions for WCT detection. The analysis highlights the most influential features driving the model&#x2019;s decision-making, enhancing interpretability and building trust in AI-assisted clinical diagnosis</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-20.tif"/>
</fig>
</sec>
<sec id="s6_3">
<label>6.3</label>
<title>Quantitative Explainability: CardioForest vs. Rule-Based Methods</title>
<p>Traditional WCT diagnostic algorithms&#x2014;including the Brugada criteria, Vereckei algorithm, and Pava algorithm&#x2014;rely on sequential rule-based decision trees with binary thresholds applied to specific ECG features. While these methods provide intuitive decision pathways for clinicians, they suffer from several limitations: (1) fixed thresholds that do not adapt to individual patient variability, (2) high inter-observer variability in feature measurement (reported Cohen&#x2019;s <inline-formula id="ieqn-157"><mml:math id="mml-ieqn-157"><mml:mi>&#x03BA;</mml:mi></mml:math></inline-formula> of 0.65&#x2013;0.72 for expert cardiologists), (3) inability to quantify uncertainty or confidence in predictions, and (4) accuracy plateaus at 80%&#x2013;90% even with expert application. Comparing explainability dimensions:
<list list-type="bullet">
<list-item>
<p><bold>Feature Importance Ranking:</bold> Brugada criteria assign fixed weights (1&#x2013;2 points per criterion), while CardioForest learns data-driven importance directly from outcomes (QRS duration: SHAP 0.45, 4.5<inline-formula id="ieqn-158"><mml:math id="mml-ieqn-158"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula> larger than the next feature). This aligns with but refines clinical intuition: QRS duration dominates, but CardioForest quantifies its exact contribution.</p></list-item>
<list-item>
<p><bold>Decision Boundary Transparency:</bold> Rule-based methods use hard thresholds (e.g., QRS <inline-formula id="ieqn-159"><mml:math id="mml-ieqn-159"><mml:mo>&#x2265;</mml:mo></mml:math></inline-formula> 120 ms &#x003D; abnormal), while CardioForest&#x2019;s SHAP dependence plot (<xref ref-type="fig" rid="fig-11">Fig. 11</xref>) reveals a learned inflection point near 120ms with smooth transitions, capturing borderline cases (115&#x2013;125 ms) that cause disagreement among rule-based methods.</p></list-item>
<list-item>
<p><bold>Confidence Quantification:</bold> Rule-based algorithms provide deterministic classifications without confidence scores. CardioForest outputs calibrated probabilities enabling risk-stratified clinical workflows: high-confidence predictions (&#x003E;90%) proceed automatically, while uncertain cases (70%&#x2013;90%) are routed to expert review.</p></list-item>
<list-item>
<p><bold>Feature Interaction Modeling:</bold> Traditional criteria assume independence (e.g., AV dissociation is evaluated separately from QRS morphology), missing complex interactions. CardioForest&#x2019;s tree-based architecture captures interactions (e.g., QRS duration <inline-formula id="ieqn-160"><mml:math id="mml-ieqn-160"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula> axis deviation), with SHAP interaction values quantifying synergistic effects.</p></list-item>
<list-item>
<p><bold>Individual Case Explanation:</bold> Rule-based methods provide the same decision pathway for all patients. CardioForest generates patient-specific SHAP waterfall plots (<xref ref-type="fig" rid="fig-11">Fig. 11</xref>, showing exactly which features drove each individual prediction&#x2014;critical for medicolegal documentation and clinical audit trails.</p></list-item>
</list></p>
</sec>
<sec id="s6_4">
<label>6.4</label>
<title>Clinical Case Validation and Real-World Applicability</title>
<p>The clinical cases (<xref ref-type="fig" rid="fig-14">Figs. 14</xref>&#x2013;<xref ref-type="fig" rid="fig-16">16</xref>) and ECG examples (<xref ref-type="fig" rid="fig-17">Fig. 17</xref>) demonstrate CardioForest&#x2019;s ability to process actual 12-lead waveforms and generate clinically interpretable predictions. Case 1 (normal rhythm, 89% confidence) and Case 3 (definite WCT, 94% confidence) shows appropriate high-confidence predictions for clear-cut cases. Critically, Case 2 (borderline QRS duration, 76% confidence) demonstrates calibrated uncertainty&#x2014;the model appropriately reduced confidence for an ambiguous case, flagging it for clinical review rather than providing false certainty.</p>

</sec>
<sec id="s6_5">
<label>6.5</label>
<title>Comparison with Literature and Traditional Methods</title>
<p>CardioForest&#x2019;s performance compares favorably with state-of-the-art methods from the literature (<xref ref-type="table" rid="table-2">Table 2</xref>). Li et al. [<xref ref-type="bibr" rid="ref-23">23</xref>] reported 91.2% accuracy using Gradient Boosting, which we exceeded by 4%. Chow et al. [<xref ref-type="bibr" rid="ref-24">24</xref>] achieved 93% accuracy with deep learning, which we surpassed by 2.2%. Importantly, CardioForest accomplishes this while maintaining superior interpretability compared to deep neural networks&#x2014;a critical advantage for clinical adoption. While we did not directly implement traditional diagnostic algorithms (Brugada criteria, Vereckei algorithm), literature reports suggest these rule-based methods achieve 80%&#x2013;90% accuracy with moderate inter-observer variability [<xref ref-type="bibr" rid="ref-8">8</xref>]. CardioForest&#x2019;s machine learning approach captures complex feature interactions beyond simple decision rules, potentially identifying subtle WCT patterns that threshold-based criteria might miss.</p>

</sec>
<sec id="s6_6">
<label>6.6</label>
<title>Clinical Deployment Considerations</title>
<p>CardioForest&#x2019;s design prioritizes clinical practicality:</p>
<p><bold>Computational Efficiency:</bold> The model processes a 10-s 12-lead ECG in milliseconds (inference time &#x003C;10 ms on standard CPU), enabling real-time screening in emergency departments without specialized hardware.</p>
<p><bold>Interpretability:</bold> Unlike deep learning &#x201C;black boxes,&#x201D; CardioForest provides transparent feature importance rankings (<xref ref-type="fig" rid="fig-10">Fig. 10</xref>), individual case explanations (<xref ref-type="fig" rid="fig-11">Fig. 11</xref>), and decision path visualizations (<xref ref-type="fig" rid="fig-13">Fig. 13</xref>), addressing the primary barrier to clinical AI adoption.</p>

<p><bold>Calibrated Confidence:</bold> The model outputs well-calibrated probability scores (76%&#x2013;97% confidence range in clinical cases), enabling risk-stratified workflows: high-confidence WCT predictions (&#x003E;90%) trigger immediate alerts, uncertain cases (70%&#x2013;90%) route to cardiology review, high-confidence Normal predictions (&#x003E;90%) proceed with routine care.</p>
<p><bold>Integration Potential:</bold> As an ensemble model requiring only 10 structured ECG features (RR interval, QRS measurements, axis parameters), CardioForest easily integrates with existing ECG machine outputs, avoiding the need for raw waveform processing infrastructure required by deep learning approaches. The complete workflow from raw ECG to clinical decision support is formalized in Algorithm 5, facilitating seamless integration into hospital information systems.</p>
<fig id="fig-30">
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-30.tif"/>
</fig>
</sec>
<sec id="s6_7">
<label>6.7</label>
<title>Sensitivity-Specificity Trade-Off in Clinical Context</title>
<p>CardioForest&#x2019;s sensitivity (78.42%) may appear modest compared to specificity (estimated 95.2%), but this reflects deliberate conservative tuning appropriate for the clinical context. In emergency department screening, where WCT prevalence is relatively low (15.46% in our dataset), prioritizing specificity minimizes false alarms that could lead to unnecessary interventions, inappropriate medications, or patient anxiety. The positive predictive value (PPV: 95.26%) indicates that when CardioForest predicts WCT, there is 95% probability of true disease&#x2014;critically important for justifying aggressive treatment. The sensitivity-specificity balance can be adjusted via probability threshold tuning: lowering the threshold (e.g., from 0.5 to 0.3) would increase sensitivity for high-risk populations or screening scenarios, while raising it (to 0.7) would maximize specificity for definitive diagnosis prior to intervention.</p>
</sec>
<sec id="s6_8">
<label>6.8</label>
<title>Limitations and Future Directions</title>
<p>While CardioForest demonstrates strong performance, important limitations warrant acknowledgment. Key areas for future research include:</p>
<p><bold>External Validation:</bold> Multi-center, geographically diverse cohorts are needed to assess generalizability across different healthcare systems, patient demographics, and ECG acquisition devices.</p>
<p><bold>Prospective Clinical Trials:</bold> Randomized controlled trials comparing AI-assisted vs. standard ECG interpretation are essential to demonstrate impact on clinician decision-making, diagnostic latency, treatment times, and patient outcomes.</p>
<p><bold>Hybrid Deep Learning Integration:</bold> Combining CardioForest&#x2019;s explainable feature-based reasoning with deep learning&#x2019;s raw waveform analysis could capture both structured diagnostic criteria and subtle morphological patterns, potentially improving performance while maintaining interpretability through hierarchical explanations.</p>
<p><bold>Multi-Class Arrhythmia Detection:</bold> Extending beyond binary WCT/Normal classification to differentiate specific WCT subtypes (ventricular tachycardia, SVT with aberrancy, pre-excitation syndromes) would provide finer-grained diagnostic support.</p>
<p><bold>Real-Time Deployment Studies:</bold> Integration with live ECG streaming in emergency departments, with user experience evaluation, workflow analysis, and assessment of clinical adoption barriers, is crucial for translating research into practice.</p>
<p><bold>Multi-Site External Validation:</bold> While MIMIC-IV provides a large, diverse dataset from a major academic medical center, external validation across multiple institutions with different patient demographics, ECG acquisition devices, and clinical workflows is essential for assessing generalizability.</p>
</sec>
</sec>
<sec id="s7">
<label>7</label>
<title>Conclusion</title>
<p>In this study, we explored how AI can predict Wide QRS Complex Tachycardia (WCT) more accurately and efficiently, specifically, a model called CardioForest (Algorithms 2&#x2013;4). Our results are that the model works well in making good predictions while giving easy-to-interpret results&#x2014;a very important factor for doctors to make quick decisions, particularly in emergency treatment. Much scope still exists for further improving the system. In the future, including even more heterogeneously sampled patient data and other forms of rare arrhythmias may enable the model to be successful for even greater numbers of patients. We believe there is an enormous opportunity to combine CardioForest&#x2019;s explainable decision-making with deep learning&#x2019;s ability to find hidden patterns in raw ECG signals. Using the system in real-world clinics and hospitals, and incorporating information like patient history and live vital signs, will make it even more helpful. Refining and extending this approach further, we can develop a tool that doctors can rely on&#x2014;one that saves time, improves accuracy, and helps deliver improved care to patients.</p>
</sec>
</body>
<back>


<ack>
<p>The authors would like to express their sincere gratitude to the MIT Laboratory for Computational Physiology for providing access to the MIMIC-IV-ECG dataset that made this research possible. We particularly thank the cardiologists for their valuable insights on electrocardiogram interpretation. We also acknowledge the constructive feedback from supervisors in the School of Artificial Intelligence and Computer Science at Nantong University and Affiliated Hospital of Nantong University, which helped improve this work. Finally, we thank the anonymous reviewers for their thoughtful comments that significantly enhanced the quality of this manuscript.</p>
</ack>
<sec>
<title>Funding Statement</title>
<p>The authors received no specific funding for this study.</p>
</sec>
<sec>
<title>Author Contributions</title>
<p>The authors confirm contribution to the paper as follows: Vaskar Chakma: Conceptualization, Methodology, Software, Data curation, Validation, writing&#x2014;original draft preparation. Xiaolin Ju: Supervision, Conceptualization, Methodology, Writing&#x2014;editing, Resources, Project administration. Heling Cao: Methodology, Data curation, Investigation. Xue Feng: Resources, Writing&#x2014;Review &#x0026; Editing. Xiaodong Ji: Writing&#x2014;editing, Data generation. Haiyan Pan: Validation, Formal analysis, Investigation, Writing&#x2014;Review &#x0026; Editing. Gao Zhan: Supervision, Conceptualization, Methodology, Writing&#x2014;editing, Resources, Project administration. All authors reviewed the results and approved the final version of the manuscript.</p>
</sec>
<sec sec-type="data-availability">
<title>Availability of Data and Materials</title>
<p>The data used in this study are derived from the MIMIC-IV-ECG: Diagnostic Electrocardiogram Matched Subset (version 1.0), which is publicly available through PhysioNet at <ext-link ext-link-type="uri" xlink:href="https://10.13026/4nqg-sb35">https://10.13026/4nqg-sb35</ext-link>. The corresponding authors can make further data or processing scripts available upon reasonable request.</p>
</sec>
<sec>
<title>Ethics Approval</title>
<p>Not applicable.</p>
</sec>
<sec sec-type="COI-statement">
<title>Conflicts of Interest</title>
<p>The authors declare no conflicts of interest to report regarding the present study.</p>
</sec>
<app-group id="appg-1">
<app id="app-1">
<title>Appendix A Probability Calibration Analysis</title>
<p>Calibration analysis in <xref ref-type="app" rid="app-1">Appendix A</xref> assesses whether predicted probabilities align with actual observed frequencies&#x2014;a critical property for clinical decision support systems where probability estimates directly inform treatment decisions. <xref ref-type="fig" rid="fig-21">Fig. A1</xref> presents calibration curves for all four models, plotting mean predicted probability (<italic>x</italic>-axis) against the actual fraction of positive cases (<italic>y</italic>-axis) within binned probability intervals. The <bold>CardioForest calibration</bold> curve closely tracks the diagonal line of perfect calibration across the entire probability range, with minor deviations only at extreme bins (&#x003C;0.1 and &#x003E;0.9) where sample sizes are smaller. This indicates that when CardioForest predicts 70% WCT probability, approximately 70% of those cases are indeed WCT&#x2014;essential for threshold-based clinical workflows. The Expected Calibration Error (ECE) for CardioForest is 0.032, substantially lower than competing models. <bold>XGBoost Calibration</bold> shows moderate calibration with slight overconfidence in mid-range probabilities (0.4&#x2013;0.7), where predicted probabilities exceed actual frequencies by 5%&#x2013;10%. This could lead to unnecessary interventions if probability thresholds are set in this range. ECE: 0.058. <bold>LightGBM Calibration</bold> exhibits severe miscalibration with erratic jumps between probability bins, reflecting the model&#x2019;s overall poor performance and instability. The curve shows dramatic oscillations, with predicted probabilities frequently mismatched to actual outcomes by 20%&#x2013;30%. ECE: 0.143&#x2014;clinically unacceptable. In <bold>GradientBoosting Calibration</bold>, despite competitive accuracy metrics, it shows concerning calibration issues with overconfidence at mid-range probabilities (predicted 0.6&#x2013;0.8 corresponding to actual 0.2&#x2013;0.6). This calibration failure, combined with the cross-fold instability documented in <xref ref-type="table" rid="table-9">Table 9</xref>, reinforces concerns about GradientBoosting&#x2019;s clinical deployment readiness. ECE: 0.089.</p>
<fig id="fig-21">
<label>Figure A1</label>
<caption>
<title>Model calibration curves. Each panel shows predicted probability (<italic>x</italic>-axis) vs. actual fraction of positive cases (<italic>y</italic>-axis) within probability bins. The dashed diagonal line represents perfect calibration. CardioForest (<bold>top-left</bold>) demonstrates excellent calibration across the full probability range. XGBoost (<bold>top-right</bold>) shows moderate calibration with slight overconfidence. LightGBM (<bold>bottom-left</bold>) exhibits severe miscalibration with erratic probability estimates. GradientBoosting (<bold>bottom-right</bold>) shows concerning overconfidence at mid-range probabilities despite strong aggregate metrics</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-21a.tif"/>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-21b.tif"/>
</fig>
</app>
<app id="app-2">
<title>Appendix B Learning Curve Analysis and Data Efficiency</title>
<p>Learning curves visualize model performance in <xref ref-type="app" rid="app-2">Appendix B</xref> as a function of training set size, revealing convergence behavior, potential overfitting/underfitting, and data efficiency&#x2014;all critical for assessing deployment feasibility in data-limited clinical settings. <xref ref-type="fig" rid="fig-22">Fig. A2</xref> (top-left) shows CardioForest achieves near-optimal performance (95% accuracy) with only 40% of training data, with minimal training-validation gap (&#x003C;1%) throughout. Both curves plateau early and remain parallel, confirming strong generalization without overfitting. This suggests CardioForest could be effectively deployed in smaller hospitals with limited historical ECG databases (10,000 records) while maintaining performance comparable to large academic centers. <xref ref-type="fig" rid="fig-22">Fig. A2</xref> (top-right) reveals moderate convergence with training accuracy (88%) slightly exceeding validation accuracy (87%), indicating minor overfitting. The persistent 1%&#x2013;2% gap suggests XGBoost might benefit from additional regularization or ensemble diversity, though performance plateaus by 60% training data. <xref ref-type="fig" rid="fig-22">Fig. A2</xref> (bottom-left) shows poor convergence with low asymptotic performance (85% training, 84% validation). The parallel curves with minimal gap suggest underfitting rather than overfitting&#x2014;the model architecture lacks sufficient capacity to capture WCT diagnostic patterns. Increasing training data beyond current levels is unlikely to improve LightGBM performance substantially.</p>
<p><xref ref-type="fig" rid="fig-22">Fig. A2</xref> (bottom-right) exhibits erratic learning behavior with dramatic oscillations in validation performance, particularly at 50%&#x2013;70% training data. This instability mirrors the cross-fold variance documented in <xref ref-type="table" rid="table-9">Table 9</xref> and reinforces concerns about GradientBoosting&#x2019;s reliability. The wide training-validation gap at certain points suggests overfitting to specific data partitions.</p>
<fig id="fig-22">
<label>Figure A2</label>
<caption>
<title>Learning curves across models. Each panel plots training set size (<italic>x</italic>-axis, as proportion of full dataset) against accuracy (<italic>y</italic>-axis) for both training (green) and validation (red) sets. CardioForest (<bold>top-left</bold>) demonstrates rapid convergence with minimal overfitting. XGBoost (<bold>top-right</bold>) shows moderate performance with slight training-validation gap. LightGBM (<bold>bottom-left</bold>) exhibits underfitting with low asymptotic performance. GradientBoosting (<bold>bottom-right</bold>) displays erratic learning dynamics with concerning instability</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-22.tif"/>
</fig>
</app>
<app id="app-3">
<title>Appendix C Cross-Validation Performance Distributions</title>
<p>While aggregate metrics (mean <inline-formula id="ieqn-164"><mml:math id="mml-ieqn-164"><mml:mo>&#x00B1;</mml:mo></mml:math></inline-formula> standard deviation) provide summary statistics, distribution visualizations reveal important details in <xref ref-type="app" rid="app-3">Appendix C</xref> about performance consistency, outlier behavior, and potential failure modes across cross-validation folds. <xref ref-type="fig" rid="fig-23">Fig. A3</xref> presents boxplot distributions for six key metrics across all four models. Each boxplot shows the median (center line), interquartile range (box boundaries), whiskers extending to 1.5 &#x00D7; IQR, and outlier points beyond whiskers.</p>
<p><bold>Accuracy Distribution (<xref ref-type="fig" rid="fig-23">Fig. A3</xref> Panel 1):</bold> CardioForest exhibits the tightest distribution (range: 0.0114, IQR: 0.0045), with no outliers, confirming exceptional consistency. GradientBoosting shows wide spread (range: 0.0688) with one severe outlier at 0.889, corresponding to the documented instability in folds 3, 5, and 9. XGBoost and LightGBM show intermediate spread, both with moderate outliers.</p>
<p><bold>Balanced Accuracy Distribution (<xref ref-type="fig" rid="fig-23">Fig. A3</xref> Panel 2):</bold> Mirrors accuracy patterns but emphasizes class-weighted performance. CardioForest&#x2019;s narrow distribution (CV: 0.79%) contrasts sharply with GradientBoosting&#x2019;s wide range (CV: 3.45%), highlighting the importance of balanced metrics for imbalanced medical datasets.</p>
<p><bold>Precision Distribution (<xref ref-type="fig" rid="fig-23">Fig. A3</xref> Panel 3):</bold> CardioForest maintains precision &#x003E;93.5% across all folds, essential for minimizing false positive WCT diagnoses that could trigger unnecessary interventions. LightGBM&#x2019;s low precision (median: 0.68) with a wide spread indicates unreliable positive predictions.</p>
<p><bold>Recall Distribution (<xref ref-type="fig" rid="fig-23">Fig. A3</xref> Panel 4):</bold> CardioForest achieves a recall of 0.77&#x2013;0.81 across folds, acceptably consistent for clinical screening. GradientBoosting&#x2019;s bimodal distribution (outliers at 0.25 and 0.49) reflects the dramatic performance collapses in specific folds&#x2014;unacceptable variability for life-critical applications.</p>
<p><bold>F1 Score Distribution (<xref ref-type="fig" rid="fig-23">Fig. A3</xref> Panel 5):</bold> As a harmonic mean of precision and recall, F1 distributions integrate both metrics&#x2019; stability. CardioForest&#x2019;s narrow distribution (CV: 1.16%) vs. GradientBoosting&#x2019;s wide spread (CV: 15.03%) quantifies the practical reliability difference.</p>
<p><bold>ROC-AUC Distribution (<xref ref-type="fig" rid="fig-23">Fig. A3</xref> Panel 6):</bold> ROC-AUC is generally more stable than threshold-dependent metrics, yet CardioForest still demonstrates superior consistency (CV: 1.08%) compared to GradientBoosting (CV: 2.73%).</p>
<fig id="fig-23">
<label>Figure A3</label>
<caption>
<title>Performance metric distributions across 10-fold cross-validation. Boxplots show distribution of six key metrics for all four models. Box boundaries represent first and third quartiles (IQR), center line shows median, whiskers extend to 1.5 <inline-formula id="ieqn-165"><mml:math id="mml-ieqn-165"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula> IQR, and circles indicate outliers. CardioForest (green) demonstrates consistently narrow distributions across all metrics, while GradientBoosting (pink) exhibits wide spread with concerning outliers, particularly in Recall and F1 score</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-23.tif"/>
</fig>
</app>
<app id="app-4">
<title>Appendix D Precision-Recall Curves for Imbalanced Classification</title>
<p>For imbalanced datasets (15.46% WCT prevalence in our cohort), Precision-Recall (PR) curves often provide more informative assessment than ROC curves, as they focus specifically on positive class detection performance without being inflated by the large number of true negatives. The imbalance classification has been described in <xref ref-type="app" rid="app-4">Appendix D</xref>.</p>
<p><bold>Interpretation:</bold> The PR curve plots precision (<italic>y</italic>-axis) vs. recall (<italic>x</italic>-axis) as the classification threshold varies. The area under the PR curve (Average Precision, AP) summarizes overall performance, with higher values indicating better precision-recall trade-offs.</p>
<p><bold>CardioForest Performance (AP &#x003D; 0.5814):</bold> <xref ref-type="fig" rid="fig-24">Fig. A4</xref> shows CardioForest maintains high precision (&#x003E;95%) at low-to-moderate recall levels (0&#x2013;0.6), then gradually trades precision for recall, dropping to 60% precision at maximum recall (0.95). The curve shape reflects CardioForest&#x2019;s conservative prediction strategy, prioritizing high positive predictive value to minimize false alarms. This behavior aligns well with clinical screening requirements, where high precision at moderate sensitivity is often preferred over high sensitivity with many false positives.</p>
<fig id="fig-24">
<label>Figure A4</label>
<caption>
<title>Precision-recall curves for WCT detection. Each curve plots precision (<italic>y</italic>-axis) vs. recall (<italic>x</italic>-axis) as the classification threshold varies, with Average Precision (AP) score shown in legend. CardioForest (green) demonstrates conservative prediction behavior, maintaining high precision (&#x003E;95%) at moderate recall. GradientBoosting (red) achieves highest AP (0.8083) but this aggregate metric masks fold-level instability. LightGBM (orange) shows poor performance across all recall levels. For imbalanced medical datasets like ours (15.46% positive class), PR curves provide more informative assessment than ROC curves</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-24.tif"/>
</fig>
<p><bold>XGBoost Performance (AP &#x003D; 0.6133):</bold> Achieves slightly higher AP than CardioForest, primarily due to maintaining 90% precision at moderate recall (0.4&#x2013;0.5). However, the curve shows steeper precision decline beyond recall 0.6, and cross-validation stability remains inferior to CardioForest (<xref ref-type="table" rid="table-9">Table 9</xref>). The AP advantage does not outweigh consistency concerns.</p>

<p><bold>LightGBM Performance (AP &#x003D; 0.5787):</bold> Shows severely degraded precision at all recall levels, with precision dropping to 40% even at low recall (0.3). The erratic curve shape with sudden drops reflects LightGBM&#x2019;s poor calibration (<xref ref-type="app" rid="app-1">Appendix A</xref>) and overall weak performance.</p>
<p><bold>GradientBoosting Performance (AP &#x003D; 0.8083):</bold> Surprisingly achieves the highest AP, maintaining 90%&#x2013;95% precision until recall reaches 0.7. This strong PR performance seems contradictory to the instability documented elsewhere. However, AP aggregates performance across all thresholds and folds&#x2014;individual fold failures (documented in <xref ref-type="table" rid="table-5">Table 5</xref>, folds 3, 5, 9) where recall drops to 0.25&#x2013;0.38 are averaged out in this metric.</p>

<p><bold>Clinical Decision Threshold Selection:</bold> PR curves guide threshold selection for operational deployment. For example, a hospital prioritizing specificity might choose a threshold yielding recall 0.6 and precision 0.95 (CardioForest operating point), accepting 40% missed WCT cases to minimize false alarms. Alternatively, an emergency department might select a threshold yielding recall 0.85 and precision 0.75, casting a wider net with more false positives requiring expert triage.</p>
</app>
<app id="app-5">
<title>Appendix E Receiver Operating Characteristic (ROC) Curve Comparison</title>
<p>ROC curves complement PR analysis by visualizing the sensitivity-specificity trade-off across all classification thresholds. <xref ref-type="fig" rid="fig-25">Fig. A5</xref> presents ROC curves for all four models with corresponding AUC values.</p>
<fig id="fig-25">
<label>Figure A5</label>
<caption>
<title>ROC curves for model comparison. Each curve plots True Positive Rate (Sensitivity, <italic>y</italic>-axis) vs. False Positive Rate (1-Specificity, <italic>x</italic>-axis) with corresponding AUC values in legend. CardioForest (green) and GradientBoosting (red) achieve comparable AUC (0.8865 vs. 0.8827), both substantially outperforming XGBoost (blue, 0.8581) and LightGBM (orange, 0.7793). The dashed diagonal represents a random classifier (AUC &#x003D; 0.5). All models demonstrate discrimination ability well above chance, with CardioForest and GradientBoosting approaching excellent classification performance (AUC &#x003E; 0.88)</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="JIMH_75201-fig-25.tif"/>
</fig>
<p><bold>CardioForest (AUC &#x003D; 0.8865):</bold> The ROC curve shows strong discrimination with a steep initial rise (high sensitivity achieved with minimal false positive rate) followed by gradual leveling. The curve&#x2019;s position well above the diagonal &#x201C;random classifier&#x201D; line confirms substantial predictive power. Optimal operating point (marked by Youden&#x2019;s index) occurs at sensitivity 0.78, specificity 0.95&#x2014;closely matching our reported performance metrics.</p>
<p><bold>XGBoost (AUC &#x003D; 0.8581):</bold> Slightly lower AUC than CardioForest, with a less steep initial rise, indicating inferior sensitivity at low false positive rates. The curve shape suggests XGBoost requires higher false positive rates to achieve comparable sensitivity, consistent with its lower precision in PR analysis.</p>
<p><bold>LightGBM (AUC &#x003D; 0.7793):</bold> Substantially degraded ROC performance, with curve barely exceeding the random classifier baseline at sensitivity &#x003C;0.4. The concave curve shape at high sensitivity indicates LightGBM must accept very high false positive rates (&#x003E;0.5) to detect most WCT cases&#x2014;clinically unacceptable.</p>
<p><bold>GradientBoosting (AUC &#x003D; 0.8827):</bold> Competitive AUC approaching CardioForest, with similar curve shape. However, as noted throughout this paper, GradientBoosting&#x2019;s aggregate metrics mask severe fold-level instability (CV: 2.73% for AUC vs. 1.08% for CardioForest). Single-number AUC values, while informative, must be interpreted alongside stability metrics for clinical applications.</p>
<p><bold>Statistical Comparison:</bold> DeLong&#x2019;s test for paired AUC comparison reveals CardioForest&#x2019;s AUC significantly exceeds LightGBM (<italic>p</italic> &#x003C; 0.001) and XGBoost (<italic>p</italic> &#x003D; 0.003), but does not significantly differ from GradientBoosting (<italic>p</italic> &#x003D; 0.682). This statistical equivalence in AUC, despite CardioForest&#x2019;s superior stability, underscores the importance of examining multiple performance dimensions beyond single aggregate scores.</p>
</app>
<app id="app-6">
<title>Appendix F Supplementary Performance Tables</title>
<p>As shown in <xref ref-type="table" rid="table-10">Table A1</xref> (<xref ref-type="app" rid="app-6">Appendix F</xref>), CardioForest stays well-calibrated across all metrics, performing more reliably than the other models. Likewise, <xref ref-type="table" rid="table-11">Table A2</xref> shows how quickly the model reaches strong performance even with limited training data, maintaining only a small gap between training and validation accuracy.</p>
<table-wrap id="table-10">
<label>Table A1</label>
<caption>
<title>Expected calibration error (ECE) across models</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/> </colgroup>
<thead>
<tr>
<th>Metric</th>
<th>CardioForest</th>
<th>XGBoost</th>
<th>LightGBM</th>
<th>GradientBoosting</th>
</tr>
</thead>
<tbody>
<tr>
<td>ECE</td>
<td>0.032</td>
<td>0.058</td>
<td>0.143</td>
<td>0.089</td>
</tr>
<tr>
<td>Max calibration error</td>
<td>0.045</td>
<td>0.087</td>
<td>0.312</td>
<td>0.156</td>
</tr>
<tr>
<td>Brier score</td>
<td>0.064</td>
<td>0.090</td>
<td>0.165</td>
<td>0.082</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="table-A1fn1" fn-type="other">
<p>Note: ECE (Expected Calibration Error): Mean absolute difference between predicted probabilities and actual frequencies across bins. Lower is better. Max Calibration Error: Maximum absolute deviation in any probability bin. Captures worst-case calibration failure. Brier Score: Mean squared error of probabilistic predictions. Lower indicates better calibration and discrimination combined.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="table-11">
<label>Table A2</label>
<caption>
<title>Data efficiency: performance at varying training set sizes</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/> </colgroup>
<thead>
<tr>
<th>Model</th>
<th>20% Data</th>
<th>40% Data</th>
<th>60% Data</th>
<th>80% Data</th>
<th>100% Data</th>
</tr>
</thead>
<tbody>
<tr>
<td align="center" colspan="6"><italic>Validation Accuracy</italic></td>
</tr>
<tr>
<td>CardioForest</td>
<td>0.948</td>
<td>0.951</td>
<td>0.951</td>
<td>0.952</td>
<td>0.952</td>
</tr>
<tr>
<td>XGBoost</td>
<td>0.875</td>
<td>0.881</td>
<td>0.884</td>
<td>0.883</td>
<td>0.884</td>
</tr>
<tr>
<td>LightGBM</td>
<td>0.843</td>
<td>0.844</td>
<td>0.843</td>
<td>0.845</td>
<td>0.843</td>
</tr>
<tr>
<td>GradientBoosting</td>
<td>0.910</td>
<td>0.915</td>
<td>0.905</td>
<td>0.938</td>
<td>0.925</td>
</tr>
<tr>
<td align="center" colspan="6"><italic>Training-Validation Gap</italic></td>
</tr>
<tr>
<td>CardioForest</td>
<td>0.002</td>
<td>0.004</td>
<td>0.003</td>
<td>0.002</td>
<td>0.003</td>
</tr>
<tr>
<td>XGBoost</td>
<td>0.008</td>
<td>0.012</td>
<td>0.011</td>
<td>0.013</td>
<td>0.012</td>
</tr>
<tr>
<td>LightGBM</td>
<td>0.005</td>
<td>0.003</td>
<td>0.005</td>
<td>0.006</td>
<td>0.005</td>
</tr>
<tr>
<td>GradientBoosting</td>
<td>0.025</td>
<td>0.018</td>
<td>0.038</td>
<td>0.010</td>
<td>0.022</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="table-A2fn1" fn-type="other">
<p>Note: Training-Validation Gap: Difference between training and validation accuracy. Larger gaps indicate overfitting. CardioForest achieves near-optimal performance (95.1%) with only 40% of training data.</p>
</fn>
</table-wrap-foot>
</table-wrap>

</app>
<app id="app-7">
<title>Appendix G Computational Efficiency Analysis</title>
<p>While predictive performance is paramount, computational requirements significantly impact clinical deployment feasibility. <xref ref-type="app" rid="app-7">Appendix G</xref>&#x2019;s <xref ref-type="table" rid="table-12">Table A3</xref> presents training and inference timing for all models on identical hardware.</p>
<table-wrap id="table-12">
<label>Table A3</label>
<caption>
<title>Computational performance comparison</title>
</caption>
<table>
<colgroup>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/>
<col align="center"/> </colgroup>
<thead>
<tr>
<th>Metric</th>
<th>CardioForest</th>
<th>XGBoost</th>
<th>LightGBM</th>
<th>GradientBoosting</th>
</tr>
</thead>
<tbody>
<tr>
<td>Training time (10-fold CV)</td>
<td>87.3 min</td>
<td>12.4 min</td>
<td>3.2 min</td>
<td>45.8 min</td>
</tr>
<tr>
<td>Single prediction time</td>
<td>8.3 ms</td>
<td>2.1 ms</td>
<td>1.4 ms</td>
<td>15.7 ms</td>
</tr>
<tr>
<td>Model size (serialized)</td>
<td>245 MB</td>
<td>18 MB</td>
<td>12 MB</td>
<td>102 MB</td>
</tr>
<tr>
<td>Memory usage (inference)</td>
<td>312 MB</td>
<td>85 MB</td>
<td>48 MB</td>
<td>178 MB</td>
</tr>
<tr>
<td><bold>Predictions per Second</bold></td>
<td><bold>120</bold></td>
<td><bold>476</bold></td>
<td><bold>714</bold></td>
<td><bold>64</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="table-A3fn1" fn-type="other">
<p>Note: Training time includes 10-fold cross-validation with full hyperparameter configuration. Single prediction time measured for one 10-feature ECG record, averaged over 10,000 inferences. CardioForest&#x2019;s 8.3 ms inference time enables real-time screening (120 ECGs/second), sufficient for emergency department deployment.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p><bold>Clinical Deployment Implications:</bold> CardioForest&#x2019;s 8.3 ms inference time translates to 120 predictions per second on standard server hardware&#x2014;more than adequate for real-time ECG screening in busy emergency departments (typical throughput: 5&#x2013;10 ECGs per minute). Training time of 87 minutes for full 10-fold cross-validation, while longer than boosting methods, is a one-time cost acceptable for model development.</p>
</app>
</app-group>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Alblaihed</surname> <given-names>L</given-names></string-name>, <string-name><surname>Al-Salamah</surname> <given-names>T</given-names></string-name></person-group>. <article-title>Wide complex tachycardias</article-title>. <source>Emerg Med Clin N Am</source>. <year>2022</year>;<volume>40</volume>(<issue>4</issue>):<fpage>733</fpage>&#x2013;<lpage>53</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.emc.2022.06.010</pub-id>; <pub-id pub-id-type="pmid">36396219</pub-id></mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Bong</surname> <given-names>GS</given-names></string-name></person-group>. <chapter-title>Electrocardiographic differential diagnosis of narrow QRS and wide QRS complex tachycardias</chapter-title>. In: <source>Clinical use of electrocardiogram</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>IntechOpen</publisher-name>; <year>2023</year>. doi:<pub-id pub-id-type="doi">10.5772/intechopen.102568</pub-id>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Fayyazifar</surname> <given-names>N</given-names></string-name>, <string-name><surname>Dwivedi</surname> <given-names>G</given-names></string-name>, <string-name><surname>Suter</surname> <given-names>D</given-names></string-name>, <string-name><surname>Ahderom</surname> <given-names>S</given-names></string-name>, <string-name><surname>Maiorana</surname> <given-names>A</given-names></string-name>, <string-name><surname>Clarkin</surname> <given-names>O</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>A novel convolutional neural network structure for differential diagnosis of wide QRS complex tachycardia</article-title>. <source>Biomed Signal Process Control</source>. <year>2023</year>;<volume>81</volume>:<fpage>104506</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.bspc.2022.104506</pub-id>.</mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kurl</surname> <given-names>S</given-names></string-name>, <string-name><surname>M&#x00E4;kikallio</surname> <given-names>TH</given-names></string-name>, <string-name><surname>Rautaharju</surname> <given-names>P</given-names></string-name>, <string-name><surname>Kiviniemi</surname> <given-names>V</given-names></string-name>, <string-name><surname>Laukkanen</surname> <given-names>JA</given-names></string-name></person-group>. <article-title>Duration of QRS complex in resting electrocardiogram is a predictor of sudden cardiac death in men</article-title>. <source>Circulation</source>. <year>2012</year>;<volume>125</volume>(<issue>21</issue>):<fpage>2588</fpage>&#x2013;<lpage>94</lpage>. doi:<pub-id pub-id-type="doi">10.1161/circulationaha.111.025577</pub-id>; <pub-id pub-id-type="pmid">22615341</pub-id></mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Silvetti</surname> <given-names>E</given-names></string-name>, <string-name><surname>Lanza</surname> <given-names>O</given-names></string-name>, <string-name><surname>Romeo</surname> <given-names>F</given-names></string-name>, <string-name><surname>Martino</surname> <given-names>A</given-names></string-name>, <string-name><surname>Fedele</surname> <given-names>E</given-names></string-name>, <string-name><surname>Lanzillo</surname> <given-names>C</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>The pivotal role of ECG in cardiomyopathies</article-title>. <source>Front Cardiovasc Med</source>. <year>2023</year>;<volume>10</volume>:<fpage>1178163</fpage>. doi:<pub-id pub-id-type="doi">10.3389/fcvm.2023.1178163</pub-id>; <pub-id pub-id-type="pmid">37404739</pub-id></mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Hampton</surname> <given-names>J</given-names></string-name>, <string-name><surname>Hampton</surname> <given-names>J</given-names></string-name>, <string-name><surname>Adlam</surname> <given-names>D</given-names></string-name></person-group>. <source>The ecg made easy e-book: the ecg made easy e-book</source>. <publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Elsevier Health Sciences</publisher-name>; <year>2024</year>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Badura</surname> <given-names>K</given-names></string-name>, <string-name><surname>Bu&#x0142;awska</surname> <given-names>D</given-names></string-name>, <string-name><surname>D&#x0105;bek</surname> <given-names>B</given-names></string-name>, <string-name><surname>Witkowska</surname> <given-names>A</given-names></string-name>, <string-name><surname>Lisi&#x0144;ska</surname> <given-names>W</given-names></string-name>, <string-name><surname>Radzioch</surname> <given-names>E</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Primary electrical heart disease&#x2014;principles of pathophysiology and genetics</article-title>. <source>Int J Mol Sci</source>. <year>2024</year>;<volume>25</volume>(<issue>3</issue>):<fpage>1826</fpage>. doi:<pub-id pub-id-type="doi">10.3390/ijms25031826</pub-id>; <pub-id pub-id-type="pmid">38339103</pub-id></mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Jastrzebski</surname> <given-names>M</given-names></string-name>, <string-name><surname>Kukla</surname> <given-names>P</given-names></string-name>, <string-name><surname>Czarnecka</surname> <given-names>D</given-names></string-name>, <string-name><surname>Kawecka-Jaszcz</surname> <given-names>K</given-names></string-name></person-group>. <article-title>Comparison of five electrocardiographic methods for differentiation of wide QRS-complex tachycardias</article-title>. <source>EP Eur</source>. <year>2012</year>;<volume>14</volume>(<issue>8</issue>):<fpage>1165</fpage>&#x2013;<lpage>71</lpage>. doi:<pub-id pub-id-type="doi">10.1093/europace/eus015</pub-id>; <pub-id pub-id-type="pmid">22333239</pub-id></mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Osadchii</surname> <given-names>OE</given-names></string-name></person-group>. <article-title>Role of abnormal repolarization in the mechanism of cardiac arrhythmia</article-title>. <source>Acta Physiol</source>. <year>2017</year>;<volume>220</volume>:<fpage>1</fpage>&#x2013;<lpage>71</lpage>. doi:<pub-id pub-id-type="doi">10.1111/apha.12902</pub-id>; <pub-id pub-id-type="pmid">28707396</pub-id></mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Tse</surname> <given-names>G</given-names></string-name></person-group>. <article-title>Mechanisms of cardiac arrhythmias</article-title>. <source>J Arrhythmia</source>. <year>2016</year>;<volume>32</volume>(<issue>2</issue>):<fpage>75</fpage>&#x2013;<lpage>81</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.joa.2015.11.003</pub-id>; <pub-id pub-id-type="pmid">27092186</pub-id></mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Adib</surname> <given-names>E</given-names></string-name></person-group>. <article-title>Generating synthetic electrocardiograms using deep generative algorithms [dissertation]. San Antonio, TX, USA: The University of Texas at San Antonio</article-title>; <year>2023</year>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Li</surname> <given-names>T</given-names></string-name>, <string-name><surname>Su</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Kang</surname> <given-names>C</given-names></string-name>, <string-name><surname>Lyu</surname> <given-names>Y</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Application of large language models in disease diagnosis and treatment</article-title>. <source>Chin Med J</source>. <year>2025</year>;<volume>138</volume>(<issue>2</issue>):<fpage>130</fpage>&#x2013;<lpage>42</lpage>. doi:<pub-id pub-id-type="doi">10.1097/cm9.0000000000003456</pub-id>; <pub-id pub-id-type="pmid">39722188</pub-id></mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Adewole</surname> <given-names>KS</given-names></string-name>, <string-name><surname>Mojeed</surname> <given-names>HA</given-names></string-name>, <string-name><surname>Ogunmodede</surname> <given-names>JA</given-names></string-name>, <string-name><surname>Gabralla</surname> <given-names>LA</given-names></string-name>, <string-name><surname>Faruk</surname> <given-names>N</given-names></string-name>, <string-name><surname>Abdulkarim</surname> <given-names>A</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Expert system and decision support system for electrocardiogram interpretation and diagnosis: review, challenges and research directions</article-title>. <source>Appl Sci</source>. <year>2022</year>;<volume>12</volume>(<issue>23</issue>):<fpage>12342</fpage>. doi:<pub-id pub-id-type="doi">10.3390/app122312342</pub-id>.</mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Meyer</surname> <given-names>AND</given-names></string-name>, <string-name><surname>Giardina</surname> <given-names>TD</given-names></string-name>, <string-name><surname>Khawaja</surname> <given-names>L</given-names></string-name>, <string-name><surname>Singh</surname> <given-names>H</given-names></string-name></person-group>. <article-title>Patient and clinician experiences of uncertainty in the diagnostic process: current understanding and future directions</article-title>. <source>Patient Educ Couns</source>. <year>2021</year>;<volume>104</volume>(<issue>11</issue>):<fpage>2606</fpage>&#x2013;<lpage>15</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.pec.2021.07.028</pub-id>; <pub-id pub-id-type="pmid">34312032</pub-id></mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ali Muzammil</surname> <given-names>M</given-names></string-name>, <string-name><surname>Javid</surname> <given-names>S</given-names></string-name>, <string-name><surname>Afridi</surname> <given-names>AK</given-names></string-name>, <string-name><surname>Siddineni</surname> <given-names>R</given-names></string-name>, <string-name><surname>Shahabi</surname> <given-names>M</given-names></string-name>, <string-name><surname>Haseeb</surname> <given-names>M</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Artificial intelligence-enhanced electrocardiography for accurate diagnosis and management of cardiovascular diseases</article-title>. <source>J Electrocardiol</source>. <year>2024</year>;<volume>83</volume>(<issue>3</issue>):<fpage>30</fpage>&#x2013;<lpage>40</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.jelectrocard.2024.01.006</pub-id>; <pub-id pub-id-type="pmid">38301492</pub-id></mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Abubaker</surname> <given-names>MB</given-names></string-name>, <string-name><surname>Babayi&#x011F;it</surname> <given-names>B</given-names></string-name></person-group>. <article-title>Detection of cardiovascular diseases in ECG images using machine learning and deep learning methods</article-title>. <source>IEEE Trans Artif Intell</source>. <year>2023</year>;<volume>4</volume>(<issue>2</issue>):<fpage>373</fpage>&#x2013;<lpage>82</lpage>. doi:<pub-id pub-id-type="doi">10.1109/tai.2022.3159505</pub-id>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ahmed</surname> <given-names>AA</given-names></string-name>, <string-name><surname>Ali</surname> <given-names>W</given-names></string-name>, <string-name><surname>Abdullah</surname> <given-names>TAA</given-names></string-name>, <string-name><surname>Malebary</surname> <given-names>SJ</given-names></string-name></person-group>. <article-title>Classifying cardiac arrhythmia from ECG signal using 1D CNN deep learning model</article-title>. <source>Mathematics</source>. <year>2023</year>;<volume>11</volume>(<issue>3</issue>):<fpage>562</fpage>. doi:<pub-id pub-id-type="doi">10.3390/math11030562</pub-id>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Ramkumar</surname> <given-names>M</given-names></string-name>, <string-name><surname>Ganesh Babu</surname> <given-names>C</given-names></string-name>, <string-name><surname>Manjunathan</surname> <given-names>A</given-names></string-name>, <string-name><surname>Udhayanan</surname> <given-names>S</given-names></string-name>, <string-name><surname>Mathankumar</surname> <given-names>M</given-names></string-name>, <string-name><surname>Sarath Kumar</surname> <given-names>R</given-names></string-name></person-group>. <chapter-title>A graphical user interface based heart rate monitoring process and detection of PQRST peaks from ECG signal</chapter-title>. In: <source>Inventive computation and information technologies</source>. <publisher-loc>Singapore</publisher-loc>: <publisher-name>Springer Nature</publisher-name>; <year>2021</year>. p. <fpage>481</fpage>&#x2013;<lpage>96</lpage>. doi:<pub-id pub-id-type="doi">10.1007/978-981-33-4305-4_36</pub-id>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Brisk</surname> <given-names>R</given-names></string-name></person-group>. <article-title>Towards broader application of deep learning methods to the automated analysis of electrocardiograms [Ph.D. thesis]. Northern Ireland, UK: Ulster University</article-title>; <year>2023</year>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Nissa</surname> <given-names>N</given-names></string-name>, <string-name><surname>Jamwal</surname> <given-names>S</given-names></string-name>, <string-name><surname>Neshat</surname> <given-names>M</given-names></string-name></person-group>. <article-title>A technical comparative heart disease prediction framework using boosting ensemble techniques</article-title>. <source>Computation</source>. <year>2024</year>;<volume>12</volume>(<issue>1</issue>):<fpage>15</fpage>. doi:<pub-id pub-id-type="doi">10.3390/computation12010015</pub-id>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lundberg</surname> <given-names>SM</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>S-I</given-names></string-name></person-group>. <article-title>A unified approach to interpreting model predictions</article-title>. <source>Adv Neural Inf Process Syst</source>. <year>2017</year>;<volume>30</volume>:<fpage>4765</fpage>&#x2013;<lpage>74</lpage>.</mixed-citation></ref>
<ref id="ref-22"><label>[22]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Sadeghi</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Alizadehsani</surname> <given-names>R</given-names></string-name>, <string-name><surname>Cifci</surname> <given-names>MA</given-names></string-name>, <string-name><surname>Kausar</surname> <given-names>S</given-names></string-name>, <string-name><surname>Rehman</surname> <given-names>R</given-names></string-name>, <string-name><surname>Mahanta</surname> <given-names>P</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>A review of explainable artificial intelligence in healthcare</article-title>. <source>Comput Electr Eng</source>. <year>2024</year>;<volume>118</volume>:<fpage>109370</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.compeleceng.2024.109370</pub-id>.</mixed-citation></ref>
<ref id="ref-23"><label>[23]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>ZZ</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>W</given-names></string-name>, <string-name><surname>Mao</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Bo</surname> <given-names>D</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Kojodjojo</surname> <given-names>P</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>A machine learning approach to differentiate wide QRS tachycardia: distinguishing ventricular tachycardia from supraventricular tachycardia</article-title>. <source>J Interv Card Electrophysiol</source>. <year>2024</year>;<volume>67</volume>(<issue>6</issue>):<fpage>1391</fpage>&#x2013;<lpage>8</lpage>. doi:<pub-id pub-id-type="doi">10.1007/s10840-024-01743-9</pub-id>; <pub-id pub-id-type="pmid">38246906</pub-id></mixed-citation></ref>
<ref id="ref-24"><label>[24]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chow</surname> <given-names>BJW</given-names></string-name>, <string-name><surname>Fayyazifar</surname> <given-names>N</given-names></string-name>, <string-name><surname>Balamane</surname> <given-names>S</given-names></string-name>, <string-name><surname>Saha</surname> <given-names>N</given-names></string-name>, <string-name><surname>Farooqui</surname> <given-names>M</given-names></string-name>, <string-name><surname>Hasan</surname> <given-names>A</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Interpreting wide-complex tachycardia with the use of artificial intelligence</article-title>. <source>Can J Cardiol</source>. <year>2024</year>;<volume>40</volume>(<issue>10</issue>):<fpage>1965</fpage>&#x2013;<lpage>73</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.cjca.2024.03.027</pub-id>; <pub-id pub-id-type="pmid">38588794</pub-id></mixed-citation></ref>
<ref id="ref-25"><label>[25]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bhattacharya</surname> <given-names>M</given-names></string-name>, <string-name><surname>Lu</surname> <given-names>D-Y</given-names></string-name>, <string-name><surname>Kudchadkar</surname> <given-names>SM</given-names></string-name>, <string-name><surname>Greenland</surname> <given-names>GV</given-names></string-name>, <string-name><surname>Lingamaneni</surname> <given-names>P</given-names></string-name>, <string-name><surname>Corona-Villalobos</surname> <given-names>CP</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Identifying ventricular arrhythmias and their predictors by applying machine learning methods to electronic health records in patients with hypertrophic cardiomyopathy (hcm-var-risk model)</article-title>. <source>Am J Cardiol</source>. <year>2019</year>;<volume>123</volume>(<issue>10</issue>):<fpage>1681</fpage>&#x2013;<lpage>9</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.amjcard.2019.02.022</pub-id>; <pub-id pub-id-type="pmid">30952382</pub-id></mixed-citation></ref>
<ref id="ref-26"><label>[26]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Hong</surname> <given-names>S</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Shang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Xiao</surname> <given-names>C</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>J</given-names></string-name></person-group>. <article-title>Opportunities and challenges of deep learning methods for electrocardiogram data: a systematic review</article-title>. <source>Comput Biol Med</source>. <year>2020</year>;<volume>122</volume>:<fpage>103801</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.compbiomed.2020.103801</pub-id>; <pub-id pub-id-type="pmid">32658725</pub-id></mixed-citation></ref>
<ref id="ref-27"><label>[27]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>May</surname> <given-names>AM</given-names></string-name>, <string-name><surname>Katbamna</surname> <given-names>BB</given-names></string-name>, <string-name><surname>Shaikh</surname> <given-names>PA</given-names></string-name>, <string-name><surname>LoCoco</surname> <given-names>S</given-names></string-name>, <string-name><surname>Deych</surname> <given-names>E</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>R</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Automated differentiation of wide QRS complex tachycardia using QRS complex polarity</article-title>. <source>Commun Med</source>. <year>2024</year>;<volume>4</volume>(<issue>1</issue>):<fpage>282</fpage>. doi:<pub-id pub-id-type="doi">10.1038/s43856-024-00725-2</pub-id>; <pub-id pub-id-type="pmid">39741223</pub-id></mixed-citation></ref>
<ref id="ref-28"><label>[28]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Howladar</surname> <given-names>P</given-names></string-name>, <string-name><surname>Sahoo</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Supraventricular tachycardia detection and classification model of ecg signal using machine learning</article-title>. <comment>arXiv:2112.12953. 2021</comment>. doi: <pub-id pub-id-type="doi">10.48550/arxiv.2112.12953</pub-id>.</mixed-citation></ref>
<ref id="ref-29"><label>[29]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Rajpurkar</surname> <given-names>P</given-names></string-name>, <string-name><surname>Hannun</surname> <given-names>AY</given-names></string-name>, <string-name><surname>Haghpanahi</surname> <given-names>M</given-names></string-name>, <string-name><surname>Bourn</surname> <given-names>C</given-names></string-name>, <string-name><surname>Ng</surname> <given-names>AY</given-names></string-name></person-group>. <article-title>Cardiologist-level arrhythmia detection with convolutional neural networks</article-title>. <comment>arXiv:1707.01836. 2017</comment>. doi: <pub-id pub-id-type="doi">10.48550/arxiv.1707.01836</pub-id>.</mixed-citation></ref>
<ref id="ref-30"><label>[30]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Frausto-Avila</surname> <given-names>M</given-names></string-name>, <string-name><surname>Manriquez-Amavizca</surname> <given-names>JP</given-names></string-name>, <string-name><surname>Alfred</surname> <given-names>U&#x0027;Ren</given-names></string-name>, <string-name><surname>Quiroz-Juarez</surname> <given-names>MA</given-names></string-name></person-group>. <article-title>Compact neural network algorithm for electrocardiogram classification</article-title>. <comment>arXiv:2412.17852. 2024</comment>. doi: <pub-id pub-id-type="doi">10.48550/arxiv.2412.17852</pub-id>.</mixed-citation></ref>
<ref id="ref-31"><label>[31]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Gow</surname> <given-names>B</given-names></string-name>, <string-name><surname>Pollard</surname> <given-names>T</given-names></string-name>, <string-name><surname>Nathanson</surname> <given-names>LA</given-names></string-name>, <string-name><surname>Johnson</surname> <given-names>A</given-names></string-name>, <string-name><surname>Moody</surname> <given-names>B</given-names></string-name>, <string-name><surname>Fernandes</surname> <given-names>C</given-names></string-name>, <etal>et al</etal></person-group>. <source>Mimic-iv-ecg: diagnostic electrocardiogram matched subset (version 1.0)</source>. <publisher-loc>New York, NY, USA</publisher-loc>: <publisher-name>PhysioNet</publisher-name>; <year>2023</year>.</mixed-citation></ref>
<ref id="ref-32"><label>[32]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Goldberger</surname> <given-names>AL</given-names></string-name>, <string-name><surname>Amaral</surname> <given-names>LAN</given-names></string-name>, <string-name><surname>Glass</surname> <given-names>L</given-names></string-name>, <string-name><surname>Hausdorff</surname> <given-names>JM</given-names></string-name>, <string-name><surname>Ivanov</surname> <given-names>PC</given-names></string-name>, <string-name><surname>Mark</surname> <given-names>RG</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals</article-title>. <source>Circulation</source>. <year>2000</year>;<volume>101</volume>(<issue>23</issue>):<fpage>E215</fpage>&#x2013;<lpage>20</lpage>. doi:<pub-id pub-id-type="doi">10.1161/01.cir.101.23.e215</pub-id>; <pub-id pub-id-type="pmid">10851218</pub-id></mixed-citation></ref>
<ref id="ref-33"><label>[33]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>H</given-names></string-name>, <string-name><surname>Monger</surname> <given-names>R</given-names></string-name>, <string-name><surname>Pishgar</surname> <given-names>E</given-names></string-name>, <string-name><surname>Pishgar</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Icu readmission prediction for intracerebral hemorrhage patients using mimic iii and mimic iv databases</article-title>. <source>medRxiv</source>. <year>2025</year>;<volume>12</volume>(<issue>2</issue>):<fpage>673624</fpage>. doi:<pub-id pub-id-type="doi">10.1101/2025.01.01.25319859</pub-id>.</mixed-citation></ref>
<ref id="ref-34"><label>[34]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yoon</surname> <given-names>D</given-names></string-name>, <string-name><surname>Han</surname> <given-names>C</given-names></string-name>, <string-name><surname>Kim</surname> <given-names>DW</given-names></string-name>, <string-name><surname>Kim</surname> <given-names>S</given-names></string-name>, <string-name><surname>Bae</surname> <given-names>S</given-names></string-name>, <string-name><surname>Ryu</surname> <given-names>JA</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Redefining health care data interoperability: empirical exploration of large language models in information exchange</article-title>. <source>J Med Internet Res</source>. <year>2024</year>;<volume>26</volume>:<fpage>e56614</fpage>. doi:<pub-id pub-id-type="doi">10.2196/56614</pub-id>; <pub-id pub-id-type="pmid">38819879</pub-id></mixed-citation></ref>
<ref id="ref-35"><label>[35]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Sharma</surname> <given-names>D</given-names></string-name>, <string-name><surname>Kohli</surname> <given-names>N</given-names></string-name></person-group>. <article-title>WFDB Software for Python: a toolkit for physiological signals</article-title>. In: <conf-name>Proceedings of the 2023 Third International Conference on Secure Cyber Computing and Communication (ICSCCC); 2023 May 26&#x2013;28</conf-name>; <publisher-loc>Jalandhar, India. New York, NY, USA</publisher-loc>: <publisher-name>IEEE</publisher-name>; <year>2023</year>. p. <fpage>86</fpage>&#x2013;<lpage>92</lpage>. doi:<pub-id pub-id-type="doi">10.1109/icsccc58608.2023.10176714</pub-id>.</mixed-citation></ref>
<ref id="ref-36"><label>[36]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Goodwin</surname> <given-names>AJ</given-names></string-name>, <string-name><surname>Eytan</surname> <given-names>D</given-names></string-name>, <string-name><surname>Dixon</surname> <given-names>W</given-names></string-name>, <string-name><surname>Goodfellow</surname> <given-names>SD</given-names></string-name>, <string-name><surname>Doherty</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Greer</surname> <given-names>RW</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Timing errors and temporal uncertainty in clinical databases&#x2014;a narrative review</article-title>. <source>Front Digit Health</source>. <year>2022</year>;<volume>4</volume>:<fpage>932599</fpage>. doi:<pub-id pub-id-type="doi">10.3389/fdgth.2022.932599</pub-id>; <pub-id pub-id-type="pmid">36060541</pub-id></mixed-citation></ref>
<ref id="ref-37"><label>[37]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chakma</surname> <given-names>V</given-names></string-name>, <string-name><surname>Ju</surname> <given-names>X</given-names></string-name>, <string-name><surname>Cao</surname> <given-names>H</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>X</given-names></string-name>, <string-name><surname>Ji</surname> <given-names>X</given-names></string-name>, <string-name><surname>Pan</surname> <given-names>H</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>CardioForest: an explainable ensemble learning model for automatic wide QRS complex tachycardia diagnosis from ECG</article-title>. <source>medRxiv</source>. <year>2025</year>;<volume>40</volume>(<issue>4</issue>):<fpage>733</fpage>. doi:<pub-id pub-id-type="doi">10.1101/2025.09.15.25335837</pub-id>.</mixed-citation></ref>
<ref id="ref-38"><label>[38]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Gupta</surname> <given-names>P</given-names></string-name>, <string-name><surname>Bagchi</surname> <given-names>A</given-names></string-name></person-group>. <chapter-title>Data manipulation with pandas</chapter-title>. In: <source>Essentials of python for artificial intelligence and machine learning</source>. <publisher-loc>Cham, Switzerland</publisher-loc>: <publisher-name>Springer Nature</publisher-name>; <year>2024</year>. p. <fpage>197</fpage>&#x2013;<lpage>235</lpage>. doi:<pub-id pub-id-type="doi">10.1007/978-3-031-43725-0_6</pub-id>.</mixed-citation></ref>
<ref id="ref-39"><label>[39]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Morelli</surname> <given-names>D</given-names></string-name>, <string-name><surname>Rossi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Cairo</surname> <given-names>M</given-names></string-name>, <string-name><surname>Clifton</surname> <given-names>DA</given-names></string-name></person-group>. <article-title>Analysis of the impact of interpolation methods of missing RR-intervals caused by motion artifacts on HRV features estimations</article-title>. <source>Sensors</source>. <year>2019</year>;<volume>19</volume>(<issue>14</issue>):<fpage>3163</fpage>. doi:<pub-id pub-id-type="doi">10.3390/s19143163</pub-id>; <pub-id pub-id-type="pmid">31323850</pub-id></mixed-citation></ref>
<ref id="ref-40"><label>[40]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lwin</surname> <given-names>TZ</given-names></string-name>, <string-name><surname>Naing</surname> <given-names>L</given-names></string-name></person-group>. <article-title>How to choose a sampling technique and determine sample size for research: a simplified guide for researchers</article-title>. <source>J Med</source>. <year>2024</year>;<volume>25</volume>(<issue>1</issue>):<fpage>1</fpage>&#x2013;<lpage>12</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.oor.2024.100662</pub-id>.</mixed-citation></ref>
<ref id="ref-41"><label>[41]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Pudjihartono</surname> <given-names>N</given-names></string-name>, <string-name><surname>Fadason</surname> <given-names>T</given-names></string-name>, <string-name><surname>Kempa-Liehr</surname> <given-names>AW</given-names></string-name>, <string-name><surname>O&#x2019;Sullivan</surname> <given-names>JM</given-names></string-name></person-group>. <article-title>A review of feature selection methods for machine learning-based disease risk prediction</article-title>. <source>Front Bioinform</source>. <year>2022</year>;<volume>2</volume>:<fpage>927312</fpage>. doi:<pub-id pub-id-type="doi">10.3389/fbinf.2022.927312</pub-id>; <pub-id pub-id-type="pmid">36304293</pub-id></mixed-citation></ref>
<ref id="ref-42"><label>[42]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Jolliffe</surname> <given-names>IT</given-names></string-name>, <string-name><surname>Cadima</surname> <given-names>J</given-names></string-name></person-group>. <article-title>Principal component analysis: a review and recent developments</article-title>. <source>Phil Trans R Soc A</source>. <year>2016</year>;<volume>374</volume>(<issue>2065</issue>):<fpage>20150202</fpage>. doi:<pub-id pub-id-type="doi">10.1098/rsta.2015.0202</pub-id>; <pub-id pub-id-type="pmid">26953178</pub-id></mixed-citation></ref>
<ref id="ref-43"><label>[43]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Sharma</surname> <given-names>LN</given-names></string-name>, <string-name><surname>Dandapat</surname> <given-names>S</given-names></string-name>, <string-name><surname>Mahanta</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Multichannel ECG data compression based on multiscale principal component analysis</article-title>. <source>IEEE Trans Inform Technol Biomed</source>. <year>2012</year>;<volume>16</volume>(<issue>4</issue>):<fpage>730</fpage>&#x2013;<lpage>6</lpage>. doi:<pub-id pub-id-type="doi">10.1109/titb.2012.2195322</pub-id>; <pub-id pub-id-type="pmid">22542694</pub-id></mixed-citation></ref>
<ref id="ref-44"><label>[44]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Melzi</surname> <given-names>P</given-names></string-name>, <string-name><surname>Tolosana</surname> <given-names>R</given-names></string-name>, <string-name><surname>Cecconi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Sanz-Garcia</surname> <given-names>A</given-names></string-name>, <string-name><surname>Ortega</surname> <given-names>GJ</given-names></string-name>, <string-name><surname>Jimenez-Borreguero</surname> <given-names>LJ</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Analyzing artificial intelligence systems for the prediction of atrial fibrillation from sinus-rhythm ECGs including demographics and feature visualization</article-title>. <source>Sci Rep</source>. <year>2021</year>;<volume>11</volume>(<issue>1</issue>):<fpage>22786</fpage>. doi:<pub-id pub-id-type="doi">10.1038/s41598-021-02179-1</pub-id>; <pub-id pub-id-type="pmid">34815461</pub-id></mixed-citation></ref>
<ref id="ref-45"><label>[45]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Little</surname> <given-names>R</given-names></string-name>, <string-name><surname>Rubin</surname> <given-names>D</given-names></string-name></person-group>. <source>Statistical analysis with missing data</source>. <edition>3rd ed</edition>. <publisher-loc>Hoboken, NJ, USA</publisher-loc>: <publisher-name>John Wiley &#x0026; Sons</publisher-name>; <year>2019</year>. doi:<pub-id pub-id-type="doi">10.1002/9781119482260</pub-id>.</mixed-citation></ref>
<ref id="ref-46"><label>[46]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ochieng&#x2019; Odhiambo</surname> <given-names>F</given-names></string-name></person-group>. <article-title>Comparative study of various methods of handling missing data</article-title>. <source>Math Model Appl</source>. <year>2020</year>;<volume>5</volume>(<issue>2</issue>):<fpage>87</fpage>. doi:<pub-id pub-id-type="doi">10.11648/j.mma.20200502.14</pub-id>.</mixed-citation></ref>
<ref id="ref-47"><label>[47]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Deo</surname> <given-names>TY</given-names></string-name>, <string-name><surname>Sanju</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Data imputation and comparison of custom ensemble models with existing libraries like XGBoost, CATBoost, AdaBoost and Scikit learn for predictive equipment failure</article-title>. <source>Mater Today Proc</source>. <year>2023</year>;<volume>72</volume>:<fpage>1596</fpage>&#x2013;<lpage>604</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.matpr.2022.09.410</pub-id>.</mixed-citation></ref>
<ref id="ref-48"><label>[48]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Potdar</surname> <given-names>K</given-names></string-name>, <string-name><surname>Taher</surname> <given-names>S</given-names></string-name>, <string-name><surname>Chinmay</surname> <given-names>D</given-names></string-name></person-group>. <article-title>A comparative study of categorical variable encoding techniques for neural network classifiers</article-title>. <source>Int J Comput Appl</source>. <year>2017</year>;<volume>175</volume>(<issue>4</issue>):<fpage>7</fpage>&#x2013;<lpage>9</lpage>. doi:<pub-id pub-id-type="doi">10.5120/ijca2017915495</pub-id>.</mixed-citation></ref>
<ref id="ref-49"><label>[49]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Garreta</surname> <given-names>R</given-names></string-name>, <string-name><surname>Moncecchi</surname> <given-names>G</given-names></string-name></person-group>. <source>Learning scikit-learn: machine learning in python</source>. Vol. <volume>2013</volume>. <publisher-loc>Birmingham, UK</publisher-loc>: <publisher-name>Packt Publishing Birmingham</publisher-name>; <year>2013</year>.</mixed-citation></ref>
<ref id="ref-50"><label>[50]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Siontis</surname> <given-names>KC</given-names></string-name>, <string-name><surname>Noseworthy</surname> <given-names>PA</given-names></string-name>, <string-name><surname>Attia</surname> <given-names>ZI</given-names></string-name>, <string-name><surname>Friedman</surname> <given-names>PA</given-names></string-name></person-group>. <article-title>Artificial intelligence-enhanced electrocardiography in cardiovascular disease management</article-title>. <source>Nat Rev Cardiol</source>. <year>2021</year>;<volume>18</volume>(<issue>7</issue>):<fpage>465</fpage>&#x2013;<lpage>78</lpage>. doi:<pub-id pub-id-type="doi">10.1038/s41569-020-00503-2</pub-id>; <pub-id pub-id-type="pmid">33526938</pub-id></mixed-citation></ref>
<ref id="ref-51"><label>[51]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Regoli</surname> <given-names>FD</given-names></string-name>, <string-name><surname>Cattaneo</surname> <given-names>M</given-names></string-name>, <string-name><surname>Kola</surname> <given-names>F</given-names></string-name>, <string-name><surname>Thartori</surname> <given-names>A</given-names></string-name>, <string-name><surname>Bytyci</surname> <given-names>H</given-names></string-name>, <string-name><surname>Saccarello</surname> <given-names>L</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Management of hemodynamically stable wide QRS complex tachycardia in patients with implantable cardioverter defibrillators</article-title>. <source>Front Cardiovasc Med</source>. <year>2023</year>;<volume>9</volume>:<fpage>1011619</fpage>. doi:<pub-id pub-id-type="doi">10.3389/fcvm.2022.1011619</pub-id>; <pub-id pub-id-type="pmid">36684577</pub-id></mixed-citation></ref>
<ref id="ref-52"><label>[52]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Mulo</surname> <given-names>J</given-names></string-name>, <string-name><surname>Liang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Qian</surname> <given-names>M</given-names></string-name>, <string-name><surname>Biswas</surname> <given-names>M</given-names></string-name>, <string-name><surname>Rawal</surname> <given-names>B</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>Y</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Navigating challenges and harnessing opportunities: deep learning applications in Internet of medical things</article-title>. <source>Future Internet</source>. <year>2025</year>;<volume>17</volume>(<issue>3</issue>):<fpage>107</fpage>. doi:<pub-id pub-id-type="doi">10.3390/fi17030107</pub-id>.</mixed-citation></ref>
<ref id="ref-53"><label>[53]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Khan</surname> <given-names>H</given-names></string-name>, <string-name><surname>Bilal</surname> <given-names>A</given-names></string-name>, <string-name><surname>Aslam</surname> <given-names>MA</given-names></string-name>, <string-name><surname>Mustafa</surname> <given-names>H</given-names></string-name></person-group>. <article-title>Heart disease detection: a comprehensive analysis of machine learning, ensemble learning, and deep learning algorithms</article-title>. <source>Nano Biomed Eng</source>. <year>2024</year>;<volume>16</volume>(<issue>4</issue>):<fpage>677</fpage>&#x2013;<lpage>90</lpage>. doi:<pub-id pub-id-type="doi">10.26599/nbe.2024.9290087</pub-id>.</mixed-citation></ref>
<ref id="ref-54"><label>[54]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Breiman</surname> <given-names>L</given-names></string-name></person-group>. <article-title>Random forests</article-title>. <source>Mach Learn</source>. <year>2001</year>;<volume>45</volume>(<issue>1</issue>):<fpage>5</fpage>&#x2013;<lpage>32</lpage>. doi:<pub-id pub-id-type="doi">10.1023/A:1010933404324</pub-id>.</mixed-citation></ref>
<ref id="ref-55"><label>[55]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Du</surname> <given-names>KL</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>R</given-names></string-name>, <string-name><surname>Jiang</surname> <given-names>B</given-names></string-name>, <string-name><surname>Zeng</surname> <given-names>J</given-names></string-name>, <string-name><surname>Lu</surname> <given-names>J</given-names></string-name></person-group>. <article-title>Foundations and innovations in data fusion and ensemble learning for effective consensus</article-title>. <source>Mathematics</source>. <year>2025</year>;<volume>13</volume>(<issue>4</issue>):<fpage>587</fpage>. doi:<pub-id pub-id-type="doi">10.3390/math13040587</pub-id>.</mixed-citation></ref>
<ref id="ref-56"><label>[56]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Breiman</surname> <given-names>L</given-names></string-name>, <string-name><surname>Friedman</surname> <given-names>JH</given-names></string-name>, <string-name><surname>Olshen</surname> <given-names>RA</given-names></string-name>, <string-name><surname>Stone</surname> <given-names>CJ</given-names></string-name></person-group>. <source>Classification and regression trees</source>. <publisher-loc>Monterey, CA, USA</publisher-loc>: <publisher-name>Wadsworth &#x0026; Brooks/Cole Advanced Books &#x0026; Software</publisher-name>; <year>1984</year>.</mixed-citation></ref>
<ref id="ref-57"><label>[57]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Esposito</surname> <given-names>F</given-names></string-name>, <string-name><surname>Malerba</surname> <given-names>D</given-names></string-name>, <string-name><surname>Semeraro</surname> <given-names>G</given-names></string-name>, <string-name><surname>Kay</surname> <given-names>J</given-names></string-name></person-group>. <article-title>A comparative analysis of methods for pruning decision trees</article-title>. <source>IEEE Trans Pattern Anal Machine Intell</source>. <year>1997</year>;<volume>19</volume>(<issue>5</issue>):<fpage>476</fpage>&#x2013;<lpage>93</lpage>. doi:<pub-id pub-id-type="doi">10.1109/34.589207</pub-id>.</mixed-citation></ref>
<ref id="ref-58"><label>[58]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Friedman</surname> <given-names>JH</given-names></string-name></person-group>. <article-title>Greedy function approximation: a gradient boosting machine</article-title>. <source>Ann Statist</source>. <year>2001</year>;<volume>29</volume>(<issue>5</issue>):<fpage>1189</fpage>&#x2013;<lpage>232</lpage>. doi:<pub-id pub-id-type="doi">10.1214/aos/1013203451</pub-id>.</mixed-citation></ref>
<ref id="ref-59"><label>[59]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>T</given-names></string-name>, <string-name><surname>Guestrin</surname> <given-names>C</given-names></string-name></person-group>. <article-title>XGBoost: a scalable tree boosting system</article-title>. In: <conf-name>Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2016 Aug 13&#x2013;17</conf-name>; <publisher-loc>San Francisco, CA, USA. New York, NY, USA</publisher-loc>: <publisher-name>ACM</publisher-name>; <year>2016</year>. p. <fpage>785</fpage>&#x2013;<lpage>94</lpage>. doi:<pub-id pub-id-type="doi">10.1145/2939672.2939785</pub-id>.</mixed-citation></ref>
<ref id="ref-60"><label>[60]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ke</surname> <given-names>G</given-names></string-name>, <string-name><surname>Meng</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Finley</surname> <given-names>T</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>T</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>W</given-names></string-name>, <string-name><surname>Ma</surname> <given-names>W</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>Lightgbm: a highly efficient gradient boosting decision tree</article-title>. <source>Adv Neural Inf Process Syst</source>. <year>2017</year>;<volume>30</volume>:<fpage>3146</fpage>&#x2013;<lpage>54</lpage>.</mixed-citation></ref>
<ref id="ref-61"><label>[61]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Manivannan</surname> <given-names>GS</given-names></string-name>, <string-name><surname>Rajaguru</surname> <given-names>H</given-names></string-name>, <string-name><surname>Rajanna</surname> <given-names>S</given-names></string-name>, <string-name><surname>Talawar</surname> <given-names>SV</given-names></string-name></person-group>. <article-title>Cardiovascular disease detection from cardiac arrhythmia ECG signals using artificial intelligence models with hyperparameters tuning methodologies</article-title>. <source>Heliyon</source>. <year>2024</year>;<volume>10</volume>(<issue>17</issue>):<fpage>e36751</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.heliyon.2024.e36751</pub-id>; <pub-id pub-id-type="pmid">39263121</pub-id></mixed-citation></ref>
<ref id="ref-62"><label>[62]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bergstra</surname> <given-names>J</given-names></string-name>, <string-name><surname>Bengio</surname> <given-names>Y</given-names></string-name></person-group>. <article-title>Random search for hyper-parameter optimization</article-title>. <source>J Mach Learn Res</source>. <year>2012</year>;<volume>13</volume>(<issue>1</issue>):<fpage>281</fpage>&#x2013;<lpage>305</lpage>.</mixed-citation></ref>
<ref id="ref-63"><label>[63]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>da Silva</surname> <given-names>JHB</given-names></string-name>, <string-name><surname>Cortez</surname> <given-names>PC</given-names></string-name>, <string-name><surname>Jagatheesaperumal</surname> <given-names>SK</given-names></string-name>, <string-name><surname>de Albuquerque</surname> <given-names>VHC</given-names></string-name></person-group>. <article-title>ECG measurement uncertainty based on Monte Carlo approach: an effective analysis for a successful cardiac health monitoring system</article-title>. <source>Bioengineering</source>. <year>2023</year>;<volume>10</volume>(<issue>1</issue>):<fpage>115</fpage>. doi:<pub-id pub-id-type="doi">10.3390/bioengineering10010115</pub-id>; <pub-id pub-id-type="pmid">36671687</pub-id></mixed-citation></ref>
<ref id="ref-64"><label>[64]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Kohavi</surname> <given-names>R</given-names></string-name></person-group>. <article-title>A study of cross-validation and bootstrap for accuracy estimation and model selection</article-title>. In: <conf-name>Proceedings of the 14th International Joint Conference on Artificial Intelligence; 1995 Aug 20&#x2013;25</conf-name>; <publisher-loc>Montreal, QC, Canada</publisher-loc>. p. <fpage>1137</fpage>&#x2013;<lpage>45</lpage>.</mixed-citation></ref>
<ref id="ref-65"><label>[65]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bradley</surname> <given-names>AP</given-names></string-name></person-group>. <article-title>The use of the area under the ROC curve in the evaluation of machine learning algorithms</article-title>. <source>Pattern Recognit</source>. <year>1997</year>;<volume>30</volume>(<issue>7</issue>):<fpage>1145</fpage>&#x2013;<lpage>59</lpage>. doi:<pub-id pub-id-type="doi">10.1016/s0031-3203(96)00142-2</pub-id>.</mixed-citation></ref>
<ref id="ref-66"><label>[66]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Brodersen</surname> <given-names>KH</given-names></string-name>, <string-name><surname>Ong</surname> <given-names>CS</given-names></string-name>, <string-name><surname>Stephan</surname> <given-names>KE</given-names></string-name>, <string-name><surname>Buhmann</surname> <given-names>JM</given-names></string-name></person-group>. <article-title>The balanced accuracy and its posterior distribution</article-title>. In: <conf-name>Proceedings of the 2010 20th International Conference on Pattern Recognition; 2010 Aug 23&#x2013;26</conf-name>; <publisher-loc>Istanbul, Turkey. New York, NY, USA</publisher-loc>: <publisher-name>IEEE</publisher-name>; <year>2010</year>. p. <fpage>3121</fpage>&#x2013;<lpage>4</lpage>. doi:<pub-id pub-id-type="doi">10.1109/icpr.2010.764</pub-id>.</mixed-citation></ref>
<ref id="ref-67"><label>[67]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lundberg</surname> <given-names>SM</given-names></string-name>, <string-name><surname>Erion</surname> <given-names>G</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>H</given-names></string-name>, <string-name><surname>DeGrave</surname> <given-names>A</given-names></string-name>, <string-name><surname>Prutkin</surname> <given-names>JM</given-names></string-name>, <string-name><surname>Nair</surname> <given-names>B</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>From local explanations to global understanding with explainable AI for trees</article-title>. <source>Nat Mach Intell</source>. <year>2020</year>;<volume>2</volume>(<issue>1</issue>):<fpage>56</fpage>&#x2013;<lpage>67</lpage>. doi:<pub-id pub-id-type="doi">10.1038/s42256-019-0138-9</pub-id>; <pub-id pub-id-type="pmid">32607472</pub-id></mixed-citation></ref>
</ref-list>
</back></article>
















