<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CMES</journal-id>
<journal-id journal-id-type="nlm-ta">CMES</journal-id>
<journal-id journal-id-type="publisher-id">CMES</journal-id>
<journal-title-group>
<journal-title>Computer Modeling in Engineering &#x0026; Sciences</journal-title>
</journal-title-group>
<issn pub-type="epub">1526-1506</issn>
<issn pub-type="ppub">1526-1492</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">22647</article-id>
<article-id pub-id-type="doi">10.32604/cmes.2023.022647</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>A Two-Step Algorithm to Estimate Variable Importance for Multi-State Data: An Application to COVID-19</article-title>
<alt-title alt-title-type="left-running-head">A Two-Step Algorithm to Estimate Variable Importance for Multi-State Data: An Application to COVID-19</alt-title>
<alt-title alt-title-type="right-running-head">A Two-Step Algorithm to Estimate Variable Importance for Multi-State Data: An Application to COVID-19</alt-title>
</title-group>
<contrib-group content-type="authors">
<contrib id="author-1" contrib-type="author">
<name name-style="western"><surname>Alafchi</surname><given-names>Behnaz</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-2" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Tapak</surname><given-names>Leili</given-names></name><xref ref-type="aff" rid="aff-1">1</xref><email>l.tapak@umsha.ac.ir</email></contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>Doosti</surname><given-names>Hassan</given-names></name><xref ref-type="aff" rid="aff-2">2</xref></contrib>
<contrib id="author-4" contrib-type="author">
<name name-style="western"><surname>Chesneau</surname><given-names>Christophe</given-names></name><xref ref-type="aff" rid="aff-3">3</xref></contrib>
<contrib id="author-5" contrib-type="author">
<name name-style="western"><surname>Roshanaei</surname><given-names>Ghodratollah</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<aff id="aff-1"><label>1</label><institution>Department of Biostatistics, School of Public Health, Modeling of Noncommunicable Diseases Research Center, Hamadan University of Medical Sciences</institution>, <addr-line>Hamadan</addr-line>, <country>Iran</country></aff>
<aff id="aff-2"><label>2</label><institution>School of Mathematical and Physical Sciences, Macquarie University</institution>, <addr-line>Sydney</addr-line>, <country>Australia</country></aff>
<aff id="aff-3"><label>3</label><institution>Department of Mathematics, LMNO, University of Caen-Normandie</institution>, <addr-line>Caen</addr-line>, <country>France</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Leili Tapak. Email: <email>l.tapak@umsha.ac.ir</email></corresp>
</author-notes>
<pub-date pub-type="epub" date-type="pub" iso-8601-date="2022-11-22"><day>22</day>
<month>11</month>
<year>2022</year></pub-date>
<volume>135</volume>
<issue>3</issue>
<fpage>2047</fpage>
<lpage>2064</lpage>
<history>
<date date-type="received"><day>18</day><month>3</month><year>2022</year></date>
<date date-type="accepted"><day>07</day><month>7</month><year>2022</year></date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2023 Alafchi et al.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Alafchi et al.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CMES_22647.pdf"></self-uri>
<abstract>
<p>Survival data with a multi-state structure are frequently observed in follow-up studies. An analytic approach based on a multi-state model (MSM) should be used in longitudinal health studies in which a patient experiences a sequence of clinical progression events. One main objective in the MSM framework is variable selection, where attempts are made to identify the risk factors associated with the transition hazard rates or probabilities of disease progression. The usual variable selection methods, including stepwise and penalized methods, do not provide information about the importance of variables. In this context, we present a two-step algorithm to evaluate the importance of variables for multi-state data. Three different machine learning approaches (random forest, gradient boosting, and neural network) as the most widely used methods are considered to estimate the variable importance in order to identify the factors affecting disease progression and rank these factors according to their importance. The performance of our proposed methods is validated by simulation and applied to the COVID-19 data set. The results revealed that the proposed two-stage method has promising performance for estimating variable importance.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Multi-state data</kwd>
<kwd>deviance residual</kwd>
<kwd>martingale residual</kwd>
<kwd>gradient boosting</kwd>
<kwd>random forest</kwd>
<kwd>neural network</kwd>
<kwd>variable importance</kwd>
<kwd>variable selection</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1"><label>1</label><title>Introduction</title>
<p>In longitudinal health studies, a patient may experience a sequence of clinical progression events. For instance, the local recurrence may be followed by a distant recurrence and then death [<xref ref-type="bibr" rid="ref-1">1</xref>]. Also, in the renal transplantation process of patients with end-stage kidney disease, renal allograft failure is considered an intermediate event that can affect the whole survival of the patient [<xref ref-type="bibr" rid="ref-2">2</xref>]. For another instance, the state of the COVID-19 patients who are admitted to the intensive care unit (ICU) at the time of admission can influence their survival time. A patient may need invasive or non-invasive ICU ventilation and extracorporeal membrane oxygenation and may be discharged from the hospital or die eventually [<xref ref-type="bibr" rid="ref-3">3</xref>]. Moreover, the need for ICU before death or discharge from the hospital among COVID-19 patients who are admitted to the ward could be another state. In such studies, the progression of the disease is considered as a multi-state process, and multi-state models (MSM) are used to assess the impact of different covariates on the transition between different health states [<xref ref-type="bibr" rid="ref-1">1</xref>,<xref ref-type="bibr" rid="ref-4">4</xref>]. Ferrer&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-1">1</xref>] developed a multi-state model for the analysis of prostate cancer. Beesley&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-4">4</xref>] also applied the multi-state approach for modeling the progression of prostate cancer.</p>
<p>One main objective in the MSM framework is variable selection, where attempts are made to identify the risk factors associated with the transition hazard rates or probabilities of the disease progression. The classic approach is to use stepwise methods, where steps are taken by sequentially adding (forward) or removing (backward) variables at each step. However, the stepwise method suffers from some drawbacks, such as instability of the selected variables. In the context of MSMs, few attempts have been made at the variable selection. For example, Dang&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-5">5</xref>] suggested L1-Regularized multi-state models using a Least Absolute Shrinkage and Selection Operator (LASSO) penalty for parameter estimation and variable selection, simultaneously. The LASSO method has several advantages for variable selection, including sparseness avoiding overfitting and a lower burden of computations. It is also applicable in the presence of a large number of covariates. Nevertheless, the selected model by the LASSO is unstable in terms of selected features. Moreover, for a set of highly correlated features, the LASSO selects one of them randomly. Penalized methods neglect the interaction between variables and their complex relationships or their unknown functional form. Most importantly, these techniques were unable to provide a quantitative assessment of their significance [<xref ref-type="bibr" rid="ref-6">6</xref>]. While some of the features measured in a study may be associated with the sojourn times in the entry states of each transition, they may have different degrees of importance, and some of them may be more important for some transitions than others [<xref ref-type="bibr" rid="ref-7">7</xref>]. The variable importance can be estimated as a real-valued parameter using an optimal estimating function [<xref ref-type="bibr" rid="ref-8">8</xref>].</p>
<p>In the context of COVID-19, several studies have investigated factors associated with various outcomes using machine learning models through variable importance. For example, Stachel&#x00A0;et&#x00A0;al.&#x00A0;utilized machine learning methods for variables associated with mortality among COVID-19 patients [<xref ref-type="bibr" rid="ref-9">9</xref>]. Also, Snider&#x00A0;et&#x00A0;al.&#x00A0;investigated variables affecting mortality in COVID-19 patients using artificial intelligence methods [<xref ref-type="bibr" rid="ref-10">10</xref>]. However, most of these studies considered binary outcomes such as survival or death. In various studies, random survival forest has been used to predict mortality in hospitalized COVID-19 patients. They considered time to hospital discharge and mortality as the outcomes of interest and competing risks, respectively. More complex structures were also considered by some authors. Hazard&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-11">11</xref>] considered a multistate structure for joint analysis of the duration of ventilation, length of intensive care, and mortality of COVID-19 patients. To our knowledge, no study has dealt with estimating variable importance in multi-state data structures such as COVID-19 data.</p>
<p>Breiman [<xref ref-type="bibr" rid="ref-12">12</xref>] introduced the random forest technique for classification and regression problems, where it has been widely used for variable selection. Gradient boosting [<xref ref-type="bibr" rid="ref-13">13</xref>] is also a commonly used method for variable selection, showing satisfactory performance in many problems. Negassa&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-14">14</xref>] investigated model selection in tree-structured subgroup analysis using the RECursive Partition and Amalgamation algorithm. They found that there is no single model selection criterion with uniformly superior performance, and they proposed a two-stage approach for model selection with promising performance in variable selection. Recently, Duan&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-6">6</xref>] proposed a machine learning-based approach to estimate variable importance using martingale and deviance residuals and their standardized counterparts which resulted in promising performances based on simulation studies. In the present study, we proposed a two-step algorithm to estimate the importance of variables for multi-state data based on three different machine learning approaches: random forest, gradient boosting, and neural network as the most widely used method. Our main contribution to this study is to extend the proposed approach to multi-state data to estimate the variable importance. The remaining sections of this paper are organized as follows: In <xref ref-type="sec" rid="s2">Section 2</xref>, we describe the multi-state model and its related residuals. In <xref ref-type="sec" rid="s3">Section 3</xref>, we define three risk indices based on the residuals mentioned in <xref ref-type="sec" rid="s2">Section 2</xref>. In <xref ref-type="sec" rid="s4">Section 4</xref>, we describe our proposed two-step algorithm to evaluate the variable&#x2019;s importance for multi-state data and introduce the loss function criteria to compare the performance of different methods. In <xref ref-type="sec" rid="s5">Section 5</xref>, the performance of our algorithm is evaluated via simulation studies, and we apply our method to the COVID-19 data set through an illness-death multi-state model, and a brief discussion is finally given in <xref ref-type="sec" rid="s5">Section 5</xref>.</p>
</sec>
<sec id="s2"><label>2</label><title>Methods</title>
<sec id="s2_1"><label>2.1</label><title>Multi-State Model</title>
<p>A multi-state model is a stochastic process (<inline-formula id="ieqn-1"><mml:math id="mml-ieqn-1"><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mi>&#x03C2;</mml:mi><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>) with a finite state space <inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mi>p</mml:mi><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>, where <inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:mi>&#x03C2;</mml:mi><mml:mo>=</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mi>&#x03C4;</mml:mi><mml:mo stretchy="false">]</mml:mo></mml:math></inline-formula> for <inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:mi>&#x03C4;</mml:mi><mml:mo>&#x2264;</mml:mo><mml:mo>+</mml:mo><mml:mi mathvariant="normal">&#x221E;</mml:mi></mml:math></inline-formula>. The multi-state process is assumed to be continuous-time with the right-continuous sample paths, <inline-formula id="ieqn-5"><mml:math id="mml-ieqn-5"><mml:mi>X</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msup><mml:mi>t</mml:mi><mml:mrow><mml:mo>+</mml:mo></mml:mrow></mml:msup><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. The transition probabilities are given as follows:
<disp-formula id="ueqn-1">
<mml:math id="mml-ueqn-1" display="block"><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>k</mml:mi><mml:mo fence="false" stretchy="false">|</mml:mo><mml:mi>X</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>s</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03C7;</mml:mi><mml:mrow><mml:mrow><mml:msup><mml:mi>s</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo></mml:math></disp-formula>for <inline-formula id="ieqn-6"><mml:math id="mml-ieqn-6"><mml:mrow><mml:mtext>h</mml:mtext></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:mtext>k</mml:mtext></mml:mrow><mml:mo>&#x2208;</mml:mo><mml:mi>S</mml:mi></mml:math></inline-formula>, <inline-formula id="ieqn-7"><mml:math id="mml-ieqn-7"><mml:mi>s</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mi>&#x03C2;</mml:mi><mml:mo>,</mml:mo><mml:mi>s</mml:mi><mml:mo>&#x2264;</mml:mo><mml:mi>t</mml:mi></mml:math></inline-formula>, and <inline-formula id="ieqn-8"><mml:math id="mml-ieqn-8"><mml:msub><mml:mi>&#x03C7;</mml:mi><mml:mrow><mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> be a &#x03C3;-algebra that stands for everything that happened up to time <italic>s</italic>. Then, the transition intensities are defined as follows:
<disp-formula id="ueqn-2">
<mml:math id="mml-ueqn-2" display="block"><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="true" form="prefix">lim</mml:mo><mml:mrow><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mi>t</mml:mi><mml:mo stretchy="false">&#x2192;</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:munder><mml:mfrac><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mi>t</mml:mi><mml:mo fence="false" stretchy="false">|</mml:mo><mml:msub><mml:mi>&#x03C7;</mml:mi><mml:mrow><mml:mrow><mml:msup><mml:mi>s</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:mfrac><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<p>Suppose, there are <italic>n</italic> subjects and let <inline-formula id="ieqn-9"><mml:math id="mml-ieqn-9"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> represents the vector of the <inline-formula id="ieqn-10"><mml:math id="mml-ieqn-10"><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2265;</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> observed transition times for the <italic>i</italic>th subject, with <inline-formula id="ieqn-11"><mml:math id="mml-ieqn-11"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub><mml:mo>&#x003C;</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula id="ieqn-12"><mml:math id="mml-ieqn-12"><mml:mi>r</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>. When the last observed state is an absorbing state (i.e., a state that once entered cannot be never left; e.g., death), then the <italic>i</italic>th subject will experience <inline-formula id="ieqn-13"><mml:math id="mml-ieqn-13"><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> direct transitions. Otherwise, <inline-formula id="ieqn-14"><mml:math id="mml-ieqn-14"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mrow></mml:msub></mml:math></inline-formula> is equal to right censoring time <inline-formula id="ieqn-15"><mml:math id="mml-ieqn-15"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and there are <inline-formula id="ieqn-16"><mml:math id="mml-ieqn-16"><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> direct transitions.</p>
<p>The counting process for the subject <italic>i</italic> (<inline-formula id="ieqn-17"><mml:math id="mml-ieqn-17"><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mi>n</mml:mi></mml:math></inline-formula>) can be defined as (a multivariate counting process) by <inline-formula id="ieqn-18"><mml:math id="mml-ieqn-18"><mml:mrow><mml:mo>{</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mi>S</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>&#x2260;</mml:mo><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo>&#x2264;</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula> counting the number of direct transitions from the state <italic>h</italic> to the state <italic>k</italic> for the subject <italic>i</italic> over the interval <inline-formula id="ieqn-19"><mml:math id="mml-ieqn-19"><mml:mo stretchy="false">[</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">]</mml:mo></mml:math></inline-formula> and <inline-formula id="ieqn-20"><mml:math id="mml-ieqn-20"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x2264;</mml:mo><mml:mi>&#x03C4;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. The intensity function for the subject <italic>i</italic> based on the Cox regression model, by using counting process formulation of the model, is given as follows:
<disp-formula id="eqn-1"><label>(1)</label><mml:math id="mml-eqn-1" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mi></mml:mi><mml:mo>=</mml:mo><mml:munder><mml:mo movablelimits="true" form="prefix">lim</mml:mo><mml:mrow><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mi>t</mml:mi><mml:mo stretchy="false">&#x2192;</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:munder><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo fence="false" stretchy="false">|</mml:mo><mml:msub><mml:mi>&#x03C7;</mml:mi><mml:mrow><mml:mrow><mml:msup><mml:mi>s</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:mrow><mml:mrow><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:mfrac></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd /><mml:mtd><mml:mi></mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
<p>Here, <inline-formula id="ieqn-21"><mml:math id="mml-ieqn-21"><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>I</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:msup><mml:mi>t</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo></mml:mrow></mml:msup><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>h</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is an indicator specifying whether <inline-formula id="ieqn-22"><mml:math id="mml-ieqn-22"><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>.</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is in the state <italic>h</italic> at time <inline-formula id="ieqn-23"><mml:math id="mml-ieqn-23"><mml:msup><mml:mi>t</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula> (some <italic>h</italic> and <italic>k</italic> combinations may not be possible). Specifically, <inline-formula id="ieqn-24"><mml:math id="mml-ieqn-24"><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula> means that the individual <italic>i</italic> is at risk for transition from <italic>h</italic> to <italic>k</italic> immediately before time <italic>t</italic>. The <inline-formula id="ieqn-25"><mml:math id="mml-ieqn-25"><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> only jumps when <inline-formula id="ieqn-26"><mml:math id="mml-ieqn-26"><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>. Also, in the above equation <inline-formula id="ieqn-27"><mml:math id="mml-ieqn-27"><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>.</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is the baseline intensity function, and <inline-formula id="ieqn-28"><mml:math id="mml-ieqn-28"><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> is the vector of predictors associated with the vector of coefficients <inline-formula id="ieqn-29"><mml:math id="mml-ieqn-29"><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> for the transition from <italic>h</italic> to <italic>k</italic>.</p>
<p>Note that, under the Markov assumption, future evolution of the process only depends on the current state and, therefore, <inline-formula id="ieqn-30"><mml:math id="mml-ieqn-30"><mml:msub><mml:mi>&#x03C7;</mml:mi><mml:mrow><mml:mrow><mml:msup><mml:mi>s</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:msub></mml:math></inline-formula> has been omitted from the formulas.</p>
</sec>
<sec id="s2_2"><label>2.2</label><title>Risk Indices</title>
<sec id="s2_2_1"><label>2.2.1</label><title>Martingale Residuals</title>
<p>Using the Doob&#x2013;Meyer decomposition [<xref ref-type="bibr" rid="ref-15">15</xref>], in multi-state models, the martingale residual for the subject <italic>i</italic> on the time interval of <inline-formula id="ieqn-31"><mml:math id="mml-ieqn-31"><mml:mo stretchy="false">[</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">]</mml:mo></mml:math></inline-formula> is defined as follows:
<disp-formula id="ueqn-3">
<mml:math id="mml-ueqn-3" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mtd><mml:mtd><mml:mi></mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd /><mml:mtd><mml:mi></mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x2212;</mml:mo><mml:msubsup><mml:mrow><mml:mo>&#x222B;</mml:mo></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>u</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>u</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo fence="false" stretchy="false">}</mml:mo><mml:mi>d</mml:mi><mml:msub><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>u</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
<p>In the above equation, <inline-formula id="ieqn-32"><mml:math id="mml-ieqn-32"><mml:msub><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> represents the cumulative intensity function and is a predictable process called the compensator of <inline-formula id="ieqn-33"><mml:math id="mml-ieqn-33"><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. Also, <inline-formula id="ieqn-34"><mml:math id="mml-ieqn-34"><mml:msub><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msubsup><mml:mrow><mml:mo>&#x222B;</mml:mo></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>u</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mi>d</mml:mi><mml:mi>u</mml:mi></mml:math></inline-formula> indicates the cumulative baseline intensity function. The <inline-formula id="ieqn-35"><mml:math id="mml-ieqn-35"><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-36"><mml:math id="mml-ieqn-36"><mml:msub><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> are unknown parameters that should be estimated. Let <inline-formula id="ieqn-37"><mml:math id="mml-ieqn-37"><mml:msub><mml:mrow><mml:mover><mml:mi>&#x03B2;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-38"><mml:math id="mml-ieqn-38"><mml:msub><mml:mrow><mml:mover><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> represent the maximum likelihood estimator of <inline-formula id="ieqn-39"><mml:math id="mml-ieqn-39"><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> [<xref ref-type="bibr" rid="ref-16">16</xref>,<xref ref-type="bibr" rid="ref-17">17</xref>] and the Breslow estimator of <inline-formula id="ieqn-40"><mml:math id="mml-ieqn-40"><mml:msub><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> [<xref ref-type="bibr" rid="ref-17">17</xref>,<xref ref-type="bibr" rid="ref-18">18</xref>], respectively. Then, the martingale residual for the transition <inline-formula id="ieqn-41"><mml:math id="mml-ieqn-41"><mml:mi>h</mml:mi><mml:mo stretchy="false">&#x2192;</mml:mo><mml:mi>k</mml:mi></mml:math></inline-formula> is defined as follows:
<disp-formula id="ueqn-4">
<mml:math id="mml-ueqn-4" display="block"><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msubsup><mml:mrow><mml:mo>&#x222B;</mml:mo></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>u</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>&#x03B2;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>u</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mi>d</mml:mi><mml:msub><mml:mrow><mml:mover><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>u</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<p>In the multi-state data, suppose that the last observed time of transition from the state <italic>h</italic> to the state <italic>k</italic> (h &#x2192; k) for the subject <italic>i</italic> is called <inline-formula id="ieqn-42"><mml:math id="mml-ieqn-42"><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, (the minimum of the administrative censoring time <inline-formula id="ieqn-43"><mml:math id="mml-ieqn-43"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>a</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, the individual censoring time (loss to follow-up) <inline-formula id="ieqn-44"><mml:math id="mml-ieqn-44"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and the time of transition to the state <italic>k</italic>, <inline-formula id="ieqn-45"><mml:math id="mml-ieqn-45"><mml:mi mathvariant="normal">&#x2200;</mml:mi><mml:mi>k</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>). Therefore, the martingale residual for the subject i at time <inline-formula id="ieqn-46"><mml:math id="mml-ieqn-46"><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> has the following form:
<disp-formula id="eqn-2"><label>(2)</label><mml:math id="mml-eqn-2" display="block"><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>&#x03B2;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<p>Note that, we use <inline-formula id="ieqn-47"><mml:math id="mml-ieqn-47"><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> to represent <inline-formula id="ieqn-48"><mml:math id="mml-ieqn-48"><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p>
</sec>
<sec id="s2_2_2"><label>2.2.2</label><title>Deviance</title>
<p>The deviance defined in Therneau&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-19">19</xref>] is as follows:</p>
<p>D&#x2009;&#x003D;&#x2009;2&#x007B;log likelihood (saturated model)-log likelihood (<inline-formula id="ieqn-49"><mml:math id="mml-ieqn-49"><mml:mrow><mml:mover><mml:mi>&#x03B2;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow></mml:math></inline-formula>)&#x007D;.</p>
<p>The saturated model is a model with a parameter for every observation. Here, we extend the definition of the deviance for univariate survival data to the multi-state data. Under mild regularity conditions, the log-likelihood contribution for each individual can be obtained using counting process theory as follows:
<disp-formula id="ueqn-5">
<mml:math id="mml-ueqn-5" display="block"><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munderover><mml:mrow><mml:mo>[</mml:mo><mml:msubsup><mml:mrow><mml:mo>&#x222B;</mml:mo></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msubsup><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mi>d</mml:mi><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x2212;</mml:mo><mml:msubsup><mml:mrow><mml:mo>&#x222B;</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msubsup><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>r</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mi>d</mml:mi><mml:mi>t</mml:mi><mml:mo>]</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<p>Note that <inline-formula id="ieqn-50"><mml:math id="mml-ieqn-50"><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi mathvariant="normal">&#x03A3;</mml:mi><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> for <inline-formula id="ieqn-51"><mml:math id="mml-ieqn-51"><mml:mi>k</mml:mi><mml:mo>&#x2260;</mml:mo><mml:mi>h</mml:mi></mml:math></inline-formula>, and <inline-formula id="ieqn-52"><mml:math id="mml-ieqn-52"><mml:msub><mml:mi>&#x03B4;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is a transition indicator (for the subject <italic>i</italic>) which equals one if a direct transition is observed at time <inline-formula id="ieqn-53"><mml:math id="mml-ieqn-53"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>r</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and zero otherwise.</p>
<p>Then, the definition of deviance for multi-state data with the intensity function in <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref>, which is an extension of the deviance provided by Therneau&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-19">19</xref>] for survival data, is as follows:
<disp-formula id="eqn-3"><label>(3)</label><mml:math id="mml-eqn-3" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo></mml:mtd><mml:mtd><mml:mn>2</mml:mn><mml:munder><mml:mo form="prefix">sup</mml:mo><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:munder><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>n</mml:mi></mml:mrow></mml:munderover><mml:mrow><mml:mo>[</mml:mo><mml:mo>&#x222B;</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>&#x03B2;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>}</mml:mo></mml:mrow><mml:mi>d</mml:mi><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd /><mml:mtd><mml:mi></mml:mi><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo><mml:mo>&#x222B;</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>&#x03B2;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mi>d</mml:mi><mml:msub><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
<p>Let <inline-formula id="ieqn-54"><mml:math id="mml-ieqn-54"><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> be the per-subject estimates of <inline-formula id="ieqn-55"><mml:math id="mml-ieqn-55"><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>. By taking the first derivation of the above summation part with respect to <inline-formula id="ieqn-56"><mml:math id="mml-ieqn-56"><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and setting it equal to 0, we have:
<disp-formula id="eqn-4"><label>(4)</label><mml:math id="mml-eqn-4" display="block"><mml:mo>&#x222B;</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mi>exp</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msup><mml:mrow><mml:msub><mml:mi>&#x03B3;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:msub><mml:mi>Z</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mi>d</mml:mi><mml:msub><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x222B;</mml:mo><mml:mi>d</mml:mi><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<p>By subtracting <xref ref-type="disp-formula" rid="eqn-3">Eqs. (3)</xref> and <xref ref-type="disp-formula" rid="eqn-4">(4)</xref>, <inline-formula id="ieqn-57"><mml:math id="mml-ieqn-57"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> has the following form:
<disp-formula id="ueqn-6">
<mml:math id="mml-ueqn-6" display="block"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>t</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<p>As mentioned before, we supposed <inline-formula id="ieqn-58"><mml:math id="mml-ieqn-58"><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the observed time of transition to the state k (for the transition from state h to state k). Therefore, the deviance at time <inline-formula id="ieqn-59"><mml:math id="mml-ieqn-59"><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> has the following form:
<disp-formula id="eqn-5"><label>(5)</label><mml:math id="mml-eqn-5" display="block"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<p>Note that <inline-formula id="ieqn-60"><mml:math id="mml-ieqn-60"><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>&#x2260;</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>, and <inline-formula id="ieqn-61"><mml:math id="mml-ieqn-61"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mn>2</mml:mn><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> when <inline-formula id="ieqn-62"><mml:math id="mml-ieqn-62"><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>.</p>
</sec>
<sec id="s2_2_3"><label>2.2.3</label><title>Deviance Residuals</title>
<p>The deviance residuals are defined as follows:
<disp-formula id="eqn-6"><label>(6)</label><mml:math id="mml-eqn-6" display="block"><mml:msubsup><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>g</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:msqrt><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:msqrt><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<p>The deviance residual <inline-formula id="ieqn-63"><mml:math id="mml-ieqn-63"><mml:msubsup><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> is equal to zero if the martingale residual is zero (<inline-formula id="ieqn-64"><mml:math id="mml-ieqn-64"><mml:msub><mml:mi>M</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>). The variables are considered to be fixed over time (not time-dependent). In this paper, the martingale residuals, the deviances, and the deviance residuals are called risk indices.</p>
</sec>
</sec>
<sec id="s2_3"><label>2.3</label><title>Variable Importance for Multi-State Data</title>
<p>In this section, we introduce an algorithm to evaluate the variable&#x2019;s importance for multi-state data. Therneau&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-19">19</xref>] mentioned that using martingale residuals from a null Cox model as the input of classification and regression trees worked well for survival data. Then, Duan&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-6">6</xref>] extended their proposed algorithm to recurrent event data. Here, we extend their approach to multi-state data.</p>
<sec id="s2_3_1"><label>2.3.1</label><title>Two-Step Variable Selection Algorithm</title>
<p>The proposed algorithm has two steps. In the first step, a null multi-state model is fitted as follows:
<disp-formula id="ueqn-7">
<mml:math id="mml-ueqn-7" display="block"><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<p>This model does not include any variables, and only the baseline intensity functions should be estimated, which can be estimated using the Breslow estimator [<xref ref-type="bibr" rid="ref-17">17</xref>,<xref ref-type="bibr" rid="ref-18">18</xref>]. Then, the martingale residuals, the deviances, and the deviance residuals are obtained as three risk indices for each subject. For a multi-state model with three states (depicted in <xref ref-type="fig" rid="fig-1">Fig. 1</xref>) (<inline-formula id="ieqn-65"><mml:math id="mml-ieqn-65"><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>3</mml:mn></mml:math></inline-formula>), the martingale residuals are defined as follows:
<disp-formula id="ueqn-8">
<mml:math id="mml-ueqn-8" display="block"><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>120</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math></disp-formula>
<disp-formula id="ueqn-9">
<mml:math id="mml-ueqn-9" display="block"><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>130</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math></disp-formula>
<disp-formula id="ueqn-10">
<mml:math id="mml-ueqn-10" display="block"><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x03C4;</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x03C4;</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi mathvariant="normal">&#x039B;</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>230</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x03C4;</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-66"><mml:math id="mml-ieqn-66"><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> is the observed time of the transition from the state 1 to the state 2, <inline-formula id="ieqn-67"><mml:math id="mml-ieqn-67"><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mn>3</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> is the observed time of the transition from the state 1 to the state 3, and <inline-formula id="ieqn-68"><mml:math id="mml-ieqn-68"><mml:msubsup><mml:mrow><mml:mi>&#x03C4;</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> is the observed time of the transition from the state 2 to the state 3. Then, the deviances are as follows:
<disp-formula id="ueqn-11">
<mml:math id="mml-ueqn-11" display="block"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math></disp-formula>
<disp-formula id="ueqn-12">
<mml:math id="mml-ueqn-12" display="block"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math></disp-formula>
<disp-formula id="ueqn-13">
<mml:math id="mml-ueqn-13" display="block"><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>&#x2212;</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo>[</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x03C4;</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x03C4;</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mi>&#x03C4;</mml:mi></mml:mrow><mml:mrow><mml:mn>3</mml:mn><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:math></disp-formula>and the deviance residuals are defined as follows:
<disp-formula id="ueqn-14">
<mml:math id="mml-ueqn-14" display="block"><mml:msubsup><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>g</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:msqrt><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mn>12</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:msqrt><mml:mo>,</mml:mo></mml:math></disp-formula>
<disp-formula id="ueqn-15">
<mml:math id="mml-ueqn-15" display="block"><mml:msubsup><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>g</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:msqrt><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mn>13</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:msqrt><mml:mo>,</mml:mo></mml:math></disp-formula>
<disp-formula id="ueqn-16">
<mml:math id="mml-ueqn-16" display="block"><mml:msubsup><mml:mrow><mml:mi>D</mml:mi></mml:mrow><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>g</mml:mi><mml:mi>n</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mrow><mml:mover><mml:mi>M</mml:mi><mml:mo>&#x2322;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:msqrt><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mn>23</mml:mn><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:msqrt><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<fig id="fig-1"><label>Figure 1</label><caption><title>The multi-state representation of the disease progression of COVID-19 hospitalized patients</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMES_22647-fig-1.png"/></fig>
<p>In the second step, the random forest, the gradient boosting, and the neural network methods are applied to the risk indices generated in the first step to evaluate the importance of each variable in different transitions.</p>
</sec>
</sec>
<sec id="s2_4"><label>2.4</label><title>Machine Learning Models</title>
<sec id="s2_4_1"><label>2.4.1</label><title>Random Forest</title>
<p>The random forest method is a supervised ensemble learning algorithm developed by Brieman in 2001. It is constructed by combining several decision tree algorithms to create solutions for complex problems. In this method, a series of simple unpruned regression trees are constructed by using random bootstrapped samples that are obtained from the original data sample. The results of the simple trees are then accumulated to produce a final prediction of the response for the subjects in regression problems, which is the average of the predictions given by all trees. In the random forest algorithm, in the first step, for b&#x2009;&#x003D;&#x2009;1 to B, a bootstrapped sample K&#x002A; of size N (the original sample size) is drawn from the training data. Each tree (T<sub>b</sub>), in the random forest, is grown for the bootstrapped data by repeating the following recursive steps for each of the terminal nodes in the tree: a) A set of m variables is randomly selected from a set of p original variables; b) the best variable/split-point is selected from the set of m variables chosen in Step (a); and c) the node is split into two daughter nodes. These three steps are repeated until the minimum predefined node size is achieved (n<sub>min</sub>). In the second step, the output of all trees (B) is aggregated. A prediction for a new observation is obtained by calculating <inline-formula id="ieqn-69"><mml:math id="mml-ieqn-69"><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mn>1</mml:mn><mml:mi>B</mml:mi></mml:mfrac></mml:mstyle><mml:msubsup><mml:mrow><mml:mo>&#x2211;</mml:mo></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>B</mml:mi></mml:mrow></mml:msubsup><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>.</p>
</sec>
<sec id="s2_4_2"><label>2.4.2</label><title>Gradient Boosting Machine</title>
<p>Gradient boosting machines (GBMs) consist of a group of powerful machine-learning methods with substantial accomplishments in many practical applications. GBMs can be very adaptable to the needs of the application. The boosting works based on sequentially adding new models to the ensemble, so that in each particular iteration, &#x201C;a new weak, base-learner model is trained with respect to the error of the whole ensemble learnt so far&#x201D; [<xref ref-type="bibr" rid="ref-20">20</xref>]. A GBM has been established to connect boosting with a statistical framework [<xref ref-type="bibr" rid="ref-21">21</xref>&#x2013;<xref ref-type="bibr" rid="ref-23">23</xref>], which provides the required justification for the hyperparameters in the model and the methodological foundation for developing further gradient boosting models [<xref ref-type="bibr" rid="ref-20">20</xref>].</p>
<p>In GBMs, successive new models are fitted based on the chosen learning technique; so that more accurate estimates of the outcome are produced. The main idea underlying this algorithm is to create &#x201C;the new base-learners to be maximally correlated with the negative gradient of the loss function, associated with the whole ensemble&#x201D; [<xref ref-type="bibr" rid="ref-20">20</xref>]. The loss functions used can be researcher-defined or standard loss functions derived through trial and error in the past.</p>
</sec>
<sec id="s2_4_3"><label>2.4.3</label><title>Neural Network</title>
<p>Artificial neural networks (ANNs) are machine learning methods that consist of an input layer of neurons (or nodes, units), at least one hidden layer of neurons, and a layer of output neurons. Connections between layers are illustrated by numeric values (e.g., weights) through activation functions. The most widely used form of ANN is the multi-layer perceptron, where the data flows in a forward direction from the input layer to the output layer. So, the neurons are trained with the back propagation learning algorithm. In this study, the regression weights were estimated using a tangent hyperbolic activation function and an identity activation function, which provided better results than other settings [<xref ref-type="bibr" rid="ref-24">24</xref>].</p>
</sec>
</sec>
<sec id="s2_5"><label>2.5</label><title>Performance Measurement</title>
<p>Here, the loss function (introduced by Cox [<xref ref-type="bibr" rid="ref-17">17</xref>,<xref ref-type="bibr" rid="ref-18">18</xref>]) was considered to assess the performance of the different variable importance evaluation methods for multi-state data. Suppose that there are q standardized covariates <inline-formula id="ieqn-70"><mml:math id="mml-ieqn-70"><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> for the transition from state <italic>h</italic> to state <italic>k</italic> that are associated with the coefficients <inline-formula id="ieqn-71"><mml:math id="mml-ieqn-71"><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, respectively. The absolute values of the coefficients are related to their importance, such that, for the transition <inline-formula id="ieqn-72"><mml:math id="mml-ieqn-72"><mml:mi>h</mml:mi><mml:mo stretchy="false">&#x2192;</mml:mo><mml:mi>k</mml:mi></mml:math></inline-formula>, if the ranks of the absolute value of the coefficients are 1, 2,&#x2026;, q (<inline-formula id="ieqn-73"><mml:math id="mml-ieqn-73"><mml:mo fence="false" stretchy="false">|</mml:mo><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">|</mml:mo></mml:math></inline-formula> has the maximum value), the rank of their corresponding variable importance values should be 1, 2,&#x2026;, q (from the most to the least important). However, the estimated rank may not be absolutely correct. In such a situation, if <inline-formula id="ieqn-74"><mml:math id="mml-ieqn-74"><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> represent the estimated rank of the covariates for the transition <inline-formula id="ieqn-75"><mml:math id="mml-ieqn-75"><mml:mi>h</mml:mi><mml:mo stretchy="false">&#x2192;</mml:mo><mml:mi>k</mml:mi></mml:math></inline-formula>, at least one pair of parameters satisfies the <inline-formula id="ieqn-76"><mml:math id="mml-ieqn-76"><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x003E;</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> for <inline-formula id="ieqn-77"><mml:math id="mml-ieqn-77"><mml:mi>j</mml:mi><mml:mo>&#x003C;</mml:mo><mml:mi>i</mml:mi></mml:math></inline-formula>. The loss function of the ranking results for the transition <inline-formula id="ieqn-78"><mml:math id="mml-ieqn-78"><mml:mi>h</mml:mi><mml:mo stretchy="false">&#x2192;</mml:mo><mml:mi>k</mml:mi></mml:math></inline-formula> is defined as follows:
<disp-formula id="ueqn-17">
<mml:math id="mml-ueqn-17" display="block"><mml:mi>L</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>s</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub><mml:mo>;</mml:mo><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>q</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:munderover><mml:munder><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x003C;</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:munder><mml:mi>I</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x003E;</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo fence="false" stretchy="false">|</mml:mo><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">|</mml:mo><mml:mo>.</mml:mo></mml:math></disp-formula></p>
<p>Here, <inline-formula id="ieqn-79"><mml:math id="mml-ieqn-79"><mml:mi>I</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x003E;</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> indicates whether there is an incorrect ranking in the transition <inline-formula id="ieqn-80"><mml:math id="mml-ieqn-80"><mml:mi>h</mml:mi><mml:mo stretchy="false">&#x2192;</mml:mo><mml:mi>k</mml:mi></mml:math></inline-formula>, and <inline-formula id="ieqn-81"><mml:math id="mml-ieqn-81"><mml:mo fence="false" stretchy="false">|</mml:mo><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>&#x2212;</mml:mo><mml:msub><mml:mi>&#x03B7;</mml:mi><mml:mrow><mml:mi>h</mml:mi><mml:mi>k</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">|</mml:mo></mml:math></inline-formula> is a weighted function that takes into account the severity of the ranking mistake as mentioned by Cox&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-17">17</xref>,<xref ref-type="bibr" rid="ref-18">18</xref>].</p>
</sec>
</sec>
<sec id="s3"><label>3</label><title>Simulation Study and Application</title>
<sec id="s3_1"><label>3.1</label><title>Simulation Study</title>
<p>Simulation studies are presented in this section to evaluate the model performance and to compare different proposed algorithms.</p>
<sec id="s3_1_1"><label>3.1.1</label><title>Data Generation</title>
<p>Mimicking the structure of the real data (depicted in <xref ref-type="fig" rid="fig-1">Fig. 1</xref>), we consider a three-state process (<inline-formula id="ieqn-82"><mml:math id="mml-ieqn-82"><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>k</mml:mi><mml:mo>&#x2208;</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>3</mml:mn><mml:mo fence="false" stretchy="false">}</mml:mo></mml:math></inline-formula>) with three possible transitions. The vector of the observed times of transitions of <inline-formula id="ieqn-83"><mml:math id="mml-ieqn-83"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>12</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>13</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>, was generated according to similar studies [<xref ref-type="bibr" rid="ref-25">25</xref>,<xref ref-type="bibr" rid="ref-26">26</xref>]. For the <italic>i</italic>th subject, the censoring time was generated according to a uniform distribution, <inline-formula id="ieqn-84"><mml:math id="mml-ieqn-84"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>&#x223C;<inline-formula id="ieqn-85"><mml:math id="mml-ieqn-85"><mml:mrow><mml:mtext>Uniform</mml:mtext></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>, where T is the largest time-point in the study (here, two different values of 3 and 12 months were assumed).</p>
<p>The vector of <inline-formula id="ieqn-86"><mml:math id="mml-ieqn-86"><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>=</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>12</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>13</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> (indicating true transition times) was generated as follows:
<list list-type="simple">
<list-item><label>I.</label><p>Three random numbers of <inline-formula id="ieqn-87"><mml:math id="mml-ieqn-87"><mml:msub><mml:mi>u</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>12</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>u</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>13</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>u</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula> were generated from standard uniform distribution.</p></list-item>
<list-item><label>II.</label><p>The values of <inline-formula id="ieqn-88"><mml:math id="mml-ieqn-88"><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>12</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup></mml:math></inline-formula> and <inline-formula id="ieqn-89"><mml:math id="mml-ieqn-89"><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>13</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup></mml:math></inline-formula> were generated by solving the following equations: <inline-formula id="ieqn-90"><mml:math id="mml-ieqn-90"><mml:msubsup><mml:mrow><mml:mo>&#x222B;</mml:mo></mml:mrow><mml:mrow><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mi>k</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:msubsup><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>&#x03C5;</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mi>d</mml:mi><mml:msub><mml:mi>&#x03C5;</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>u</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>, for k&#x2009;&#x003D;&#x2009;2, 3. The Brent&#x2019;s root funding method [<xref ref-type="bibr" rid="ref-27">27</xref>] was employed.</p></list-item>
<list-item><label>III.</label><p>Finally, the value of <inline-formula id="ieqn-91"><mml:math id="mml-ieqn-91"><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup></mml:math></inline-formula> was generated by solving <inline-formula id="ieqn-92"><mml:math id="mml-ieqn-92"><mml:msubsup><mml:mrow><mml:mo>&#x222B;</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>12</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:msubsup><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>&#x03C5;</mml:mi><mml:mrow><mml:mn>23</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mi>d</mml:mi><mml:msub><mml:mi>&#x03C5;</mml:mi><mml:mrow><mml:mn>23</mml:mn></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>log</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>u</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:math></inline-formula>.</p></list-item>
</list></p>
<p>Eventually, the observed transition times <inline-formula id="ieqn-93"><mml:math id="mml-ieqn-93"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> were considered by the following relationships:
<list list-type="bullet">
<list-item><p><inline-formula id="ieqn-94"><mml:math id="mml-ieqn-94"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>12</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo movablelimits="true" form="prefix">min</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>12</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>,</p></list-item>
<list-item><p><inline-formula id="ieqn-95"><mml:math id="mml-ieqn-95"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>13</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo movablelimits="true" form="prefix">min</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>13</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>,</p></list-item>
<list-item><p><inline-formula id="ieqn-96"><mml:math id="mml-ieqn-96"><mml:msub><mml:mi>T</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left left" rowspacing=".2em" columnspacing="1em" displaystyle="false"><mml:mtr><mml:mtd><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup></mml:mtd><mml:mtd><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>12</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>&#x2264;</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mtd><mml:mtd><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>12</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mrow><mml:mi>T</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>23</mml:mn></mml:mrow><mml:mrow><mml:mo>&#x2217;</mml:mo></mml:mrow></mml:msubsup><mml:mo>&#x003E;</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:math></inline-formula>.</p></list-item>
</list></p>
<p>The transition probability from the first state to the <italic>k</italic>th (<inline-formula id="ieqn-97"><mml:math id="mml-ieqn-97"><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>3</mml:mn><mml:mo>}</mml:mo></mml:mrow></mml:math></inline-formula>) state was calculated using <inline-formula id="ieqn-98"><mml:math id="mml-ieqn-98"><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>T</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mfrac><mml:mrow><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mrow><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>12</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mn>13</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:math></inline-formula>, and patients were eventually transferred to each of these states with a greater probability.</p>
<p>Software Source codes of this paper are available on Github:</p> <p><uri>https://github.com/BehnazAlafchi/ Variable-importance-for-multi-state-data</uri>.</p>
</sec>
<sec id="s3_1_2"><label>3.1.2</label><title>Simulation Result</title>
<p>The performance of our proposed two-step algorithm was evaluated via simulation studies utilizing the combination of different risk indices with the gradient boosting and random forest algorithms. We also used the original generated transition times at the first step and then applied the neural network, gradient boosting, and random forest algorithms at the second step to evaluate whether it could be useful to use the original survival times instead of risk indices for the variable selection. We considered 10 covariates in each possible transition, but only three of them were effective. All of the effective covariates were generated from an uniform distribution <inline-formula id="ieqn-99"><mml:math id="mml-ieqn-99"><mml:mi>U</mml:mi><mml:mo stretchy="false">[</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy="false">]</mml:mo></mml:math></inline-formula>. We simulated multi-state data with <inline-formula id="ieqn-100"><mml:math id="mml-ieqn-100"><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>012</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>013</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03BB;</mml:mi><mml:mrow><mml:mn>023</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mn>0.15</mml:mn><mml:mo>,</mml:mo><mml:mn>0.1</mml:mn><mml:mo>,</mml:mo><mml:mn>0.1</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. The maximum time of follow-up was 3 and 12 months in different simulation studies. The results were based on two sample sizes, n&#x2009;&#x003D;&#x2009;500 and n&#x2009;&#x003D;&#x2009;1000, and each simulation run was based on 1000 iterations.</p>
<p><xref ref-type="table" rid="table-1">Tables 1</xref> and <xref ref-type="table" rid="table-2">2</xref> show the values of the loss function provided by different methods. <xref ref-type="table" rid="table-1">Table 1</xref> provides the results for a 1-year study, and <xref ref-type="table" rid="table-2">Table 2</xref> provides the results for a 3-month study. In these tables, the first column indicates the type of transition, the second column gives the values of the three effective parameters, and the subsequent columns give the mean value of the loss function provided by different methods. Columns 3 and 5 give the results for D, columns 6 and 8 show the results for DR, and columns 9 and 11 show the results for MR. As the simulation results show, the gradient boosting method on MR and DR provided the smallest loss values in almost all simulations for all transitions, respectively. The results also revealed that the proposed two-stage method has better performance for larger sample sizes. However, its performance was almost similar across different lengths of studies. The last column also shows the performance of the traditional multi-state model fitted to the generated data in simulations. According to the results, machine learning models outperformed the traditional model based on the Cox proportional hazards model.</p>
<table-wrap id="table-1"><label>Table 1</label><caption><title>The values of loss function for different classification methods and different residuals/time-to-event with follow-up time 365 days</title></caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left"/>
<th align="left">Parameters</th>
<th align="left">D. ANN</th>
<th align="left">D. RF</th>
<th align="left">D. GBM</th>
<th align="left">DR. ANN</th>
<th align="left">DR. RF</th>
<th align="left">DR. GBM</th>
<th align="left">MR. ANN</th>
<th align="left">MR. RF</th>
<th align="left">MR. GBM</th>
<th align="left">SMSM</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" colspan="12">n&#x2009;&#x003D;&#x2009;500</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">2/1.7/0.8</td>
<td align="left">10.54</td>
<td align="left">11.15</td>
<td align="left">12.04</td>
<td align="left">27.72</td>
<td align="left">3.18</td>
<td align="left"><bold>0.34</bold></td>
<td align="left">28.26</td>
<td align="left">2.19</td>
<td align="left"><bold>0.18</bold></td>
<td align="left">29.33</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">2/1.9/0.1</td>
<td align="left">9.50</td>
<td align="left">10.49</td>
<td align="left">12.12</td>
<td align="left">7.41</td>
<td align="left">4.07</td>
<td align="left"><bold>0.48</bold></td>
<td align="left">24.91</td>
<td align="left">1.87</td>
<td align="left"><bold>0.26</bold></td>
<td align="left">27.45</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1/0.9/0.1</td>
<td align="left">6.28</td>
<td align="left">5.87</td>
<td align="left">6.54</td>
<td align="left">3.75</td>
<td align="left">2.26</td>
<td align="left"><bold>0.52</bold></td>
<td align="left">12.42</td>
<td align="left">2.89</td>
<td align="left"><bold>0.39</bold></td>
<td align="left">13.50</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">2/1/0.1</td>
<td align="left">9.09</td>
<td align="left">8.63</td>
<td align="left">9.05</td>
<td align="left">20.90</td>
<td align="left">2.54</td>
<td align="left"><bold>0.67</bold></td>
<td align="left">21.18</td>
<td align="left">1.61</td>
<td align="left"><bold>0.30</bold></td>
<td align="left">22.14</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">2/0.8/0.2</td>
<td align="left">9.46</td>
<td align="left">8.29</td>
<td align="left">8.56</td>
<td align="left">6.11</td>
<td align="left">4.06</td>
<td align="left"><bold>2.19</bold></td>
<td align="left">18.62</td>
<td align="left">3.06</td>
<td align="left"><bold>1.13</bold></td>
<td align="left">20.97</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1.5/0.5/0.3</td>
<td align="left">7.88</td>
<td align="left">7.15</td>
<td align="left">8.17</td>
<td align="left">5.71</td>
<td align="left">3.32</td>
<td align="left"><bold>2.07</bold></td>
<td align="left">15.79</td>
<td align="left">3.54</td>
<td align="left"><bold>1.60</bold></td>
<td align="left">15.45</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">1/0.9/0.1</td>
<td align="left">6.31</td>
<td align="left">6.34</td>
<td align="left">3.37</td>
<td align="left">12.37</td>
<td align="left"><bold>0.74</bold></td>
<td align="left">3.12</td>
<td align="left">12.34</td>
<td align="left">2.60</td>
<td align="left"><bold>0.36</bold></td>
<td align="left">13.47</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">1/0.6/0.1</td>
<td align="left">6.13</td>
<td align="left">5.77</td>
<td align="left">5.65</td>
<td align="left"><bold>2.79</bold></td>
<td align="left">4.45</td>
<td align="left">3.09</td>
<td align="left">8.80</td>
<td align="left">3.43</td>
<td align="left"><bold>1.20</bold></td>
<td align="left">11.21</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1/0.9/0.3</td>
<td align="left">6.30</td>
<td align="left">6.57</td>
<td align="left">7.10</td>
<td align="left">3.52</td>
<td align="left">3.08</td>
<td align="left"><bold>0.82</bold></td>
<td align="left">12.92</td>
<td align="left">3.39</td>
<td align="left"><bold>0.67</bold></td>
<td align="left">14.16</td>
</tr>
<tr>
<td align="left" colspan="12">n&#x2009;&#x003D;&#x2009;1000</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">2/1.7/0.8</td>
<td align="left">8.51</td>
<td align="left">9.01</td>
<td align="left">10.14</td>
<td align="left">27.08</td>
<td align="left">1.05</td>
<td align="left"><bold>0.08</bold></td>
<td align="left">28.79</td>
<td align="left">0.74</td>
<td align="left"><bold>0.03</bold></td>
<td align="left">29.38</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">2/1.9/0.1</td>
<td align="left">8.57</td>
<td align="left">8.12</td>
<td align="left">9.23</td>
<td align="left">6.46</td>
<td align="left">1.62</td>
<td align="left"><bold>0.26</bold></td>
<td align="left">25.54</td>
<td align="left">0.65</td>
<td align="left"><bold>0.08</bold></td>
<td align="left">27.44</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1/0.9/0.1</td>
<td align="left">6.04</td>
<td align="left">5.63</td>
<td align="left">5.49</td>
<td align="left">3.31</td>
<td align="left">1.03</td>
<td align="left"><bold>0.32</bold></td>
<td align="left">13.22</td>
<td align="left">1.31</td>
<td align="left"><bold>0.16</bold></td>
<td align="left">13.52</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">2/1/0.1</td>
<td align="left">6.99</td>
<td align="left">7.94</td>
<td align="left">8.30</td>
<td align="left">21.26</td>
<td align="left">1.52</td>
<td align="left"><bold>0.35</bold></td>
<td align="left">21.74</td>
<td align="left">1.02</td>
<td align="left"><bold>0.04</bold></td>
<td align="left">22.13</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">2/0.8/0.2</td>
<td align="left">8.23</td>
<td align="left">6.74</td>
<td align="left">7.62</td>
<td align="left">6.65</td>
<td align="left">2.52</td>
<td align="left"><bold>1.28</bold></td>
<td align="left">19.74</td>
<td align="left"><bold>0.41</bold></td>
<td align="left">1.66</td>
<td align="left">21.18</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1.5/0.5/0.3</td>
<td align="left">6.07</td>
<td align="left">6.34</td>
<td align="left">6.13</td>
<td align="left">5.07</td>
<td align="left"><bold>0.72</bold></td>
<td align="left">1.86</td>
<td align="left">15.14</td>
<td align="left">1.88</td>
<td align="left"><bold>0.43</bold></td>
<td align="left">15.89</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">1/0.9/0.1</td>
<td align="left">6.56</td>
<td align="left">6.03</td>
<td align="left">5.89</td>
<td align="left">13.05</td>
<td align="left">1.48</td>
<td align="left"><bold>0.41</bold></td>
<td align="left">12.85</td>
<td align="left">0.99</td>
<td align="left"><bold>0.18</bold></td>
<td align="left">13.52</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">1/0.6/0.1</td>
<td align="left">6.19</td>
<td align="left">5.21</td>
<td align="left">5.41</td>
<td align="left">2.97</td>
<td align="left">3.35</td>
<td align="left"><bold>1.93</bold></td>
<td align="left">10.05</td>
<td align="left">2.18</td>
<td align="left"><bold>0.55</bold></td>
<td align="left">11.58</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1/0.9/0.3</td>
<td align="left">5.64</td>
<td align="left">6.37</td>
<td align="left">6.43</td>
<td align="left">3.74</td>
<td align="left">1.47</td>
<td align="left"><bold>0.55</bold></td>
<td align="left">13.69</td>
<td align="left">1.63</td>
<td align="left"><bold>0.38</bold></td>
<td align="left">14.36</td>
</tr>
</tbody>
</table>
<table-wrap-foot><fn id="tfn1_1"><p>Note: MR.ANN: Artificial neural network based on martingale residual</p></fn>
<fn id="tfn1_2"><p>MR.RF: Random forest based on martingale residual</p></fn>
<fn id="tfn1_3"><p>MR.GBM: Gradient boosting machine based on martingale residual</p></fn>
<fn id="tfn1_4"><p>D.ANN: Artificial neural network based on deviance</p></fn>
<fn id="tfn1_5"><p>D.RF: Random forest based on deviance</p></fn>
<fn id="tfn1_6"><p>D.GBM: Gradient boosting machine based on deviance</p></fn>
<fn id="tfn1_7"><p>DR.ANN: Artificial neural network based on deviance residual</p></fn>
<fn id="tfn1_8"><p>DR.RF: Random forest based on deviance residual</p></fn>
<fn id="tfn1_9"><p>DR.GBM: Gradient boosting machine based on deviance residual</p></fn>
<fn id="tfn1_10"><p>SMSM: Stepwise transition-specific proportional intensities multi-state model</p></fn>
<fn id="tfn1_11"><p>For each row, two least values are indicated in bold.</p></fn>
</table-wrap-foot>
</table-wrap><table-wrap id="table-2"><label>Table 2</label><caption><title>The values of loss function for different classification methods and different residuals/time-to-event with follow-up time 90 days</title></caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left"/>
<th align="left">Parameters</th>
<th align="left">D. ANN</th>
<th align="left">D. RF</th>
<th align="left">D. GBM</th>
<th align="left">DR. ANN</th>
<th align="left">DR. RF</th>
<th align="left">DR. GBM</th>
<th align="left">MR. ANN</th>
<th align="left">MR. RF</th>
<th align="left">MR. GBM</th>
<th align="left">SMSM</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" colspan="12">n&#x2009;&#x003D;&#x2009;500</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">2/1.7/0.8</td>
<td align="left">9.62</td>
<td align="left">11.02</td>
<td align="left">1.77</td>
<td align="left">26.47</td>
<td align="left">3.02</td>
<td align="left"><bold>0.41</bold></td>
<td align="left">27.73</td>
<td align="left">2.46</td>
<td align="left"><bold>0.24</bold></td>
<td align="left">29.37</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">2/1.9/0.1</td>
<td align="left">10.28</td>
<td align="left">9.37</td>
<td align="left">10.92</td>
<td align="left">7.87</td>
<td align="left">4.14</td>
<td align="left"><bold>0.42</bold></td>
<td align="left">24.67</td>
<td align="left">1.78</td>
<td align="left"><bold>0.25</bold></td>
<td align="left">27.49</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1/0.9/0.1</td>
<td align="left">6.07</td>
<td align="left">6.10</td>
<td align="left">6.78</td>
<td align="left">3.96</td>
<td align="left">2.72</td>
<td align="left"><bold>0.61</bold></td>
<td align="left">12.44</td>
<td align="left">3.09</td>
<td align="left"><bold>0.50</bold></td>
<td align="left">13.51</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">2/1/0.1</td>
<td align="left">8.77</td>
<td align="left">8.44</td>
<td align="left">9.59</td>
<td align="left">20.51</td>
<td align="left">2.04</td>
<td align="left"><bold>0.62</bold></td>
<td align="left">20.99</td>
<td align="left">1.76</td>
<td align="left"><bold>0.19</bold></td>
<td align="left">22.11</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">2/0.8/0.2</td>
<td align="left">9.09</td>
<td align="left">8.38</td>
<td align="left">9.44</td>
<td align="left">5.33</td>
<td align="left">4.17</td>
<td align="left"><bold>2.09</bold></td>
<td align="left">18.58</td>
<td align="left">2.79</td>
<td align="left"><bold>1.12</bold></td>
<td align="left">21.01</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1.5/0.5/0.3</td>
<td align="left">7.32</td>
<td align="left">6.30</td>
<td align="left">6.82</td>
<td align="left">4.05</td>
<td align="left">2.76</td>
<td align="left"><bold>1.16</bold></td>
<td align="left">14.37</td>
<td align="left">2.95</td>
<td align="left"><bold>1.10</bold></td>
<td align="left">15.56</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">1/0.9/0.1</td>
<td align="left">6.15</td>
<td align="left">5.88</td>
<td align="left">6.53</td>
<td align="left">12.50</td>
<td align="left">3.06</td>
<td align="left"><bold>0.87</bold></td>
<td align="left">12.25</td>
<td align="left">2.63</td>
<td align="left"><bold>0.36</bold></td>
<td align="left">13.49</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">1/0.6/0.1</td>
<td align="left">6.36</td>
<td align="left">5.43</td>
<td align="left">5.77</td>
<td align="left">3.49</td>
<td align="left">4.15</td>
<td align="left"><bold>2.61</bold></td>
<td align="left">8.78</td>
<td align="left">3.04</td>
<td align="left"><bold>1.26</bold></td>
<td align="left">11.02</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1/0.9/0.3</td>
<td align="left">6.85</td>
<td align="left">6.36</td>
<td align="left">6.38</td>
<td align="left">4.39</td>
<td align="left">3.36</td>
<td align="left"><bold>1.01</bold></td>
<td align="left">12.75</td>
<td align="left">3.80</td>
<td align="left"><bold>0.72</bold></td>
<td align="left">14.02</td>
</tr>
<tr>
<td align="left" colspan="12">n&#x2009;&#x003D;&#x2009;1000</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">2/1.7/0.8</td>
<td align="left">8.06</td>
<td align="left">10.45</td>
<td align="left">10.42</td>
<td align="left">27.97</td>
<td align="left">1.07</td>
<td align="left"><bold>0.06</bold></td>
<td align="left">28.83</td>
<td align="left">1.01</td>
<td align="left"><bold>0.02</bold></td>
<td align="left">29.38</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">2/1.9/0.1</td>
<td align="left">7.89</td>
<td align="left">8.61</td>
<td align="left">10.24</td>
<td align="left">9.72</td>
<td align="left">1.08</td>
<td align="left"><bold>0.32</bold></td>
<td align="left">26.37</td>
<td align="left">0.51</td>
<td align="left"><bold>0.08</bold></td>
<td align="left">27.45</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1/0.9/0.1</td>
<td align="left">6.26</td>
<td align="left">5.93</td>
<td align="left">6.54</td>
<td align="left">4.85</td>
<td align="left">1.24</td>
<td align="left"><bold>0.33</bold></td>
<td align="left">13.15</td>
<td align="left">1.57</td>
<td align="left"><bold>0.19</bold></td>
<td align="left">13.53</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">2/1/0.1</td>
<td align="left">6.02</td>
<td align="left">7.45</td>
<td align="left">8.42</td>
<td align="left">21.32</td>
<td align="left">1.35</td>
<td align="left"><bold>0.28</bold></td>
<td align="left">21.75</td>
<td align="left">0.87</td>
<td align="left"><bold>0.03</bold></td>
<td align="left">22.17</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">2/0.8/0.2</td>
<td align="left">8.74</td>
<td align="left">6.99</td>
<td align="left">8.21</td>
<td align="left">6.27</td>
<td align="left">2.77</td>
<td align="left"><bold>1.28</bold></td>
<td align="left">19.97</td>
<td align="left">1.95</td>
<td align="left"><bold>0.47</bold></td>
<td align="left">21.18</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1.5/0.5/0.3</td>
<td align="left">6.10</td>
<td align="left">6.31</td>
<td align="left">6.32</td>
<td align="left">4.72</td>
<td align="left">1.68</td>
<td align="left"><bold>0.85</bold></td>
<td align="left">14.98</td>
<td align="left">1.94</td>
<td align="left"><bold>0.58</bold></td>
<td align="left">15.89</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 2</td>
<td align="left">1/0.9/0.1</td>
<td align="left">6.31</td>
<td align="left">5.84</td>
<td align="left">6.29</td>
<td align="left">12.97</td>
<td align="left">1.36</td>
<td align="left"><bold>0.44</bold></td>
<td align="left">12.97</td>
<td align="left">1.14</td>
<td align="left"><bold>0.19</bold></td>
<td align="left">13.49</td>
</tr>
<tr>
<td align="left">Transition 1 &#x2192; 3</td>
<td align="left">1/0.6/0.1</td>
<td align="left">6.32</td>
<td align="left">5.51</td>
<td align="left">5.70</td>
<td align="left">3.84</td>
<td align="left">3.72</td>
<td align="left"><bold>1.48</bold></td>
<td align="left">10.12</td>
<td align="left">2.49</td>
<td align="left"><bold>0.56</bold></td>
<td align="left">11.58</td>
</tr>
<tr>
<td align="left">Transition 2 &#x2192; 3</td>
<td align="left">1/0.9/0.3</td>
<td align="left">5.81</td>
<td align="left">6.22</td>
<td align="left">6.41</td>
<td align="left">4.87</td>
<td align="left">1.85</td>
<td align="left"><bold>0.60</bold></td>
<td align="left">13.96</td>
<td align="left">2.25</td>
<td align="left"><bold>0.43</bold></td>
<td align="left">14.31</td>
</tr>
</tbody>
</table>
<table-wrap-foot><fn id="tfn2_1"><p>Note: MR.ANN: Artificial neural network based on martingale residual</p></fn>
<fn id="tfn2_2"><p>MR.RF: Random forest based on martingale residual</p></fn>
<fn id="tfn2_3"><p>MR.GBM: Gradient boosting machine based on martingale residual</p></fn>
<fn id="tfn2_4"><p>D.ANN: Artificial neural network based on deviance</p></fn>
<fn id="tfn2_5"><p>D.RF: Random forest based on deviance</p></fn>
<fn id="tfn2_6"><p>D.GBM: Gradient boosting machine based on deviance</p></fn>
<fn id="tfn2_7"><p>DR.ANN: Artificial neural network based on deviance residual</p></fn>
<fn id="tfn2_8"><p>DR.RF: Random forest based on deviance residual</p></fn>
<fn id="tfn2_9"><p>DR.GBM: Gradient boosting machine based on deviance residual</p></fn>
<fn id="tfn2_10"><p>SMSM: Stepwise transition-specific proportional intensities multi-state model</p></fn>
<fn id="tfn2_11"><p>For each row, two least values are indicated in bold.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
</sec>
<sec id="s4"><label>4</label><title>Application</title>
<p>We apply our proposed two-step variable selection algorithm to a dataset of COVID-19 hospitalized patients. Each run took 30 s using an HP i5-laptop, RAM 8. The information of 2943 hospitalized patients with COVID-19 from February 20, 2020, to June 02, 2021 in Farshchian Medical Center and Shahid Beheshti Medical Center in Hamadan province, the west of Iran, was enrolled (<xref ref-type="table" rid="table-3">Table A</xref> in the <xref ref-type="sec" rid="app1">Appendix A</xref>). All of the patients were admitted to the ward (state 1). Then a patient may be transferred to the ICU (state 2) or die (state 3). The outcomes of interest were time to transfer to the ICU, time to death, and time to death after admission to the ICU. All patients who were alive at the end of the study were censored for death, and those who did not need to transfer to the ICU during the study were censored for admission to the ICU. The multi-state structure of the data is depicted in <xref ref-type="fig" rid="fig-1">Fig. 1</xref>. The matrix below shows the number of observed transitions between different health states:
<disp-formula id="ueqn-18"><mml:math id="mml-ueqn-18" display="block"><mml:mtable columnalign='left'><mml:mtr><mml:mtd><mml:mtext>&#x2003;&#x2003;&#x2003;&#x2003;&#x2003;&#x2003;</mml:mtext><mml:mn>1</mml:mn><mml:mtext>&#x2003;&#x2003;</mml:mtext><mml:mn>2</mml:mn><mml:mtext>&#x2003;&#x2003;</mml:mtext><mml:mn>3</mml:mn><mml:mtext>&#x2003;&#x2003;</mml:mtext></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mi>&#x03A9;</mml:mi><mml:mo>=</mml:mo><mml:mtable><mml:mtr><mml:mtd><mml:mn>1</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>2</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>3</mml:mn></mml:mtd></mml:mtr></mml:mtable><mml:mtext>&#x2003;</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtable><mml:mtr><mml:mtd><mml:mrow><mml:mn>2016</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mn>852</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mn>75</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>438</mml:mn></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mn>414</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mn>0</mml:mn></mml:mtd><mml:mtd><mml:mrow><mml:mn>489</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p>
<p>This matrix gives the number of direct transitions between health states. In total, 852 patients were transferred to the ICU. A total of 489 patients died during the follow-up; among them, 414 patients died in the ICU and 75 patients died in the ward. In addition, 2454 patients were recovered and were considered as censored. Among them, a number of 2016 patients were censored for both transfer to ICU and death events, and a number of 438 patients were censored for death.</p>
<p>The mean (SD) and median age of patients were 60.2 (17.11) and 61.0 years, respectively. The clinical and demographic information of the patients are given in <xref ref-type="sec" rid="app1">Appendix A</xref>. Here, we applied our proposed two-step algorithm to detect associated covariates with the risk of transition between different health states. We have used MR at the first step and gradient boosting at the second step. <xref ref-type="fig" rid="fig-2">Figs. 2a</xref>&#x2013;<xref ref-type="fig" rid="fig-2">2c</xref> depicts the variable importance associated with Admission in ward &#x2192; ICU, Admission in ward &#x2192; Death, and ICU &#x2192; Death, respectively. In this figure, the variables are ranked according to their importance. Based on <xref ref-type="fig" rid="fig-2">Fig. 2a</xref>, the most important variables for the time from admission in the ward to transfer to the ICU were saturation of peripheral oxygen (SPO2), age, lymphocyte count (LYM), lactate dehydrogenase (LDH), hemoglobin (Hb), blood sugar (BS), initial heart rate, body mass index (BMI), erythrocyte sedimentation rate (ESR), and Oseltamivire, respectively. The most important variables for the transition from ward to death were age, SPO2, LYM, LDH, Hb, initial heart rate, ESR, heart disease, BMI, and cancer, respectively (<xref ref-type="fig" rid="fig-2">Fig. 2b</xref>). Moreover, as given in <xref ref-type="fig" rid="fig-2">Fig. 2c</xref>, SPO2, age, Hb, hydroxychloroquine, LDH, LYM, BS, ESR, initial heart rate, BMI, and Koltra were the most important variables at the time of transition from ICU to death. Among many others, these variables were chosen as the most important (presented in <xref ref-type="sec" rid="app1">Appendix A</xref>).</p>
<fig id="fig-2"><label>Figure 2</label><caption><title>The results of variable importance based on martingale residual and gradient boosting algorithm for transitions from (a) Admission in ward to ICU, (b) Admission in ward to death, and (c) ICU to death, in hospitalized COVID-19 patients</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMES_22647-fig-2.png"/></fig>
<p>Our case study revealed that SPO2 was an important predictor of the time of transition to the ICU and the time of death (among patients who were transferred to the ICU or not). Other studies have revealed that SPO2 affects the survival time and the length of hospital stay in COVID-19 patients [<xref ref-type="bibr" rid="ref-28">28</xref>]. According to Zhao&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-29">29</xref>] a lower level of oxygen saturation at the time of admission is associated with a longer length of stay in the hospital.</p>
<p>Consistent with other studies, age was one of the most important predictors of the time of transition to ICU and the time of death among patients who were either transferred to ICU or not [<xref ref-type="bibr" rid="ref-28">28</xref>&#x2013;<xref ref-type="bibr" rid="ref-31">31</xref>]. Several studies revealed that older patients were more likely to be transferred to the ICU and die [<xref ref-type="bibr" rid="ref-28">28</xref>,<xref ref-type="bibr" rid="ref-29">29</xref>]. Moreover, it has been shown that the age of the patients can influence the effects of other risk factors such as LYM on COVID-19 outcomes [<xref ref-type="bibr" rid="ref-32">32</xref>], which is an important risk factor for the disease progression [<xref ref-type="bibr" rid="ref-31">31</xref>,<xref ref-type="bibr" rid="ref-33">33</xref>]. These findings are consistent with the results of previous studies proposing that have shown that lower levels of LYM are related to the higher risk of admission to the ICU [<xref ref-type="bibr" rid="ref-32">32</xref>,<xref ref-type="bibr" rid="ref-34">34</xref>], such that ICU admitted patients had a decreased fraction of LYM compared to the other patients who were admitted to the ward [<xref ref-type="bibr" rid="ref-29">29</xref>]. Moreover, lower levels of LYM in peripheral blood were observed among patients who died [<xref ref-type="bibr" rid="ref-29">29</xref>].</p>
<p>The results also showed that the level of Hb is another important predictor for all three transitions, especially for mortality among ICU admitted patients. Other researchers have conducted different studies to evaluate the association between the severity of the disease or mortality and anemia among COVID-19 patients and they have received controversial results [<xref ref-type="bibr" rid="ref-30">30</xref>,<xref ref-type="bibr" rid="ref-35">35</xref>&#x2013;<xref ref-type="bibr" rid="ref-39">39</xref>]. A meta-analysis study has shown that the levels of Hb were considerably lower than the normal level in patients with severe disease [<xref ref-type="bibr" rid="ref-40">40</xref>]. Several studies, mostly conducted in China, have shown an association between anemia and poor outcomes in hospitalized patients, and this may be because of its impact on immunity [<xref ref-type="bibr" rid="ref-38">38</xref>,<xref ref-type="bibr" rid="ref-39">39</xref>].</p>
</sec>
<sec id="s5"><label>5</label><title>Discussions and Conclusions</title>
<p>Estimating variable importance in a multi-state process can be a challenging issue due to the complex relationships between variables. Classic variable selection methods like stepwise proportional hazards regression or penalized methods fail to provide an estimation of variable importance or considering complex/non-linear relationships between the inputs and outputs as well as the interaction between covariates. Therneau&#x00A0;et&#x00A0;al.&#x00A0;[<xref ref-type="bibr" rid="ref-19">19</xref>] suggested that the martingale residual obtained from a null Cox model can be used as the outcome variable, so that usual regression analysis or classification methods like machine learning techniques can be applied to the new outcome variable (martingale residuals), and they provide very good results for survival data. This approach has also been applied for analyzing multivariate survival data like recurrent events. Nevertheless, few attempts have been made in relation to the multi-state data. In this paper, we proposed a two-step algorithm to evaluate the variable&#x2019;s importance for MSMs. We applied neural network, random forest, and gradient boosting algorithms to the martingale residuals, deviance, and deviance residuals made from an MSM and compared their results. The simulation studies revealed that using gradient boosting on the martingale residuals and deviance residuals outperforms other algorithms. This may be due to the fact that the gradient boosting is trained sequentially, so that in the training process the errors in the previous steps are corrected. This is in contrast to methods like random forest, where the trees are parallel and are made independently. Gradient boosting is also able to capture complex patterns in the data. Moreover, the individual predictions, obtained from several independent trees (that are determined in any order) in the random forest, are aggregated (by the principal of the majority vote or the average value), while the sequence of gradient boosting does not change (it runs in a fixed order). Boosted trees are prone to overfitting and begin modeling the noise in the presence of noisy data, despite the benefits of gradient boosting.</p>
<p>In this study, we assumed a continuous and Markov multi-state process. Nevertheless, in other contexts, a semi-Markov or non-Markov process could be defined as well. In addition to considering ensemble methods, other model selection methods like support vector machines and deep learning methods can be introduced into the proposed residuals, which can be considered as a future work. Also, optimization of tuning parameters using heuristic algorithms like genetic algorithms or Bayesian optimization methods is worth investigating in future studies.</p>
<p>Here, we used our proposed method to identify the important variables at the time of transition from the ward to the ICU and death among COVID-19 patients who were admitted to the hospital. It should be noted that our goal in this case was simply to identify the most important variables influencing the risk of transitioning between different health states. To assess the direction and impact of the selected variables on the risk of different transitions, the use of classical multi-state models can be used as a complementary method. It is noteworthy that while there are too many studies that have utilized machine learning methods in COVID-19 data sets, no study has hybridized machine learning and multi-state methods, especially in analyzing COVID-19 data sets. This highlights the novelty aspect of this study.</p>
<p><bold>Limitations</bold></p>
<p>There were some limitations to the present study. In the data used in this study, information on only three states was available, including admission to the ward, ICU, and death. Although, there were patients who were retransferred to the ward from the ICU, their information was not available. So, our example had the illness-death multi-state structure. It is suggested to analyze more complex multi-state data sets with the provided model in this study. Another limitation was that dealing with time-dependent covariates is only possible by using martingale residual and adjusting martingale residual. Despite these limitations, the proposed method can be easily applied to the context of high-dimensional data like genome-wide association studies and medical image data to detect the most important genes or brain regions associated with survival outcomes more accurately than classic statistical methods.</p>
</sec>
</body>
<back>
<ack>
<p>We would like to appreciate the Vice-chancellor of Education of the Hamadan University of Medical Science for technical support for their approval and support of this work.</p>
</ack>
<fn-group>
<fn fn-type="other"><p><bold>Ethics Approval and Consent to Participate:</bold> The data were collected from the patients&#x2019; medical recodes that have already been discharged and were not accessible for giving informed consent. A waiver of informed consent was awarded for the analysis conducted in this study by the Ethical Committee of the Hamadan University of Medical Sciences. All methods were carried out in accordance with relevant guidelines and regulations, and the study was approved by the Ethical Committee of the Hamadan University of Medical Sciences (IR.UMSHA.REC.1401.251; No. 140104072334).</p></fn>
<fn fn-type="other"><p><bold>Funding Statement:</bold> The authors received no specific funding for this study.</p></fn>
<fn fn-type="conflict"><p><bold>Conflicts of Interest:</bold> The authors declare that they have no conflicts of interest to report regarding the present study.</p></fn>
</fn-group>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>1.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ferrer</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Rondeau</surname>, <given-names>V.</given-names></string-name>, <string-name><surname>Dignam</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Pickles</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Jacqmin-Gadda</surname>, <given-names>H.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2016</year>). <article-title>Joint modelling of longitudinal and multi-state processes: Application to clinical progressions in prostate cancer</article-title>. <source>Statistics in Medicine</source><italic>,</italic> <volume>35</volume><issue>(22)</issue><italic>,</italic> <fpage>3933</fpage>&#x2013;<lpage>3948</lpage>. DOI <pub-id pub-id-type="doi">10.1002/sim.6972</pub-id>.</mixed-citation></ref>
<ref id="ref-2"><label>2.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Mirzaee</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Mohammad</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Mahmoodi</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Zeraati</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Ebadzadeh</surname>, <given-names>M. R.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2014</year>). <article-title>Multi-state survival analysis in renal transplantation recipients</article-title>. <source>Iranian Journal of Public Health</source><italic>,</italic> <volume>43</volume><issue>(3)</issue><italic>,</italic> <fpage>316</fpage>&#x2013;<lpage>322</lpage>.</mixed-citation></ref>
<ref id="ref-3"><label>3.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ursino</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Dupuis</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Buetti</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>de Montmollin</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Bouadma</surname>, <given-names>L.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2021</year>). <article-title>Multistate modeling of COVID-19 patients using a large multicentric prospective cohort of critically ill patients</article-title>. <source>Journal of Clinical Medicine</source><italic>,</italic> <volume>10</volume><issue>(3)</issue><italic>,</italic> <fpage>544</fpage>. DOI <pub-id pub-id-type="doi">10.3390/jcm10030544</pub-id>.</mixed-citation></ref>
<ref id="ref-4"><label>4.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Beesley</surname>, <given-names>L. J.</given-names></string-name>, <string-name><surname>Morgan</surname>, <given-names>T. M.</given-names></string-name>, <string-name><surname>Spratt</surname>, <given-names>D. E.</given-names></string-name>, <string-name><surname>Singhal</surname>, <given-names>U.</given-names></string-name>, <string-name><surname>Feng</surname>, <given-names>F. Y.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2019</year>). <article-title>Individual and population comparisons of surgery and radiotherapy outcomes in prostate cancer using Bayesian multistate models</article-title>. <source>JAMA Network Open</source><italic>,</italic> <volume>2</volume><issue>(2)</issue><italic>,</italic> <fpage>e187765</fpage>. DOI <pub-id pub-id-type="doi">10.1001/jamanetworkopen.2018.7765</pub-id>.</mixed-citation></ref>
<ref id="ref-5"><label>5.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dang</surname>, <given-names>X.</given-names></string-name>, <string-name><surname>Huang</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Qian</surname>, <given-names>X.</given-names></string-name></person-group> (<year>2021</year>). <article-title>Risk factor identification in heterogeneous disease progression with L1-regularized multi-state models</article-title>. <source>Journal of Healthcare Informatics Research</source><italic>,</italic> <volume>5</volume><issue>(1)</issue><italic>,</italic> <fpage>20</fpage>&#x2013;<lpage>53</lpage>. DOI <pub-id pub-id-type="doi">10.1007/s41666-020-00085-1</pub-id>.</mixed-citation></ref>
<ref id="ref-6"><label>6.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Duan</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Fu</surname>, <given-names>H.</given-names></string-name></person-group> (<year>2015</year>). <article-title>Estimate variable importance for recurrent event outcomes with an application to identify hypoglycemia risk factors</article-title>. <source>Statistics in Medicine</source><italic>,</italic> <volume>34</volume><issue>(19)</issue><italic>,</italic> <fpage>2743</fpage>&#x2013;<lpage>2754</lpage>. DOI <pub-id pub-id-type="doi">10.1002/sim.6516</pub-id>.</mixed-citation></ref>
<ref id="ref-7"><label>7.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Sennhenn-Reulen</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Kneib</surname>, <given-names>T.</given-names></string-name></person-group> (<year>2016</year>). <article-title>Structured fusion lasso penalized multi-state models</article-title>. <source>Statistics in Medicine</source><italic>,</italic> <volume>35</volume><issue>(25)</issue><italic>,</italic> <fpage>4637</fpage>&#x2013;<lpage>4659</lpage>. DOI <pub-id pub-id-type="doi">10.1002/sim.7017</pub-id>.</mixed-citation></ref>
<ref id="ref-8"><label>8.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>van der Laan</surname>, <given-names>M. J.</given-names></string-name></person-group> (<year>2006</year>). <article-title>Statistical inference for variable importance</article-title>. <source>The International Journal of Biostatistics</source><italic>,</italic> <volume>2</volume><issue>(1)</issue><italic>,</italic> <fpage>1</fpage>&#x2013;<lpage>31</lpage>. DOI <pub-id pub-id-type="doi">10.2202/1557-4679.1008</pub-id>.</mixed-citation></ref>
<ref id="ref-9"><label>9.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Stachel</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Daniel</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>Ding</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Francois</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Phillips</surname>, <given-names>M.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2021</year>). <article-title>Development and validation of a machine learning model to predict mortality risk in patients with COVID-19</article-title>. <source>BMJ Health &#x0026; Care Informatics</source><italic>,</italic> <volume>28</volume><issue>(1)</issue><italic>,</italic> <fpage>e100235</fpage>. DOI <pub-id pub-id-type="doi">10.1136/bmjhci-2020-100235</pub-id>.</mixed-citation></ref>
<ref id="ref-10"><label>10.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Snider</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>McBean</surname>, <given-names>E. A.</given-names></string-name>, <string-name><surname>Yawney</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Gadsden</surname>, <given-names>S. A.</given-names></string-name>, <string-name><surname>Patel</surname>, <given-names>B.</given-names></string-name></person-group> (<year>2021</year>). <article-title>Identification of variable importance for predictions of mortality from COVID-19 using AI models for Ontario, Canada</article-title>. <source>Frontiers in Public Health</source><italic>,</italic> <volume>9</volume><italic>,</italic> <fpage>6757</fpage>&#x2013;<lpage>6766</lpage>.</mixed-citation></ref>
<ref id="ref-11"><label>11.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Hazard</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Kaier</surname>, <given-names>K.</given-names></string-name>, <string-name><surname>von Cube</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Grodd</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Bugiera</surname>, <given-names>L.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Joint analysis of duration of ventilation, length of intensive care, and mortality of COVID-19 patients: A multistate approach</article-title>. <source>BMC Medical Research Methodology</source><italic>,</italic> <volume>20</volume><issue>(1)</issue><italic>,</italic> <fpage>1</fpage>&#x2013;<lpage>9</lpage>. DOI <pub-id pub-id-type="doi">10.1186/s12874-020-01082-z</pub-id>.</mixed-citation></ref>
<ref id="ref-12"><label>12.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Breiman</surname>, <given-names>L.</given-names></string-name></person-group> (<year>2001</year>). <article-title>Random forests</article-title>. <source>Machine Learning</source><italic>,</italic> <volume>45</volume><issue>(1)</issue><italic>,</italic> <fpage>5</fpage>&#x2013;<lpage>32</lpage>. DOI <pub-id pub-id-type="doi">10.1023/A:1010933404324</pub-id>.</mixed-citation></ref>
<ref id="ref-13"><label>13.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Breiman</surname>, <given-names>L.</given-names></string-name></person-group> (<year>1996</year>). <article-title>Bagging predictors</article-title>. <source>Machine Learning</source><italic>,</italic> <volume>24</volume><issue>(2)</issue><italic>,</italic> <fpage>123</fpage>&#x2013;<lpage>140</lpage>. DOI <pub-id pub-id-type="doi">10.1007/BF00058655</pub-id>.</mixed-citation></ref>
<ref id="ref-14"><label>14.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Negassa</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Ciampi</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Abrahamowicz</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Shapiro</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Boivin</surname>, <given-names>J. F.</given-names></string-name></person-group> (<year>2005</year>). <article-title>Tree-structured subgroup analysis for censored survival data: Validation of computationally inexpensive model selection criteria</article-title>. <source>Statistics and Computing</source><italic>,</italic> <volume>15</volume><issue>(3)</issue><italic>,</italic> <fpage>231</fpage>&#x2013;<lpage>239</lpage>. DOI <pub-id pub-id-type="doi">10.1007/s11222-005-1311-z</pub-id>.</mixed-citation></ref>
<ref id="ref-15"><label>15.</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Andersen</surname>, <given-names>P. K.</given-names></string-name>, <string-name><surname>Borgan</surname>, <given-names>O.</given-names></string-name>, <string-name><surname>Gill</surname>, <given-names>R. D.</given-names></string-name>, <string-name><surname>Keiding</surname>, <given-names>N.</given-names></string-name></person-group> (<year>1993</year>). <source>Statistical models based on counting processes</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Springer-Verlag</publisher-name>.</mixed-citation></ref>
<ref id="ref-16"><label>16.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Andersen</surname>, <given-names>P. K.</given-names></string-name>, <string-name><surname>Gill</surname>, <given-names>R. D.</given-names></string-name></person-group> (<year>1982</year>). <article-title>Cox&#x2019;s regression model for counting processes: A large sample study</article-title>. <source>The Annals of Statistics</source><italic>,</italic> <volume>10</volume><issue>(4)</issue><italic>,</italic> <fpage>1100</fpage>&#x2013;<lpage>1120</lpage>. DOI <pub-id pub-id-type="doi">10.1214/aos/1176345976</pub-id>.</mixed-citation></ref>
<ref id="ref-17"><label>17.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cox</surname>, <given-names>D. R.</given-names></string-name></person-group> (<year>1972</year>). <article-title>Regression models and life tables</article-title>. <source>Journal of the Royal Statistical Society: Series B (Methodological)</source><italic>,</italic> <volume>34</volume><issue>(2)</issue><italic>,</italic> <fpage>187</fpage>&#x2013;<lpage>202</lpage>.</mixed-citation></ref>
<ref id="ref-18"><label>18.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cox</surname>, <given-names>D. R.</given-names></string-name></person-group> (<year>1972</year>). <article-title>Breslow&#x2019;s commons on regression models and life-tables</article-title>. <source>Journal of the Royal Statistical Society: Series B (Methodological)</source><italic>,</italic> <volume>23</volume><italic>,</italic> <fpage>187</fpage>&#x2013;<lpage>220</lpage>.</mixed-citation></ref>
<ref id="ref-19"><label>19.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Therneau</surname>, <given-names>T. M.</given-names></string-name>, <string-name><surname>Grambsch</surname>, <given-names>P. M.</given-names></string-name>, <string-name><surname>Fleming</surname>, <given-names>T. R.</given-names></string-name></person-group> (<year>1990</year>). <article-title>Martingale-based residuals for survival models</article-title>. <source>Biometrika</source><italic>,</italic> <volume>77</volume><issue>(1)</issue><italic>,</italic> <fpage>147</fpage>&#x2013;<lpage>160</lpage>. DOI <pub-id pub-id-type="doi">10.1093/biomet/77.1.147</pub-id>.</mixed-citation></ref>
<ref id="ref-20"><label>20.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Natekin</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Knoll</surname>, <given-names>A.</given-names></string-name></person-group> (<year>2013</year>). <article-title>Gradient boosting machines, a tutorial</article-title>. <source>Frontiers in Neurorobotics</source><italic>,</italic> <volume>7</volume><italic>,</italic> <fpage>21</fpage>. DOI <pub-id pub-id-type="doi">10.3389/fnbot.2013.00021</pub-id>.</mixed-citation></ref>
<ref id="ref-21"><label>21.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Freund</surname>, <given-names>Y.</given-names></string-name>, <string-name><surname>Schapire</surname>, <given-names>R. E.</given-names></string-name></person-group> (<year>1997</year>). <article-title>A decision-theoretic generalization of on-line learning and an application to boosting</article-title>. <source>Journal of Computer and System Sciences</source><italic>,</italic> <volume>55</volume><issue>(1)</issue><italic>,</italic> <fpage>119</fpage>&#x2013;<lpage>139</lpage>. DOI <pub-id pub-id-type="doi">10.1006/jcss.1997.1504</pub-id>.</mixed-citation></ref>
<ref id="ref-22"><label>22.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Friedman</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Hastie</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Tibshirani</surname>, <given-names>R.</given-names></string-name></person-group> (<year>2000</year>). <article-title>Additive logistic regression: A statistical view of boosting (with discussion and a rejoinder by the authors)</article-title>. <source>The Annals of Statistics</source><italic>,</italic> <volume>28</volume><issue>(2)</issue><italic>,</italic> <fpage>337</fpage>&#x2013;<lpage>407</lpage>. DOI <pub-id pub-id-type="doi">10.1214/aos/1016218223</pub-id>.</mixed-citation></ref>
<ref id="ref-23"><label>23.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Friedman</surname>, <given-names>J. H.</given-names></string-name></person-group> (<year>2001</year>). <article-title>Greedy function approximation: A gradient boosting machine</article-title>. <source>Annals of Statistics</source><italic>,</italic> <volume>29</volume><issue>(5)</issue><italic>,</italic> <fpage>1189</fpage>&#x2013;<lpage>1232</lpage>. DOI <pub-id pub-id-type="doi">10.1214/aos/1013203451</pub-id>.</mixed-citation></ref>
<ref id="ref-24"><label>24.</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Raj</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Evangeline</surname>, <given-names>P.</given-names></string-name></person-group> (<year>2020</year>). <source>The digital twin paradigm for smarter systems and environments: The industry use cases</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>Academic Press</publisher-name>.</mixed-citation></ref>
<ref id="ref-25"><label>25.</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Beyersmann</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Allignol</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Schumacher</surname>, <given-names>M.</given-names></string-name></person-group> (<year>2012</year>). <source>Competing risks and multistate models with R</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer Science &#x0026; Business Media</publisher-name>.</mixed-citation></ref>
<ref id="ref-26"><label>26.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Crowther</surname>, <given-names>M. J.</given-names></string-name>, <string-name><surname>Lambert</surname>, <given-names>P. C.</given-names></string-name></person-group> (<year>2013</year>). <article-title>Simulating biologically plausible complex survival data</article-title>. <source>Statistics in Medicine</source><italic>,</italic> <volume>32</volume><issue>(23)</issue><italic>,</italic> <fpage>4118</fpage>&#x2013;<lpage>4134</lpage>. DOI <pub-id pub-id-type="doi">10.1002/sim.5823</pub-id>.</mixed-citation></ref>
<ref id="ref-27"><label>27.</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><surname>Brent</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Richard</surname>, <given-names>P.</given-names></string-name></person-group> (<year>1973</year>). <source>Algorithms for minimization without derivatives</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Dover Publications</publisher-name>.</mixed-citation></ref>
<ref id="ref-28"><label>28.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Nasir</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Perveen</surname>, <given-names>R. A.</given-names></string-name>, <string-name><surname>Ahmad</surname>, <given-names>S. N.</given-names></string-name>, <string-name><surname>Nazneen</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Ahmed</surname>, <given-names>S. M. P.</given-names></string-name></person-group> (<year>2021</year>). <article-title>Outcome of instrumental oxygen therapy in COVID-19: Survivors versus non-survivors in Bangladeshi cohort</article-title>. <source>American Journal of Internal Medicine</source><italic>,</italic> <volume>9</volume><issue>(1)</issue><italic>,</italic> <fpage>52</fpage>&#x2013;<lpage>57</lpage>. DOI <pub-id pub-id-type="doi">10.11648/j.ajim.20210901.18</pub-id>.</mixed-citation></ref>
<ref id="ref-29"><label>29.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhao</surname>, <given-names>Z.</given-names></string-name>, <string-name><surname>Chen</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Hou</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Graham</surname>, <given-names>J. M.</given-names></string-name>, <string-name><surname>Li</surname>, <given-names>H.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Prediction model and risk scores of ICU admission and mortality in COVID-19</article-title>. <source>PLoS One</source><italic>,</italic> <volume>15</volume><issue>(7)</issue><italic>,</italic> <fpage>e0236618</fpage>. DOI <pub-id pub-id-type="doi">10.1371/journal.pone.0236618</pub-id>.</mixed-citation></ref>
<ref id="ref-30"><label>30.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Guan</surname>, <given-names>W. J.</given-names></string-name>, <string-name><surname>Ni</surname>, <given-names>Z. Y.</given-names></string-name>, <string-name><surname>Hu</surname>, <given-names>Y.</given-names></string-name>, <string-name><surname>Liang</surname>, <given-names>W. H.</given-names></string-name>, <string-name><surname>Ou</surname>, <given-names>C. Q.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Clinical characteristics of coronavirus disease 2019 in China</article-title>. <source>New England Journal of Medicine</source><italic>,</italic> <volume>382</volume><issue>(18)</issue><italic>,</italic> <fpage>1708</fpage>&#x2013;<lpage>1720</lpage>. DOI <pub-id pub-id-type="doi">10.1056/NEJMoa2002032</pub-id>.</mixed-citation></ref>
<ref id="ref-31"><label>31.</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Lu</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Hu</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Fan</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Liu</surname>, <given-names>Z.</given-names></string-name>, <string-name><surname>Yin</surname>, <given-names>X.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>ACP risk grade: A simple mortality index for patients with confirmed or suspected severe acute respiratory syndrome coronavirus 2 disease (COVID-19) during the early stage of outbreak in Wuhan, China</article-title>. DOI <pub-id pub-id-type="doi">10.1101/2020.02.20.20025510</pub-id>.</mixed-citation></ref>
<ref id="ref-32"><label>32.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Huang</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Wang</surname>, <given-names>Y.</given-names></string-name>, <string-name><surname>Li</surname>, <given-names>X.</given-names></string-name>, <string-name><surname>Ren</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Zhao</surname>, <given-names>J.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China</article-title>. <source>The Lancet</source><italic>,</italic> <volume>395</volume><issue>(10223)</issue><italic>,</italic> <fpage>497</fpage>&#x2013;<lpage>506</lpage>. DOI <pub-id pub-id-type="doi">10.1016/S0140-6736(20)30183-5</pub-id>.</mixed-citation></ref>
<ref id="ref-33"><label>33.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Hou</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Zhang</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Jin</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Liang</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Xu</surname>, <given-names>B.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Risk factors for disease progression in hospitalized patients with COVID-19: A retrospective cohort study</article-title>. <source>Infectious Diseases</source><italic>,</italic> <volume>52</volume><issue>(7)</issue><italic>,</italic> <fpage>498</fpage>&#x2013;<lpage>505</lpage>. DOI <pub-id pub-id-type="doi">10.1080/23744235.2020.1759817</pub-id>.</mixed-citation></ref>
<ref id="ref-34"><label>34.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ng</surname>, <given-names>D. H.</given-names></string-name>, <string-name><surname>Choy</surname>, <given-names>C. Y.</given-names></string-name>, <string-name><surname>Chan</surname>, <given-names>Y. H.</given-names></string-name>, <string-name><surname>Young</surname>, <given-names>B. E.</given-names></string-name>, <string-name><surname>Fong</surname>, <given-names>S. W.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Fever patterns, cytokine profiles, and outcomes in COVID-19</article-title>. <source>Open Forum Infectious Diseases</source><italic>,</italic> <volume>7</volume><issue>(9)</issue><italic>,</italic> <fpage>ofaa375</fpage>. DOI <pub-id pub-id-type="doi">10.1093/ofid/ofaa375</pub-id>.</mixed-citation></ref>
<ref id="ref-35"><label>35.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bellmann-Weiler</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Lanser</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Barket</surname>, <given-names>R.</given-names></string-name>, <string-name><surname>Rangger</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Schapfl</surname>, <given-names>A.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Prevalence and predictive value of anemia and dysregulated iron homeostasis in patients with COVID-19 infection</article-title>. <source>Journal of Clinical Medicine</source><italic>,</italic> <volume>9</volume><issue>(8)</issue><italic>,</italic> <fpage>2429</fpage>. DOI <pub-id pub-id-type="doi">10.3390/jcm9082429</pub-id>.</mixed-citation></ref>
<ref id="ref-36"><label>36.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Tao</surname>, <given-names>Z.</given-names></string-name>, <string-name><surname>Xu</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Chen</surname>, <given-names>W.</given-names></string-name>, <string-name><surname>Yang</surname>, <given-names>Z.</given-names></string-name>, <string-name><surname>Xu</surname>, <given-names>X.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2021</year>). <article-title>Anemia is associated with severe illness in COVID-19: A retrospective cohort study</article-title>. <source>Journal of Medical Virology</source><italic>,</italic> <volume>93</volume><issue>(3)</issue><italic>,</italic> <fpage>1478</fpage>&#x2013;<lpage>1488</lpage>. DOI <pub-id pub-id-type="doi">10.1002/jmv.26444</pub-id>.</mixed-citation></ref>
<ref id="ref-37"><label>37.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Young</surname>, <given-names>B. E.</given-names></string-name>, <string-name><surname>Ong</surname>, <given-names>S. W. X.</given-names></string-name>, <string-name><surname>Kalimuddin</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Low</surname>, <given-names>J. G.</given-names></string-name>, <string-name><surname>Tan</surname>, <given-names>S. Y.</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Epidemiologic features and clinical course of patients infected with SARS-CoV-2 in Singapore</article-title>. <source>JAMA</source><italic>,</italic> <volume>323</volume><issue>(15)</issue><italic>,</italic> <fpage>1488</fpage>&#x2013;<lpage>1494</lpage>. DOI <pub-id pub-id-type="doi">10.1001/jama.2020.3204</pub-id>.</mixed-citation></ref>
<ref id="ref-38"><label>38.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ryan</surname>, <given-names>A. S.</given-names></string-name></person-group> (<year>1997</year>). <article-title>Iron-deficiency anemia in infant development: Implications for growth, cognitive development, resistance to infection, and iron supplementation</article-title>. <source>American Journal of Biological Anthropology</source><italic>,</italic> <volume>104</volume><issue>(S25)</issue><italic>,</italic> <fpage>25</fpage>&#x2013;<lpage>62</lpage>. DOI <pub-id pub-id-type="doi">10.1002/(ISSN)1096-8644</pub-id>.</mixed-citation></ref>
<ref id="ref-39"><label>39.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dinevari</surname>, <given-names>M. F.</given-names></string-name>, <string-name><surname>Somi</surname>, <given-names>M. H.</given-names></string-name>, <string-name><surname>Majd</surname>, <given-names>E. S.</given-names></string-name>, <string-name><surname>Farhangi</surname>, <given-names>M. A.</given-names></string-name>, <string-name><surname>Nikniaz</surname>, <given-names>Z.</given-names></string-name></person-group> (<year>2021</year>). <article-title>Anemia predicts poor outcomes of COVID-19 in hospitalized patients: A prospective study in Iran</article-title>. <source>BMC Infectious Diseases</source><italic>,</italic> <volume>21</volume><issue>(1)</issue><italic>,</italic> <fpage>1</fpage>&#x2013;<lpage>7</lpage>.</mixed-citation></ref>
<ref id="ref-40"><label>40.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lippi</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Mattiuzzi</surname>, <given-names>C.</given-names></string-name></person-group> (<year>2020</year>). <article-title>Hemoglobin value may be decreased in patients with severe coronavirus disease 2019</article-title>. <source>Hematology, Transfusion and Cell Therapy</source><italic>,</italic> <volume>42</volume><italic>,</italic> <fpage>116</fpage>&#x2013;<lpage>117</lpage>.</mixed-citation></ref>
</ref-list>
<app-group id="appg1">
<app id="app1"><label></label>
<title><bold>Appendix A.</bold> Characteristics of COVID-19 patients</title>
<table-wrap id="table-3"><label>Table A</label><caption><title>Characteristics of the study population infected with COVID-19 (n&#x2009;&#x003D;&#x2009;2943)</title></caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left">Variable</th>
<th align="left">Category</th>
<th align="left">Number</th>
<th align="left">Percent</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" rowspan="2">Sex</td>
<td align="left">Male</td>
<td align="left">1565</td>
<td align="left">53.2</td>
</tr>
<tr>
<td align="left">Female</td>
<td align="left">1378</td>
<td align="left">46.8</td>
</tr>
<tr>
<td align="left" rowspan="2">Marital status</td>
<td align="left">Married</td>
<td align="left">2560</td>
<td align="left">87.4</td>
</tr>
<tr>
<td align="left">Single</td>
<td align="left">368</td>
<td align="left">12.6</td>
</tr>
<tr>
<td align="left" rowspan="2">Resident</td>
<td align="left">Urban</td>
<td align="left">2452</td>
<td align="left">83.3</td>
</tr>
<tr>
<td align="left">Ruler</td>
<td align="left">490</td>
<td align="left">16.7</td>
</tr>
<tr>
<td align="left" rowspan="2">Smoking</td>
<td align="left">No</td>
<td align="left">2727</td>
<td align="left">92.7</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">216</td>
<td align="left">7.3</td>
</tr>
<tr>
<td align="left" rowspan="2">Substance abuse</td>
<td align="left">No</td>
<td align="left">2740</td>
<td align="left">93.1</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">203</td>
<td align="left">6.9</td>
</tr>
<tr>
<td align="left" rowspan="2">Lung disease</td>
<td align="left">No</td>
<td align="left">2641</td>
<td align="left">89.7</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">302</td>
<td align="left">10.3</td>
</tr>
<tr>
<td align="left" rowspan="2">Heart disease</td>
<td align="left">No</td>
<td align="left">2487</td>
<td align="left">84.5</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">456</td>
<td align="left">15.5</td>
</tr>
<tr>
<td align="left" rowspan="2">Diabetes</td>
<td align="left">No</td>
<td align="left">2384</td>
<td align="left">81.0</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">559</td>
<td align="left">19.0</td>
</tr>
<tr>
<td align="left" rowspan="2">Kidney disease</td>
<td align="left">No</td>
<td align="left">2821</td>
<td align="left">95.9</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">122</td>
<td align="left">4.1</td>
</tr>
<tr>
<td align="left" rowspan="2">Hypertension</td>
<td align="left">No</td>
<td align="left">1942</td>
<td align="left">66.0</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">1001</td>
<td align="left">34.0</td>
</tr>
<tr>
<td align="left" rowspan="2">Liver disease</td>
<td align="left">No</td>
<td align="left">2919</td>
<td align="left">99.2</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">24</td>
<td align="left">0.80</td>
</tr>
<tr>
<td align="left" rowspan="2">Cancer</td>
<td align="left">No</td>
<td align="left">2884</td>
<td align="left">98.0</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">59</td>
<td align="left">2.0</td>
</tr>
<tr>
<td align="left" rowspan="2">Weakness, lethargy, and fatigue</td>
<td align="left">No</td>
<td align="left">2620</td>
<td align="left">89.0</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">323</td>
<td align="left">11.0</td>
</tr>
<tr>
<td align="left" rowspan="2">Lack of smell</td>
<td align="left">No</td>
<td align="left">2884</td>
<td align="left">98.0</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">59</td>
<td align="left">2.0</td>
</tr>
<tr>
<td align="left" rowspan="2">Vomiting</td>
<td align="left">No</td>
<td align="left">2219</td>
<td align="left">75.4</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">724</td>
<td align="left">24.6</td>
</tr>
<tr>
<td align="left" rowspan="2">Diarrhea</td>
<td align="left">No</td>
<td align="left">2654</td>
<td align="left">90.2</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">289</td>
<td align="left">9.8</td>
</tr>
<tr>
<td align="left" rowspan="2">Fever</td>
<td align="left">No</td>
<td align="left">1417</td>
<td align="left">48.1</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">1526</td>
<td align="left">51.9</td>
</tr>
<tr>
<td align="left" rowspan="2">Shortness of breath</td>
<td align="left">No</td>
<td align="left">1220</td>
<td align="left">41.5</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">1723</td>
<td align="left">58.5</td>
</tr>
<tr>
<td align="left" rowspan="2">Muscle pain</td>
<td align="left">No</td>
<td align="left">1610</td>
<td align="left">54.7</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">1333</td>
<td align="left">45.3</td>
</tr>
<tr>
<td align="left" rowspan="2">Stomachache</td>
<td align="left">No</td>
<td align="left">2926</td>
<td align="left">99.4</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">17</td>
<td align="left">0.6</td>
</tr>
<tr>
<td align="left" rowspan="2">Chest pain</td>
<td align="left">No</td>
<td align="left">2881</td>
<td align="left">97.9</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">62</td>
<td align="left">2.1</td>
</tr>
<tr>
<td align="left" rowspan="2">Loss of consciousness</td>
<td align="left">No</td>
<td align="left">2919</td>
<td align="left">99.2</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">24</td>
<td align="left">0.8</td>
</tr>
<tr>
<td align="left" rowspan="2">Sore throat</td>
<td align="left">No</td>
<td align="left">2837</td>
<td align="left">96.4</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">106</td>
<td align="left">3.6</td>
</tr>
<tr>
<td align="left" rowspan="2">Cough</td>
<td align="left">No</td>
<td align="left">1173</td>
<td align="left">39.9</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">1770</td>
<td align="left">60.1</td>
</tr>
<tr>
<td align="left" rowspan="2">Headache</td>
<td align="left">No</td>
<td align="left">2425</td>
<td align="left">82.4</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">518</td>
<td align="left">17.6</td>
</tr>
<tr>
<td align="left" rowspan="2">Blood pressure</td>
<td align="left">Normal</td>
<td align="left">2776</td>
<td align="left">94.9</td>
</tr>
<tr>
<td align="left">Abnormal</td>
<td align="left">149</td>
<td align="left">5.1</td>
</tr>
<tr>
<td align="left" rowspan="2">Oxygen therapy</td>
<td align="left">No</td>
<td align="left">123</td>
<td align="left">4.2</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">2820</td>
<td align="left">95.8</td>
</tr>
<tr>
<td align="left" rowspan="2">Mechanical ventilation</td>
<td align="left">No</td>
<td align="left">2365</td>
<td align="left">80.4</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">578</td>
<td align="left">19.6</td>
</tr>
<tr>
<td align="left" rowspan="2">Oseltamivir</td>
<td align="left">No</td>
<td align="left">2680</td>
<td align="left">91.1</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">263</td>
<td align="left">8.9</td>
</tr>
<tr>
<td align="left" rowspan="2">Azithromycin</td>
<td align="left">No</td>
<td align="left">1725</td>
<td align="left">58.6</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">1218</td>
<td align="left">41.4</td>
</tr>
<tr>
<td align="left" rowspan="2">Kaltura</td>
<td align="left">No</td>
<td align="left">1217</td>
<td align="left">41.4</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">1726</td>
<td align="left">58.6</td>
</tr>
<tr>
<td align="left" rowspan="2">Hydroxychloroquine</td>
<td align="left">No</td>
<td align="left">1490</td>
<td align="left">50.6</td>
</tr>
<tr>
<td align="left">Yes</td>
<td align="left">1453</td>
<td align="left">49.4</td>
</tr>
<tr>
<td align="left" colspan="2">Variable</td>
<td align="left">Mean</td>
<td align="left">SD</td>
</tr>
<tr>
<td align="left" colspan="2">Age</td>
<td align="left">60.2</td>
<td align="left">17.11</td>
</tr>
<tr>
<td align="left" colspan="2">BMI</td>
<td align="left">26.64</td>
<td align="left">4.49</td>
</tr>
<tr>
<td align="left" colspan="2">Initial body temperature</td>
<td align="left">36.91</td>
<td align="left">1.72</td>
</tr>
<tr>
<td align="left" colspan="2">Initial heart beat</td>
<td align="left">92.17</td>
<td align="left">15.44</td>
</tr>
<tr>
<td align="left" colspan="2">SPO2</td>
<td align="left">84.41</td>
<td align="left">10.89</td>
</tr>
<tr>
<td align="left" colspan="2">ESR</td>
<td align="left">43.17</td>
<td align="left">29.20</td>
</tr>
<tr>
<td align="left" colspan="2">LDH</td>
<td align="left">597.00</td>
<td align="left">276.13</td>
</tr>
<tr>
<td align="left" colspan="2">BS</td>
<td align="left">146.87</td>
<td align="left">74.10</td>
</tr>
<tr>
<td align="left" colspan="2">Hb</td>
<td align="left">13.86</td>
<td align="left">2.13</td>
</tr>
<tr>
<td align="left" colspan="2">LYM</td>
<td align="left">22.52</td>
<td align="left">11.68</td>
</tr>
</tbody>
</table>
<table-wrap-foot><fn id="tfn3_1"><p>Note: BMI: body mass index; SPO2: saturation of peripheral oxygen; ESR: erythrocyte sedimentation rate; LDH: lactate dehydrogenase; BS: blood sugar; Hb: hemoglobin; LYM: lymphocyte count.</p></fn></table-wrap-foot></table-wrap>
</app>
</app-group>
</back>
</article>




