<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CSSE</journal-id>
<journal-id journal-id-type="nlm-ta">CSSE</journal-id>
<journal-id journal-id-type="publisher-id">CSSE</journal-id>
<journal-title-group>
<journal-title>Computer Systems Science &#x0026; Engineering</journal-title>
</journal-title-group>
<issn pub-type="ppub">0267-6192</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">34910</article-id>
<article-id pub-id-type="doi">10.32604/csse.2023.034910</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>BS-SC Model: A Novel Method for Predicting Child Abuse Using Borderline-SMOTE Enabled Stacking Classifier</article-title><alt-title alt-title-type="left-running-head">BS-SC Model: A Novel Method for Predicting Child Abuse Using Borderline-SMOTE Enabled Stacking Classifier</alt-title><alt-title alt-title-type="right-running-head">BS-SC Model: A Novel Method for Predicting Child Abuse Using Borderline-SMOTE Enabled Stacking Classifier</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author">
<name name-style="western"><surname>Parthasarathy</surname><given-names>Saravanan</given-names></name>
</contrib>
<contrib id="author-2" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Lakshminarayanan</surname><given-names>Arun Raj</given-names></name><email>arunraj@crescent.education</email>
</contrib>
<aff id="aff-1"><institution>B. S. Abdur Rahman Crescent Institute of Science and Technology</institution>, <addr-line>GST Road, Vandalur, Chennai, 600048, Tamil Nadu</addr-line>, <country>India</country></aff>
</contrib-group><author-notes><corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Arun Raj Lakshminarayanan. Email: <email>arunraj@crescent.education</email></corresp></author-notes>
<pub-date date-type="collection" publication-format="electronic">
<year>2023</year></pub-date>
<pub-date date-type="pub" publication-format="electronic"><day>6</day>
<month>2</month>
<year>2023</year></pub-date>
<volume>46</volume>
<issue>2</issue>
<fpage>1311</fpage>
<lpage>1336</lpage>
<history>
<date date-type="received"><day>01</day><month>8</month><year>2022</year></date>
<date date-type="accepted"><day>22</day><month>11</month><year>2022</year></date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2023 Parthasarathy and Lakshminarayanan</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Parthasarathy and Lakshminarayanan</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CSSE_34910.pdf"></self-uri>
<abstract>
<p>For a long time, legal entities have developed and used crime prediction methodologies. The techniques are frequently updated based on crime evaluations and responses from scientific communities. There is a need to develop type-based crime prediction methodologies that can be used to address issues at the subgroup level. Child maltreatment is not adequately addressed because children are voiceless. As a result, the possibility of developing a model for predicting child abuse was investigated in this study. Various exploratory analysis methods were used to examine the city of Chicago&#x2019;s child abuse events. The data set was balanced using the Borderline-SMOTE technique, and then a stacking classifier was employed to ensemble multiple algorithms to predict various types of child abuse. The proposed approach successfully predicted crime types with 93&#x0025; of accuracy, precision, recall, and <italic>F1</italic>-Score. The AUC value of the same was 0.989. However, when compared to the Extra Trees model (17.55), which is the second best, the proposed model&#x2019;s execution time was significantly longer (476.63). We discovered that Machine Learning methods effectively evaluate the demographic and spatial-temporal characteristics of the crimes and predict the occurrences of various subtypes of child abuse. The results indicated that the proposed Borderline-SMOTE enabled Stacking Classifier model (BS-SC Model) would be effective in the real-time child abuse prediction and prevention process.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Child abuse</kwd>
<kwd>sexual offending</kwd>
<kwd>decision-making</kwd>
<kwd>machine learning</kwd>
<kwd>stacking classifier</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>Humanity is pondering the potential of expanding its presence throughout the universe. Humans are being developed as the first multiplanetary species, according to scientists and technologists. Children are the only ones who can keep humanity alive no matter where we go. As a result, it is the obligation of society to safeguard their safety and well-being. According to the WHO, children are the most vulnerable group when it comes to domestic and sexual abuse [<xref ref-type="bibr" rid="ref-1">1</xref>]. Child abuse includes a variety of behaviors such as corporal punishment, physical misconduct, emotional violations, and sexual exploitation. Child abuse can be perpetrated on the victim by a known or unknown individual. According to a report by the Children&#x2019;s Bureau of the United States, 618000 children would be subjected to child abuse by 2020 [<xref ref-type="bibr" rid="ref-2">2</xref>]. In comparison to boys (7.9 per 1,000), girl children are more prone to abuse (8.9 per 1,000). Boys, on the other hand, have a greater mortality rate (2.99 per 100,000) than females who have been abused or neglected (2.05 per 100,000). In many situations (77.2&#x0025;), parents are the major perpetrators, and astonishingly, females are more aggressive (52&#x0025;) toward children than males (47.1&#x0025;). It suggested that living at home might not be the safest option.</p>
<p>For the rest of their lives, child abuse victims will struggle with physical and/or mental trauma. Child abuse is a traumatic occurrence that can lead to eating disorders, unwanted pregnancies, HIV infection, heart illness, physical disabilities, learning difficulties, anxiety, despair, and relationship problems in the victim [<xref ref-type="bibr" rid="ref-3">3</xref>&#x2013;<xref ref-type="bibr" rid="ref-5">5</xref>]. Adverse childhood experiences make people more sensitive to drug usage, severe menopausal symptoms, and cognitive problems later in life [<xref ref-type="bibr" rid="ref-6">6</xref>&#x2013;<xref ref-type="bibr" rid="ref-8">8</xref>]. The victim&#x2019;s sexual behavior in middle age was hampered by childhood mistreatment. Childhood hardship may also influence the pain and pleasure experienced during sexual intercourse [<xref ref-type="bibr" rid="ref-9">9</xref>]. Along with physical and neurological issues, the victims face difficulties with executive processes such as emotional control and decision-making [<xref ref-type="bibr" rid="ref-10">10</xref>]. Furthermore, victims of child sexual abuse are more likely to become perpetrators later in life [<xref ref-type="bibr" rid="ref-11">11</xref>]. As a result, governments and international child welfare groups are working to eliminate crimes against children by implementing effective countermeasures.</p>
<p>In the process of resolving real-world challenges, technical improvements are extremely beneficial. Child abuse is rampant all around the world, regardless of the countries&#x2019; economic, social, or religious backgrounds. Child abuse is linked to socio-demographic factors such as victim and offender details, financial position, geographic information, and legal delineations, much like any other crime. The data management and analysis processes are becoming more sophisticated as the amount of data grows. The challenges of data storage, retrieval, and analysis can all be addressed with the help of emerging big data technologies. The progress paved the way for the creation of a global repository of child abuse materials [<xref ref-type="bibr" rid="ref-12">12</xref>]. Artificial Intelligence (AI) has similarly cleared the door for the development of rigorous predictive policing frameworks. The AI-enabled smart police expert systems link different crime-related attributes and yielded better results [<xref ref-type="bibr" rid="ref-13">13</xref>].</p>
<p>In 2020, a ten-year-old Chicago schoolgirl was abused in the Grand hotel by a middle-aged man [<xref ref-type="bibr" rid="ref-14">14</xref>]. During the inquiry, the authorities discovered she had been sexually assaulted for the past three years. Even though the school and motel had made an official report, the perpetrators were not caught by law enforcement. Instead, the victim was transported to a mental institution and held there for more than a month. As an excuse to keep her in the hospital for a prolonged amount of time, the Illinois Department of Children and Family Services notified the court that there was no place for her to reside. The legal authorities were required to act against the abusers while the victim was being mistreated for the first time. The legal entities&#x2019; prompt steps may have rescued the girl from further abuse. The victim was subjected to retaliate as a result of the government&#x2019;s continuous operational ineptitude. Artificial intelligence-based technologies could have helped predict child abuse and the probability of recidivism in such instances. We were inspired by the above incident and decided to conduct a study on child maltreatment in Chicago.</p>
<p>The remaining sections of the paper are as follows: Section 2 contains the literature review. The research gap and dataset description are discussed in Section 3. Section 4 explains the planned approach as well as the findings of this investigation. Section 5 contains the conclusion and recommendations for future research.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Related Work</title>
<p>The term &#x201C;child maltreatment&#x201D; refers to any inappropriate behavior by an adult toward a child. The most common types of maltreatment are physical abuse, mental abuse, sexual abuse, neglect, induced disease, and social abuse [<xref ref-type="bibr" rid="ref-15">15</xref>]. The activities typically create bodily, mental, and sexual suffering in the youngster. Child abuse can occur in both the real and virtual worlds. On the internet, there are also adult-child sex advocacy websites that defend this perversion as a normal activity [<xref ref-type="bibr" rid="ref-16">16</xref>]. The breeding grounds for child sexual abuse are child sex pornographic websites, printed magazines, and adult-child sex advocacy forums. Individuals&#x2019; socioeconomic and psychological backgrounds also play a significant effect in child abuse. Crime classification approaches would be effective in developing countermeasures since the severity of child maltreatment varies from case to case [<xref ref-type="bibr" rid="ref-17">17</xref>].</p>
<p>Russell [<xref ref-type="bibr" rid="ref-18">18</xref>] discussed the limitations and opportunities of employing technological breakthroughs to predict child maltreatment. When a child is abused, child welfare groups ask who, how, and why. Though predictive algorithms are useful in predicting child abuse, they necessitate a sufficient amount of high-quality data. An effective analytics tool, according to the author, could be beneficial in social services. Gillingham praised the use of block box machine learning algorithms in social issues [<xref ref-type="bibr" rid="ref-19">19</xref>].</p>
<p>In New Zealand, Vaithianathan et al. developed a Predictive Risk Model (PRM) to forecast the likelihood of child abuse [<xref ref-type="bibr" rid="ref-20">20</xref>]. The model identified the children who may have been harmed as a result of the abuse. Those who were identified as being at risk were eventually enrolled in the public assistance system. It demonstrated the PRM&#x2019;s effectiveness. Cherian et al. investigated the crimes that occurred in San Francisco and classified them [<xref ref-type="bibr" rid="ref-21">21</xref>]. The Random Forest model produced 84.68&#x0025; and 29.31&#x0025; accuracy in training and testing data, respectively. There was no doubt that it was an issue of overfitting. As a response, the demographic characteristics of the incident were included, and the models were revalued. The proposed approach achieved 31.84&#x0025; in testing data and the training accuracy was reduced. Wilson et al. [<xref ref-type="bibr" rid="ref-22">22</xref>] proposed a predictive risk modelling strategy for predicting child abuse and neglect in New Zealand. They analyzed thirteen years of data gathered by the family protection agency. Partial Least Square and Multi-Level Model were identified as the best performers after comparing the results of twelve state-of-the-art algorithms. Both have an AUR rating of 88&#x0025;, indicating that they might be utilized to make real-time predictions. The authors concluded, however, that the PRMs could only be employed as support units for traditional human judgement. Vaithianathan et al. suggested a Random Forest-based PRM model based on data from Allegheny County&#x2019;s Child Protective Services (CPS) and General Protective Services (GPS) [<xref ref-type="bibr" rid="ref-23">23</xref>].</p>
<p>Horikawa et al. used the age of the kid, the age of the perpetrator, the history of maltreatment, and the financial background to predict the recurrence of child abuse. It assured that demographic data played a significant effect in the occurrence of crimes [<xref ref-type="bibr" rid="ref-24">24</xref>]. Injuries are one of the signs that a child has been abused. To assess the risk of child maltreatment, data from child protection and hospitals were matched [<xref ref-type="bibr" rid="ref-25">25</xref>]. The technique projected a 2 out of 100 chances of abuse-related harm among high-risk children. The same goes for the low-risk youngsters, who are predicted to have a risk factor of 0.2 out of 100. The suggested approach would be obligated to detect the risk associated with foster care families. Su et al. used hospital data to conduct a cross-functional analysis to predict the suicide risk among children and adolescents [<xref ref-type="bibr" rid="ref-26">26</xref>]. With an AUC of 0.84, the L1 Logistic regression models predicted both short and long-term suicidal tendencies. It confirmed the importance of hospital records in the development of predictive models. Walsh et al. developed a logistic regression model to estimate Adverse Childhood Experiences [<xref ref-type="bibr" rid="ref-27">27</xref>]. The relationship between the mother and her spouse, financial standing, community, and the health of the parents were all considered while making the prediction. With an AUC of 0.76, the model predicted the children who would be affected by adverse events.</p>
<p>Child abuse is common among children aged one to ten, according to data collected by Wongcharoenwatana et al. [<xref ref-type="bibr" rid="ref-28">28</xref>]. In most cases, the biological parents are the perpetrators, which is an unacceptable truth. Recurrent child maltreatment or recidivism should be eliminated since the child would lose hope in humanity. Using the demographic information of the children, the suggested logistic regression model successfully predicted child abuse. By studying hospital records, birth and death records, and child protection details, Putnam-hornstein et al. was able to discover child maltreatment [<xref ref-type="bibr" rid="ref-29">29</xref>]. According to the findings, data from related domains might improve the accuracy of child maltreatment prediction. Intentional injuries signify that a child has been maltreated or has a suicidal tendency. Yin et al. evaluated various Machine Learning models by utilizing data from the Chinese National Injury Surveillance System (NISS). Deep Neural Networks and AdaBoost models were more successful than the others in classifying injuries [<xref ref-type="bibr" rid="ref-30">30</xref>].</p>
<p>Child sexual abuse is more difficult to identify and analyze because there are no outward traces in many cases. By examining self-figure drawings, Kissos et al. suggested a novel method for performing the task [<xref ref-type="bibr" rid="ref-31">31</xref>]. The abuses were predicted with 70&#x0025; accuracy using a CNN-based image classification model. Tsai et al. used image classification techniques to identify classic metaphyseal lesion (CML) injuries [<xref ref-type="bibr" rid="ref-32">32</xref>]. The residual Neural Network algorithm was used to evaluate the radiographic images of the newborns, and it categorized the injuries with 93&#x0025;. The proposed methodology could be useful in identifying infants who have been vulnerable to physical abuse. Kim et al. examined data from the Korean Child Protection Service and compared victim, offender, and family background characteristics. Aside from severe repercussions for perpetrators, evidence-based intervention would be the best strategy for reducing future abuses [<xref ref-type="bibr" rid="ref-33">33</xref>]. Problematic childhood experiences were caused by certain societal characteristics of the families. Authorities and organizations need to use data analysis to identify those families. Early action could reduce the number of unlawful incidents [<xref ref-type="bibr" rid="ref-34">34</xref>].</p>
<p>One of the issues in child maltreatment research is data availability. Despite the establishment of venues such as the ISPCAN Working Group, data quantity, quality, and availability remain a problem [<xref ref-type="bibr" rid="ref-35">35</xref>]. Governments throughout the world should have a policy in place for collecting and sharing information about child maltreatment. Creating a centralized data pool, standardizing rules, cross-validating data, and evaluating the proposed model with multiple datasets could help to assure the efficiency of predictive risk models. The researchers should always be cognizant of the negative consequences that false positive predictions could have [<xref ref-type="bibr" rid="ref-36">36</xref>]. Hence, the model should be appraised with balanced training data to reduce the predictive bios [<xref ref-type="bibr" rid="ref-37">37</xref>]. PRM, like every progressive initiative, is confronted with ethical challenges [<xref ref-type="bibr" rid="ref-38">38</xref>]. PRM, on the other hand, has been shown to be effective in investigations and to produce superior results than existing approaches. The PRM models thrive in social corrective activities because of their practicality [<xref ref-type="bibr" rid="ref-39">39</xref>]. The PRM&#x2019;s ethical challenges would be reduced if birth matching and algorithmic decision-making were transparent. The automated system for detecting child maltreatment should be held accountable for the stated decision [<xref ref-type="bibr" rid="ref-40">40</xref>]. Machine Learning models could only be acknowledged as a practice if they were human centrically designed. Researchers should guarantee that the strategy is transparent and that the solutions are accountable. Though the ultimate result is essential, the tools should never have unintentional or intended negative consequences [<xref ref-type="bibr" rid="ref-41">41</xref>].</p>
<p>Crimes are linked to multiple community nodes, which generate terabytes of data every day. Due to the dimensionality of crime data, the prediction process is more difficult. As a result, Machine Learning models were used to predict different sorts of crimes [<xref ref-type="bibr" rid="ref-42">42</xref>,<xref ref-type="bibr" rid="ref-43">43</xref>]. Since child abuse is considered a crime, legal entities are constantly monitoring society and attempting to address the problem through countermeasures. However, different approaches must be taken depending on the nature of the offence. We should view child abuse as a distinct class of crimes because of the victims&#x2019; lack of voice, the impact of the crimes on individuals and society, and the lesser likelihood of reporting. Addition of analyzing demographic information, medical records, and financial information, there is a need for scrutinizing the crime reports. By constructing a dedicated Machine Learning model for child abuse prediction, we could overcome the challenges that arise with it. We attempted to build a model that could predict child abuse by evaluating criminal data in this study.</p>
</sec>
<sec id="s3">
<label>3</label>
<title>Data Collection and Preprocessing</title>
<p><xref ref-type="fig" rid="fig-1">Fig. 1</xref> illustrates the process flow for the data set preparation. The crime statistics were obtained from Chicago&#x2019;s open data portal [<xref ref-type="bibr" rid="ref-44">44</xref>]. It includes information on crimes committed between 2001 to the present. ID, Case Number, Date, Block, IUCR, Primary Type, Description, Location Description, Arrest, Domestic, Beat, District, Ward, Community Area, FBI Code, X Coordinate, Y Coordinate, Year, Updated On, Latitude, Longitude, and Location information were all plotted for each occurrence. The dataset comprises 74,76,832 offences over the chosen time period. The incidents that occurred between January 2001 and December 2021 were then filtered. We chose the data under the primary type of &#x2018;offence involving children&#x2019;. We considered the crimes labelled as &#x2018;criminal sexual abuse by a family member&#x2019;, &#x2018;sexual assault of a child by a family member&#x2019;, &#x2018;aggravated sexual assault of a child by a family member&#x2019;, &#x2018;aggravated criminal sexual abuse by a family member&#x2019;, &#x2018;child abuse&#x2019;, and &#x2018;child pornography&#x2019; which were very severe in nature. Illinois law defines the aforementioned categories as those including physical, emotional, and sexual abuse of children by known or unknown offenders. <xref ref-type="table" rid="table-1">Table 1</xref> depicts the specifics of the offence, offender, and type.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>Phase I&#x2013;data set preparation and exploratory analysis&#x2013;flow diagram</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-1.tif"/>
</fig><table-wrap id="table-1"><label>Table 1</label>
<caption>
<title>Crime&#x2013;subtypes <italic>vs.</italic> perpetrator <italic>vs.</italic> type</title></caption>
<table><colgroup><col align="left"/><col align="left"/><col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left">Crime&#x2013;subtypes</th>
<th align="left">Perpetrator</th>
<th align="left">Type</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">1751&#x2013;Criminal sex abuse by family member</td>
<td align="left">Family member</td>
<td align="left">Sexual</td>
</tr>
<tr>
<td align="left">1753&#x2013;Sex assault of child by family member</td>
<td align="left">Family member</td>
<td align="left">Sexual</td>
</tr>
<tr>
<td align="left">1754&#x2013;Aggravated sex assault of child family member</td>
<td align="left">Family member</td>
<td align="left">Sexual</td>
</tr>
<tr>
<td align="left">1752&#x2013;Aggravated criminal sex abuse family member</td>
<td align="left">Family member</td>
<td align="left">Sexual</td>
</tr>
<tr>
<td align="left">1750&#x2013;Child abuse</td>
<td align="left">Known/unknown person</td>
<td align="left">Physical, emotional, sexual</td>
</tr>
<tr>
<td align="left">1582&#x2013;Child pornography</td>
<td align="left">Known/unknown person</td>
<td align="left">Sexual</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The following columns were determined as less important and were removed: ID, Case Number, Primary Type, X Coordinate, Y Coordinate, Ward, Community Area, Updated On, Description, and Location. The FBI Code was also deleted from the data set because it represents the sort of crime in a tangential way. The crime incidences were only identified with block level information to protect the victims&#x2019; privacy. To generalize the data, the numerical portion of the block was eliminated. The description of the location was renamed to &#x2018;Location,&#x2019; and the null value was replaced with the mode value. The Date column was separated into four columns: Year, Month, Day, and Time. The &#x2018;Geopy&#x2019; package was used to replace the 3095 null values in the Latitude and Longitude columns by referencing the block name. The dataset had 1759 Blocks and 93 unique Locations that were replaced with unique numeric values. Each of the six main crime subtypes in the IUCR Code has been replaced by a numerical value ranging from one to six.</p>
<p>&#x2018;Abuses Against Children&#x2014;Data set for Exploratory Analysis&#x2019; (AAC-DEA) is the processed dataset, which contains 29010 rows of information on Location, Block, Beat, District, Arrest, Domestic, Latitude, Longitude, Year, Month, Day, Time, and IUCR (Refer to <xref ref-type="table" rid="table-2">Table 2</xref>). This data set was used for the exploratory analysis. Because Arrest and Domestic are post-occurrence attributes, they were deleted from the data set in order to continue. The finalized &#x2018;Abuses Against Children&#x2014;Data set for Prediction (AAC-DP)&#x2019; with Location, Block, Beat, District, Latitude, Longitude, Year, Month, Day, Time, and IUCR information was ready at the end of phase one.</p>
<table-wrap id="table-2"><label>Table 2</label>
<caption>
<title>AAC data set description</title></caption>
<table><colgroup><col align="left"/><col align="left"/><col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left">Column name</th>
<th align="left">Description of column</th>
<th align="left">Data type</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Location</td>
<td align="left">Place of crime occurrence</td>
<td align="left">Integer</td>
</tr>
<tr>
<td align="left">Block</td>
<td align="left">Block of crime occurrence</td>
<td align="left">Integer</td>
</tr>
<tr>
<td align="left">Beat</td>
<td align="left">Beat of crime occurrence</td>
<td align="left">Integer</td>
</tr>
<tr>
<td align="left">District</td>
<td align="left">District of crime occurrence</td>
<td align="left">Integer</td>
</tr>
<tr>
<td align="left">Domestic&#x002A;</td>
<td align="left">Whether the crime is under domestic abuse or not</td>
<td align="left">Binary</td>
</tr>
<tr>
<td align="left">Arrest&#x002A;</td>
<td align="left">Whether the abuser got arrested or not</td>
<td align="left">Binary</td>
</tr>
<tr>
<td align="left">Latitude</td>
<td align="left">Latitude of crime occurrence</td>
<td align="left">Float</td>
</tr>
<tr>
<td align="left">Longitude</td>
<td align="left">Longitude of crime occurrence</td>
<td align="left">Float</td>
</tr>
<tr>
<td align="left">Year</td>
<td align="left">Year of crime occurrence</td>
<td align="left">Integer</td>
</tr>
<tr>
<td align="left">Month</td>
<td align="left">Month of crime occurrence</td>
<td align="left">Integer</td>
</tr>
<tr>
<td align="left">Day</td>
<td align="left">Day of crime occurrence</td>
<td align="left">Integer</td>
</tr>
<tr>
<td align="left">Time</td>
<td align="left">Time of crime occurrence</td>
<td align="left">Integer</td>
</tr>
<tr>
<td align="left">IUCR</td>
<td align="left"/>
<td align="left"/>
</tr>
<tr>
<td align="left">(crime sub-types)</td>
<td align="left">Illinois Uniform Crime Reporting (IUCR) codes 1751&#x2013;Criminal sex abuse by family member (1) 1753&#x2013;Sex assault of child by family member (2) 1754&#x2013;Aggravated sex assault of child family member (3) 1752&#x2013;Aggravated criminal sex abuse family member (4) 1750&#x2013;Child abuse (5) 1582&#x2013;Child pornography (6)</td>
<td align="left">Integer (Range 1 to 6)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn2_1">
<p>Note: &#x002A;&#x2013;Removed from AAC-DP.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s4">
<label>4</label>
<title>Exploratory Analysis</title>
<p>According to <xref ref-type="fig" rid="fig-2">Fig. 2</xref>, the number of Abuses Against Children (AAC) peaked between 2003 and 2005. Then it started to fall until 2012 when it started to rise again from 2013 to 2019. The number of crimes against children has decreased in the last two years. The lowest number of incidents was recorded in 2021. According to <xref ref-type="fig" rid="fig-3">Fig. 3</xref>, the highest number of offences against children were committed in January, with the lowest in July and August. <xref ref-type="fig" rid="fig-4">Fig. 4</xref> depicts the AAC day-by-day trend, which shows that AAC incidents peaked on the first, tenth, fifteenth, and twentieth days of each month. The month ended on a low note, with only a few occurrences recorded on the 31st. The AAC events that occurred at various times are depicted in <xref ref-type="fig" rid="fig-5">Fig. 5</xref>. The majority of the AAC activities took place during the night. The frequency of occurrences increased again in the middle of the day and later in the afternoon. There was a major crest of events at eight o&#x2019;clock in the morning.</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>AAC&#x2013;year wise</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-2.tif"/>
</fig><fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>AAC&#x2013;month wise</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-3.tif"/>
</fig><fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>AAC&#x2013;day wise</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-4.tif"/>
</fig><fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>AAC&#x2013;time wise</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-5.tif"/>
</fig>
<p>As shown in <xref ref-type="fig" rid="fig-6">Fig. 6</xref>, child abuse was the most common occurrence among AAC events. Aggravated sexual assaults by family members were the second most common incident from 2001 to 2007. Following that, aggravated criminal sexual abuse by family members received the second highest ranking. Despite a decrease in child abuse in 2020, it increased in 2021. AAC events and their locations should be correlated in order to identify vulnerabilities. Most of these assaults occurred in homes, apartments, and schools (Refer to <xref ref-type="fig" rid="fig-7">Fig. 7</xref>).</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>AAC&#x2013;year <italic>vs.</italic> crime subtypes</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-6.tif"/>
</fig><fig id="fig-7">
<label>Figure 7</label>
<caption>
<title>AAC&#x2013;top 10 locations</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-7.tif"/>
</fig>
<p>According to <xref ref-type="fig" rid="fig-8">Figs. 8</xref> and <xref ref-type="fig" rid="fig-9">9</xref>, children are especially vulnerable in police districts 25, 8, and 4, as well as beats 423, 421, and 823. Along South Damen Avenue, South DR Martin Luther King Jr Drive, and South Michigan Avenue, the AAC has been designated as a vulnerable location (Refer to <xref ref-type="fig" rid="fig-10">Fig. 10</xref>). <xref ref-type="fig" rid="fig-11">Fig. 11</xref> depicts the intensity of crime in Chicago. The incident map shows the number of crimes committed at the block level (Refer to <xref ref-type="fig" rid="fig-12">Fig. 12</xref>).</p>
<fig id="fig-8">
<label>Figure 8</label>
<caption>
<title>AAC&#x2013;districts</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-8.tif"/>
</fig><fig id="fig-9">
<label>Figure 9</label>
<caption>
<title>AAC&#x2013;top 25 beats</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-9.tif"/>
</fig><fig id="fig-10">
<label>Figure 10</label>
<caption>
<title>AAC&#x2013;top 25 blocks</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-10.tif"/>
</fig><fig id="fig-11">
<label>Figure 11</label>
<caption>
<title>Heat map&#x2013;AAC in Chicago</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-11.tif"/>
</fig><fig id="fig-12">
<label>Figure 12</label>
<caption>
<title>Incident map&#x2013;AAC in Chicago</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-12.tif"/>
</fig>
<p>The most common types of crimes against children in Chicago are child pornography (4.7&#x0025;), sexual assault of a child by a family member (6&#x0025;), criminal sexual abuse by a family member (8.4&#x0025;), aggravated sexual assault of a child by a family member (12.7&#x0025;), aggravated criminal sexual abuse by a family member (14.9&#x0025;), and child abuse (53.3&#x0025;) (Refer to <xref ref-type="fig" rid="fig-13">Fig. 13</xref>). According to <xref ref-type="fig" rid="fig-14">Figs. 14</xref> and <xref ref-type="fig" rid="fig-15">15</xref>, domestic violence (54&#x0025;) is more common than non-domestic offences (46&#x0025;), and such heinous crimes resulted in far fewer arrests (22.4&#x0025;). Child abuse could have been committed by an unknown individual, resulting in fewer arrests (Refer to <xref ref-type="fig" rid="fig-16">Fig. 16</xref>). However, the proportion of family members arrested for sexual crimes against children is also low.</p>
<fig id="fig-13">
<label>Figure 13</label>
<caption>
<title>AAC&#x2013;types of crimes</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-13.tif"/>
</fig><fig id="fig-14">
<label>Figure 14</label>
<caption>
<title>AAC&#x2013;domestic <italic>vs.</italic> non-domestic</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-14.tif"/>
</fig><fig id="fig-15">
<label>Figure 15</label>
<caption>
<title>AAC&#x2013;arrest status</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-15.tif"/>
</fig><fig id="fig-16">
<label>Figure 16</label>
<caption>
<title>AAC&#x2013;types of crimes <italic>vs.</italic> arrest</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-16.tif"/>
</fig>
</sec>
<sec id="s5">
<label>5</label>
<title>Proposed Approach and Results</title>
<p><xref ref-type="fig" rid="fig-17">Fig. 17</xref> depicted the visual representation of the proposed methodology. The AAC-DP Crime Data set contains the following features: location, block, beat, district, latitude, longitude, year, month, day, time, and IUCR. The categorized IUCR column was designated as a dependent variable, while the first ten attributes were classified as independent variables. Because the features contain a wide range of information, they were standardized by the Pandas library. In this method, each data point was subtracted from the mean and then divided by the standard deviation, yielding the standardized data set (Refer to <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref>). By standardizing the data set, the features are transformed into a similar scale which reduced the prediction bios.</p>
<p><disp-formula id="eqn-1"><label>(1)</label>
<mml:math id="mml-eqn-1" display="block"><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>n</mml:mi><mml:mi>e</mml:mi><mml:mi>w</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>x</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mrow><mml:mi mathvariant="normal">&#x03BC;</mml:mi></mml:mrow></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">&#x03C3;</mml:mi></mml:mrow></mml:mrow></mml:mfrac></mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mstyle></mml:math>
</disp-formula></p>
<p>x<sub>new</sub>&#x2013;&#x003E; Standardized value</p>
<p>x&#x2013;&#x003E; observed value</p>
<p>&#x03BC;&#x2013;&#x003E; mean of the sample</p>
<p>&#x03C3;&#x2013;&#x003E; standard deviation of the sample</p>
<fig id="fig-17">
<label>Figure 17</label>
<caption>
<title>Phase II&#x2013;flow diagram of proposed approach</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-17.tif"/>
</fig>
<p>The data set was then split into two sections: training and testing, with the former accounting for 80&#x0025; of the total. At the end, it had 23208 rows of training data and 5802 rows of testing data. In this paper, well-known state-of-the-art models were used to predict AAC subtypes [<xref ref-type="bibr" rid="ref-45">45</xref>,<xref ref-type="bibr" rid="ref-46">46</xref>]. These models include Naive Bayes, Logistic regression, Random Forest, KNN, Decision Tree, Adaboost, XGBoost, and Extra Trees. Particularly, tree-based models were utilized because they could accommodate multidimensional attributes and observations. They also perform exceptionally well when the data contains both quantitative and categorical variables [<xref ref-type="bibr" rid="ref-47">47</xref>,<xref ref-type="bibr" rid="ref-48">48</xref>].</p>
<p>Accuracy (ACC), Precision, Recall, <italic>F1</italic>-score, and Area Under the Curve (AUC) are commonly used metrics for classification problems [<xref ref-type="bibr" rid="ref-49">49</xref>]. The accuracy of the model depicts its ability to perform the entire classification task. Precision is defined as the proportion of correctly predicted positive observations to all predicted positive observations. The ratio of accurately predicted positive observations to all observations in the actual class is referred to as recall. The <italic>F1</italic>-Score is calculated as the weighted average of Precision and Recall. The Area Under the Curve (AUC) is a measure of a classifier&#x2019;s ability to distinguish between classes. Elapsed time was also considered to determine the model&#x2019;s responsiveness. These metrics are listed in <xref ref-type="disp-formula" rid="eqn-2">Eqs. (2)</xref> to <xref ref-type="disp-formula" rid="eqn-7">(7)</xref>.</p>
<p><disp-formula id="eqn-2"><label>(2)</label>
<mml:math id="mml-eqn-2" display="block"><mml:mi>A</mml:mi><mml:mi>c</mml:mi><mml:mi>c</mml:mi><mml:mi>u</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>c</mml:mi><mml:mi>y</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>N</mml:mi><mml:mi>e</mml:mi><mml:mi>g</mml:mi><mml:mi>e</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>s</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi><mml:mo>+</mml:mo><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>N</mml:mi><mml:mi>e</mml:mi><mml:mi>g</mml:mi><mml:mi>e</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>s</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>N</mml:mi><mml:mi>e</mml:mi><mml:mi>g</mml:mi><mml:mi>e</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:mfrac></mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mstyle></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-3"><label>(3)</label>
<mml:math id="mml-eqn-3" display="block"><mml:mtable columnalign="right left" rowspacing=".5em" columnspacing="thickmathspace" displaystyle="true"><mml:mtr><mml:mtd><mml:mspace width="1em" /><mml:mi>P</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>s</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:mfrac></mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mstyle></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mspace width="1em" /></mml:mtd></mml:mtr></mml:mtable></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-4"><label>(4)</label>
<mml:math id="mml-eqn-4" display="block"><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>l</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>s</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>N</mml:mi><mml:mi>e</mml:mi><mml:mi>g</mml:mi><mml:mi>e</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:mfrac></mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mstyle></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-5"><label>(5)</label>
<mml:math id="mml-eqn-5" display="block"><mml:mi>F</mml:mi><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:mi>S</mml:mi><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mo>=</mml:mo><mml:mn>2</mml:mn><mml:mo>&#x2217;</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>&#x2217;</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>l</mml:mi></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:mfrac></mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mstyle></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-6"><label>(6)</label>
<mml:math id="mml-eqn-6" display="block"><mml:mi>A</mml:mi><mml:mi>U</mml:mi><mml:mi>C</mml:mi><mml:mo>=</mml:mo><mml:munderover><mml:mrow><mml:mo>&#x222B;</mml:mo></mml:mrow><mml:mn>0</mml:mn><mml:mn>1</mml:mn></mml:munderover><mml:mo>&#x2061;</mml:mo><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi>R</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:msup><mml:mi>R</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mtext>&#x00A0;</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mtext>&#x00A0;</mml:mtext><mml:mi>d</mml:mi><mml:mi>x</mml:mi><mml:mtext>&#x00A0;</mml:mtext></mml:math>
</disp-formula></p>
<p>Where<inline-formula id="ieqn-1">
<mml:math id="mml-ieqn-1"><mml:mtext>&#x00A0;</mml:mtext><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi>R</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>s</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>N</mml:mi><mml:mi>e</mml:mi><mml:mi>g</mml:mi><mml:mi>e</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:mfrac></mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mstyle></mml:math>
</inline-formula>and <inline-formula id="ieqn-2">
<mml:math id="mml-ieqn-2"><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mi>R</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true" scriptlevel="0"><mml:mrow><mml:mfrac><mml:mrow><mml:mi>F</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>s</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>r</mml:mi><mml:mi>u</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>N</mml:mi><mml:mi>e</mml:mi><mml:mi>g</mml:mi><mml:mi>e</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi><mml:mo>+</mml:mo><mml:mi>F</mml:mi><mml:mi>a</mml:mi><mml:mi>l</mml:mi><mml:mi>s</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>P</mml:mi><mml:mi>o</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>e</mml:mi></mml:mrow></mml:mfrac></mml:mrow></mml:mstyle></mml:math>
</inline-formula><disp-formula id="eqn-7"><label>(7)</label>
<mml:math id="mml-eqn-7" display="block"><mml:mi>E</mml:mi><mml:mi>l</mml:mi><mml:mi>a</mml:mi><mml:mi>p</mml:mi><mml:mi>s</mml:mi><mml:mi>e</mml:mi><mml:mi>d</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mo>=</mml:mo><mml:mi>E</mml:mi><mml:mi>n</mml:mi><mml:mi>d</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mo>&#x2212;</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mi>S</mml:mi><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>r</mml:mi><mml:mi>t</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>m</mml:mi><mml:mi>e</mml:mi><mml:mo>.</mml:mo><mml:mtext>&#x00A0;</mml:mtext></mml:math>
</disp-formula></p>
<p><xref ref-type="table" rid="table-3">Table 3</xref> depicts the outcomes of the appraised models. XGBoost model outperformed the others, by achieving 59&#x0025; accuracy, 41&#x0025; of precision, 31&#x0025; of recall, and 31&#x0025; of <italic>F1</italic>-Score. It also resulted in a higher AUC (0.6986) and a longer run time (10.43&#x2005;s) when compared to others. Even though the Extra Trees and Random Forest models produced similar results, Extra Trees is much faster than Random Forest. Adaboost, Logistic Regression, GaussianNB, and KNN models predicted crime subtypes moderately. The Decision Tree model underperformed, with a lower accuracy (44&#x0025;). GaussianNB was the fastest model in terms of runtime (0.01&#x2005;s). When compared to the performance of prediction models proposed by other researchers [<xref ref-type="bibr" rid="ref-50">50</xref>,<xref ref-type="bibr" rid="ref-51">51</xref>], the XGBoost model performed mediocrely. The data set is referred to as an imbalanced data set since the number of observations in the target classes varies significantly. The algorithms would assign biased weightage to the same because one class outnumbers the other in terms of quantity. It would influence the algorithms&#x2019; performance, resulting in poor outcomes [<xref ref-type="bibr" rid="ref-52">52</xref>,<xref ref-type="bibr" rid="ref-53">53</xref>].</p>
<table-wrap id="table-3"><label>Table 3</label>
<caption>
<title>Metrics of algorithms</title></caption>
<table><colgroup><col align="left"/><col align="left"/><col align="left"/><col align="left"/><col align="left"/><col align="left"/><col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left">Classifiers</th>
<th align="left">Accuracy</th>
<th align="left">Precision</th>
<th align="left">Recall</th>
<th align="left"><italic>F1</italic>-Score</th>
<th align="left">AUC</th>
<th align="left">Elapsed time</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Decision tree</td>
<td align="left">44</td>
<td align="left">27</td>
<td align="left">28</td>
<td align="left">27</td>
<td align="left">0.5653</td>
<td align="left">0.51</td>
</tr>
<tr>
<td align="left">KNN</td>
<td align="left">51</td>
<td align="left">33</td>
<td align="left">28</td>
<td align="left">29</td>
<td align="left">0.6178</td>
<td align="left">0.25</td>
</tr>
<tr>
<td align="left">Gaussian NB</td>
<td align="left">55</td>
<td align="left">26</td>
<td align="left">25</td>
<td align="left">24</td>
<td align="left">0.6491</td>
<td align="left">0.01</td>
</tr>
<tr>
<td align="left">Logistic regression</td>
<td align="left">56</td>
<td align="left">27</td>
<td align="left">22</td>
<td align="left">20</td>
<td align="left">0.6437</td>
<td align="left">1.66</td>
</tr>
<tr>
<td align="left">AdaBoost</td>
<td align="left">57</td>
<td align="left">35</td>
<td align="left">26</td>
<td align="left">25</td>
<td align="left">0.6835</td>
<td align="left">0.91</td>
</tr>
<tr>
<td align="left">Random forest</td>
<td align="left">58</td>
<td align="left">41</td>
<td align="left">29</td>
<td align="left">31</td>
<td align="left">0.6842</td>
<td align="left">10.41</td>
</tr>
<tr>
<td align="left">Extra trees</td>
<td align="left">58</td>
<td align="left">42</td>
<td align="left">28</td>
<td align="left">30</td>
<td align="left">0.6785</td>
<td align="left">5.5</td>
</tr>
<tr>
<td align="left">XGBoost</td>
<td align="left">59</td>
<td align="left">41</td>
<td align="left">31</td>
<td align="left">31</td>
<td align="left">0.6986</td>
<td align="left">10.43</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><xref ref-type="table" rid="table-4">Table 4</xref> depicts a stark disparity between the target variable classes, indicating that the incidence of child abuse is disproportionately high in comparison to other categories. <xref ref-type="table" rid="table-5">Table 5</xref> shows that the XGBoost model accurately predicted the majority class, but it underperformed when predicting the minority classes. The use of sampling techniques could be a way to solve this problem. Under-sampling would remove a large portion of the original data, which could never be a good solution for sensitive issues like crime prediction. As a result, we addressed the issue using the Borderline Synthetic Minority Oversampling Technique (Borderline SMOTE).</p>
<table-wrap id="table-4"><label>Table 4</label>
<caption>
<title>Classes of target variable and count</title></caption>
<table><colgroup><col align="left"/><col align="left"/><col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left">Offence</th>
<th align="left">Class</th>
<th align="left">Count</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">1751-Criminal sex abuse by family member</td>
<td align="left">1</td>
<td align="left">2442</td>
</tr>
<tr>
<td align="left">1753-Sex assault of child by family member</td>
<td align="left">2</td>
<td align="left">1734</td>
</tr>
<tr>
<td align="left">1754-Aggravated sex assault of child family member</td>
<td align="left">3</td>
<td align="left">3680</td>
</tr>
<tr>
<td align="left">1752-Aggravated criminal sex abuse family member</td>
<td align="left">4</td>
<td align="left">4320</td>
</tr>
<tr>
<td align="left">1750-Child abuse</td>
<td align="left">5</td>
<td align="left">15461</td>
</tr>
<tr>
<td align="left">1582-Child pornography</td>
<td align="left">6</td>
<td align="left">1373</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-5"><label>Table 5</label>
<caption>
<title>Class wise prediction by the best performing model&#x2013;XGBoost</title></caption>
<table><colgroup><col align="left"/><col align="left"/><col align="left"/><col align="left"/><col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left">Class</th>
<th align="left">Precision</th>
<th align="left">Recall</th>
<th align="left"><italic>F1</italic>-score</th>
<th align="left">Support</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">1</td>
<td align="left">0.16</td>
<td align="left">0.02</td>
<td align="left">0.04</td>
<td align="left">513</td>
</tr>
<tr>
<td align="left">2</td>
<td align="left">0.21</td>
<td align="left">0.02</td>
<td align="left">0.03</td>
<td align="left">350</td>
</tr>
<tr>
<td align="left">3</td>
<td align="left">0.33</td>
<td align="left">0.18</td>
<td align="left">0.24</td>
<td align="left">737</td>
</tr>
<tr>
<td align="left">4</td>
<td align="left">0.38</td>
<td align="left">0.36</td>
<td align="left">0.37</td>
<td align="left">872</td>
</tr>
<tr>
<td align="left">5</td>
<td align="left">0.65</td>
<td align="left">0.93</td>
<td align="left">0.77</td>
<td align="left">3068</td>
</tr>
<tr>
<td align="left">6</td>
<td align="left">0.73</td>
<td align="left">0.32</td>
<td align="left">0.44</td>
<td align="left">262</td>
</tr>
<tr>
<td align="left" colspan="5"/>
</tr>
<tr>
<td align="left">Accuracy</td>
<td align="left"/>
<td align="left"/>
<td align="left">0.59</td>
<td align="left">5802</td>
</tr>
<tr>
<td align="left">Macro average</td>
<td align="left">0.41</td>
<td align="left">0.31</td>
<td align="left">0.31</td>
<td align="left">5802</td>
</tr>
<tr>
<td align="left">Weighted average</td>
<td align="left">0.51</td>
<td align="left">0.59</td>
<td align="left">0.52</td>
<td align="left">5802</td>
</tr>
</tbody>
</table>
</table-wrap>
<sec id="s5_1">
<label>5.1</label>
<title>Borderline Synthetic Minority Oversampling Technique (Borderline SMOTE)</title>
<p>SMOTE is used to balance the data set by upsampling minority classes. It separates data points into minority classes and identifies their nearest neighbors. Then a random number called &#x2018;<italic>R</italic>&#x2019; was generated, ranging from 0 to 1. The distance between the nearest points is multiplied by the &#x2018;<italic>R</italic>&#x2019; value and added to the primary point. The newly calculated point would be added as a synthetic observation to the existing data set [<xref ref-type="bibr" rid="ref-54">54</xref>].</p>
<p>Example:</p>
<p>(x<sub>1obs</sub>, y<sub>1obs</sub>) and (x<sub>2obs</sub>, y<sub>2obs</sub>) are the two observations from minority class. The value of <italic>R</italic> ranges from 0 to 1. The synthetic values would be:<disp-formula id="ueqn-1">
<mml:math id="mml-ueqn-1" display="block"><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>s</mml:mi><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>s</mml:mi><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>o</mml:mi><mml:mi>b</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>R</mml:mi><mml:mo>&#x2217;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>o</mml:mi><mml:mi>b</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mtext>&#x00A0;</mml:mtext><mml:mo>&#x2212;</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>o</mml:mi><mml:mi>b</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mtext>&#x00A0;&#x00A0;</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>o</mml:mi><mml:mi>b</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>R</mml:mi><mml:mo>&#x2217;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mi>o</mml:mi><mml:mi>b</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msub><mml:mtext>&#x00A0;</mml:mtext><mml:mo>&#x2212;</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mi>o</mml:mi><mml:mi>b</mml:mi><mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>Outliers in minority classes may result in overlapping with the majority class in this approach. Borderline-SMOTE, on the other hand, ignores outliers and normal data points and only considers the border points of majority and minority classes. It would generate synthetic observations using those points as a reference. At the end of the process, the number of minority data points matched the number of majority data points [<xref ref-type="bibr" rid="ref-55">55</xref>]. It would address the issue of data imbalance and enhance the performance of Machine Learning models [<xref ref-type="bibr" rid="ref-56">56</xref>,<xref ref-type="bibr" rid="ref-57">57</xref>]. In this study, Borderline SMOTE increased the quantity of minority classes and matched them with the count of the majority class. As a result, the data set size has been increased to 92766, which is three times larger than the original data set.</p>
</sec>
<sec id="s5_2">
<label>5.2</label>
<title>Stacking Classifier</title>
<p>Stacking, also known as stacked generalization, is an ensemble algorithm with two prediction levels. A bunch of classification models at the base level, or Level 0, and Level 1 contains a meta classifier. At the basic level, the models predict the target classes in the test data set. The output of the base classifiers would be fed into the meta classifier as an input. The meta classifier considers the bios of the base models and addresses them while predicting. It obviously reduces the error rate and improves prediction accuracy [<xref ref-type="bibr" rid="ref-58">58</xref>,<xref ref-type="bibr" rid="ref-59">59</xref>].<disp-formula id="ueqn-2">
<mml:math id="mml-ueqn-2" display="block"><mml:mi>D</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:mi>s</mml:mi><mml:mi>e</mml:mi><mml:mi>t</mml:mi><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:mi>D</mml:mi><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>.</mml:mo><mml:msub><mml:mi>X</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mtext>&#x00A0;</mml:mtext><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>.</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>In the data set, <italic>X</italic> represents the independent variables and <italic>Y</italic> represents the target class. The data set was divided into two parts: training (DSTraining) and testing (DSTesting). The target classes were predicted by &#x2018;<italic>N</italic>&#x2019; number of classifiers at Level &#x2018;0.&#x2019; Each model&#x2019;s prediction cluster would be:<disp-formula id="ueqn-3">
<mml:math id="mml-ueqn-3" display="block"><mml:mtable columnalign="left" rowspacing=".5em" columnspacing="thickmathspace" displaystyle="true"><mml:mtr><mml:mtd><mml:mspace width="1em" /><mml:mrow><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">g</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">t</mml:mi><mml:mi mathvariant="normal">h</mml:mi><mml:mi mathvariant="normal">m</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x003A;</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>1</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mtext>&#x00A0;</mml:mtext><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>1</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>1</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>1</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>.</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>1</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mi>n</mml:mi></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mspace width="1em" /><mml:mrow><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">g</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">t</mml:mi><mml:mi mathvariant="normal">h</mml:mi><mml:mi mathvariant="normal">m</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>2</mml:mn><mml:mo>&#x003A;</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>2</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mtext>&#x00A0;</mml:mtext><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>2</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>2</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>2</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>.</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>2</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mi>n</mml:mi></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mspace width="1em" /><mml:mrow><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">g</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">t</mml:mi><mml:mi mathvariant="normal">h</mml:mi><mml:mi mathvariant="normal">m</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>3</mml:mn><mml:mo>&#x003A;</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>3</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mtext>&#x00A0;</mml:mtext><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>3</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>3</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>3</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>.</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi></mml:mrow><mml:mn>3</mml:mn><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mi>n</mml:mi></mml:msub></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mspace width="1em" /><mml:mo>&#x2026;</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo><mml:mspace width="thinmathspace" /><mml:mo>&#x2026;</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mspace width="1em" /><mml:mrow><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">g</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">t</mml:mi><mml:mi mathvariant="normal">h</mml:mi><mml:mi mathvariant="normal">m</mml:mi><mml:mtext>&#x00A0;</mml:mtext><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">N</mml:mi></mml:mrow><mml:mo>&#x003A;</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">N</mml:mi></mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mtext>&#x00A0;</mml:mtext><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">N</mml:mi></mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">N</mml:mi></mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">N</mml:mi></mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>.</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mrow><mml:mrow><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">N</mml:mi></mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mo>&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mi>n</mml:mi></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math>
</disp-formula></p>
<p>As input, the above predictions would be fed into a Level &#x2018;1&#x2019; meta classifier (AMETA). The meta classifier predicts the target classes and provides the following results:<disp-formula id="ueqn-4">
<mml:math id="mml-ueqn-4" display="block"><mml:mrow><mml:mi mathvariant="normal">A</mml:mi><mml:mi mathvariant="normal">l</mml:mi><mml:mi mathvariant="normal">g</mml:mi><mml:mi mathvariant="normal">o</mml:mi><mml:mi mathvariant="normal">r</mml:mi><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">t</mml:mi><mml:mi mathvariant="normal">h</mml:mi><mml:mi mathvariant="normal">m</mml:mi><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">M</mml:mi><mml:mi mathvariant="normal">E</mml:mi><mml:mi mathvariant="normal">T</mml:mi><mml:mi mathvariant="normal">A</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>&#x003A;</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>y</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>y</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>y</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>y</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mrow><mml:mtext>&#x00A0;</mml:mtext></mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>y</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mi>n</mml:mi></mml:msub></mml:math>
</disp-formula></p>
<p>The Borderline-SMOTE balanced data set was divided into training and testing blocks in an 80:20 ratio for this study. The training set had 74212 samples, while the testing set had 18554 instances. Then, as Level &#x2018;0&#x2019; classifiers, KNN, Decision Tree, AdaBoost, Extra Trees, and XGBoost were allocated, and Random Forest was assigned as the meta classifier. Because the prediction needed to be more reliable, the cross-validation score was set to five, and the model was evaluated. In addition, the other cutting-edge algorithms were evaluated with the same to make comparisons. <xref ref-type="table" rid="table-6">Table 6</xref> displays the results of the models that were evaluated, while <xref ref-type="table" rid="table-7">Table 7</xref> depicts the performance of the proposed model.</p>
<table-wrap id="table-6"><label>Table 6</label>
<caption>
<title>Outcome of stacked classifier <italic>vs.</italic> other state of art models</title></caption>
<table><colgroup><col align="left"/><col align="left"/><col align="left"/><col align="left"/><col align="left"/><col align="left"/><col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left">Classifiers</th>
<th align="left">Accuracy</th>
<th align="left">Precision</th>
<th align="left">Recall</th>
<th align="left">F1-score</th>
<th align="left">AUC</th>
<th align="left">Elapsed time</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Gaussian NB</td>
<td align="left">28</td>
<td align="left">30</td>
<td align="left">28</td>
<td align="left">24</td>
<td align="left">0.6805</td>
<td align="left">0.05</td>
</tr>
<tr>
<td align="left">Logistic Regression</td>
<td align="left">33</td>
<td align="left">31</td>
<td align="left">33</td>
<td align="left">31</td>
<td align="left">0.6908</td>
<td align="left">3.73</td>
</tr>
<tr>
<td align="left">AdaBoost</td>
<td align="left">38</td>
<td align="left">36</td>
<td align="left">39</td>
<td align="left">36</td>
<td align="left">0.7233</td>
<td align="left">10.69</td>
</tr>
<tr>
<td align="left">XGBoost</td>
<td align="left">68</td>
<td align="left">68</td>
<td align="left">69</td>
<td align="left">68</td>
<td align="left">0.909</td>
<td align="left">39.23</td>
</tr>
<tr>
<td align="left">Decision tree</td>
<td align="left">71</td>
<td align="left">71</td>
<td align="left">71</td>
<td align="left">71</td>
<td align="left">0.8254</td>
<td align="left">2.01</td>
</tr>
<tr>
<td align="left">KNN</td>
<td align="left">81</td>
<td align="left">81</td>
<td align="left">81</td>
<td align="left">80</td>
<td align="left">0.9509</td>
<td align="left">5.2</td>
</tr>
<tr>
<td align="left">Random forest</td>
<td align="left">87</td>
<td align="left">87</td>
<td align="left">87</td>
<td align="left">87</td>
<td align="left">0.974</td>
<td align="left">45.21</td>
</tr>
<tr>
<td align="left">Extra trees</td>
<td align="left">90</td>
<td align="left">90</td>
<td align="left">90</td>
<td align="left">90</td>
<td align="left">0.983</td>
<td align="left">17.55</td>
</tr>
<tr>
<td align="left">Stacking classifier</td>
<td align="left">93</td>
<td align="left">93</td>
<td align="left">93</td>
<td align="left">93</td>
<td align="left">0.989</td>
<td align="left">476.63</td>
</tr>
</tbody>
</table>
</table-wrap><table-wrap id="table-7"><label>Table 7</label>
<caption>
<title>Class wise prediction by stacking classifier</title></caption>
<table><colgroup><col align="left"/><col align="left"/><col align="left"/><col align="left"/><col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left">Class</th>
<th align="left">Precision</th>
<th align="left">Recall</th>
<th align="left"><italic>F1</italic>-score</th>
<th align="left">Support</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">1</td>
<td align="left">0.96</td>
<td align="left">0.93</td>
<td align="left">0.95</td>
<td align="left">3151</td>
</tr>
<tr>
<td align="left">2</td>
<td align="left">0.97</td>
<td align="left">0.95</td>
<td align="left">0.96</td>
<td align="left">3140</td>
</tr>
<tr>
<td align="left">3</td>
<td align="left">0.94</td>
<td align="left">0.9</td>
<td align="left">0.92</td>
<td align="left">3122</td>
</tr>
<tr>
<td align="left">4</td>
<td align="left">0.93</td>
<td align="left">0.9</td>
<td align="left">0.91</td>
<td align="left">3083</td>
</tr>
<tr>
<td align="left">5</td>
<td align="left">0.82</td>
<td align="left">0.93</td>
<td align="left">0.87</td>
<td align="left">3012</td>
</tr>
<tr>
<td align="left">6</td>
<td align="left">0.99</td>
<td align="left">0.98</td>
<td align="left">0.98</td>
<td align="left">3046</td>
</tr>
<tr>
<td align="left" colspan="5"/>
</tr>
<tr>
<td align="left">Accuracy</td>
<td align="left"/>
<td align="left"/>
<td align="left">0.93</td>
<td align="left">18554</td>
</tr>
<tr>
<td align="left">Macro average</td>
<td align="left">0.93</td>
<td align="left">0.93</td>
<td align="left">0.93</td>
<td align="left">18554</td>
</tr>
<tr>
<td align="left">Weighted average</td>
<td align="left">0.93</td>
<td align="left">0.93</td>
<td align="left">0.93</td>
<td align="left">18554</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The data balancing had a negative impact on the Gaussian NB, Logistic Regression, and AdaBoost models, as shown in <xref ref-type="fig" rid="fig-18">Fig. 18</xref>. When the accuracy of the Gaussian NB, Logistic Regression, and AdaBoost models was compared to the performance on the imbalanced data set, a significant droppage was observed. Although the precision, recall, and AUC of the Gaussian NB model improved, the <italic>F1</italic>-Score remained unchanged. Though the Logistic Regression model&#x2019;s recall and <italic>F1</italic>-Score were significantly improved, precision was only slightly improved. The AdaBoost model&#x2019;s recall was increased, although there was only a slight improvement in precision, which influenced the <italic>F1</italic>-Score. The remaining models&#x2019; metrics revealed that the over-sampling approach had a positive impact. Poor performers of the imbalanced data set, such as Decision Tree and KNN, performed better, with the accuracy of 71&#x0025; and 81&#x0025;, respectively. Along with Precision, Recall, and <italic>F1</italic>-Score, the models&#x2019; AUC values were significantly increased (Refer to <xref ref-type="fig" rid="fig-19">Fig. 19</xref>). XGBoost, the best performer in the imbalanced data set, delivered moderate accuracy (68&#x0025;). However, the AUC value (0.909) of the same was higher than the Decision Tree (0.8254). The Random Forest outperforms the other models in terms of Accuracy, Precision, Recall, <italic>F1</italic>-Score, and AUC (0.974). The Extra Trees model delivered 90&#x0025; Accuracy, Precision, Recall, <italic>F1</italic>-Score, and AUC value (0.983), making it the second-best model among all.</p>
<fig id="fig-18">
<label>Figure 18</label>
<caption>
<title>Accuracy, precision, recall, <italic>F1</italic>-score of state of art models</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-18.tif"/>
</fig><fig id="fig-19">
<label>Figure 19</label>
<caption>
<title>AUC of state of art models</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-19.tif"/>
</fig>
<p>To detect Child Abuses by over-sampling minority classes with Borderline-SMOTE, the maximum detection performance was attained by the stacking model. It outperformed the others by yielding 93&#x0025; of Accuracy, Precision, Recall, <italic>F1</italic>-Score, and a better AUC value (0.989). However, the elapsed time was on the longer side (476.63), which would be a challenge in real-time as we appraised a large data set (Refer to <xref ref-type="fig" rid="fig-20">Fig. 20</xref>).</p>
<fig id="fig-20">
<label>Figure 20</label>
<caption>
<title>Elapsed time of state of art models</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CSSE_34910-fig-20.tif"/>
</fig>
</sec>
</sec>
<sec id="s6">
<label>6</label>
<title>Conclusion</title>
<p>Artificial intelligence-based crime prediction is an effective instrument for forecasting child maltreatment. Because child abuse is a major issue around the world, researchers are working to develop effective techniques to support detection performance. We extracted child maltreatment incidents from the Chicago crime database for this study. To predict child abuse, we used machine learning algorithms such as Decision Tree, KNN, Gaussian NB, Logistic Regression, AdaBoost, Random Forest, Extra Trees, and XGBoost. The results revealed that the algorithms performed mediocrely due to unbalanced data. As a result, we used Borderline-SMOTE to balance the data set, and the findings demonstrate that the performance of the XGBoost, Decision Tree, KNN, Random Forest, and Extra Trees classifiers improved. Furthermore, the stacking model was developed using AdaBoost, XGBoost, Decision Tree, KNN, and Extra Trees in Level &#x2018;0&#x2019;, as well as Random Forest as the meta classifier. It outscored the competition in terms of accuracy (93&#x0025;), precision (93&#x0025;), recall (93&#x0025;), <italic>F1</italic>-Score (93&#x0025;), and AUC value (0.983). The proposed Borderline-SMOTE enabled Stacking Classifier model (BS-SC Model) could be used to effectively predict child maltreatment. In the future, we endeavor to use feature selection methodologies and dimensionality reduction techniques to minimize the model fitting time.</p>
</sec>
</body>
<back>
<sec>
<title>Funding Statement</title>
<p>We have received no specific funding for this study.</p>
</sec>
<sec sec-type="COI-statement">
<title>Conflicts of Interest</title>
<p>The authors declare that they have no conflicts of interest to report regarding the present study.</p>
</sec>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="web"><person-group person-group-type="author"><collab>WHO</collab></person-group>, <source>Child maltreatment</source>, <year>2022</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://www.who.int/news-room/fact-sheets/detail/child-maltreatment">https://www.who.int/news-room/fact-sheets/detail/child-maltreatment</ext-link>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><collab>ACF</collab></person-group>, <source>Child maltreatment report 2020</source>, Administration for Children and Families, <year>2022</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://www.acf.hhs.gov/cb/data-research/child-maltreatment">https://www.acf.hhs.gov/cb/data-research/child-maltreatment</ext-link>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J. P.</given-names> <surname>Ryan</surname></string-name>, <string-name><given-names>B. A.</given-names> <surname>Jacob</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Gross</surname></string-name>, <string-name><given-names>B. E.</given-names> <surname>Perron</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Moore</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Early exposure to child maltreatment and academic outcomes</article-title>,&#x201D; <source>Child Maltreatment</source>, vol. <volume>23</volume>, no. <issue>4</issue>, pp. <fpage>365</fpage>&#x2013;<lpage>375</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K. A.</given-names> <surname>Davis</surname></string-name> and <string-name><given-names>R. A.</given-names> <surname>Knight</surname></string-name></person-group>, &#x201C;<article-title>The relation of childhood abuse experiences to problematic sexual behaviors in male youths who have sexually offended</article-title>,&#x201D; <source>Archives of Sexual Behavior</source>, vol. <volume>48</volume>, no. <issue>7</issue>, pp. <fpage>2149</fpage>&#x2013;<lpage>2169</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K.</given-names> <surname>Orrigio</surname></string-name>, <string-name><given-names>R. B.</given-names> <surname>Pierre</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Gordon-Harrison</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Lewis-O&#x2019;Connor</surname></string-name>, <string-name><given-names>G.</given-names> <surname>Gordon-Strachan</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Sexual abuse and sexually-transmitted HIV/AIDS in Jamaican children and adolescents aged 6&#x2013;19 years</article-title>,&#x201D; <source>The Journal of Infection in Developing Countries</source>, vol. <volume>15</volume>, no. <issue>7</issue>, pp. <fpage>989</fpage>&#x2013;<lpage>996</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. E.</given-names> <surname>Austin</surname></string-name>, <string-name><given-names>M. E.</given-names> <surname>Shanahan</surname></string-name> and <string-name><given-names>B. J.</given-names> <surname>Zvara</surname></string-name></person-group>, &#x201C;<article-title>Association of childhood abuse and prescription opioid use in early adulthood</article-title>,&#x201D; <source>Addictive Behaviors</source>, vol. <volume>76</volume>, pp. <fpage>265</fpage>&#x2013;<lpage>269</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>E.</given-names> <surname>Kapoor</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Okuno</surname></string-name>, <string-name><given-names>V. M.</given-names> <surname>Miller</surname></string-name>, <string-name><given-names>L. G.</given-names> <surname>Rocca</surname></string-name>, <string-name><given-names>W. A.</given-names> <surname>Rocca</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Association of adverse childhood experiences with menopausal symptoms: Results from the data registry on experiences of aging, menopause and sexuality (DREAMS)</article-title>,&#x201D; <source>Maturitas</source>, vol. <volume>143</volume>, pp. <fpage>209</fpage>&#x2013;<lpage>215</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. B.</given-names> <surname>Halpin</surname></string-name>, <string-name><given-names>R. K.</given-names> <surname>MacAulay</surname></string-name>, <string-name><given-names>A. R.</given-names> <surname>Boeve</surname></string-name>, <string-name><given-names>L. M.</given-names> <surname>D&#x2019;Errico</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Michaud</surname></string-name></person-group>, &#x201C;<article-title>Are adverse childhood experiences associated with worse cognitive function in older adults?</article-title>&#x201D; <source>Journal of the International Neuropsychological Society</source>, vol. <volume>28</volume>, no. <issue>10</issue>, pp. <fpage>1029</fpage>&#x2013;<lpage>1038</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Talmon</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Uysal</surname></string-name> and <string-name><given-names>J. J.</given-names> <surname>Gross</surname></string-name></person-group>, &#x201C;<article-title>Childhood maltreatment and mid-life adult sexuality: A 10-year longitudinal study</article-title>,&#x201D; <source>Archives of Sexual Behavior</source>, vol. <volume>51</volume>, no. <issue>2</issue>, pp. <fpage>781</fpage>&#x2013;<lpage>795</lpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J. M.</given-names> <surname>Moreno-Manso</surname></string-name>, <string-name><given-names>M. E.</given-names> <surname>Garc&#x00ED;a-Baamonde</surname></string-name>, <string-name><given-names>M.</given-names> <surname>de la Rosa Murillo</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Bl&#x00E1;zquez-Alonso</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Guerrero-Barona</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Differences in executive functions in minors suffering physical abuse and neglect</article-title>,&#x201D; <source>Journal of Interpersonal Violence</source>, vol. <volume>37</volume>, no. <issue>5</issue>, pp. <fpage>NP2588</fpage>&#x2013;<lpage>NP2604</lpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. J.</given-names> <surname>Drury</surname></string-name>, <string-name><given-names>M. J.</given-names> <surname>Elbert</surname></string-name> and <string-name><given-names>M.</given-names> <surname>DeLisi</surname></string-name></person-group>, &#x201C;<article-title>Childhood sexual abuse is significantly associated with subsequent sexual offending: New evidence among federal correctional clients</article-title>,&#x201D; <source>Child Abuse &#x0026; Neglect</source>, vol. <volume>95</volume>, pp. <fpage>104035</fpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K. V.</given-names> <surname>A&#x00E7;ar</surname></string-name></person-group>, &#x201C;<article-title>Framework for a single global repository of child abuse materials</article-title>,&#x201D; <source>Global Policy</source>, vol. <volume>11</volume>, no. <issue>1</issue>, pp. <fpage>178</fpage>&#x2013;<lpage>190</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><given-names>T.</given-names> <surname>Gash</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Hobbs</surname></string-name></person-group>, <source>Policing 4.0&#x2014;Deciding the future of policing in the UK</source>, Deloitte, <year>2018</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://www2.deloitte.com/content/dam/Deloitte/ie/Documents/PublicSector/deloitte-uk-future-of-policing.pdf">https://www2.deloitte.com/content/dam/Deloitte/ie/Documents/PublicSector/deloitte-uk-future-of-policing.pdf</ext-link>.</mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="web"><person-group person-group-type="author"><collab>CBSNEWS</collab></person-group>, <source>Why are Suspects in Repeated Sexual Abuse of 10-Year-Old Girl Free, While She was Locked Up in A Psychiatric Facility?</source> <publisher-loc>Chicago, IL, USA</publisher-loc>: <publisher-name>CBS News Chicago</publisher-name>, <year>2021</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://www.cbsnews.com/chicago/news/10-year-old-girl-sexual-abuse-dcfs-psychiatric-2-investigators/">https://www.cbsnews.com/chicago/news/10-year-old-girl-sexual-abuse-dcfs-psychiatric-2-investigators/</ext-link>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>V.</given-names> <surname>Paul</surname></string-name>, <string-name><given-names>V. K.</given-names> <surname>Rathaur</surname></string-name>, <string-name><given-names>N. K.</given-names> <surname>Bhat</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Sananganba</surname></string-name>, <string-name><given-names>A. L.</given-names> <surname>Ittoop</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Child abuse: A social evil in Indian perspective</article-title>,&#x201D; <source>Journal of Family Medicine and Primary Care</source>, vol. <volume>10</volume>, no. <issue>1</issue>, pp. <fpage>110</fpage>&#x2013;<lpage>115</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>D&#x2019;OVIDIO</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Mitman</surname></string-name>, <string-name><given-names>I. J.</given-names> <surname>El-Burki</surname></string-name> and <string-name><given-names>W.</given-names> <surname>Shumar</surname></string-name></person-group>, &#x201C;<chapter-title>Adult&#x2013;child sex advocacy websites as learning environments for crime</chapter-title>,&#x201D; in <source>Cyber Criminology: Exploring Internet Crimes and Criminal Behavior</source>, <edition>1st ed.</edition>, vol. <volume>1</volume>. <publisher-loc>Boca Raton, FL, USA</publisher-loc>: <publisher-name>CRC Press</publisher-name>, pp. <fpage>103</fpage>&#x2013;<lpage>126</lpage>, <year>2011</year>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>P.</given-names> <surname>Gillingham</surname></string-name></person-group>, &#x201C;<article-title>Can predictive algorithms assist decision-making in social work with children and families?</article-title>,&#x201D; <source>Child Abuse Review</source>, vol. <volume>28</volume>, no. <issue>2</issue>, pp. <fpage>114</fpage>&#x2013;<lpage>126</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Russell</surname></string-name></person-group>, &#x201C;<article-title>Predictive analytics and child protection: Constraints and opportunities</article-title>,&#x201D; <source>Child Abuse &#x0026; Neglect</source>, vol. <volume>46</volume>, pp. <fpage>182</fpage>&#x2013;<lpage>189</lpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>P.</given-names> <surname>Gillingham</surname></string-name></person-group>, &#x201C;<article-title>Predictive risk modelling to prevent child maltreatment and other adverse outcomes for service users: Inside the &#x2018;black box&#x2019; of machine learning</article-title>,&#x201D; <source>The British Journal of Social Work</source>, vol. <volume>46</volume>, no. <issue>4</issue>, pp. <fpage>1044</fpage>&#x2013;<lpage>1058</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Vaithianathan</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Maloney</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Putnam-Hornstein</surname></string-name> and <string-name><given-names>N.</given-names> <surname>Jiang</surname></string-name></person-group>, &#x201C;<article-title>Children in the public benefit system at risk of maltreatment: Identification via predictive modeling</article-title>,&#x201D; <source>American Journal of Preventive Medicine</source>, vol. <volume>45</volume>, no. <issue>3</issue>, pp. <fpage>354</fpage>&#x2013;<lpage>359</lpage>, <year>2013</year>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Cherian</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Dawson</surname></string-name></person-group>, &#x201C;<article-title>RoboCop: Crime classification and prediction in San Francisco</article-title>,&#x201D; <source>Forest</source>, vol. <volume>15</volume>, pp. <fpage>70</fpage>&#x2013;69, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-22"><label>[22]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. L.</given-names> <surname>Wilson</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Tumen</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Ota</surname></string-name> and <string-name><given-names>A. G.</given-names> <surname>Simmers</surname></string-name></person-group>, &#x201C;<article-title>Predictive modeling: Potential application in prevention services</article-title>,&#x201D; <source>American Journal of Preventive Medicine</source>, vol. <volume>48</volume>, no. <issue>5</issue>, pp. <fpage>509</fpage>&#x2013;<lpage>519</lpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-23"><label>[23]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Vaithianathan</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Putnam-Hornstein</surname></string-name>, <string-name><given-names>N.</given-names> <surname>Jiang</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Nand</surname></string-name> and <string-name><given-names>T.</given-names> <surname>Maloney</surname></string-name></person-group>, <source>Developing predictive models to support child maltreatment hotline screening decisions: Allegheny County methodology and implementation</source>, Center for Social data Analytics, <year>2017</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://www.alleghenycountyanalytics.us/wp-content/uploads/2019/05/Methodology-V1-from-16-ACDHS-26_PredictiveRisk_Package_050119_FINAL.pdf">https://www.alleghenycountyanalytics.us/wp-content/uploads/2019/05/Methodology-V1-from-16-ACDHS-26_PredictiveRisk_Package_050119_FINAL.pdf</ext-link>.</mixed-citation></ref>
<ref id="ref-24"><label>[24]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H.</given-names> <surname>Horikawa</surname></string-name>, <string-name><given-names>S. P.</given-names> <surname>Suguimoto</surname></string-name>, <string-name><given-names>P. M.</given-names> <surname>Musumari</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Techasrivichien</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Ono-Kihara</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Development of a prediction model for child maltreatment recurrence in Japan: A historical cohort study using data from a child guidance center</article-title>,&#x201D; <source>Child Abuse &#x0026; Neglect</source>, vol. <volume>59</volume>, pp. <fpage>55</fpage>&#x2013;<lpage>65</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-25"><label>[25]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Vaithianathan</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Putnam-Hornstein</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Chouldechova</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Benavides-Prado</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Berger</surname></string-name></person-group>, &#x201C;<article-title>Hospital injury encounters of children identified by a predictive risk model for screening child maltreatment referrals: Evidence from the allegheny family screening tool</article-title>,&#x201D; <source>JAMA Pediatrics</source>, vol. <volume>174</volume>, no. <issue>11</issue>, pp. <fpage>e202770</fpage>&#x2013;<lpage>e202770</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-26"><label>[26]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C.</given-names> <surname>Su</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Aseltine</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Doshi</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>S. C.</given-names> <surname>Rogers</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Machine learning for suicide risk prediction in children and adolescents with electronic health records</article-title>,&#x201D; <source>Translational Psychiatry</source>, vol. <volume>10</volume>, no. <issue>1</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>10</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-27"><label>[27]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. C.</given-names> <surname>Walsh</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Joyce</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Maloney</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Vaithianathan</surname></string-name></person-group>, &#x201C;<article-title>Exploring the protective factors of children and families identified at highest risk of adverse childhood experiences by a predictive risk model: An analysis of the growing up in New Zealand cohort</article-title>,&#x201D; <source>Children and Youth Services Review</source>, vol. <volume>108</volume>, pp. <fpage>104556</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-28"><label>[28]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Wongcharoenwatana</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Tarugsa</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Kaewpornsawan</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Eamsobhana</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Chotigavanichaya</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Identifying children at high risk for recurrence child abuse</article-title>,&#x201D; <source>Journal of Orthopaedic Surgery</source>, vol. <volume>29</volume>, no. <issue>1</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>7</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-29"><label>[29]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><given-names>E.</given-names> <surname>Putnam-Hornstein</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Prindle</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Rebbe</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Huang</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Kuelbs</surname></string-name> <etal>et al.,</etal></person-group> <source>Using hospital data to predict child maltreatment risk</source>, Administration for Children and Families, <year>2021</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://www.acf.hhs.gov/sites/default/files/documents/opre/OPRE-Using-Hospital-Data-Predict-Child-Maltreatment-Risk-Dec2021.pdf">https://www.acf.hhs.gov/sites/default/files/documents/opre/OPRE-Using-Hospital-Data-Predict-Child-Maltreatment-Risk-Dec2021.pdf</ext-link>.</mixed-citation></ref>
<ref id="ref-30"><label>[30]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Yin</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Ma</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Zhu</surname></string-name> and <string-name><given-names>D.</given-names> <surname>Li</surname></string-name></person-group>, &#x201C;<article-title>Identifying intentional injuries among children and adolescents based on machine learning</article-title>,&#x201D; <source>PLoS One</source>, vol. <volume>16</volume>, no. <issue>1</issue>, pp. <fpage>e0245437</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-31"><label>[31]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>L.</given-names> <surname>Kissos</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Goldner</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Butman</surname></string-name>, <string-name><given-names>N.</given-names> <surname>Eliyahu</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Lev-Wiesel</surname></string-name></person-group>, &#x201C;<article-title>Can artificial intelligence achieve human-level performance? A pilot study of childhood sexual abuse detection in self-figure drawings</article-title>,&#x201D; <source>Child Abuse &#x0026; Neglect</source>, vol. <volume>109</volume>, pp. <fpage>104755</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-32"><label>[32]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Tsai</surname></string-name> and <string-name><given-names>P. K.</given-names> <surname>Kleinman</surname></string-name></person-group>, &#x201C;<article-title>Machine learning to identify distal tibial classic metaphyseal lesions of infant abuse: A pilot study</article-title>,&#x201D; <source>Pediatric Radiology</source>, vol. <volume>52</volume>, no. <issue>6</issue>, pp. <fpage>1095</fpage>&#x2013;<lpage>1103</lpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-33"><label>[33]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K.</given-names> <surname>Kim</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Choi</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Jang</surname></string-name>, <string-name><given-names>H. J.</given-names> <surname>Lee</surname></string-name> and <string-name><given-names>H.</given-names> <surname>Jang</surname></string-name></person-group>, &#x201C;<article-title>Predictive model for intra-familial child maltreatment re-reports and recurrence in South Korea: Analysis of national child protection services case records</article-title>,&#x201D; <source>Child Abuse &#x0026; Neglect</source>, vol. <volume>125</volume>, pp. <fpage>105487</fpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-34"><label>[34]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Edwards</surname></string-name>, <string-name><given-names>V.</given-names> <surname>Gillies</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Gorin</surname></string-name></person-group>, &#x201C;<article-title>Problem-solving for problem-solving: Data analytics to identify families for service intervention</article-title>,&#x201D; <source>Critical Social Policy</source>, vol. <volume>42</volume>, no. <issue>2</issue>, pp. <fpage>265</fpage>&#x2013;<lpage>284</lpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-35"><label>[35]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J. D.</given-names> <surname>Fluke</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Tonmyr</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Gray</surname></string-name>, <string-name><given-names>L. B.</given-names> <surname>Rodrigues</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Bolter</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Child maltreatment data: A summary of progress, prospects and challenges</article-title>,&#x201D; <source>Child Abuse &#x0026; Neglect</source>, vol. <volume>119</volume>, pp. <fpage>104650</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-36"><label>[36]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. Y.</given-names> <surname>Landau</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Ferrarello</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Blanchard</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Cato</surname></string-name>, <string-name><given-names>N.</given-names> <surname>Atkins</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Developing machine learning-based models to help identify child abuse and neglect: Key ethical challenges and recommended solutions</article-title>,&#x201D; <source>Journal of the American Medical Informatics Association</source>, vol. <volume>29</volume>, no. <issue>3</issue>, pp. <fpage>576</fpage>&#x2013;<lpage>580</lpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-37"><label>[37]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Chouldechova</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Benavides-Prado</surname></string-name>, <string-name><given-names>O.</given-names> <surname>Fialko</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Vaithianathan</surname></string-name></person-group>, &#x201C;<article-title>A case study of algorithm-assisted decision making in child maltreatment hotline screening decisions</article-title>,&#x201D; in <conf-name>Proc. of 1st Conf. on Fairness, Accountability and Transparency</conf-name>, <conf-loc>New York, NY, USA</conf-loc>, pp. <fpage>134</fpage>&#x2013;<lpage>148</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-38"><label>[38]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Waller</surname></string-name> and <string-name><given-names>P.</given-names> <surname>Waller</surname></string-name></person-group>, &#x201C;<article-title>Why predictive algorithms are so risky for public sector bodies</article-title>,&#x201D; <source>Social Science Research Network</source>, SSRN 3716166, <year>2020</year>. <ext-link ext-link-type="uri" xlink:href="https://dx.doi.org/10.2139/ssrn.3716166">https://dx.doi.org/10.2139/ssrn.3716166</ext-link>.</mixed-citation></ref>
<ref id="ref-39"><label>[39]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>B.</given-names> <surname>Drake</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Jonson-Reid</surname></string-name>, <string-name><given-names>M. G.</given-names> <surname>Ocampo</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Morrison</surname></string-name> and <string-name><given-names>D.</given-names> <surname>Dvalishvili</surname></string-name></person-group>, &#x201C;<article-title>A practical framework for considering the use of predictive risk modeling in child welfare</article-title>,&#x201D; <source>The ANNALS of the American Academy of Political and Social Science</source>, vol. <volume>692</volume>, no. <issue>1</issue>, pp. <fpage>162</fpage>&#x2013;<lpage>181</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-40"><label>[40]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>P.</given-names> <surname>Lanier</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Rodriguez</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Verbiest</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Bryant</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Guan</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Preventing infant maltreatment with predictive analytics: Applying ethical principles to evidence-based child welfare policy</article-title>,&#x201D; <source>Journal of Family Violence</source>, vol. <volume>35</volume>, no. <issue>1</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>13</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-41"><label>[41]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Vaithianathan</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Benavides-Prado</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Dalton</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Chouldechova</surname></string-name> and <string-name><given-names>E.</given-names> <surname>Putnam-Hornstein</surname></string-name></person-group>, &#x201C;<article-title>Using a machine learning tool to support high-stakes decisions in child protection</article-title>,&#x201D; <source>AI Magazine</source>, vol. <volume>42</volume>, no. <issue>1</issue>, pp. <fpage>53</fpage>&#x2013;<lpage>60</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-42"><label>[42]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>W.</given-names> <surname>Safat</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Asghar</surname></string-name> and <string-name><given-names>S. A.</given-names> <surname>Gillani</surname></string-name></person-group>, &#x201C;<article-title>Empirical analysis for crime prediction and forecasting using machine learning and deep learning techniques</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>9</volume>, pp. <fpage>70080</fpage>&#x2013;<lpage>70094</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-43"><label>[43]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. S.</given-names> <surname>Baek</surname></string-name>, <string-name><given-names>W.</given-names> <surname>Park</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Park</surname></string-name>, <string-name><given-names>K. H.</given-names> <surname>Jang</surname></string-name> and <string-name><given-names>Y. T.</given-names> <surname>Lee</surname></string-name></person-group>, &#x201C;<article-title>Smart policing technique with crime type and risk score prediction based on machine learning for early awareness of risk situation</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>9</volume>, pp. <fpage>131906</fpage>&#x2013;<lpage>131915</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-44"><label>[44]</label><mixed-citation publication-type="web"><person-group person-group-type="author"><collab>Chicago Data Portal</collab></person-group>, <source>Crimes&#x2014;2001 to Present</source>, <publisher-loc>Chicago, IL, USA</publisher-loc>: <publisher-name>Chicago Data Portal</publisher-name>, <year>2022</year>. [Online]. Available: <ext-link ext-link-type="uri" xlink:href="https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-Present/ijzp-q8t2">https://data.cityofchicago.org/Public-Safety/Crimes-2001-to-Present/ijzp-q8t2</ext-link>, Date: 28/01/2022.</mixed-citation></ref>
<ref id="ref-45"><label>[45]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C. C.</given-names> <surname>Sun</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Yao</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Li</surname></string-name> and <string-name><given-names>K.</given-names> <surname>Lee</surname></string-name></person-group>, &#x201C;<article-title>Detecting crime types using classification algorithms</article-title>,&#x201D; <source>Journal of Digital Information Management</source>, vol. <volume>12</volume>, no. <issue>5</issue>, pp. <fpage>321</fpage>&#x2013;<lpage>327</lpage>, <year>2014</year>.</mixed-citation></ref>
<ref id="ref-46"><label>[46]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. F. M.</given-names> <surname>Nasir</surname></string-name> and <string-name><given-names>K. A. M.</given-names> <surname>Sukri</surname></string-name></person-group>, &#x201C;<article-title>Machine learning approach on cyberstalking detection in social media using naive Bayes and decision tree</article-title>,&#x201D; <source>Journal of Soft Computing and Data Mining</source>, vol. <volume>3</volume>, no. <issue>1</issue>, pp. <fpage>19</fpage>&#x2013;<lpage>27</lpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-47"><label>[47]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>G.</given-names> <surname>Hajela</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Chawla</surname></string-name> and <string-name><given-names>A.</given-names> <surname>Rasool</surname></string-name></person-group>, &#x201C;<article-title>A multi-dimensional crime spatial pattern analysis and prediction model based on classification</article-title>,&#x201D; <source>ETRI Journal</source>, vol. <volume>43</volume>, no. <issue>2</issue>, pp. <fpage>272</fpage>&#x2013;<lpage>287</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-48"><label>[48]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>S. S.</given-names> <surname>May</surname></string-name>, <string-name><given-names>O. E.</given-names> <surname>Isafiade</surname></string-name> and <string-name><given-names>O. O.</given-names> <surname>Ajayi</surname></string-name></person-group>, &#x201C;<article-title>Hybridizing extremely randomized trees with bootstrap aggregation for crime prediction</article-title>,&#x201D; in <conf-name>Proc. of 4th Int. Conf. on Artificial Intelligence and Pattern Recognition</conf-name>, <conf-loc>Xiamen, China</conf-loc>, pp. <fpage>536</fpage>&#x2013;<lpage>541</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-49"><label>[49]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>W.</given-names> <surname>Jiang</surname></string-name>, <string-name><given-names>Z.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Xiang</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Shao</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Ma</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>SSEM: A novel self-adaptive stacking ensemble model for classification</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>7</volume>, pp. <fpage>120337</fpage>&#x2013;<lpage>120349</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-50"><label>[50]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J. A.</given-names> <surname>Reid</surname></string-name> and <string-name><given-names>E.</given-names> <surname>Beauregard</surname></string-name></person-group>, &#x201C;<article-title>Exploring a machine learning approach: Predicting death in sexual assault</article-title>,&#x201D; <source>Journal of Criminal Justice</source>, vol. <volume>71</volume>, pp. <fpage>101741</fpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-51"><label>[51]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Han</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Modaresnezhad</surname></string-name> and <string-name><given-names>H.</given-names> <surname>Nemati</surname></string-name></person-group>, &#x201C;<article-title>An adaptive machine learning system for predicting recurrence of child maltreatment: A routine activity theory perspective</article-title>,&#x201D; <source>Knowledge-Based Systems</source>, vol. <volume>227</volume>, pp. <fpage>107164</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-52"><label>[52]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Luque</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Carrasco</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Mart&#x00ED;n</surname></string-name> and <string-name><given-names>A.</given-names> <surname>de Las Heras</surname></string-name></person-group>, &#x201C;<article-title>The impact of class imbalance in classification performance metrics based on the binary confusion matrix</article-title>,&#x201D; <source>Pattern Recognition</source>, vol. <volume>91</volume>, pp. <fpage>216</fpage>&#x2013;<lpage>231</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-53"><label>[53]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>P.</given-names> <surname>Vuttipittayamongkol</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Elyan</surname></string-name> and <string-name><given-names>A.</given-names> <surname>Petrovski</surname></string-name></person-group>, &#x201C;<article-title>On the class overlap problem in imbalanced data classification</article-title>,&#x201D; <source>Knowledge-Based Systems</source>, vol. <volume>212</volume>, pp. <fpage>106631</fpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-54"><label>[54]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Fern&#x00E1;ndez</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Garcia</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Herrera</surname></string-name> and <string-name><given-names>N. V.</given-names> <surname>Chawla</surname></string-name></person-group>, &#x201C;<article-title>SMOTE for learning from imbalanced data: Progress and challenges, marking the 15-year anniversary</article-title>,&#x201D; <source>Journal of Artificial Intelligence Research</source>, vol. <volume>61</volume>, pp. <fpage>863</fpage>&#x2013;<lpage>905</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-55"><label>[55]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>H.</given-names> <surname>Han</surname></string-name>, <string-name><given-names>W. Y.</given-names> <surname>Wang</surname></string-name> and <string-name><given-names>B. H.</given-names> <surname>Mao</surname></string-name></person-group>, &#x201C;<article-title>Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning</article-title>,&#x201D; in <conf-name>Proc. of Int. Conf. on Intelligent Computing</conf-name>, <conf-loc>Berlin, Heidelberg</conf-loc>, <publisher-name>Springer</publisher-name>, pp. <fpage>878</fpage>&#x2013;<lpage>887</lpage>, <year>2005</year>.</mixed-citation></ref>
<ref id="ref-56"><label>[56]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Chang</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Guo</surname></string-name></person-group>, &#x201C;<article-title>Effects of data augmentation method borderline-SMOTE on emotion recognition of EEG signals based on convolutional neural network</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>9</volume>, pp. <fpage>47491</fpage>&#x2013;<lpage>47502</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-57"><label>[57]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K. L.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>B.</given-names> <surname>Ren</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Guan</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Yu</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>A hybrid cluster-borderline SMOTE method for imbalanced data of rock groutability classification</article-title>,&#x201D; <source>Bulletin of Engineering Geology and the Environment</source>, vol. <volume>81</volume>, no. <issue>1</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>15</lpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-58"><label>[58]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. H.</given-names> <surname>Wolpert</surname></string-name></person-group>, &#x201C;<article-title>Stacked generalization</article-title>,&#x201D; <source>Neural Networks</source>, vol. <volume>5</volume>, no. <issue>2</issue>, pp. <fpage>241</fpage>&#x2013;<lpage>259</lpage>, <year>1992</year>.</mixed-citation></ref>
<ref id="ref-59"><label>[59]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Yan</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Han</surname></string-name></person-group>, &#x201C;<article-title>Classifying imbalanced data sets by a novel re-sample and cost-sensitive stacked generalization method</article-title>,&#x201D; <source>Mathematical Problems in Engineering</source>, vol. <volume>2018</volume>, Article ID. 5036710, 2018. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1155/2018/5036710">https://doi.org/10.1155/2018/5036710</ext-link>.</mixed-citation></ref>
</ref-list>
</back>
</article>


























