<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CMC</journal-id>
<journal-id journal-id-type="nlm-ta">CMC</journal-id>
<journal-id journal-id-type="publisher-id">CMC</journal-id>
<journal-title-group>
<journal-title>Computers, Materials &#x0026; Continua</journal-title>
</journal-title-group>
<issn pub-type="epub">1546-2226</issn>
<issn pub-type="ppub">1546-2218</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">14774</article-id>
<article-id pub-id-type="doi">10.32604/cmc.2021.014774</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>M-IDM: A Multi-Classification Based Intrusion Detection Model in Healthcare IoT</article-title>
<alt-title alt-title-type="left-running-head">M-IDM: A Multi-Classification Based Intrusion Detection Model in Healthcare IoT</alt-title>
<alt-title alt-title-type="right-running-head">M-IDM: A Multi-Classification Based Intrusion Detection Model in Healthcare IoT</alt-title>
</title-group>
<contrib-group content-type="authors">
<contrib id="author-1" contrib-type="author">
<name name-style="western">
<surname>Lee</surname>
<given-names>Jae Dong</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref><xref ref-type="aff" rid="aff-2">2</xref>
</contrib>
<contrib id="author-2" contrib-type="author">
<name name-style="western">
<surname>Cha</surname>
<given-names>Hyo Soung</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western">
<surname>Rathore</surname>
<given-names>Shailendra</given-names>
</name>
<xref ref-type="aff" rid="aff-2">2</xref></contrib>
<contrib id="author-4" contrib-type="author" corresp="yes">
<name name-style="western">
<surname>Park</surname>
<given-names>Jong Hyuk</given-names>
</name>
<xref ref-type="aff" rid="aff-2">2</xref>
<email>jhpark1@seoultech.ac.kr</email></contrib>
<aff id="aff-1"><label>1</label><institution>Healthcare AI Team, National Cancer Center</institution>, <addr-line>Goyang-si, 10408</addr-line>, <country>Korea</country></aff>
<aff id="aff-2"><label>2</label><institution>Department of Computer Science and Engineering, Seoul National University of Science and Technology, (SeoulTech)</institution>, <addr-line>Seoul, 01811</addr-line>, <country>Korea</country></aff>
</contrib-group>
<author-notes><corresp id="cor1">&#x002A;Corresponding Author: Jong Hyuk Park. Email: <email>jhpark1@seoultech.ac.kr</email></corresp></author-notes>
<pub-date pub-type="epub" date-type="pub" iso-8601-date="2020-12-18">
<day>18</day>
<month>12</month>
<year>2020</year>
</pub-date>
<volume>67</volume>
<issue>2</issue>
<fpage>1537</fpage>
<lpage>1553</lpage>
<history>
<date date-type="received">
<day>15</day>
<month>10</month>
<year>2020</year>
</date>
<date date-type="accepted">
<day>12</day>
<month>12</month>
<year>2020</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2021 Lee et al.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Lee et al.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CMC_14774.pdf"></self-uri>
<abstract>
<p>In recent years, the application of a smart city in the healthcare sector via loT systems has continued to grow exponentially and various advanced network intrusions have emerged since these loT devices are being connected. Previous studies focused on security threat detection and blocking technologies that rely on testbed data obtained from a single medical IoT device or simulation using a well-known dataset, such as the NSL-KDD dataset. However, such approaches do not reflect the features that exist in real medical scenarios, leading to failure in potential threat detection. To address this problem, we proposed a novel intrusion classification architecture known as a Multi-class Classification based Intrusion Detection Model (M-IDM), which typically relies on data collected by real devices and the use of convolutional neural networks (i.e., it exhibits better performance compared with conventional machine learning algorithms, such as na&#x00EF;ve Bayes, support vector machine (SVM)). Unlike existing studies, the proposed architecture employs the actual healthcare IoT environment of National Cancer Center in South Korea and actual network data from real medical devices, such as a patient&#x2019;s monitors (i.e., electrocardiogram and thermometers). The proposed architecture classifies the data into multiple classes: Critical, informal, major, and minor, for intrusion detection. Further, we experimentally evaluated and compared its performance with those of other conventional machine learning algorithms, including na&#x00EF;ve Bayes, SVM, and logistic regression, using neural networks.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Smart city healthcare IoT</kwd>
<kwd>neural network</kwd>
<kwd>intrusion classification</kwd>
<kwd>machine learning</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>Nowadays, information and communication technology is increasingly applied to the the healthcare sector in smart city infrastructure, the foundation of which is network technology for data transmission and reception. Network flows in such infrastructure are also increasing in complexity owing to advanced technologies such as Internet of Things (IoT), cloud computing, big data, mobile, artificial intelligence, and blockchain technologies [<xref ref-type="bibr" rid="ref-1">1</xref>].</p>
<p>Before the advent of IoT, interactions between patients and medical staff were limited to visits and telephone calls. As such, it was impossible to monitor patients continuously. The application of IoT has enhanced the connectivity of devices related to healthcare and has redefined the interaction space of devices and people when medical services are provided, significantly improving the medical sector. With the emergence of IoT-applied medical services, all members in a city, including healthy people, patients, medical staff, hospitals, and health insurance companies, can now remotely monitor a person&#x2019;s health status with no distinction between inside and outside a medical institution. This capability has increased the ease and efficiency of interacting with medical staff. It not only shortens hospital stays and prevents re-hospitalization, but also substantially reduces medical costs and improves treatment outcomes [<xref ref-type="bibr" rid="ref-2">2</xref>&#x2013;<xref ref-type="bibr" rid="ref-4">4</xref>].</p>
<p>A vast amount of data in smart city healthcare field has been actively trasferred between people through devices based on edge nodes or edge cloud. There are also various types of connectivity-based equipment. Such an environment, however, contains either directly or indirectly sensitive information, which potentially exposes personal information to attacks. Unlike other fields, healthcare cyberattacks in smart cities can cause physical and logical confusion to individuals and society. Therefore, it should be able to defend against interrupting service requests on the network [<xref ref-type="bibr" rid="ref-2">2</xref>&#x2013;<xref ref-type="bibr" rid="ref-4">4</xref>].</p>
<p>Previous studies mostly focused on security threat detection and blocking technology (based on testbed data composed of a single medical IoT device or simulator) [<xref ref-type="bibr" rid="ref-5">5</xref>&#x2013;<xref ref-type="bibr" rid="ref-12">12</xref>]. However, such approaches lack reflection of features that exist in the real world.</p>
<p>Therefore, in this study, machine learning technology was applied to classify network events into four different classes (critical, informal, major, and minor) using data collected by real devices in order to sufficiently reflect the complex network flow and characteristics of the actual healthcare IoT environment. We built real world data-based models using a neural network-based multi-class intrusion classification algorithm for these classes.</p>
<p>To address the above problems in healthcare IoT, we proposed a Multi-class classification based Intrusion Detection Model (M-IDM) for healthcare IoT in a smart city that relies on machine learning techniques. The contributions of this paper are as follows:
<list list-type="bullet">
<list-item><p>We proposed a novel intrusion classification architecture based on machine learning techniques to overcome problems related to the detection of unknown attacks in healthcare IoT.</p></list-item>
<list-item><p>A service scenario is presented to classify the security event in the network as &#x201C;normal&#x201D; or &#x201C;anomaly (critical, major, minor)&#x201D; based on various features.</p></list-item>
<list-item><p>We experimentally evaluated and analyzed the proposed model architecture using a large amount of data to demonstrate its practicability and feasibility.</p></list-item>
</list></p>
<p>The structure of the rest of this paper is as follows. Section 2 discusses related works on intrusion detection and machine learning. Section 3 proposes a prediction model using machine learning algorithms for intelligent network intrusion detection. Section 4 provides analysis and comparison of the existing and proposed models for network intrusion detection. Finally, Section 5 summarizes the main findings of this study and the concluding remarks.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Related Work</title>
<sec id="s2_1">
<label>2.1</label>
<title>Intrusion Classification</title>
<p>This procedure is divided into the network intrusion detection system (NIDS) and host-based intrusion detection system (HIDS) according to the detection location. The NIDS analyzes the network traffic, and the result is combined with other technologies to increase the performance of the detection and prediction speed. In particular, artificial neural network-based intrusion detection systems can recognize intrusion patterns more efficiently, which helps them analyze large amounts of data. Meanwhile, the HIDS monitors important operating system files and the inbound and outbound packets of the device and also sends alerts in cases of a suspicious activity.</p>
<p>Classification techniques can be divided into signature-based and anomaly-based methods. Signature-based methods search for specific patterns, such as byte sequences of network traffic or sequences of known malicious instructions using malware. In contrast, anomaly-based methods can easily detect known attacks but show poor detection performance in the case of new attacks in which patterns cannot be used. Anomaly-based methods are primarily used to classify unknown attacks due to the rapid development of a malicious code. Essentially, the machine learning algorithm is used to create a reliable model, then, its operations are compared. Although unknown attacks can be detected, this method may also result in false positives. An efficient feature selection algorithm must be used to enhance the reliability of classification [<xref ref-type="bibr" rid="ref-13">13</xref>&#x2013;<xref ref-type="bibr" rid="ref-19">19</xref>].</p>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Machine Learning</title>
<p>In theoretical terms, machine learning is a field of artificial intelligence in which algorithms are developed that enable machines to learn and execute operations that are not specified in codes. Representation and generalization are the key elements among the many features that are involved in machine learning. Representation refers to the evaluation of given data, whereas generalization refers to processing of unknown data. In practice, the three key elements of machine learning are the training set, model, and inference. The training set refers to data used for learning, the model is the output obtained through the training set, and the inference is the training output prediction based on input values through actual data [<xref ref-type="bibr" rid="ref-20">20</xref>,<xref ref-type="bibr" rid="ref-21">21</xref>]. <xref ref-type="fig" rid="fig-1">Fig. 1</xref> summarizes the above descriptions.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>Workflow of a machine learning-based model</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="fig-1.png"/>
</fig>
<p>In a conventional program, data are input and the program presents the results of processing the input data. However, when machine learning processes the data, the model (algorithm) developed from the training dataset provides the prediction results of the input values in the test dataset. Hence, machine learning algorithms are suitable for solving problems where it is difficult to explain the sequence or reasoning clearly [<xref ref-type="bibr" rid="ref-20">20</xref>].</p>
<p>The machine learning model was selected based on whether the data were labeled or not; if the data are labeled, supervised learning models are used to perform classification and prediction, whereas if the data are unlabeled, unsupervised learning models are used to perform clustering. The two models are different, but when applying actual data to the model, a harmonized methodology is used because labeled data are rare [<xref ref-type="bibr" rid="ref-20">20</xref>&#x2013;<xref ref-type="bibr" rid="ref-22">22</xref>].</p>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Existing Research</title>
<p>Kabir et al. [<xref ref-type="bibr" rid="ref-23">23</xref>] proposed an algorithm that selects representative samples from sub-groups so that the samples faithfully reflect the entire dataset. In the optimal allocation technique, least-squares support vector machine (SVM) is applied to the extracted sample to detect intrusion after generation based on the diversity of observations in the subgroup.</p>
<p>Wang et al. [<xref ref-type="bibr" rid="ref-24">24</xref>] proposed an effective intrusion detection framework with improved functionality based on SVMs, emphasizing that high-quality training data are important for enhancing detection performance. In this framework, log marginal density ratio conversion is implemented to achieve high-quality SVM detection.</p>
<p>Farnaaz et al. [<xref ref-type="bibr" rid="ref-25">25</xref>] constructed a model for an intrusion detection system using a random forest (RF)-based classifier. The RF algorithm is used to detect four types of attacks: denial of service (DoS), probe, U2R, and R2L attacks. Cross-validation is adopted to achieve accurate classification, and a feature selection algorithm is applied to the dataset to remove redundant or irrelevant features.</p>
<p>Swarnkar et al. [<xref ref-type="bibr" rid="ref-26">26</xref>] proposed a novel and efficient data structure called the probability tree structure. If not found in the database in the test phase, or if the probability of packet occurrence is not found in the training phase, then the short sequence is treated as an anomaly. The possibility of an abnormal short sequence is used to generate the class label for the test packet. Some intelligence algorithms are utilized to optimize the parameters of machine techniques for feature selection or feature weighting in network intrusions. In this regard, Yang et al. [<xref ref-type="bibr" rid="ref-27">27</xref>] presented a modified na&#x00EF;ve Bayes algorithm based on the artificial bee population algorithm.</p>
<p>For the search strategy, Khammassi et al. [<xref ref-type="bibr" rid="ref-28">28</xref>] applied a wrapper method based on a genetic algorithm, whereas for the learning algorithm for network intrusion detection, they used a method that selects the best subset of functions by applying logistic regression.</p>
<p>Caminero et al. [<xref ref-type="bibr" rid="ref-29">29</xref>] first applied hostile reinforcement learning to intrusion detection and proposed a novel technique that integrates the behavior of the environment into the learning process of the modified reinforcement learning algorithm. The researchers demonstrated that the proposed algorithm is appropriate for supervised learning based on labeled datasets and verified its performance through comparisons with other well-known machine learning models for two datasets.</p>
<p>To identify a variety of unauthorized use, misuse, and abuse of computer systems, Liu et al. [<xref ref-type="bibr" rid="ref-30">30</xref>] proposed an adaptive network intrusion detection technique based on the selective ensemble of a kernel extreme learning machine with random functions.</p>
<p>Handling redundant or irrelevant features in high-dimensional datasets has been a long-term challenge in network anomaly detection. Removing these features through spectral information not only speeds up the classification process but also helps classifiers make accurate decisions during instances of attack recognition.</p>
<p>Salo et al. [<xref ref-type="bibr" rid="ref-31">31</xref>] proposed a new hybrid dimension reduction technique, namely the principal component analysis&#x2013;ensemble technique, using an ensemble classifier based on information gain, an SVM, an instance-based learning algorithm, and a multi-layer perceptron.</p>
<p>Divyasree et al. [<xref ref-type="bibr" rid="ref-32">32</xref>] proposed an efficient intrusion detection system using the ensemble core vector machine (CVM) method. The CVM algorithm, which is based on the minimum enclosing ball concept, detects attacks such as U2R, R2L, probe, and DoS attacks. CVM classifiers are modeled for each type of attack; chi-square tests are used to select the relevant function for each attack, and the functions are weighted for dimension reduction.</p>
<p>Al-Jarrah et al. [<xref ref-type="bibr" rid="ref-33">33</xref>] presented a semi-supervised multi-layer clustering (SMLC) model for network intrusion detection and prevention. SMLC, which achieves a detection performance similar to that of the supervised ML-based intrusion detection system (IDS) intrusion prevention system (IPS), performs learning using partially classified data. SMLC&#x2019;s performance is identical to those of algorithms that make up the layers of the well-known semi-supervised model (tri-training) and the supervised RF, bagging, and AdaboostM1 machine learning models.</p>
<p>Hady et al. [<xref ref-type="bibr" rid="ref-34">34</xref>] built a real-time testbed to monitor patient biometrics and collect network flow metrics. They combined network flow data with a patient&#x2019;s biometric data to improve system performance and used it as a training dataset. The proposed system improved the area under curve (AUC) by up to 25%. The aforementioned system used four machine learning methods: RF, K-nearest neighbors, SVM, and artificial neural network.</p>
<p>Gao et al. [<xref ref-type="bibr" rid="ref-35">35</xref>] developed a feature set specifically for implanted medical devices and conducted experiments to test the performance of different learning algorithms including decision tree, SVM, and K-means algorithms. The study showed that decision-tree based algorithms achieved the highest detection accuracy, low false-positive rate, and fast training and prediction speed compared with other algorithms. In addition, several other researchers discussed intrusion detection from different perspectives, including distributed DoS attacks, deep packet inspection, emotion classification, and network sub-slicing [<xref ref-type="bibr" rid="ref-36">36</xref>&#x2013;<xref ref-type="bibr" rid="ref-39">39</xref>].</p>
<p>In this paper, we demonstrated that a model created using machine learning based on extracting actual data from the hospital environment can respond to the security threats of IoT medical devices, which are otherwise difficult to manage. Moreover, it is useful to classify detailed risks to enable greater focus on serious events in an IoT medical device mass produced from heterogeneous medical devices, as it shows that it is possible to classify threats of four labels beyond simple binary classification with high accuracy.</p>
<p>In summary, existing studies demonstrated that machine learning is a good approach to support network intrusion detection in communication and distributed infrastructure. Thus, this paper presents an M-IDM to respond to the security threats of IoT medical devices, which are difficult to manage, through a model trained by extracting actual data from the hospital environment. The proposed model shows that it is possible to classify threats of four labels beyond simple binary classification with high accuracy.</p>
</sec>
</sec>
<sec id="s3">
<label>3</label>
<title>Multi-Class Intrusion Classification Model (M-IDM)</title>
<p>The proposed security model M-IDM relies on the concept of intrusion classification in which a machine learning model is trained over the baseline dataset to classify the anomaly behaviors from legitimate ones. Unlike existing studies, the proposed M-IDM uses the actual healthcare IoT environment of the National Cancer Center, South Korea, and actual network data from real medical devices, such as a patient&#x2019;s monitor, including electrocardiogram and thermometers. Moreover, it employs convolutional neural network (CNN), which exhibits better performance compared with conventional machine learning algorithms such as na&#x00EF;ve Bayes and SVM, to classify the data into multiple classes (critical, informal, major, and minor) for intrusion detection. This section describes the architectural design overview of the M-IDM, including major module data description, data preprocessing, and service scenario.</p>
<sec id="s3_1">
<label>3.1</label>
<title>Proposed Model Architecture</title>
<p>The architectural design overview of the proposed M-IDM is shown in <xref ref-type="fig" rid="fig-2">Fig. 2</xref>. It consists of five stages: Input data, preprocessing, feature extraction, classification, and output.</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Overview of proposed model architecture</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="fig-2.png"/>
</fig>
<p>During the input stage, raw data is accumulated, which includes network traffic, logs, scan from internal medical sources, vulnerability database, threat feeds from technical sources, social media, forums, and dark web from human sources. Preprocessing eliminates some inappropriate, multifunctional, or noisy data that might be present in subsequent raw data. The feature extraction component provides extraction and specification of the relevant features, including network security event data such as the IP, port, protocol, and severity from heterogeneous medical devices to support security threat classification in the healthcare IoT environment. The classification module is responsible for creating a trained model with relevant features from the preprocessed data. It uses various machine learning algorithms for classification purposes.</p>
<p>Here, the processed data is divided into training and test data. The classification model is trained using only the training data. The trained model is then repeatedly validated using the validation data. The process either proceeds to the next stage or corrects the parameters, learning method, etc., based on the validation results, and training is repeated. The model is completed through this process. In the output stage, the actual values are input into the model completed in the previous stage to confirm the classification. The classes are normal and anomaly (critical, major, minor).</p>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Data Description</title>
<p>The proposed M-IDM architecture uses the actual healthcare IoT environment of National Cancer Center, South Korea, and actual network data from real medical devices, unlike previous studies. The dataset was collected from a total of six medical devices with the same IP band, such as a patient&#x2019;s monitor, an electrocardiogram, a thermometer, a sphygmomanometer, a hygrometer, and a fall prevention bed with an alarm watch, which is used in an isolated internal-medical-device-only wireless network. There is a network tab device configured using the mirror method for transmitting and receiving all traffic between the medical IoT device and gateway. We obtained monthly logs of all traffic going through this firewall to the gateway. Out of the 300,000 cases collected (12 months), 100,000 cases (4 months, approximately 833/day) were selected in an even distribution. For the data label, four risk labels defined in the firewall were used: Normal, critical, major, and minor.</p>
<p>The network event data consists of 11 features: one target variable and ten explanatory variables for machine learning, as listed in <xref ref-type="table" rid="table-1">Tab. 1</xref>. The target variable is the severity classification value of each event, that is, normal, critical, major, and minor. The type of source/destination IP refers to the type of IP that attempts to access or receive access from the device, that is, private-internal, public-external. The date variable was recorded as year, month, day, hour, and second based on when the event was created. Flag is used in the TCP flag, that is, URG, ACK, PSH, RST, SYN, and FIN.</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>Variable descriptions</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Type</th>
<th>Variable type</th>
<th>Attributes</th>
<th>Data type</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>Severity</td>
<td>Target</td>
<td>Normal, critical, major, minor</td>
<td>Nominal</td>
<td/>
</tr>
<tr>
<td>Working hour</td>
<td>Explanatory</td>
<td>Day: 09:00&#x2013;18:00<break/> Night: 18:00&#x2013;09:00</td>
<td>Binary</td>
<td/>
</tr>
<tr>
<td>Date</td>
<td/>
<td>2017-01-01 00:00:00</td>
<td>Redefine to working hour</td>
<td/>
</tr>
<tr>
<td>Type of source/destination IP</td>
<td/>
<td>Private, public</td>
<td>Binary</td>
<td/>
</tr>
<tr>
<td>Source/destination IP</td>
<td/>
<td>000.000.000.000</td>
<td>Redefine to type of source/destination IP</td>
<td/>
</tr>
<tr>
<td>Source/destination port</td>
<td/>
<td>1&#x2013;65535</td>
<td>Numeric</td>
<td/>
</tr>
<tr>
<td>Protocol</td>
<td/>
<td>dns, kerberos, http, https, ssh, telnet, imap, smtp, pop3, tftp, ftp, smb, smb2, icmp, ntp, tcp, udp</td>
<td>Nominal</td>
<td/>
</tr>
<tr>
<td>Flag</td>
<td/>
<td>URG, ACK, PSH, RST, SYN, FIN, N/A</td>
<td>Nominal</td>
<td/>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Data Preprocessing</title>
<p>There are two types of dataset attributes in the proposed M-IDM: Symbolic and numeric. The data set attribute is numeric. However, the data of symbolic properties cannot be directly processed. Thus, it is necessary to convert symbolic data to numeric data. <xref ref-type="table" rid="table-2">Tab. 2</xref> lists symbolic attributes and their associated values. In the table, the two redefined attributes &#x201C;Working hour&#x201D; and &#x201C;Type of source/destination IP&#x201D; have a value of 0 or 1; in this case, these can be processed in the same way as the numeric attribute. Furthermore, &#x201C;Protocol&#x201D; and &#x201C;Flag&#x201D; attributes were converted as a sequence of integers, that is, Protocol: 1&#x2013;17, Flag: 1&#x2013;6, after being represented as on-hot vector.</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>Symbolic attributes and their associated values</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Symbolic attributes</th>
<th>Symbolic values</th>
<th>Number of distinct values</th>
</tr>
</thead>
<tbody>
<tr>
<td>Working hour</td>
<td>1 and 0</td>
<td>2</td>
</tr>
<tr>
<td>Type of source/destination IP</td>
<td>1 and 0</td>
<td>2</td>
</tr>
<tr>
<td>Protocol</td>
<td>dns, kerberos, http, https, ssh, telnet, imap, smtp, pop3, tftp, ftp, smb, smb2, icmp, ntp, tcp, udp</td>
<td>17</td>
</tr>
<tr>
<td>Flag</td>
<td>URG, ACK, PSH, RST, SYN, FIN, N/A</td>
<td>6</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The protocol attribute has 17 unique values; similarly, the flag attribute is defined with 6 unique values. Many approaches have been proposed for handling symbolic attributes. In an experiment conducted as part of this study, we employed a method that uses conditional probability and dummy indicator variables to process protocol and flag properties [<xref ref-type="bibr" rid="ref-40">40</xref>,<xref ref-type="bibr" rid="ref-41">41</xref>]. However, using only this method increases the dimension of the dataset; thus, we clustered similar types in symbolic attributes. In <xref ref-type="table" rid="table-2">Tabs. 2</xref> and <xref ref-type="table" rid="table-3">3</xref>, it can be observed that the dimension is reduced by clustering into different classes for different protocol properties. The study in [<xref ref-type="bibr" rid="ref-42">42</xref>] also performed clustering into a similar type of symbolic attribute. Then, we converted these classes into indicator variables as presented in <xref ref-type="table" rid="table-3">Tabs. 3</xref> and <xref ref-type="table" rid="table-4">4</xref>. Data scaling was performed because normalized data is required to perform classification.</p>
<table-wrap id="table-3">
<label>Table 3</label>
<caption>
<title>Clustering of protocol attribute</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Symbolic attributes</th>
<th>Protocol type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>PR1</td>
<td>dns</td>
<td>Service belongs to names server</td>
</tr>
<tr>
<td>PR2</td>
<td>kerberos</td>
<td>Service belongs to authentication</td>
</tr>
<tr>
<td>PR3</td>
<td>http, https, <inline-formula id="ieqn-1"><alternatives><inline-graphic xlink:href="ieqn-1.png"/><tex-math id="tex-ieqn-1"><![CDATA[$\ldots$]]></tex-math><mml:math id="mml-ieqn-1"><mml:mo>&#x2026;</mml:mo></mml:math></alternatives></inline-formula></td>
<td>Service belongs to web applications</td>
</tr>
<tr>
<td>PR4</td>
<td>ssh, telnet, <inline-formula id="ieqn-2"><alternatives><inline-graphic xlink:href="ieqn-2.png"/><tex-math id="tex-ieqn-2"><![CDATA[$\ldots$]]></tex-math><mml:math id="mml-ieqn-2"><mml:mo>&#x2026;</mml:mo></mml:math></alternatives></inline-formula></td>
<td>Service for remote access to other machines</td>
</tr>
<tr>
<td>PR5</td>
<td>imap, smtp</td>
<td>Service for mail transfer</td>
</tr>
<tr>
<td>PR6</td>
<td>tftp, ftp, smb</td>
<td>Service for file transfer</td>
</tr>
<tr>
<td>PR7</td>
<td>Remaining protocols</td>
<td>All other services</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="table-4">
<label>Table 4</label>
<caption>
<title>Clustering of flag attribute</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Symbolic attributes</th>
<th>Flag type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>FL1</td>
<td>SYN</td>
<td>Connection request and in TCP</td>
</tr>
<tr>
<td>FL2</td>
<td>ACK</td>
<td>Response in TCP</td>
</tr>
<tr>
<td>FL3</td>
<td>RST</td>
<td>Connection reset in TCP</td>
</tr>
<tr>
<td>FL4</td>
<td>PSH</td>
<td>Message push in TCP</td>
</tr>
<tr>
<td>FL5</td>
<td>URG</td>
<td>Urgent message in TCP</td>
</tr>
<tr>
<td>FL6</td>
<td>FIN</td>
<td>Connection termination in TCP</td>
</tr>
<tr>
<td>FL7</td>
<td>N/A</td>
<td>All other flags or blank</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In this experimental evaluation of the proposed M-IDM architecture, the selected data (i.e., 100,000 cases or instances) were randomly sampled and divided into training or labeled data and testing or unlabeled data. The ratio of training and testing dataset was 90:10, where 90% (i.e., 90,000 instances) is training data and the remaining 10% (i.e., 10,000) is testing data.</p>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>Service Scenario</title>
<p>This section describes the service scenario of the proposed M-IDM, which classifies the security event data into classes of &#x201C;normal&#x201D; or &#x201C;anomaly (critical, major, minor).&#x201D; <xref ref-type="fig" rid="fig-3">Fig. 3</xref> shows a schematic diagram of the service scenario for the proposed model. This section describes details of the procedures for each stage from &#x2780; to &#x2783;.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Service scenario of proposed M-IDM model</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="fig-3.png"/>
</fig>
<p>The details of the service scenario are as follows:
<list list-type="simple">
<list-item><label>&#x2780;</label><p>Data separation: All security event data collected on the healthcare network are randomly sampled and divided into training and test data. The separated data are used to generate the model through learning and to validate the reliability of the model.</p></list-item>
<list-item><label>&#x2781;</label><p>Model training: The learning algorithm is selected considering various conditions; then, the parameters are adjusted according to the algorithm and learning is performed using only the training data from the data separated in &#x2780;. After assessing the precision of the learning model using the test data, this process is repeated by applying different parameters and algorithms and other methods until the desired result is obtained. The processes in &#x2780; and &#x2781; are performed in batch form.</p></list-item>
<list-item><label>&#x2782;</label><p>Real-time classification 1: The model generated in &#x2781;  is applied to the classifier; then, the real IoT medical devices network security event data (the real data do not overlap with the data in &#x2780;) are input in real-time. The input data are first classified as &#x201C;normal&#x201D; or &#x201C;anomaly&#x201D; using a trained model that is not based on rules.</p></list-item>
<list-item><label>&#x2783;</label><p>Real-time classification 2: The IoT medical devices security event data classified as &#x201C;anomaly&#x201D; in &#x2782; are further classified as &#x201C;critical,&#x201D; &#x201C;major,&#x201D; or &#x201C;minor.&#x201D; The processes in &#x2782; and &#x2783; are performed in real-time.</p></list-item>
</list></p>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Experiment and Performance Evaluation</title>
<p>In this study, we experimentally evaluated the performance of the proposed M-IDM, which was developed by employing CNN algorithms in Python 3.7.0 environment with orange. We selected a CNN by validating its classification performance and those of conventional machine learning algorithms such as na&#x00EF;ve Bayes and SVM. The CNN has the structure: <inline-formula id="ieqn-3"><alternatives><inline-graphic xlink:href="ieqn-3.png"/><tex-math id="tex-ieqn-3"><![CDATA[$\text{I}\text{n}\text{p}\text{u}\text{t}\rightarrow \text{C}\text{o}\text{n}\text{v}\rightarrow \text{M}\text{a}\text{x}\text{p}\text{o}\text{ol}\rightarrow \text{Fully Connected}\rightarrow \text{O}\text{u}\text{t}\text{p}\text{u}\text{t}$]]></tex-math><mml:math id="mml-ieqn-3"><mml:mstyle class="text"><mml:mtext>I</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>n</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>p</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>u</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>t</mml:mtext></mml:mstyle><mml:mo>&#x2192;</mml:mo><mml:mstyle class="text"><mml:mtext>C</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>o</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>n</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>v</mml:mtext></mml:mstyle><mml:mo>&#x2192;</mml:mo><mml:mstyle class="text"><mml:mtext>M</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>a</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>x</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>p</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>o</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>ol</mml:mtext></mml:mstyle><mml:mo>&#x2192;</mml:mo><mml:mstyle class="text"><mml:mtext>Fully&#x00A0;Connected</mml:mtext></mml:mstyle><mml:mo>&#x2192;</mml:mo><mml:mstyle class="text"><mml:mtext>O</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>u</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>t</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>p</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>u</mml:mtext></mml:mstyle><mml:mstyle class="text"><mml:mtext>t</mml:mtext></mml:mstyle></mml:math></alternatives></inline-formula>, where the weights and bias parameters are <inline-formula id="ieqn-4"><alternatives><inline-graphic xlink:href="ieqn-4.png"/><tex-math id="tex-ieqn-4"><![CDATA[$\mathrm{w}_{1}= \left(10,  1,  3,  3\right)\text{ and }\mathrm{b}_{1}=(10,  1)$]]></tex-math><mml:math id="mml-ieqn-4"><mml:msub><mml:mrow><mml:mstyle mathvariant="normal"><mml:mi>w</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>10</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>3</mml:mn><mml:mo>,</mml:mo><mml:mn>3</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mstyle class="text"><mml:mtext>&#x00A0;and&#x00A0;</mml:mtext></mml:mstyle><mml:msub><mml:mrow><mml:mstyle mathvariant="normal"><mml:mi>b</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>10</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></alternatives></inline-formula> for the Conv layer, <inline-formula id="ieqn-5"><alternatives><inline-graphic xlink:href="ieqn-5.png"/><tex-math id="tex-ieqn-5"><![CDATA[$\mathrm{w}_{2}= \left(1960,  128\right)\text{ and }\mathrm{b}_{2}=(1,  128)$]]></tex-math><mml:math id="mml-ieqn-5"><mml:msub><mml:mrow><mml:mstyle mathvariant="normal"><mml:mi>w</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>1960</mml:mn><mml:mo>,</mml:mo><mml:mn>128</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mstyle class="text"><mml:mtext>&#x00A0;and&#x00A0;</mml:mtext></mml:mstyle><mml:msub><mml:mrow><mml:mstyle mathvariant="normal"><mml:mi>b</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>128</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></alternatives></inline-formula> for the fully connected layer, and <inline-formula id="ieqn-6"><alternatives><inline-graphic xlink:href="ieqn-6.png"/><tex-math id="tex-ieqn-6"><![CDATA[$\mathrm{w}_{2}= \left(128,  10\right)\text{ and }\mathrm{b}_{2}=(1,  10)$]]></tex-math><mml:math id="mml-ieqn-6"><mml:msub><mml:mrow><mml:mstyle mathvariant="normal"><mml:mi>w</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>128</mml:mn><mml:mo>,</mml:mo><mml:mn>10</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mstyle class="text"><mml:mtext>&#x00A0;and&#x00A0;</mml:mtext></mml:mstyle><mml:msub><mml:mrow><mml:mstyle mathvariant="normal"><mml:mi>b</mml:mi></mml:mstyle></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>10</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></alternatives></inline-formula> for the output layer. We set other training parameters (i.e., learning rate, no. of epochs, number of iterations) as 0.5, 1, and 1500, respectively.</p>
<p>The specifications of the PC used for the experimental setup are as follows: CPU i7-8700 3.2 GHz, memory 8 GB, and graphic card RTX 2060 4 GB. Several standard measures, such as precision, recall, area under the receiver operating characteristic curve (AUC), and F1-score were used.</p>
<sec id="s4_1">
<label>4.1</label>
<title>Effect of Number of Instances</title>
<p>To achieve an objective comparison of the proposed algorithm against existing conventional algorithms, the precision, recall, AUC, and F1-score [<xref ref-type="bibr" rid="ref-43">43</xref>&#x2013;<xref ref-type="bibr" rid="ref-45">45</xref>] were investigated for different number of instances. When using the same data (sampling type: 10-fold cross-validation, target class: average over classes), the CNN exhibited the best performance in all items for all the number of instances, as presented in <xref ref-type="table" rid="table-5">Tab. 5</xref>.</p>
<table-wrap id="table-5">
<label>Table 5</label>
<caption>
<title>Performance of machine learning methods for different number of instances</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Number of instances</th>
<th>Method</th>
<th>AUC</th>
<th>F1</th>
<th>Precision</th>
<th>Recall</th>
</tr>
</thead>
<tbody>
<tr>
<td><italic>N</italic> = 10,000</td>
<td><bold>M-IDM</bold></td>
<td><bold>0.965</bold></td>
<td><bold>0.890</bold></td>
<td><bold>0.911</bold></td>
<td><bold>0.844</bold></td>
</tr>
<tr>
<td/>
<td>Na&#x00EF;ve Bayes</td>
<td>0.957</td>
<td>0.881</td>
<td>0.906</td>
<td>0.886</td>
</tr>
<tr>
<td/>
<td>Logistic regression</td>
<td>0.947</td>
<td>0.871</td>
<td>0.900</td>
<td>0.875</td>
</tr>
<tr>
<td><italic>N</italic> = 50,000</td>
<td><bold>M-IDM</bold></td>
<td><bold>0.967</bold></td>
<td><bold>0.937</bold></td>
<td><bold>0.947</bold></td>
<td><bold>0.946</bold></td>
</tr>
<tr>
<td/>
<td>Na&#x00EF;ve Bayes</td>
<td>0.957</td>
<td>0.863</td>
<td>0.940</td>
<td>0.815</td>
</tr>
<tr>
<td/>
<td>Logistic regression</td>
<td>0.929</td>
<td>0.865</td>
<td>0.894</td>
<td>0.901</td>
</tr>
<tr>
<td><italic>N</italic> = 100,000</td>
<td><bold>M-IDM</bold></td>
<td><bold>0.967</bold></td>
<td><bold>0.937</bold></td>
<td><bold>0.947</bold></td>
<td><bold>0.946</bold></td>
</tr>
<tr>
<td/>
<td>Na&#x00EF;ve Bayes</td>
<td>0.957</td>
<td>0.869</td>
<td>0.939</td>
<td>0.827</td>
</tr>
<tr>
<td/>
<td>Logistic regression</td>
<td>0.932</td>
<td>0.897</td>
<td>0.915</td>
<td>0.923</td>
</tr>
</tbody>
</table><table-wrap-foot><p><sup>*</sup>Constraints.</p></table-wrap-foot></table-wrap>
<list list-type="simple">
<list-item><label>&#x2014;</label><p>M-IDM (activation: ReLu, hidden layer: 100, maximal number of iterations: 200, regulation <inline-formula id="ieqn-7"><alternatives><inline-graphic xlink:href="ieqn-7.png"/><tex-math id="tex-ieqn-7"><![CDATA[$\alpha$]]></tex-math><mml:math id="mml-ieqn-7"><mml:mi>&#x03B1;</mml:mi></mml:math></alternatives></inline-formula>: 0.0001, optimizer: Adam)</p></list-item>
<list-item><label>&#x2014;</label><p>Logistic regression (regulation type: ridge, strength: <inline-formula id="ieqn-8"><alternatives><inline-graphic xlink:href="ieqn-8.png"/><tex-math id="tex-ieqn-8"><![CDATA[$\mathrm{C}= 1$]]></tex-math><mml:math id="mml-ieqn-8"><mml:mstyle mathvariant="normal"><mml:mi>C</mml:mi></mml:mstyle><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></alternatives></inline-formula>)</p></list-item>
<list-item><label>&#x2014;</label><p>SVM (cost: 1.0, regression loss epsilon: 0.1, iteration limit: 100).</p></list-item>
</list>
<p>Excluding the SVM in which the precision was significantly reduced, the na&#x00EF;ve Bayes and logistic regression approaches (<inline-formula id="ieqn-9"><alternatives><inline-graphic xlink:href="ieqn-9.png"/><tex-math id="tex-ieqn-9"><![CDATA[$\mathrm{N}= 100$]]></tex-math><mml:math id="mml-ieqn-9"><mml:mstyle mathvariant="normal"><mml:mi>N</mml:mi></mml:mstyle><mml:mo>=</mml:mo><mml:mn>100</mml:mn></mml:math></alternatives></inline-formula>,000) both yielded AUCs of at least 0.932. <xref ref-type="table" rid="table-6">Tab. 6</xref> presents the detailed classification results for each class. The prediction rate for each class of the M-IDM, which exhibited the best performance (<inline-formula id="ieqn-10"><alternatives><inline-graphic xlink:href="ieqn-10.png"/><tex-math id="tex-ieqn-10"><![CDATA[$\mathrm{N}= 100$]]></tex-math><mml:math id="mml-ieqn-10"><mml:mstyle mathvariant="normal"><mml:mi>N</mml:mi></mml:mstyle><mml:mo>=</mml:mo><mml:mn>100</mml:mn></mml:math></alternatives></inline-formula>,000), was confirmed using a confusion matrix. According to the results, the &#x201C;major&#x201D; class had a relatively low prediction rate of 87.7% compared with the other classes (with prediction rates of 94.3%&#x2013;98.8%).</p>
<table-wrap id="table-6">
<label>Table 6</label>
<caption>
<title>Proportions of predictions obtained using M-IDM</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Type of classes</th>
<th colspan="4">Predicted</th>
</tr>
<tr>
<th>
</th>
<th>Critical (%)</th>
<th>Informal (%)</th>
<th>Major (%)</th>
<th>Minor (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Actual</td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td>Critical</td>
<td>98.8</td>
<td>2.4</td>
<td>0.0</td>
<td>1.4</td>
</tr>
<tr>
<td>Informal</td>
<td>0.3</td>
<td>94.3</td>
<td>5.3</td>
<td>1.2</td>
</tr>
<tr>
<td>Major</td>
<td>0.2</td>
<td>0.3</td>
<td>87.7</td>
<td>0.1</td>
</tr>
<tr>
<td>Minor</td>
<td>0.6</td>
<td>3.0</td>
<td>7.0</td>
<td>97.4</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><xref ref-type="fig" rid="fig-4">Figs. 4</xref> and <xref ref-type="fig" rid="fig-5">5</xref> also show the AUC for each machine learning method for the same data (<inline-formula id="ieqn-11"><alternatives><inline-graphic xlink:href="ieqn-11.png"/><tex-math id="tex-ieqn-11"><![CDATA[$\mathrm{N}=100$]]></tex-math><mml:math id="mml-ieqn-11"><mml:mstyle mathvariant="normal"><mml:mi>N</mml:mi></mml:mstyle><mml:mo>=</mml:mo><mml:mn>100</mml:mn></mml:math></alternatives></inline-formula>,000). Compared with other ML methods, the proposed method showed better performance for each class. In particular, the SVM showed a large deviation of 0.5 (more or less) for each label. The threshold for each method and label was set to 0.5.</p>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Machine learning method-specific AUC for four labels (normal, critical)</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="fig-4.png"/>
</fig>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>Machine learning method-specific AUC for four labels (major, minor) 
 </title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="fig-5.png"/>
</fig>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Impact of Class</title>
<p>This section describes the effect of the number of labels on the prediction. The same data and conditions were used in these tests as those used for the M-IDM algorithm (<inline-formula id="ieqn-12"><alternatives><inline-graphic xlink:href="ieqn-12.png"/><tex-math id="tex-ieqn-12"><![CDATA[$\mathrm{N}= 100{, }000$]]></tex-math><mml:math id="mml-ieqn-12"><mml:mstyle mathvariant="normal"><mml:mi>N</mml:mi></mml:mstyle><mml:mo>=</mml:mo><mml:mn>100</mml:mn><mml:mo>,</mml:mo><mml:mn>000</mml:mn></mml:math></alternatives></inline-formula>), which exhibited the best performance, as presented in Section 4.1. The accuracy for each label was confirmed as the number of labels was increased from two to four. <xref ref-type="fig" rid="fig-6">Fig. 6</xref> shows the accuracy of each algorithm in terms of predicting a certain label based on the number of classes.</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>Prediction rate by label</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="fig-6.png"/>
</fig>
<p>The following rates were observed: Anomaly 99.3% and normal 94.4% at two classes; critical 93.5%, <inline-formula id="ieqn-13"><alternatives><inline-graphic xlink:href="ieqn-13.png"/><tex-math id="tex-ieqn-13"><![CDATA[$\mathrm{major}+ \mathrm{minor}$]]></tex-math><mml:math id="mml-ieqn-13"><mml:mstyle mathvariant="normal"><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>j</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi></mml:mstyle><mml:mo>+</mml:mo><mml:mstyle mathvariant="normal"><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi></mml:mstyle></mml:math></alternatives></inline-formula> 85.3%, and informal 95.9% at three classes; and critical 98.6%, major 87.7%, minor 97.7%, and informal 94.3% at four classes.</p>
<p>All the algorithms showed good accuracy of 85.3%&#x2013;99.3%. At four classes, the accuracy by class ranged from 87.7% to 98.6%, where &#x201C;major&#x201D; had a relatively low accuracy of 87.7% compared with the other classes.</p>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>Analysis of M-IDM</title>
<p>We compared the findings of this study with those obtained in existing studies based on various aspects. <xref ref-type="table" rid="table-7">Tab. 7</xref> summarizes the result of the comparison based on 10 aspects of methodology, number of feature/record/class/hidden layers, minimum/maximum AUC, verification, data source, number of device types, and detection range. In <xref ref-type="table" rid="table-7">Tab. 7</xref>, &#x201C;Methodology&#x201D; indicates the main method used in this study, and &#x201C;Number of feature/record/class/hidden layers&#x201D; indicates basic information of data or learning model. Furthermore, &#x201C;Min/Max AUC&#x201D; denotes the method used to perform model learning. &#x201C;Validation&#x201D; is a job confirming that the result of a learning model have enough fidelity. &#x201C;Data source&#x201D; is the environment from which the data was extracted, and &#x201C;Number of device types&#x201D; is the number of devices used to generate training data. &#x201C;Detection range&#x201D; indicates the range of detection from sensor to server.</p>
<table-wrap id="table-7">
<label>Table 7</label>
<caption>
<title>Method comparison</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Type</th>
<th>Hady et al. [<xref ref-type="bibr" rid="ref-34">34</xref>]</th>
<th>Gao et al. [<xref ref-type="bibr" rid="ref-35">35</xref>]</th>
<th>Alrashdi et al. [<xref ref-type="bibr" rid="ref-36">36</xref>]</th>
<th colspan="2">This work</th>
</tr>
</thead>
<tbody>
<tr>
<td>Methodology</td>
<td>NN</td>
<td>Decision tree</td>
<td>Random forest</td>
<td colspan="2">NN</td>
</tr>
<tr>
<td>Number of</td>
<td colspan="5"/>
</tr>
<tr>
<td>Features</td>
<td>34</td>
<td>7</td>
<td>12</td>
<td colspan="2">10</td>
</tr>
<tr>
<td>Records</td>
<td>16,000</td>
<td>7,000</td>
<td>257,673</td>
<td colspan="2">100,000</td>
</tr>
<tr>
<td>Classes</td>
<td>2</td>
<td>2</td>
<td>2</td>
<td>2</td>
<td>4</td>
</tr>
<tr>
<td>Hidden layer</td>
<td>100</td>
<td>&#x2013;</td>
<td>&#x2013;</td>
<td>100</td>
<td>100</td>
</tr>
<tr>
<td>Min/max AUC</td>
<td>91.45&#x2013;93.42</td>
<td>87.7&#x2013;90.37</td>
<td>98</td>
<td>94.3&#x2013;99.4</td>
<td>87.7&#x2013;98.6</td>
</tr>
<tr>
<td>Validation</td>
<td>10-fold</td>
<td>&#x2013;</td>
<td>&#x2013;</td>
<td colspan="2">10-fold</td>
</tr>
<tr>
<td>Data source</td>
<td>Testbed data</td>
<td>Testbed data</td>
<td>UNSW-NB15</td>
<td colspan="2">Real-world data</td>
</tr>
<tr>
<td>Number of device types</td>
<td>1</td>
<td>1</td>
<td>&#x2013;</td>
<td colspan="2">6</td>
</tr>
<tr>
<td>Detection range (sensor&#x2013;gateway&#x2013;server)</td>
<td>Gateway&#x2013;server</td>
<td>Gateway&#x2013;server</td>
<td>Gateway&#x2013;server</td>
<td colspan="2">Sensor&#x2013;gateway (edge node)</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In existing studies, binary classification is mainly used and only simple classification is possible. Moreover, because the number of devices used for data acquisition and generation is from a testbed, it is difficult to reflect the characteristics that occur in a mixed environment of heterogeneous devices. However, this study classifies various classes while considering the constraints of the IoT environment by acquiring traffic logs that multiple actual IoT medical devices communicate with and learning from the data an environment in which heterogeneous IoT medical devices are mixed.</p>
</sec>
<sec id="s4_4">
<label>4.4</label>
<title>Computational Complexity</title>
<p>We evaluated the complexity across the proposed model. As shown in <xref ref-type="fig" rid="fig-7">Fig. 7</xref>, we observed the average of the calculation resources (CPU and memory) for each data size. As the data volume increased, more average calculation resources were required. Additionally, the ratio of data growth and computational resource use was compared. When the volume of data doubled, the average computational resource usage increased by up to a factor of 1.3, and when the data volume increased by five times, it increased by up to a factor of 1.8. Therefore, it can be observed that the computational overhead of the proposed model is not a significant problem.</p>
<fig id="fig-7">
<label>Figure 7</label>
<caption>
<title>Computational complexity of the proposed model</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="fig-7.png"/>
</fig>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Conclusion</title>
<p>In this study, we proposed a multi-class security event classification model based on machine learning. The proposed model was built using real-world data and neural network-based multi-class intrusion classification algorithm for four classes. This work sufficiently reflects the complex network flow and characteristics of a real healthcare IoT environment, and machine learning technology was applied using data from real devices to classify network events into four different classes. In future work, more meaningful features should be found in security event data before refining to enhance the performance of the proposed approach, and methods should be developed to improve the somewhat low accuracy for rare classes to address the problem of data imbalance between the classes.</p></sec>
</body>
<back>
<fn-group><fn fn-type="other"><p><bold>Funding Statement:</bold> This research was supported by a grant of the Korea Health Technology R&#x0026;D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health &#x0026; Welfare, Republic of Korea (Grant No. HI19C0839).</p></fn>
<fn fn-type="conflict"><p><bold>Conflicts of Interest:</bold> The authors declare that there is no conflict of interests to report regarding the present study.</p></fn></fn-group>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J. H.</given-names> <surname>Park</surname></string-name>, <string-name><given-names>M. M.</given-names> <surname>Salim</surname></string-name>, <string-name><given-names>J. H.</given-names> <surname>Jo</surname></string-name>, <string-name><given-names>J. C. S.</given-names> <surname>Sicato</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Rathore</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Ciot-net: A scalable cognitive iot based smart city network architecture</article-title>,&#x201D; <source>Human-centric Computing and Information Sciences</source>, vol. <volume>9</volume>, no. <issue>29</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>20</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Sengan</surname></string-name>, <string-name><given-names>V.</given-names> <surname>Subramaniyaswamy</surname></string-name>, <string-name><given-names>S. K.</given-names> <surname>Nair</surname></string-name>, <string-name><given-names>V.</given-names> <surname>Indragandhi</surname></string-name>, <string-name><given-names>J.</given-names> <surname> Manikandan</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Enhancing cyber-physical systems with hybrid smart city cyber security architecture for secure public data-smart network</article-title>,&#x201D; <source>Future Generation Computer Systems</source>, vol. <volume>112</volume>, pp. <fpage>724</fpage>&#x2013;<lpage>737</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. T.</given-names> <surname>Ramotsoela</surname></string-name>, <string-name><given-names>G. P.</given-names> <surname>Hancke</surname></string-name> and <string-name><given-names>A. M.</given-names> <surname>Abu-Mahfouz</surname></string-name></person-group>, &#x201C;<article-title>Attack detection in water distribution systems using machine learning</article-title>,&#x201D; <source>Human-Centric Computing and Information Sciences</source>, vol. <volume>9</volume>, no. <issue>13</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>22</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A.</given-names> <surname>Nieto</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Rios</surname></string-name></person-group>, &#x201C;<article-title>Cybersecurity profiles based on human-centric IoT devices</article-title>,&#x201D; <source>Human-Centric Computing and Information Sciences</source>, vol. <volume>9</volume>, no. <issue>39</issue>, pp. <fpage>1</fpage>&#x2013; <lpage>23</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Sun</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Yuan</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Li</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Research on the signal reconstruction of the phased array structural health monitoring based using the basis pursuit algorithm</article-title>,&#x201D; <source>Computers, Materials &#x0026; Continua</source>, vol. <volume>58</volume>, no. <issue>2</issue>, pp. <fpage>409</fpage>&#x2013;<lpage>420</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Liu</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Zou</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Cheng</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>A robust zero-watermarking based on sift-dct for medical images in the encrypted domain</article-title>,&#x201D; <source>Computers, Materials &#x0026; Continua</source>, vol. <volume>61</volume>, no. <issue>1</issue>, pp. <fpage>363</fpage>&#x2013;<lpage>378</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Zou</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Hu</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Wu</surname></string-name>, <string-name><given-names>J.</given-names> <surname>He</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Rigid medical image registration using learning-based interest points and features</article-title>,&#x201D; <source>Computers, Materials &#x0026; Continua</source>, vol. <volume>60</volume>, no. <issue>2</issue>, pp. <fpage>511</fpage>&#x2013;<lpage>525</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K. M.</given-names> <surname>Awan</surname></string-name>, <string-name><given-names>N.</given-names> <surname>Ashraf</surname></string-name>, <string-name><given-names>M. Q.</given-names> <surname>Saleem</surname></string-name>, <string-name><given-names>O. E.</given-names> <surname>Sheta</surname></string-name>, <string-name><given-names>K. N.</given-names> <surname> Qureshi</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>A priority-based congestion-avoidance routing protocol using IoT-based heterogeneous medical sensors for energy efficiency in healthcare wireless body area networks</article-title>,&#x201D; <source>International Journal of Distributed Sensor Networks</source>, vol. <volume>15</volume>, no. <issue>6</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>16</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. A.</given-names> <surname>Habib</surname></string-name>, <string-name><given-names>C. N.</given-names> <surname>Faisal</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Sarwar</surname></string-name>, <string-name><given-names>M. A.</given-names> <surname>Latif</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Aadil</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Privacy-based medical data protection against internal security threats in heterogeneous internet of medical things</article-title>,&#x201D; <source>International Journal of Distributed Sensor Networks</source>, vol. <volume>15</volume>, no. <issue>9</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>12</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Yang</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Zhu</surname></string-name>, <string-name><given-names>Z.</given-names> <surname>Zhang</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Heterogeneous component model for architecture of community medical internet of things</article-title>,&#x201D; <source>Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering</source>, vol. <volume>202</volume>, pp. <fpage>140</fpage>&#x2013;<lpage>151</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Cheng</surname></string-name>, <string-name><given-names>Z.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Zhao</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Wang</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Secure identity authentication of community medical Internet of Things</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>7</volume>, pp. <fpage>115966</fpage>&#x2013;<lpage>115977</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Rathore</surname></string-name> and <string-name><given-names>J. H.</given-names> <surname>Park</surname></string-name></person-group>, &#x201C;<article-title>Semi-supervised learning based distributed attack detection framework for IoT</article-title>,&#x201D; <source>Applied Soft Computing</source>, vol. <volume>72</volume>, pp. <fpage>79</fpage>&#x2013;<lpage>89</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y. H.</given-names> <surname>Lee</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Ahn</surname></string-name>, <string-name><given-names>H. B.</given-names> <surname>Ahn</surname></string-name> and <string-name><given-names>S. Y.</given-names> <surname>Lee</surname></string-name></person-group>, &#x201C;<article-title>Visual object detection and tracking using analytical learning approach of validity level</article-title>,&#x201D; <source>Intelligent Automation and Soft Computing</source>, vol. <volume>25</volume>, no. <issue>1</issue>, pp. <fpage>205</fpage>&#x2013;<lpage>215</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Rathore</surname></string-name>, <string-name><given-names>B. W.</given-names> <surname>Kwon</surname></string-name> and <string-name><given-names>J. H.</given-names> <surname>Park</surname></string-name></person-group>, &#x201C;<article-title>Blockseciotnet: Blockchain-based decentralized security architecture for IoT network</article-title>,&#x201D; <source>Journal of Network and Computer Applications</source>, vol. <volume>143</volume>, pp. <fpage>167</fpage>&#x2013;<lpage>177</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H. C.</given-names> <surname>Chen</surname></string-name> and <string-name><given-names>S. S.</given-names> <surname>Kuo</surname></string-name></person-group>, &#x201C;<article-title>Active detecting ddos attack approach based on entropy measurement for the next generation instant messaging app on smartphones</article-title>,&#x201D; <source>Intelligent Automation and Soft Computing</source>, vol. <volume>25</volume>, no. <issue>1</issue>, pp. <fpage>217</fpage>&#x2013;<lpage>228</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>N. Y.</given-names> <surname>Kim</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Rathore</surname></string-name>, <string-name><given-names>J. H.</given-names> <surname>Ryu</surname></string-name>, <string-name><given-names>J. H.</given-names> <surname>Park</surname></string-name> and <string-name><given-names>J. H.</given-names> <surname>Park</surname></string-name></person-group>, &#x201C;<article-title>A survey on cyber physical system security for IoT: Issues, challenges, threats, solutions</article-title>,&#x201D; <source>Journal of Information Processing Systems</source>, vol. <volume>14</volume>, no. <issue>6</issue>, pp. <fpage>1361</fpage>&#x2013;<lpage>1384</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Liu</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Cheng</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Qian</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>J.</given-names> <surname> Wang</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Indoor acoustic localization: A survey</article-title>,&#x201D; <source>Human-Centric Computing and Information Sciences</source>, vol. <volume>10</volume>, no. <issue>2</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>24</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Gu</surname></string-name>, <string-name><given-names>W.</given-names> <surname>Liu</surname></string-name>, <string-name><given-names>A. K.</given-names> <surname>Sangaiah</surname></string-name> and <string-name><given-names>H. J.</given-names> <surname>Kim</surname></string-name></person-group>, &#x201C;<article-title>An empower hamilton loop based data collection algorithm with mobile agent for WSNs</article-title>,&#x201D; <source>Human-Centric Computing and Information Sciences</source>, vol. <volume>9</volume>, no. <issue>1</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>14</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Rathore</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Pan</surname></string-name> and <string-name><given-names>J. H.</given-names> <surname>Park</surname></string-name></person-group>, &#x201C;<article-title>Blockdeepnet: A blockchain-based secure deep learning for IoT network</article-title>,&#x201D; <source>Sustainability</source>, vol. <volume>11</volume>, no. <issue>14</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>15</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. L.</given-names> <surname>Buczak</surname></string-name> and <string-name><given-names>E.</given-names> <surname>Guven</surname></string-name></person-group>, &#x201C;<article-title>A survey of data mining and machine learning methods for cyber security intrusion detection</article-title>,&#x201D; <source>IEEE Communications Surveys &#x0026; Tutorials</source>, vol. <volume>18</volume>, no. <issue>2</issue>, pp. <fpage>1153</fpage>&#x2013;<lpage>1176</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>U.</given-names> <surname>Tariq</surname></string-name></person-group>, &#x201C;<article-title>Intrusion detection and anticipation system (idas) for ieee 802.15.4 devices</article-title>,&#x201D; <source>Intelligent Automation and Soft Computing</source>, vol. <volume>25</volume>, no. <issue>2</issue>, pp. <fpage>231</fpage>&#x2013;<lpage>242</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-22"><label>[22]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Fredrikson</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Somesh</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Thomas</surname></string-name></person-group>, &#x201C;<article-title>Model inversion attacks that exploit confidence information and basic countermeasures</article-title>,&#x201D; in <conf-name>Proc. SIGSAC</conf-name>, Denver, Colorado, USA, pp. <fpage>1322</fpage>&#x2013;<lpage>1333</lpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-23"><label>[23]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>E.</given-names> <surname>Kabir</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Hu</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Wang</surname></string-name> and <string-name><given-names>G.</given-names> <surname>Zhuo</surname></string-name></person-group>, &#x201C;<article-title>A novel statistical technique for intrusion detection systems</article-title>,&#x201D; <source>Future Generation Computer Systems</source>, vol. <volume>79</volume>, pp. <fpage>303</fpage>&#x2013;<lpage>318</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-24"><label>[24]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>G.</given-names> <surname>Jie.</surname></string-name> and <string-name><given-names>W.</given-names> <surname>Shanshan</surname></string-name></person-group>, &#x201C;<article-title>An effective intrusion detection framework based on SVM with feature augmentation</article-title>,&#x201D; <source>Knowledge-Based Systems</source>, vol. <volume>136</volume>, pp. <fpage>130</fpage>&#x2013;<lpage>139</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-25"><label>[25]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>N.</given-names> <surname>Farnaaz</surname></string-name> and <string-name><given-names>M. A.</given-names> <surname>Jabbar</surname></string-name></person-group>, &#x201C;<article-title>Random forest modeling for network intrusion detection system</article-title>,&#x201D; <source>Computer Science</source>, vol. <volume>89</volume>, no. <issue>1</issue>, pp. <fpage>213</fpage>&#x2013;<lpage>217</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-26"><label>[26]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Swarnkar</surname></string-name> and <string-name><given-names>N.</given-names> <surname>Hubballi</surname></string-name></person-group>, &#x201C;<article-title>OCPAD: One class naive bayes classifier for payload based anomaly detection</article-title>,&#x201D; <source>Expert Systems with Applications</source>, vol. <volume>64</volume>, pp. <fpage>330</fpage>&#x2013;<lpage>339</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-27"><label>[27]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>L.</given-names> <surname>Yang</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Ding</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Wu</surname></string-name> and <string-name><given-names>K.</given-names> <surname>Wang</surname></string-name></person-group>, &#x201C;<article-title>Robust detection of false data injection attacks for data aggregation in an internet of things-based environmental surveillance</article-title>,&#x201D; <source>Computer Networks</source>, vol. <volume>129</volume>, pp. <fpage>410</fpage>&#x2013;<lpage>428</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-28"><label>[28]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C.</given-names> <surname>Khammassi</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Krichen</surname></string-name></person-group>, &#x201C;<article-title>A GA-LR wrapper approach for feature selection in network intrusion detection</article-title>,&#x201D; <source>Computers &#x0026; Security</source>, vol. <volume>70</volume>, pp. <fpage>255</fpage>&#x2013;<lpage>277</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-29"><label>[29]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>G.</given-names> <surname>Caminero</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Lopez-Martin</surname></string-name> and <string-name><given-names>B.</given-names> <surname>Carro</surname></string-name></person-group>, &#x201C;<article-title>Adversarial environment reinforcement learning algorithm for intrusion detection</article-title>,&#x201D; <source>Computer Networks</source>, vol. <volume>159</volume>, pp. <fpage>96</fpage>&#x2013;<lpage>109</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-30"><label>[30]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Liu</surname></string-name>, <string-name><given-names>J.</given-names> <surname>He</surname></string-name>, <string-name><given-names>W.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Ma</surname></string-name>, <string-name><given-names>Z.</given-names> <surname>Tang</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Anid-SEoKELM: Adaptive network intrusion detection based on selective ensemble of kernel elms with random features</article-title>,&#x201D; <source>Knowledge-Based Systems</source>, vol. <volume>177</volume>, pp. <fpage>104</fpage>&#x2013;<lpage>116</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-31"><label>[31]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>F.</given-names> <surname>Salo</surname></string-name>, <string-name><given-names>A. B.</given-names> <surname>Nassif</surname></string-name> and <string-name><given-names>A.</given-names> <surname>Essex</surname></string-name></person-group>, &#x201C;<article-title>Dimensionality reduction with IG-PCA and ensemble classifier for network intrusion detection</article-title>,&#x201D; <source>Computer Networks</source>, vol. <volume>148</volume>, pp. <fpage>164</fpage>&#x2013;<lpage>175</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-32"><label>[32]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>T. H.</given-names> <surname>Divyasree</surname></string-name> and <string-name><given-names>K. K.</given-names> <surname>Sherly</surname></string-name></person-group>, &#x201C;<article-title>A network intrusion detection system based on ensemble CVM using efficient feature selection approach</article-title>,&#x201D; <source>Computer Science</source>, vol. <volume>143</volume>, pp. <fpage>442</fpage>&#x2013;<lpage>449</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-33"><label>[33]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>O. Y.</given-names> <surname>Al-Jarrah</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Al-Hammdi</surname></string-name>, <string-name><given-names>P. D.</given-names> <surname>Yoo</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Muhaidat</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Al-Qutayri</surname></string-name></person-group>, &#x201C;<article-title>Semi-supervised multi-layered clustering model for intrusion detection</article-title>,&#x201D; <source>Digital Communications and Networks</source>, vol. <volume>4</volume>, no. <issue>4</issue>, pp. <fpage>277</fpage>&#x2013;<lpage>286</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-34"><label>[34]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. A.</given-names> <surname>Hady</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Ghubaish</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Salman</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Unal</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Jain</surname></string-name></person-group>, &#x201C;<article-title>Intrusion detection system for healthcare systems using medical and network data: A comparison study</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>8</volume>, pp. <fpage>106576</fpage>&#x2013;<lpage>106584</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-35"><label>[35]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Gao</surname></string-name> and <string-name><given-names>G.</given-names> <surname>Thamilarasu</surname></string-name></person-group>, &#x201C;<article-title>Machine-learning classifiers for security in connected medical evices</article-title>,&#x201D; in <conf-name>Proc. ICCCN</conf-name>, Vancouver, Canada, pp. <fpage>1</fpage>&#x2013;<lpage>5</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-36"><label>[36]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>I.</given-names> <surname>Alrashdi</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Alqazzaz</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Aloufi</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Alharthi</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Zohdy</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Ad-iot: Anomaly detection of iot cyberattacks in smart city using machine learning</article-title>,&#x201D; in <conf-name>Proc. CCWC</conf-name>, <publisher-name>University of Nevada</publisher-name>, Las Vegas, NV, USA, pp. <fpage>305</fpage>&#x2013;<lpage>310</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-37"><label>[37]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. F.</given-names> <surname>Oliva</surname></string-name>, <string-name><given-names>F. M.</given-names> <surname>Perez</surname></string-name>, <string-name><given-names>J. V.</given-names> <surname>Berna-Martinez</surname></string-name> and <string-name><given-names>M. A.</given-names> <surname>Ortega</surname></string-name></person-group>, &#x201C;<article-title>Non-deterministic outlier detection method based on the variable precision rough set model</article-title>,&#x201D; <source>Computer Systems Science and Engineering</source>, vol. <volume>34</volume>, no. <issue>3</issue>, pp. <fpage>131</fpage>&#x2013;<lpage>144</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-38"><label>[38]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. B.</given-names> <surname>Nejad</surname></string-name> and <string-name><given-names>M. E.</given-names> <surname>Shiri</surname></string-name></person-group>, &#x201C;<article-title>A new enhanced learning approach to automatic image classification based on salp swarm algorithm</article-title>,&#x201D; <source>Computer Systems Science and Engineering</source>, vol. <volume>34</volume>, no. <issue>2</issue>, pp. <fpage>91</fpage>&#x2013;<lpage>100</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-39"><label>[39]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S. K.</given-names> <surname>Singh</surname></string-name>, <string-name><given-names>M. M.</given-names> <surname>Salim</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Cha</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Pan</surname></string-name> and <string-name><given-names>J. H.</given-names> <surname>Park</surname></string-name></person-group>, &#x201C;<article-title>Machine learning-based network sub-slicing framework in a sustainable 5G environment</article-title>,&#x201D; <source>Sustainability</source>, vol. <volume>12</volume>, no. <issue>15</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>22</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-40"><label>[40]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D. W.</given-names> <surname>Aha</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Kibler</surname></string-name> and <string-name><given-names>M. K.</given-names> <surname>Albert</surname></string-name></person-group>, &#x201C;<article-title>Instance-based learning algorithms</article-title>,&#x201D; <source>Machine Learning</source>, vol. <volume>6</volume>, no. <issue>1</issue>, pp. <fpage>37</fpage>&#x2013;<lpage>66</lpage>, <year>1991</year>.</mixed-citation></ref>
<ref id="ref-41"><label>[41]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>M. H.</given-names> <surname>Kutner</surname></string-name>, <string-name><given-names>C. J.</given-names> <surname>Nachtsheim</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Neter</surname></string-name> and <string-name><given-names>W.</given-names> <surname>Li</surname></string-name></person-group>, <chapter-title>Regression models for quantitative and qualitative predictors</chapter-title>. in <source>Applied Linear Statistical Models</source>, 5<inline-formula id="ieqn-14"><alternatives><inline-graphic xlink:href="ieqn-14.png"/><tex-math id="tex-ieqn-14"><![CDATA[$^{\rm th}$]]></tex-math><mml:math id="mml-ieqn-14"><mml:msup><mml:mrow></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mi>h</mml:mi></mml:mrow></mml:msup></mml:math></alternatives></inline-formula> ed. <publisher-loc>New York, USA</publisher-loc>: <publisher-name>McGraw-Hill Irwin</publisher-name>, pp. <fpage>313</fpage>&#x2013;<lpage>324</lpage>, <year>2005</year>.</mixed-citation></ref>
<ref id="ref-42"><label>[42]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>E.</given-names> <surname>Hern&#x00E1;ndez-Pereira</surname></string-name>, <string-name><given-names>J. A.</given-names> <surname>Su&#x00E1;rez-Romero</surname></string-name>, <string-name><given-names>O.</given-names> <surname>Fontenla-Romero</surname></string-name> and <string-name><given-names>A.</given-names> <surname>Alonso-Betanzos</surname></string-name></person-group>, &#x201C;<article-title>Conversion methods for symbolic features: A comparison applied to an intrusion detection problem</article-title>,&#x201D; <source>Expert Systems with Applications</source>, vol. <volume>36</volume>, no. <issue>7</issue>, pp. <fpage>10612</fpage>&#x2013;<lpage>10617</lpage>, <year>2009</year>.</mixed-citation></ref>
<ref id="ref-43"><label>[43]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C.</given-names> <surname>Yin</surname></string-name>, <string-name><given-names>B.</given-names> <surname>Zhou</surname></string-name>, <string-name><given-names>Z.</given-names> <surname>Yin</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Wang</surname></string-name></person-group>, &#x201C;<article-title>Local privacy protection classification based on human-centric computing</article-title>,&#x201D; <source>Human-Centric Computing and Information Sciences</source>, vol. <volume>9</volume>, no. <issue>1</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>14</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-44"><label>[44]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y. S.</given-names> <surname>Jeong</surname></string-name> and <string-name><given-names>J. H.</given-names> <surname>Park</surname></string-name></person-group>, &#x201C;<article-title>Learning algorithms in ai system and services</article-title>,&#x201D; <source>Journal of Information Processing Systems</source>, vol. <volume>15</volume>, no. <issue>5</issue>, pp. <fpage>1029</fpage>&#x2013;<lpage>1035</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-45"><label>[45]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>B. V.</given-names> <surname> Santhosh Krishna</surname></string-name> and <string-name><given-names>T.</given-names> <surname>Gnanasekaran</surname></string-name></person-group>, &#x201C;<article-title>Retinal vessel extraction framework using modified adaboost extreme learning machine</article-title>,&#x201D; <source>Computers, Materials &#x0026; Continua</source>, vol. <volume>60</volume>, no. <issue>3</issue>, pp. <fpage>855</fpage>&#x2013;<lpage>869</lpage>, <year>2019</year>.</mixed-citation></ref>
</ref-list>
</back>
</article>