<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CSSE</journal-id>
<journal-id journal-id-type="nlm-ta">CSSE</journal-id>
<journal-id journal-id-type="publisher-id">CSSE</journal-id>
<journal-title-group>
<journal-title>Computer Systems Science &#x0026; Engineering</journal-title>
</journal-title-group>
<issn pub-type="ppub">0267-6192</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">23882</article-id>
<article-id pub-id-type="doi">10.32604/csse.2022.023882</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Multi-Site Air Pollutant Prediction Using Long Short Term Memory</article-title><alt-title alt-title-type="left-running-head">Multi-Site Air Pollutant Prediction using Long Short Term Memory</alt-title><alt-title alt-title-type="right-running-head">Multi-Site Air Pollutant Prediction using Long Short Term Memory</alt-title>
</title-group>
<contrib-group content-type="authors">
<contrib id="author-1" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Paulpandi</surname><given-names>Chitra</given-names></name><email>chitrapaulpandi09@gmail.com</email>
</contrib>
<contrib id="author-2" contrib-type="author">
<name name-style="western"><surname>Chinnasamy</surname><given-names>Murukesh</given-names></name>
</contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>Rajendiran</surname><given-names>Shanker Nagalingam</given-names></name>
</contrib><aff><institution>Anna University</institution>, <addr-line>Chennai, 600066, Tamil Nadu</addr-line>, <country>India</country></aff>
</contrib-group><author-notes><corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Chitra Paulpandi. Email: <email>chitrapaulpandi09@gmail.com</email></corresp></author-notes>
<pub-date pub-type="epub" date-type="pub" iso-8601-date="2022-05-06"><day>06</day>
<month>05</month>
<year>2022</year></pub-date>
<volume>43</volume>
<issue>3</issue>
<fpage>1341</fpage>
<lpage>1355</lpage>
<history>
<date date-type="received"><day>25</day><month>9</month><year>2021</year></date>
<date date-type="accepted"><day>08</day><month>11</month><year>2021</year></date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2022 Paulpandi, Chinnasamy and Rajendiran</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Paulpandi, Chinnasamy and Rajendiran</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CSSE_23882.pdf"></self-uri>
<abstract>
<p>The current pandemic highlights the significance and impact of air pollution on individuals. When it comes to climate sustainability, air pollution is a major challenge. Because of the distinctive nature, unpredictability, and great changeability in the reality of toxins and particulates, detecting air quality is a puzzling task. Simultaneously, the ability to predict or classify and monitor air quality is becoming increasingly important, particularly in urban areas, due to the well documented negative impact of air pollution on resident&#x2019;s health and the environment. To better comprehend the current condition of air quality, this research proposes predicting air pollution levels from real-time data. This study proposes the use of deep learning techniques to forecast air pollution levels. Layers, activation functions, and a number of epochs were used to create the suggested Long Short-Term Memory (LSTM) network based neural layer design. The use of proposed Deep Learning as a structure for high-accuracy air quality prediction is investigated in this research and obtained better accuracy of nearly 82% compared to earlier records. Determining the Air Quality Index (AQI) and danger levels would assist the government in finding appropriate ways to authorize approaches to reduce pollutants and keep inhabitants informed about the findings.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>LSTM</kwd>
<kwd>epochs</kwd>
<kwd>deep learning</kwd>
<kwd>air quality index</kwd>
<kwd>particulates</kwd>
<kwd>neural networks</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>It is due of air that we are living today. Every month, we breathe roughly 1 million times without realizing the consequences of the air pollution we inhale. Over 93 percent of the world&#x2019;s population is exposed to dangerous air pollution chemicals such as Nitrogen Oxides (NOx), Carbon Oxides (COx), Sulphur Oxides (SOx), Particulate Matter (PM), Ozone (O<sub>3</sub>), and Ammonia (NH<sub>3</sub>) on a daily basis. Indoor air pollution is also much worse than outdoor pollution. Everyday products contain toxic compounds.</p>
<p>Noise, land, water, and air pollution are all major pollutants that influence humans and other living things. Among the several types of pollution, air pollution is the most serious. Natural disasters, automobiles, industries, crop fires, dust storms, man-made smokes such as burning of wood, plastics, natural gas, and coal, deforestation, population, and other factors all contribute to air pollution in India and is typically lower in summer than in the winter. Air pollution increases the risk of a variety of health problems, including arrhythmia, ischemia, heart failure, and stroke and so understanding and monitoring air pollution is critical for our well-being. The government employs the Air Quality Index (AQI) concept to forecast air pollutant levels and inform citizens.</p>
<p>AQI is a tool that displays the current state of air quality in six categories based on ambient concentration levels of air contaminants. Good, satisfactory, moderate, poor, very poor, and severe are the six classifications. An increase in the AQI level implies that there is a chance of breathing polluted air, which can have serious health consequences. The AQI is calculated using eight primary pollutants: Particulate Matter less than 2.5 microns (PM<sub>2.5</sub>), Particulate Matter less than 10 microns (PM<sub>10</sub>), Nitrogen Dioxide (NO<sub>2</sub>), Sulfur Dioxide (SO<sub>2</sub>), Carbon Monoxide (CO), O<sub>3</sub>, NH<sub>3</sub>, and Lead (Pb). &#x201C;When we have high moisture then the aerosols in the air starts to absorb water vapors and swell thereby leads to low visibility and that is how the smog are created&#x201D;, said by Sachin Ghude, Scientist, Indian Institute of Tropical Meteorology (IITM), which operates System of Air Quality Weather Forecasting and Research (SAFAR), so it is very important to forecast air pollutants for better life.</p>
<p>Many air pollutant studies involve knowledge of environmental and computer technology, which is time consuming, and many statistical methods such as multiple linear regression [<xref ref-type="bibr" rid="ref-1">1</xref>], auto regressive moving average method and generalized line regression [<xref ref-type="bibr" rid="ref-2">2</xref>] are used for air quality predictions [<xref ref-type="bibr" rid="ref-3">3</xref>]. When compared to traditional methods such as support vector machine [<xref ref-type="bibr" rid="ref-4">4</xref>] and random forest, a commonly used air pollution prediction method in environmental or atmospheric research performed better [<xref ref-type="bibr" rid="ref-5">5</xref>&#x2013;<xref ref-type="bibr" rid="ref-7">7</xref>]. In making atmospheric decisions, accurate forecasting in air quality measurement is critical [<xref ref-type="bibr" rid="ref-8">8</xref>]. Air pollutants are also highly dependent on regional and seasonal fluctuations, making it difficult to anticipate Air Quality (AQ) and necessitating simultaneous monitoring of time and space.</p>
<p>Currently, the rising technology Artificial Intelligence (AI) is being employed in air pollution prediction, with advanced artificial intelligence approaches achieving improved results. Also AI founds to be the future promising technology that serves faster with more accuracy in short span of time without human intervention. Advanced AI creates great impact in several applications and improves people&#x2019;s lives by performing most typical tasks. Deep Recurrent Neural Network (DRNN) is utilized in predicting fine PM2.5 [<xref ref-type="bibr" rid="ref-9">9</xref>]. Hybrid model spatiotemporal forecasting of PM2.5 is employed by long term prediction [<xref ref-type="bibr" rid="ref-10">10</xref>] and air pollutant concentration is predicted by combining other traditional methods [<xref ref-type="bibr" rid="ref-11">11</xref>,<xref ref-type="bibr" rid="ref-12">12</xref>]. Extraction of spatiotemporal characteristics improves the air pollution prediction model [<xref ref-type="bibr" rid="ref-13">13</xref>&#x2013;<xref ref-type="bibr" rid="ref-16">16</xref>]. Aggregated Long Short Term Memory (LSTM) is also employed for air quality prediction [<xref ref-type="bibr" rid="ref-17">17</xref>]. Some methods provide average air pollutant concentration and to overcome the issue LSTM with Recurrent Neural Network (RNN) and Wireless Sensor Network (WSN) is employed [<xref ref-type="bibr" rid="ref-18">18</xref>]. Bayesian model [<xref ref-type="bibr" rid="ref-19">19</xref>] and bi-directional LSTM model [<xref ref-type="bibr" rid="ref-20">20</xref>] also helps to predict air quality and found to be better compared to traditional methods.</p>
<p>To forecast air pollution concentrations, this research proposes a deep learning model based on LSTM. Meteorological observations are obtained from a multi-site network of monitoring stations, and missing values are rebuilt and forecast values fine-tuned to make considerable improvements. The proposed model&#x2019;s accuracy was improved in an experimental situation by using a real-time air pollution dataset. In addition, the suggested Deep Learning (DL) model provides accurate assessment of AQI when compared to existing methodologies, and a greater number of features were compared for air quality forecasts and accuracy in the proposed DL method, so the public is warned.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Methodology</title>
<p>The suggested method begins with the selection of a data gathering region from local and near stations, collection of data from National Air Quality Index (NAQI), Central Pollution Control Board (CPCB), Tamil Nadu Pollution Control Board (TNPCB) and KAGGLE followed by pre-processing of data such as data division, manipulating missing data and normalization. The pre-processed data is classified using LSTM to anticipate air pollution with pinpoint accuracy. The methodology&#x2019;s flow is depicted in <xref ref-type="fig" rid="fig-1">Fig. 1</xref>:</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>Process of methodology</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CSSE_23882-fig-1.png"/>
</fig>
<sec id="s2_1">
<label>2.1</label>
<title>Selection of Area</title>
<p>The research&#x2019;s study area was gathered from Air Quality Monitoring Stations (AQMS) in the states listed in <xref ref-type="table" rid="table-1">Tab. 1</xref>. For data gathering, active stations were segregated. These study sites were chosen based on the availability of CPCB air quality data, satellite images, and the fact that the areas chosen were the most polluted, trafficked, and prone to industrial development activities. Some states have inactive AQMS and so they are identified first before selecting the sites. States having active air monitoring stations details are isolated. For the planned job, the network was trained using AQMS from various states across the country. The proposed paper focus on overall air pollution prediction of the country which can further be narrowed to particular state or city area. In our study nearly 21 states are selected for experiment.</p>
<table-wrap id="table-1"><label>Table 1</label>
<caption>
<title>List of air quality monitoring stations taken for study</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>S. NO.</th>
<th>Station ID</th>
<th>State</th>
<th>TOTAL number of AQMS</th>
<th>Active AQMS</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>AP001</td>
<td>Andhra</td>
<td>05</td>
<td>01</td>
</tr>
<tr>
<td>2</td>
<td>AS001</td>
<td>Assam</td>
<td>01</td>
<td>01</td>
</tr>
<tr>
<td>3</td>
<td>BR001</td>
<td>Bihar</td>
<td>10</td>
<td>06</td>
</tr>
<tr>
<td>4</td>
<td>CH001</td>
<td>Chandigarh</td>
<td>01</td>
<td>01</td>
</tr>
<tr>
<td>5</td>
<td>DL001</td>
<td>Delhi</td>
<td>38</td>
<td>37</td>
</tr>
<tr>
<td>6</td>
<td>GJ001</td>
<td>Gujarat</td>
<td>06</td>
<td>01</td>
</tr>
<tr>
<td>7</td>
<td>HR001</td>
<td>Haryana</td>
<td>29</td>
<td>29</td>
</tr>
<tr>
<td>8</td>
<td>JH001</td>
<td>Jharkhand</td>
<td>01</td>
<td>01</td>
</tr>
<tr>
<td>9</td>
<td>KA001</td>
<td>Karnataka</td>
<td>20</td>
<td>10</td>
</tr>
<tr>
<td>10</td>
<td>KL001</td>
<td>Kerala</td>
<td>08</td>
<td>02</td>
</tr>
<tr>
<td>11</td>
<td>MP001</td>
<td>Madhya Pradesh</td>
<td>16</td>
<td>01</td>
</tr>
<tr>
<td>12</td>
<td>MH001</td>
<td>Maharashtra</td>
<td>22</td>
<td>10</td>
</tr>
<tr>
<td>13</td>
<td>ML001</td>
<td>Meghalaya</td>
<td>01</td>
<td>01</td>
</tr>
<tr>
<td>14</td>
<td>MZ001</td>
<td>Mizoram</td>
<td>01</td>
<td>01</td>
</tr>
<tr>
<td>15</td>
<td>OD001</td>
<td>Odisha</td>
<td>02</td>
<td>02</td>
</tr>
<tr>
<td>16</td>
<td>PB001</td>
<td>Punjab</td>
<td>08</td>
<td>01</td>
</tr>
<tr>
<td>17</td>
<td>RJ001</td>
<td>Rajasthan</td>
<td>10</td>
<td>03</td>
</tr>
<tr>
<td>18</td>
<td>TN001</td>
<td>Tamil Nadu</td>
<td>05</td>
<td>05</td>
</tr>
<tr>
<td>19</td>
<td>TG001</td>
<td>Telangana</td>
<td>06</td>
<td>06</td>
</tr>
<tr>
<td>20</td>
<td>UP001</td>
<td>Uttar Pradesh</td>
<td>26</td>
<td>04</td>
</tr>
<tr>
<td>21</td>
<td>WB001</td>
<td>West Bengal</td>
<td>14</td>
<td>07</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Data Collection</title>
<p>The features to be collected from the specific site are processed once the study area has been established. It is critical to comprehend the data in order to recognize the features. As a result, self-reviewing data is required, and it is assessed for all of the chosen states or cities. <xref ref-type="fig" rid="fig-2">Fig. 2</xref> shows a flow diagram of the data selection process.</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Process of data selection</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CSSE_23882-fig-2.png"/>
</fig>
<p>For the available number of daily Air Quality Index data per city, about 37000 records for each station are taken on an hourly basis for the specified study areas from 2016 to 2020. The data was collected for three seasons: summer, rainy season, and winter. Before preprocessing, data collected from the KAGGLE website is rigorously scrutinized. <xref ref-type="table" rid="table-2">Tab. 2</xref> lists the features that have been identified for the proposed work. It is vital to comprehend the government-mandated averaging monitoring hours and minimal ambient concentration of air pollution levels.</p>
<table-wrap id="table-2"><label>Table 2</label>
<caption>
<title>List of features taken for study</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th rowspan="2">S.No.</th>
<th rowspan="2">Name of the air pollutant<break/>(features)</th>
<th rowspan="2">Symbol</th>
<th rowspan="2">Unit</th>
<th colspan="2">Ambient concentration level of air pollutant</th>
<th rowspan="2">Monitoringtime</th>
</tr>
<tr>
<th>Industrial, residential, rural &#x0026; other area</th>
<th>Sensitive area</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>Particulate matter less than 2.5</td>
<td>PM<sub>2.5</sub></td>
<td>&#x00B5;g/m<sup>3</sup></td>
<td>60</td>
<td>60</td>
<td>24 h</td>
</tr>
<tr>
<td>2</td>
<td>Particulate matter less than 10</td>
<td>PM<sub>10</sub></td>
<td>&#x00B5;g/m<sup>3</sup></td>
<td>100</td>
<td>100</td>
<td>24 h</td>
</tr>
<tr>
<td>3</td>
<td>Nitrogen oxide</td>
<td>NO</td>
<td>&#x00B5;g/m<sup>3</sup></td>
<td>80</td>
<td>80</td>
<td>24 h</td>
</tr>
<tr>
<td>4</td>
<td>Nitrogen dioxide</td>
<td>NO<sub>2</sub></td>
<td>&#x00B5;g/m<sup>3</sup></td>
<td>80</td>
<td>80</td>
<td>24 h</td>
</tr>
<tr>
<td>5</td>
<td>Nitrogen oxides</td>
<td>NO<sub>x</sub></td>
<td>&#x00B5;g/m<sup>3</sup></td>
<td>80</td>
<td>80</td>
<td>24 h</td>
</tr>
<tr>
<td>6</td>
<td>Ammonia</td>
<td>NH<sub>3</sub></td>
<td>&#x00B5;g/m<sup>3</sup></td>
<td>400</td>
<td>400</td>
<td>24 h</td>
</tr>
<tr>
<td>7</td>
<td>Carbon monoxide</td>
<td>CO</td>
<td>mg/m<sup>3</sup></td>
<td>4<break/>2</td>
<td>4<break/>2</td>
<td>01 h<break/>08 h</td>
</tr>
<tr>
<td>8</td>
<td>Sulphur dioxide</td>
<td>SO<sub>2</sub></td>
<td>&#x00B5;g/m<sup>3</sup></td>
<td>80</td>
<td>80</td>
<td>24 h</td>
</tr>
<tr>
<td>9</td>
<td>Ozone</td>
<td>O<sub>3</sub></td>
<td>&#x00B5;g/m<sup>3</sup></td>
<td>180<break/>100</td>
<td>180<break/>100</td>
<td>01 h<break/>08 h</td>
</tr>
<tr>
<td>10</td>
<td>Benzene</td>
<td>C<sub>6</sub>H<sub>6</sub></td>
<td>ng/m<sup>3</sup></td>
<td>5</td>
<td>5</td>
<td>08 h</td>
</tr>
<tr>
<td>11</td>
<td>Toluene</td>
<td>C<sub>7</sub>H<sub>8</sub></td>
<td>ng/m<sup>3</sup></td>
<td>5</td>
<td>5</td>
<td>08 h</td>
</tr>
<tr>
<td>12</td>
<td>Xylene</td>
<td>C<sub>8</sub>H<sub>10</sub></td>
<td>ng/m<sup>3</sup></td>
<td>5</td>
<td>5</td>
<td>08 h</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="table-2fn1" fn-type="other">
<p>Note: &#x002A;AQI is measured by following units, 1. micrograms per cubic meter (&#x00B5;g/m<sup>3</sup>), 2. parts per million (ppm) or parts per billion (ppb), 3. microns or micrometer.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Data Preprocessing</title>
<p>Once the necessary data has been gathered, it is standardized to eliminate the effects of missing numbers. <xref ref-type="fig" rid="fig-3">Fig. 3</xref> shows the stages involved in normalizing. Missing data is critical in preprocessing and has a significant influence on its own, thus diagnosing missing values with adequate data is critical. For these reasons, unknown values other than numbers are deleted from input data before transformation for complex numbers with a special number called Not a Number (NaN).</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Input data preprocessing</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CSSE_23882-fig-3.png"/>
</fig>
</sec>
<sec id="s2_4">
<label>2.4</label>
<title>Feature Classification</title>
<p>For training and testing purposes, we divided the input data into two portions. Nearly 70% of the 37000 records gathered are used for training, and 30% are used for testing. Ground truth parameters are collected during training, and the network is trained using the Stochastic Gradient Descent with Momentum (SGDM) optimizer. In comparison to other current algorithms, this best approach finds the model parameters that best fit the expected and actual outputs, calculates faster, and converges better with longer training time. Before training LSTM categorization, soft max is employed for activation layer during input data testing.</p>
</sec>
<sec id="s2_5">
<label>2.5</label>
<title>Forecasting Air Pollution and Analysis</title>
<p>Finally, the survey data is analyzed using methods from the Statistical Package for Social Sciences (SPSS). This SPSS software suite was used to conduct a detailed analysis of the data collected. The measurements done often includes mean, median, Standard Deviation (SD), Mean Absolute Percentage Error (MAPE), Mean Square Error (MSE), Mean Absolute Error (MAE), Root Mean Squared Error (RMSE) and Mean Squared Error (MSE) helps to predict the performance level of classifier which enables to conclude AQI.</p>
</sec>
</sec>
<sec id="s3">
<label>3</label>
<title>AQI Prediction Model Based on LSTM</title>
<p>Internal memory is used by the basic RNN to process the future variable sequence of inputs. <xref ref-type="fig" rid="fig-4">Fig. 4</xref> depicts the basic architecture of a basic RNN. Because the original RNN in our proposed model for training the dataset may not perform well for long-term reliance because it includes simple tanh in every repeating module, we employ LSTM, which is an expanded version of RNN, to overcome this issue.</p>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Basic RNN architecture</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CSSE_23882-fig-4.png"/>
</fig>
<sec id="s3_1">
<label>3.1</label>
<title>LSTM Networks</title>
<p>In comparison to simple RNN, the LSTM network is capable of performing long-term dependencies, which was first described by Hochreiter and Schmidhuber in 1997. It allows avoiding the long-term reliance problem. The core idea behind LSTM is as follows:</p>
<p>The key feature that goes horizontally through the diagram at the top is cell state. It&#x2019;s similar to a conveyor belt, but with a few more interactions. This cell state can be added or withdrawn based on the information and is regulated accordingly using a three-gate structure. As shown in <xref ref-type="fig" rid="fig-5">Fig. 5</xref> this regulation consists of a <inline-formula id="ieqn-1">
<mml:math id="mml-ieqn-1"><mml:mrow><mml:mo>(</mml:mo><mml:mi>&#x03C3;</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mspace width="thickmathspace" /></mml:math>
</inline-formula>sigmoid neural net layer and a (x) point-wise multiplication operation. The main purpose of this sigmoid layer is to output values that are either zero (to signal &#x201C;allow nothing through&#x201D;) or one (to indicate &#x201C;let everything through&#x201D;).</p>
<fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>LSTM concept</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CSSE_23882-fig-5.png"/>
</fig>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>LSTM Step by Step Process</title>
<p>Initially, the input data, as well as the input data concentration sequence before and after transformation, are defined. For sequence to label classification, layer array is created which includes sequence input layer, LSTM layer, fully connected layer, soft max layer and classification output layer. Sequence input layer represents total number of input features taken for study and the classes required for algorithm as decided is specified by fully connected layer. The basic block diagram of LSTM classification and regression is shown in <xref ref-type="fig" rid="fig-6">Fig. 6</xref>:</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>LSTM classification and regression</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CSSE_23882-fig-6.png"/>
</fig>
<p>First the gender of the subject is analyzed for the given input x<sub>t</sub> and the output value h<sub>t</sub>. The sigmoid layer checks h<sub>t&#x2212;1</sub> and x<sub>t</sub> and accordingly gives the output of number between 0 and 1 for each numbers in the cell state C<sub>t&#x2212;1</sub> as per <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref>. The new candidate value vector is created by tanh.</p>
<p><disp-formula id="eqn-1"><label>(1)</label>
<mml:math id="mml-eqn-1" display="block"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mi>&#x03C3;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mi>W</mml:mi><mml:mi>f</mml:mi></mml:msub></mml:mrow><mml:mo>.</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>f</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mspace width="thickmathspace" /><mml:mspace width="thickmathspace" /><mml:mspace width="thickmathspace" /><mml:mspace width="thickmathspace" /><mml:mspace width="thickmathspace" /></mml:math>
</disp-formula></p>
<p>Next C<sub>t</sub> is added to the new state followed by adding gender of the subject to the cell state as given in <xref ref-type="disp-formula" rid="eqn-2">Eq. (2)</xref>:</p>
<p><disp-formula id="eqn-2"><label>(2)</label>
<mml:math id="mml-eqn-2" display="block"><mml:mrow><mml:msub><mml:mrow><mml:mover><mml:mi>C</mml:mi><mml:mo stretchy="false">&#x007E;</mml:mo></mml:mover></mml:mrow><mml:mi>t</mml:mi></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mi>tanh</mml:mi><mml:mo>&#x2061;</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mi>W</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow><mml:mo>.</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo>,</mml:mo><mml:mrow><mml:msub><mml:mi>X</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mspace width="thickmathspace" /><mml:mspace width="thickmathspace" /></mml:math>
</disp-formula></p>
<p>Now old cell state is updated C<sub>t&#x2212;1</sub> into cell state C<sub>t</sub>. Later forgetting of previous information is performed by multiplying f<sub>t</sub> with old state and adding it with C<sub>t</sub> as shown in <xref ref-type="disp-formula" rid="eqn-3">Eq. (3)</xref>:</p>
<p><disp-formula id="eqn-3"><label>(3)</label>
<mml:math id="mml-eqn-3" display="block"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow><mml:mo>=</mml:mo><mml:mi>&#x03C3;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow><mml:mo>&#x2217;</mml:mo><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>t</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:msub><mml:mi>i</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow><mml:mo>&#x2217;</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mover><mml:mi>C</mml:mi><mml:mo stretchy="false">&#x007E;</mml:mo></mml:mover></mml:mrow></mml:mrow><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>Finally the output is decided from the cell state C<sub>t</sub>.</p>
<p>The LSTM starts with the details of the input data that will be given to the network, and this choice is made by a sigmoid layer dubbed the &#x201C;forget gate layer&#x201D; (h<sub>t&#x2212;1</sub> and x<sub>t</sub>), which produces a number between 00 and 11 for each cell state (C<sub>t&#x2212;1</sub>). The &#x2018;input gate layer&#x2019; analyses the new information that needs to be stored in the cell state and determines which values need to be changed. The tanh layer follows the input gate, creating a vector of new added values C<sub>t&#x2212;1</sub>, which is then concatenated to provide an update to the cell state. We usually set the input values to tanh between &#x2212;11 and 11 and multiply with the output sigmoid gate to only consider a certain section of the state [<xref ref-type="bibr" rid="ref-21">21</xref>].</p>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Results and Discussion</title>
<sec id="s4_1">
<label>4.1</label>
<title>Data Preprocessing</title>
<p>Data collected contains some unusual or missing data and so this impact may create side effects on the whole records and so data cleaning is very important before data preprocessing. There are numerous frequent methods to replace the missing values such as mean-median of previous or next value of current data R interpolation. Data acquisition frequently involves aberrant or missing data, which might have unforeseen repercussions for the full set of records. As a result, prior to data preparation, data cleaning is essential. Missing data is removed and relevant gaps are filled in using command tools. R-interpolation, mean-median of the previous or next value of the current data is all common ways for substituting missing values. The normalized input data is shown in <xref ref-type="table" rid="table-3">Tabs. 3</xref> and <xref ref-type="table" rid="table-4">4</xref>.</p>
<table-wrap id="table-3"><label>Table 3</label>
<caption>
<title>Data normalization of input data (for first 10 records) for 6 input features</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Records</th>
<th>PM<sub>2.5</sub></th>
<th>PM<sub>10</sub></th>
<th>NO</th>
<th>NO<sub>2</sub></th>
<th>NO<sub>x</sub></th>
<th>NH<sub>3</sub></th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>0.3554</td>
<td>0.1725</td>
<td>&#x2212;0.6568</td>
<td>&#x2212;0.3862</td>
<td>&#x2212;0.6529</td>
<td>&#x2212;0.5783</td>
</tr>
<tr>
<td>2</td>
<td>0.2999</td>
<td>0.2324</td>
<td>&#x2212;0.6650</td>
<td>&#x2212;0.1468</td>
<td>&#x2212;0.5655</td>
<td>&#x2212;0.6045</td>
</tr>
<tr>
<td>3</td>
<td>0.4879</td>
<td>0.3147</td>
<td>&#x2212;0.4204</td>
<td>0.0644</td>
<td>&#x2212;0.3470</td>
<td>&#x2212;0.4479</td>
</tr>
<tr>
<td>4</td>
<td>0.0454</td>
<td>&#x2212;0.0959</td>
<td>&#x2212;0.6055</td>
<td>&#x2212;0.0566</td>
<td>&#x2212;0.4973</td>
<td>&#x2212;0.5367</td>
</tr>
<tr>
<td>5</td>
<td>0.1946</td>
<td>0.0455</td>
<td>&#x2212;0.4832</td>
<td>&#x2212;0.2687</td>
<td>&#x2212;0.5105</td>
<td>&#x2212;0.4872</td>
</tr>
<tr>
<td>6</td>
<td>0.1466</td>
<td>0.0457</td>
<td>&#x2212;0.5079</td>
<td>&#x2212;0.4006</td>
<td>&#x2212;0.5752</td>
<td>&#x2212;0.5646</td>
</tr>
<tr>
<td>7</td>
<td>0.2215</td>
<td>0.0286</td>
<td>&#x2212;0.5129</td>
<td>&#x2212;0.4389</td>
<td>&#x2212;0.5932</td>
<td>&#x2212;0.5646</td>
</tr>
<tr>
<td>8</td>
<td>0.5084</td>
<td>0.3789</td>
<td>&#x2212;0.3696</td>
<td>&#x2212;0.1385</td>
<td>&#x2212;0.4070</td>
<td>&#x2212;0.4354</td>
</tr>
<tr>
<td>9</td>
<td>0.4587</td>
<td>0.2516</td>
<td>&#x2212;0.6783</td>
<td>&#x2212;0.3510</td>
<td>&#x2212;0.6516</td>
<td>&#x2212;0.3615</td>
</tr>
<tr>
<td>10</td>
<td>0.4137</td>
<td>0.1790</td>
<td>&#x2212;0.5386</td>
<td>&#x2212;0.1041</td>
<td>&#x2212;0.4793</td>
<td>&#x2212;0.3586</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="table-4"><label>Table 4</label>
<caption>
<title>Data normalization of input data (for first 10 records) for next 6 input features</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Records</th>
<th>CO</th>
<th>SO<sub>2</sub></th>
<th>O<sub>3</sub></th>
<th>C<sub>6</sub>H<sub>6</sub></th>
<th>C<sub>7</sub>H<sub>8</sub></th>
<th>C<sub>8</sub>H<sub>10</sub></th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>&#x2212;0.5471</td>
<td>0.5741</td>
<td>4.5372</td>
<td>&#x2212;0.2235</td>
<td>&#x2212;0.1189</td>
<td>&#x2212;0.4130</td>
</tr>
<tr>
<td>2</td>
<td>&#x2212;0.5352</td>
<td>1.8856</td>
<td>4.0719</td>
<td>&#x2212;0.2223</td>
<td>&#x2212;0.0495</td>
<td>&#x2212;0.4098</td>
</tr>
<tr>
<td>3</td>
<td>&#x2212;0.5531</td>
<td>2.6276</td>
<td>3.8005</td>
<td>&#x2212;0.2178</td>
<td>&#x2212;0.0648</td>
<td>&#x2212;0.4036</td>
</tr>
<tr>
<td>4</td>
<td>&#x2212;0.5650</td>
<td>0.9949</td>
<td>5.0719</td>
<td>&#x2212;0.2255</td>
<td>&#x2212;0.1897</td>
<td>&#x2212;0.4114</td>
</tr>
<tr>
<td>5</td>
<td>&#x2212;0.5233</td>
<td>0.0492</td>
<td>3.7007</td>
<td>&#x2212;0.2229</td>
<td>&#x2212;0.2046</td>
<td>&#x2212;0.4098</td>
</tr>
<tr>
<td>6</td>
<td>&#x2212;0.5471</td>
<td>0.4432</td>
<td>4.1033</td>
<td>&#x2212;0.2261</td>
<td>&#x2212;0.2615</td>
<td>&#x2212;0.4130</td>
</tr>
<tr>
<td>7</td>
<td>&#x2212;0.5590</td>
<td>0.4241</td>
<td>4.3786</td>
<td>&#x2212;0.2255</td>
<td>&#x2212;0.2936</td>
<td>&#x2212;0.4161</td>
</tr>
<tr>
<td>8</td>
<td>&#x2212;0.5590</td>
<td>1.0363</td>
<td>4.6163</td>
<td>&#x2212;0.2203</td>
<td>&#x2212;0.2965</td>
<td>&#x2212;0.4114</td>
</tr>
<tr>
<td>9</td>
<td>&#x2212;0.5293</td>
<td>0.1455</td>
<td>3.9447</td>
<td>&#x2212;0.2216</td>
<td>&#x2212;0.2472</td>
<td>&#x2212;0.4161</td>
</tr>
<tr>
<td>10</td>
<td>&#x2212;0.5114</td>
<td>&#x2212;0.0302</td>
<td>3.8294</td>
<td>&#x2212;0.2165</td>
<td>&#x2212;0.2611</td>
<td>&#x2212;0.4083</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Feature Validation</title>
<p>Statistical Validation of Extracted Features is done before classification, each piece of data that is used as an input must be evaluated for its importance. <xref ref-type="table" rid="table-5">Tab. 5</xref> shows the proposed characteristics and their accompanying metrics following validation. Number of samples (N), Standard Deviation (SD), Standard Error (SE), degree of freedom (df), Mean Square (MS), (measure of test accuracy) F1 Score, and significant are among the evaluation measures. The data was tested for the normality using <bold>Shapiro Walik Test</bold> and it was found that all the data was normally distributed and its significance of air pollutants is less than 0.05.</p>
<table-wrap id="table-5"><label>Table 5</label>
<caption>
<title>Comparison of proposed features <italic>vs</italic>. metrics for evaluation</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Features</th>
<th>N</th>
<th>Mean</th>
<th>SD</th>
<th>SE</th>
<th>df</th>
<th>MS</th>
<th>F1 Score</th>
<th>Sig</th>
</tr>
</thead>
<tbody>
<tr>
<td>PM<sub>2.5</sub></td>
<td>36587</td>
<td>63.3840</td>
<td>58.0911</td>
<td>0.3037</td>
<td>36586</td>
<td>640.922</td>
<td>3.121E4</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>PM<sub>10</sub></td>
<td>29191</td>
<td>111.084</td>
<td>75.4643</td>
<td>0.4417</td>
<td>29190</td>
<td>1419.749</td>
<td>1.758E4</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>NO</td>
<td>37935</td>
<td>16.2983</td>
<td>22.2567</td>
<td>0.1143</td>
<td>37934</td>
<td>397.838</td>
<td>1.861E3</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>NO<sub>2</sub></td>
<td>37935</td>
<td>29.7610</td>
<td>23.2389</td>
<td>0.1193</td>
<td>37934</td>
<td>417.193</td>
<td>2.235E3</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>NO<sub>X</sub></td>
<td>36754</td>
<td>33.0347</td>
<td>31.8394</td>
<td>0.1661</td>
<td>36753</td>
<td>786.841</td>
<td>2.121E3</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>NH<sub>3</sub></td>
<td>27967</td>
<td>19.9223</td>
<td>16.5008</td>
<td>0.0987</td>
<td>27966</td>
<td>236.746</td>
<td>840.481</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>CO</td>
<td>37726</td>
<td>01.0531</td>
<td>01.6292</td>
<td>0.0084</td>
<td>37725</td>
<td>002.395</td>
<td>816.846</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>SO<sub>2</sub></td>
<td>37877</td>
<td>10.7421</td>
<td>09.9434</td>
<td>0.0510</td>
<td>37876</td>
<td>093.935</td>
<td>398.950</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>O<sub>3</sub></td>
<td>36879</td>
<td>33.3362</td>
<td>21.3029</td>
<td>0.1109</td>
<td>63878</td>
<td>411.616</td>
<td>757.173</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>C<sub>6</sub>H<sub>6</sub></td>
<td>36059</td>
<td>03.9360</td>
<td>18.3308</td>
<td>0.0965</td>
<td>36058</td>
<td>335.656</td>
<td>008.793</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>C<sub>7</sub>H<sub>8</sub></td>
<td>31554</td>
<td>09.3520</td>
<td>23.0653</td>
<td>0.1298</td>
<td>31553</td>
<td>527.082</td>
<td>060.004</td>
<td><bold>0.0</bold></td>
</tr>
<tr>
<td>C<sub>8</sub>H<sub>10</sub></td>
<td>17765</td>
<td>02.9701</td>
<td>06.6244</td>
<td>0.0497</td>
<td>17764</td>
<td>042.887</td>
<td>083.475</td>
<td><bold>0.0</bold></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>One-way Analysis of Variant (ANOVA) was used to validate the input features. It can be used for further processing if the significant value is less than 0.05. Following validation, it was determined that all of the input features used in the study were significant, implying that all of the input characteristics used in the proposed study can be used for further classification using machine learning and deep learning algorithms.</p>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>Feature Classification</title>
<p>All of the significant features that have been validated using SPSS tools are used for classification. The corresponding sequence of air pollutant PM<sub>2.5</sub> for a given set of time T is defined as X, and these values are filled with record means to get PM<sub>2.5</sub> concentration sequence <inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:mover><mml:mi>X</mml:mi><mml:mo>&#x00AF;</mml:mo></mml:mover></mml:math></inline-formula>, and afterwards these data sequences are translated into supervised learning format. Because of its lengthy temporal dependency problem, the simple RNN cannot cope with large amounts of data. To overcome this, we use LSTM, which takes a lagging observation t&#x2212;1 as an input variable and uses it to forecast the current time step T. The modified data set is represented by a <inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:mover><mml:mi>X</mml:mi><mml:mo>&#x00AF;</mml:mo></mml:mover></mml:math></inline-formula>, while the output variable is represented by a <inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:mover><mml:mi>Y</mml:mi><mml:mo>&#x00AF;</mml:mo></mml:mover></mml:math></inline-formula>. These sequences are then used to forecast individual PM<sub>2.5</sub> series, and the process is repeated for all of the other features in the proposed study. Finally, SPSS tools are used to compare the prediction outcomes, and the performance of the classifier is evaluated using various attributes such as root mean squared error, mean, median, standard deviation, and so on. Using these assessment markers, LSTM is found to be superior to other models in processing time series data, indicating that the current model is useful in AQI prediction. <xref ref-type="fig" rid="fig-7">Fig. 7</xref> depicts the steps involved in defining the LSTM algorithm prior to network training.</p>
<fig id="fig-7">
<label>Figure 7</label>
<caption>
<title>LSTM processing steps before network training</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CSSE_23882-fig-7.png"/>
</fig>
<p>LSTM begins with initialization of sequence of input layers needed, fully connected layer, soft-max layer and classification layer. Then after training options are given which includes initial learning rate, (Ridge Regression) L2 regularization, drop periods, drop factors, epochs needed, batch size and SGDM. Once relevant initialization is completed then the input data&#x2019;s are converted to array format and later on input and ground truth are compared in activation layer. Finally the output is predicted based on the metrics such as accuracy, precision, error rate, sensitivity, specificity, F1score.</p>
<p>With proper initialization of training options the network is trained for classification. Defining LSTM layers includes input sequence (fully connected layer), LSTM 120 (soft-max) and LSTM 60 (classification layer). Initialization of learning rate (0.1), L2 regularization (0.0001), schedule (piecewise), drop factor (0.1), drop period (100), maximum epochs (500), mini batch size (128), and shuffling for every epoch plots are some of the learning rate of training options.</p>
<p>Each epoch is trained using 200 iterations, and it was discovered that the mini-batch loss and iterations are inversely proportional, with the batch loss reducing as the number of iterations grows. <xref ref-type="table" rid="table-6">Tab. 6</xref> shows the network&#x2019;s initial stage of training.</p>
<table-wrap id="table-6"><label>Table 6</label>
<caption>
<title>Network training of epoch 1 to 3</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Epoch</th>
<th>Iteration</th>
<th>Time elapsed<break/>(hh:mm:ss)</th>
<th>Mini-batch accuracy</th>
<th>Mini-batch loss</th>
<th>Base learning rate</th>
</tr>
</thead>
<tbody>
<tr>
<td rowspan="5">1</td>
<td>1</td>
<td>00:00:04</td>
<td>21.09%</td>
<td>1.7951</td>
<td>0.1000</td>
</tr>
<tr>
<td>50</td>
<td>00:00:44</td>
<td>57.03%</td>
<td>0.9743</td>
<td>0.1000</td>
</tr>
<tr>
<td>100</td>
<td>00:01:14</td>
<td>61.72%</td>
<td>0.9100</td>
<td>0.1000</td>
</tr>
<tr>
<td>150</td>
<td>00:01:45</td>
<td>65.63%</td>
<td>0.7632</td>
<td>0.1000</td>
</tr>
<tr>
<td>200</td>
<td>00:02:15</td>
<td>65.63%</td>
<td>0.8067</td>
<td>0.1000</td>
</tr>
<tr>
<td rowspan="5">2</td>
<td>250</td>
<td>00:02:44</td>
<td>71.88%</td>
<td>0.7116</td>
<td>0.1000</td>
</tr>
<tr>
<td>300</td>
<td>00:03:14</td>
<td>64.84%</td>
<td>0.7304</td>
<td>0.1000</td>
</tr>
<tr>
<td>350</td>
<td>00:03:44</td>
<td>64.06%</td>
<td>0.8772</td>
<td>0.1000</td>
</tr>
<tr>
<td>400</td>
<td>00:04:13</td>
<td>64.06%</td>
<td>0.8426</td>
<td>0.1000</td>
</tr>
<tr>
<td>450</td>
<td>00:04:42</td>
<td>67.97%</td>
<td>0.6993</td>
<td>0.1000</td>
</tr>
<tr>
<td rowspan="5">3</td>
<td>500</td>
<td>00:05:17</td>
<td>67.97%</td>
<td>0.7930</td>
<td>0.1000</td>
</tr>
<tr>
<td>550</td>
<td>00:05:58</td>
<td>77.34%</td>
<td>0.6215</td>
<td>0.1000</td>
</tr>
<tr>
<td>600</td>
<td>00:06:29</td>
<td>68.75%</td>
<td>0.7011</td>
<td>0.1000</td>
</tr>
<tr>
<td>650</td>
<td>00:07:01</td>
<td>74.22%</td>
<td>0.6566</td>
<td>0.1000</td>
</tr>
<tr>
<td>655</td>
<td>00:07:04</td>
<td>74.22%</td>
<td>0.6626</td>
<td>0.1000</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The accuracy and other characteristics are assessed for various epochs after the network has been trained for the above configurations constructed according to the suggested LSTM model. <xref ref-type="fig" rid="fig-8">Fig. 8a</xref> through <xref ref-type="fig" rid="fig-8">Fig. 8h</xref> illustrate the relevant network training plots.</p>
<p>Over or under fitting can create to classification issues in a training network, hence regularisation is crucial. In machine learning, regularisation is used to solve this problem, and in deep learning, dropout regularisation is used to prevent over-fitting and under-fitting by removing random neurons from hidden layers. For large data sets hold-out validation works good compared to cross-out validation.</p>
<p>In general, having too many epochs might lead to the model overfitting the training data. It signifies that the model memorises rather than learns the data. The accuracy of validation data is checked for each epoch or iteration to see if it over-fits or not. The number of epoch determines how the network&#x2019;s weights are changed. As the number of epochs grows, so do the number of times the neural network&#x2019;s weights are modified, and the border shifts from underfitting to optimal to overfitting.</p>
<p>For better performance, training data is shuffled for every epochs. As CPU is the available source mini batch size can be implemented that represents short sequences. Once all the desired configuration is inserted the network starts training. For every epoch and iterations the accuracy level and corresponding error rate is plotted. During run time the behaviour of network is analyzed by its accuracy level and error rate.</p>
<fig id="fig-8">
<label>Figure 8</label>
<caption>
<title>(a) Accuracy <italic>vs</italic>. iteration at 45 s (b) Accuracy <italic>vs</italic>. iteration at 3 min (c) Accuracy <italic>vs</italic>. iteration at 22 min (d) Accuracy <italic>vs</italic>. iteration at 85 min (e) Accuracy <italic>vs</italic>. iteration at 281 min (f) Accuracy <italic>vs</italic>. iteration at 573 min 59 (g) Accuracy <italic>vs</italic>. iteration at 875 min (h) Accuracy <italic>vs</italic>. iteration at 2900</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CSSE_23882-fig-8.png"/>
</fig>
</sec>
<sec id="s4_4">
<label>4.4</label>
<title>Algorithm Analysis</title>
<p>A 64-bit operating system AMD A4-5000 APU with Radeon (TM) HD graphics with 1.50 GHz and 8:00 GB RAM is utilized in conjunction with MATLAB 2019a for modeling, processing, comparisons and visualizing the experimental numbers and findings through various deep learning algorithms such as Support Vector Machine (SVM), Neural Network (NN), K-Nearest Neighbor (KNN), Naive Bayes (NB), Ensemble (EN) and LSTM.</p>
<sec id="s4_4_1">
<label>4.4.1</label>
<title>LSTM Performance for Various Input Features</title>
<p>The performance of the LSTM classifier is examined using a variety of methods, one of which is shown in <xref ref-type="table" rid="table-7">Tab. 7</xref>. The 12 input features of the planned study are compared to various computations in this section. When compared to other features, the error rate of PM<sub>10</sub> was determined to be lower.</p>
<table-wrap id="table-7"><label>Table 7</label>
<caption>
<title>Input features <italic>vs</italic>. computations</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Input features</th>
<th>Accuracy</th>
<th>Error rate</th>
<th>Sensitivity</th>
<th>Specificity</th>
<th>Precision</th>
<th>F<sub>1</sub> score</th>
</tr>
</thead>
<tbody>
<tr>
<td>PM<sub>2.5</sub></td>
<td>0.7930</td>
<td>0.2070</td>
<td>0.5283</td>
<td>0.9432</td>
<td>0.5703</td>
<td>0.5300</td>
</tr>
<tr>
<td>PM<sub>10</sub></td>
<td>0.8085</td>
<td>0.1915</td>
<td>0.6772</td>
<td>0.9480</td>
<td>0.6150</td>
<td>0.6370</td>
</tr>
<tr>
<td>NO<sub>x</sub></td>
<td>0.6575</td>
<td>0.3425</td>
<td>0.2430</td>
<td>0.8940</td>
<td>0.2150</td>
<td>0.2280</td>
</tr>
<tr>
<td>NO</td>
<td>0.5220</td>
<td>0.4780</td>
<td>0.2075</td>
<td>0.8646</td>
<td>0.7311</td>
<td>0.1852</td>
</tr>
<tr>
<td>NO<sub>2</sub></td>
<td>0.6400</td>
<td>0.3600</td>
<td>0.2495</td>
<td>0.8875</td>
<td>0.2909</td>
<td>0.2464</td>
</tr>
<tr>
<td>NH<sub>3</sub></td>
<td>0.5600</td>
<td>0.4400</td>
<td>0.1895</td>
<td>0.6854</td>
<td>0.1807</td>
<td>0.1684</td>
</tr>
<tr>
<td>CO</td>
<td>0.6285</td>
<td>0.3715</td>
<td>0.2381</td>
<td>0.8901</td>
<td>0.2078</td>
<td>0.2208</td>
</tr>
<tr>
<td>SO<sub>2</sub></td>
<td>0.6110</td>
<td>0.3890</td>
<td>0.2135</td>
<td>0.8702</td>
<td>0.2061</td>
<td>0.1986</td>
</tr>
<tr>
<td>O<sub>3</sub></td>
<td>0.6055</td>
<td>0.3945</td>
<td>0.2343</td>
<td>0.8870</td>
<td>0.2053</td>
<td>0.2143</td>
</tr>
<tr>
<td>C<sub>6</sub>H<sub>6</sub></td>
<td>0.5485</td>
<td>0.4515</td>
<td>0.2056</td>
<td>0.8650</td>
<td>0.1801</td>
<td>0.1914</td>
</tr>
<tr>
<td>C<sub>7</sub>H<sub>8</sub></td>
<td>0.5230</td>
<td>0.4770</td>
<td>0.1771</td>
<td>0.8445</td>
<td>0.1543</td>
<td>0.1574</td>
</tr>
<tr>
<td>C<sub>8</sub>H<sub>10</sub></td>
<td>0.3660</td>
<td>0.6340</td>
<td>0.1427</td>
<td>0.7340</td>
<td>0.1285</td>
<td>0.1230</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4_4_2">
<label>4.4.2</label>
<title>LSTM Performance for Various Computations</title>
<p><xref ref-type="fig" rid="fig-9">Fig. 9</xref> shows how the accuracy level of each feature is assessed. For each of the 12 input features, various other metrics like as error rate, sensitivity, specificity, accuracy, and F1score were determined individually. It was found that accuracy is high for PM<sub>10</sub> and low for Xylene.</p>
<fig id="fig-9">
<label>Figure 9</label>
<caption>
<title>Accuracy level comparison of all input features</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CSSE_23882-fig-9.png"/>
</fig>
</sec>
<sec id="s4_4_3">
<label>4.4.3</label>
<title>Algorithm Comparison with Proposed Work</title>
<p><xref ref-type="fig" rid="fig-10">Fig. 10</xref> depicts the accuracy level of several approaches used, with the LSTM method proving to be the most accurate. Six different algorithms were taken for comparison for the same set of inputs.</p>
<fig id="fig-10">
<label>Figure 10</label>
<caption>
<title>Accuracy level for various algorithms</title></caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CSSE_23882-fig-10.png"/>
</fig>
<p>As a result, various measures were examined using the LSTM approach, as shown in <xref ref-type="table" rid="table-8">Tab. 8</xref>. The error rate found is minimum for the proposed LSTM method and subsequently accuracy is better compared to other methods.</p>
<table-wrap id="table-8"><label>Table 8</label>
<caption>
<title>Algorithm comparison of accuracy, sensitivity, specificity, precision and F1 score</title></caption>
<table><colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Algorithm</th>
<th>Accuracy</th>
<th>Error rate</th>
<th>Sensitivity</th>
<th>Specificity</th>
<th>Precision</th>
<th>F1 score</th>
</tr>
</thead>
<tbody>
<tr>
<td>SVM (Cubic)</td>
<td>41.25%</td>
<td>0.5875</td>
<td>0.3997</td>
<td>0.8890</td>
<td>0.7013</td>
<td>0.3983</td>
</tr>
<tr>
<td>NN (Bilayered)</td>
<td>56.20%</td>
<td>0.4380</td>
<td>0.3941</td>
<td>0.8850</td>
<td>0.7330</td>
<td>0.4511</td>
</tr>
<tr>
<td>KNN (Weighted)</td>
<td>57.54%</td>
<td>0.4246</td>
<td>0.3976</td>
<td>0.8880</td>
<td>0.7576</td>
<td>0.4546</td>
</tr>
<tr>
<td>NB (Optimizable)</td>
<td>74.48%</td>
<td>0.2552</td>
<td>0.7103</td>
<td>0.9416</td>
<td>0.6097</td>
<td>0.6264</td>
</tr>
<tr>
<td>EN (Boosted)</td>
<td>76.80%</td>
<td>0.232</td>
<td>0.6800</td>
<td>0.9427</td>
<td>0.7901</td>
<td>0.7232</td>
</tr>
<tr>
<td><bold>LSTM </bold><bold>(Standard)</bold></td>
<td><bold>82.40%</bold></td>
<td><bold>0.1760</bold></td>
<td><bold>0.6137</bold></td>
<td><bold>0.9512</bold></td>
<td><bold>0.5759</bold></td>
<td><bold>0.5843</bold></td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="s4_5">
<label>4.5</label>
<title>Limitations of Study and Future Work</title>
<p>The study&#x2019;s shortcoming was that the computation time was prolonged. The proposed study also has the disadvantage of not monitoring air pollutant concentrations in conjunction with other AQMS around or adjacent to it. Normally, both physical and chemical features of aerosols are used to predict air quality, however biological components and qualities are limited in this case. To increase the measurement level, future work can be expanded by including more air contaminants and additional data such as satellite images and industrial emissions into the atmosphere. To further understand the consequences of air pollution and human action, the article can be expanded by looking at specific states in relation to the current pandemic, as well as the situation before and after lockdown. Also, harmful air pollutants can be projected in advance for specific sites such as homes or roads, and the same can be combined with Internet of Things (IoT) and updated in real time in cloud computing for the benefit of people.</p>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Conclusions</title>
<p>Based on historical air pollutant concentration, meteorological and time stamp data, this study provides an LSTM algorithm for predicting air pollutants in various sites. For predicting 12 major air contaminants, fine-grained air quality data is taken from active AQMS in 21 states across the country, India. Using the same dataset, six other models, including the proposed LSTM model, are evaluated, and the trials show that the suggested LSTM outperforms other techniques. By classifying air quality data and calculating dirty pixels using an LSTM classifier, the suggested work assists in obtaining specific information and permits precise knowledge of current pollutant levels in real environments of many sites. The classifier outputs the air pollutant level with higher compilation and efficiency than earlier approaches by comparing ground readings and data obtained from specific areas through private agencies, as well as suitable network training. The proposed approach delivers the best accuracy 82.4 percent of air pollution measurements for approximately 12 major air pollutants, according to the findings. This air quality measurement aids the environmental board in notifying the public and diverting traffic to low-polluting routes or areas, as well as taking appropriate measures such as tree planting, by anticipating air pollutants in advance.</p>
</sec>
</body>
<back>
<ack>
<p>The authors would like to thank Central Pollution Control Board of India (<uri xlink:href="https://app.cpcbccr.com/AQI India/">https://app.cpcbccr.com/AQI India/</uri>), for gathering information.</p>
</ack><fn-group>
<fn fn-type="other">
<p><bold>Funding Statement:</bold> The authors received no specific funding for this study.</p>
</fn>
<fn fn-type="conflict">
<p><bold>Conflicts of Interest:</bold> The authors declare that they have no conflicts of interest to report regarding the present study.</p>
</fn>
</fn-group>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>A. P. K.</given-names> <surname>Tai</surname></string-name>, <string-name><given-names>L. J.</given-names> <surname>Mickley</surname></string-name> and <string-name><given-names>D. J.</given-names> <surname>Jacob</surname></string-name></person-group>, &#x201C;<article-title>Correlations between fine Particulate Matter (PM<sub>2.5</sub>) and meteorological variables in the United States: Implications for the sensitivity of PM<sub>2.5</sub> to climate change</article-title>,&#x201D; <source>Atmospheric Environment</source>, vol. <volume>44</volume>, no. <issue>32</issue>, pp. <fpage>3976</fpage>&#x2013;<lpage>3984</lpage>, <year>2010</year>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Qin</surname></string-name> and <string-name><given-names>H.</given-names> <surname>Wang</surname></string-name></person-group>, &#x201C;<article-title>Forecasting of particulate matter time series using wavelet analysis and wavelet-ARMA/ARIMA model in Taiyuan, China</article-title>,&#x201D; <source>Journal of the Air &#x0026; Waste Management Association</source>, vol. <volume>67</volume>, no. <issue>7</issue>, pp. <fpage>776</fpage>&#x2013;<lpage>788</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C. H. M.</given-names> <surname>Tong</surname></string-name>, <string-name><given-names>S. H. L.</given-names> <surname>Yim</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Rothenberg</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>C. Y.</given-names> <surname>Lin</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Assessing the impacts of seasonal and vertical atmospheric conditions on air quality over the Pearl River Delta region</article-title>,&#x201D; <source>Atmospheric Environment</source>, vol. <volume>180</volume>, pp. <fpage>69</fpage>&#x2013;<lpage>78</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>W.</given-names> <surname>Sun</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Sun</surname></string-name></person-group>, &#x201C;<article-title>Daily PM<sub>2.5</sub> concentration prediction based on principal component analysis and LSSVM optimized by cuckoo search algorithm</article-title>,&#x201D; <source>Journal of Environmental Management</source>, vol. <volume>188</volume>, pp. <fpage>144</fpage>&#x2013;<lpage>152</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>G.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>L. D.</given-names> <surname>Knibbs</surname></string-name>, <string-name><given-names>N. A. S.</given-names> <surname>Hamm</surname></string-name>, <string-name><given-names>W.</given-names> <surname>Cao</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>A machine learning method to estimate PM<sub>2.5</sub> concentrations across China with remote sensing, meteorological and land use information</article-title>,&#x201D; <source>Science of the Total Environment</source>, vol. <volume>636</volume>, pp. <fpage>52</fpage>&#x2013;<lpage>60</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Hu</surname></string-name>, <string-name><given-names>J. H.</given-names> <surname>Belle</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Meng</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Wildani</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Waller</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Estimating PM<sub>2.5</sub> concentrations in the conterminous United States using the random forest approach</article-title>,&#x201D; <source>Environmental Science &#x0026; Technology</source>, vol. <volume>51</volume>, no. <issue>12</issue>, pp. <fpage>6936</fpage>&#x2013;<lpage>6944</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>K.</given-names> <surname>Huang</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Xiao</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Meng</surname></string-name>, <string-name><given-names>G.</given-names> <surname>Geng</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Wang</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Predicting monthly high-resolution PM<sub>2.5</sub> concentrations with random forest model in the North China Plain</article-title>,&#x201D; <source>Environmental Pollution</source>, vol. <volume>242</volume>, pp. <fpage>675</fpage>&#x2013;<lpage>683</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Zhang</surname></string-name> and <string-name><given-names>Z.</given-names> <surname>Li</surname></string-name></person-group>, &#x201C;<article-title>Remote sensing of atmospheric fine Particulate Matter (PM<sub>2.5</sub>) mass concentration near the ground from satellite observation</article-title>,&#x201D; <source>Remote Sensing of Environment</source>, vol. <volume>160</volume>, pp. <fpage>252</fpage>&#x2013;<lpage>262</lpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>B. T.</given-names> <surname>Ong</surname></string-name>, <string-name><given-names>K.</given-names> <surname>Sugiura</surname></string-name> and <string-name><given-names>K.</given-names> <surname>Zettsu</surname></string-name></person-group>, &#x201C;<article-title>Dynamically pre-trained deep recurrent neural networks using environmental monitoring data for predicting PM<sub>2.5</sub></article-title>,&#x201D; <source>Neural Computing and Applications</source>, vol. <volume>27</volume>, no. <issue>6</issue>, pp. <fpage>1553</fpage>&#x2013;<lpage>1566</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Qi</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Karimian</surname></string-name> and <string-name><given-names>D.</given-names> <surname>Liu</surname></string-name></person-group>, &#x201C;<article-title>A hybrid model for spatiotemporal forecasting of PM<sub>2.5</sub> based on graph convolutional neural network and long short-term memory</article-title>,&#x201D; <source>Science of the Total Environment</source>, vol. <volume>664</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>10</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>X.</given-names> <surname>Li</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Peng</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Yao</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Cui</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Hu</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Long short-term memory neural network for air pollutant concentration predictions: method development and evaluation</article-title>,&#x201D; <source>Environmental Pollution</source>, vol. <volume>231</volume>, pp. <fpage>997</fpage>&#x2013;<lpage>1004</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D.</given-names> <surname>Qin</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Yu</surname></string-name>, <string-name><given-names>G.</given-names> <surname>Zou</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Yong</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Zhao</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>A novel combined prediction scheme based on CNN and LSTM for urban PM<sub>2.5</sub> concentration</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>7</volume>, pp. <fpage>20050</fpage>&#x2013;<lpage>20059</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>D.</given-names> <surname>Seng</surname></string-name>, <string-name><given-names>Q.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>G.</given-names> <surname>Chen</surname></string-name> and <string-name><given-names>X.</given-names> <surname>Chen</surname></string-name></person-group>, &#x201C;<article-title>Spatiotemporal prediction of air quality based on LSTM neural network</article-title>,&#x201D; <source>Alexandria Engineering Journal</source>, vol. <volume>60</volume>, no. <issue>2</issue>, pp. <fpage>2021</fpage>&#x2013;<lpage>2032</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C.</given-names> <surname>Wen</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Liu</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Yao</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Peng</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Li</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>A novel spatiotemporal convolutional long short-term neural network for air pollution prediction</article-title>,&#x201D; <source>Science of the Total Environment</source>, vol. <volume>654</volume>, pp. <fpage>1091</fpage>&#x2013;<lpage>1099</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Ma</surname></string-name>, <string-name><given-names>Y.</given-names> <surname>Ding</surname></string-name>, <string-name><given-names>V. J. L.</given-names> <surname>Gan</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Lin</surname></string-name> and <string-name><given-names>Z.</given-names> <surname>Wan</surname></string-name></person-group>, &#x201C;<article-title>Spatiotemporal prediction of PM<sub>2.5</sub> concentrations at different time granularities using IDW-BLSTM</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>7</volume>, pp. <fpage>107897</fpage>&#x2013;<lpage>107907</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>P. W.</given-names> <surname>Soh</surname></string-name>, <string-name><given-names>J. W.</given-names> <surname>Chang</surname></string-name> and <string-name><given-names>J. W.</given-names> <surname>Huang</surname></string-name></person-group>, &#x201C;<article-title>Adaptive deep learning-based air quality prediction model using the most relevant spatial-temporal relations</article-title>,&#x201D; <source>IEEE Access</source>, vol. <volume>6</volume>, pp. <fpage>38186</fpage>&#x2013;<lpage>38199</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y. S.</given-names> <surname>Chang</surname></string-name>, <string-name><given-names>H. T.</given-names> <surname>Chiao</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Abimannan</surname></string-name>, <string-name><given-names>Y. P.</given-names> <surname>Huang</surname></string-name>, <string-name><given-names>Y. T.</given-names> <surname>Tsai</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>An LSTM-based aggregated model for air pollution forecasting</article-title>,&#x201D; <source>Atmospheric Pollution Research</source>, vol. <volume>11</volume>, no. <issue>8</issue>, pp. <fpage>1451</fpage>&#x2013;<lpage>1463</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S. V.</given-names> <surname>Belavadi</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Rajagopal</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Ranjani</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Mohan</surname></string-name></person-group>, &#x201C;<article-title>Air quality forecasting using LSTM RNN and wireless sensor networks</article-title>,&#x201D; <source>Procedia Computer Science</source>, vol. <volume>170</volume>, pp. <fpage>241</fpage>&#x2013;<lpage>248</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Y.</given-names> <surname>Han</surname></string-name>, <string-name><given-names>J. C.</given-names> <surname>Lam</surname></string-name>, <string-name><given-names>V. O.</given-names> <surname>Li</surname></string-name> and <string-name><given-names>D.</given-names> <surname>Reiner</surname></string-name></person-group>, &#x201C;<article-title>A Bayesian LSTM model to evaluate the effects of air pollution control regulations in Beijing, China</article-title>,&#x201D; <source>Environmental Science and Policy</source>, vol. <volume>115</volume>, no. <issue>11</issue>, pp. <fpage>26</fpage>&#x2013;<lpage>34</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>L.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Liu</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Zhao</surname></string-name>, <string-name><given-names>G.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>W.</given-names> <surname>Zhang</surname></string-name> <etal>et al.</etal></person-group><italic>,</italic> &#x201C;<article-title>Air quality predictions with a semi-supervised bidirectional LSTM neural network</article-title>,&#x201D; <source>Atmospheric Pollution Research</source>, vol. <volume>12</volume>, no. <issue>1</issue>, pp. <fpage>328</fpage>&#x2013;<lpage>339</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><given-names>C.</given-names> <surname>Olah</surname></string-name></person-group>, &#x201C;<article-title>Understanding LSTM Networks</article-title>,&#x201D; <year>2015</year>. [Online]. Available: <uri>https://colah.github.io/posts/2015-08-Understanding-LSTMs/</uri>.</mixed-citation></ref>
</ref-list>
</back>
</article>