<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CMC</journal-id>
<journal-id journal-id-type="nlm-ta">CMC</journal-id>
<journal-id journal-id-type="publisher-id">CMC</journal-id>
<journal-title-group>
<journal-title>Computers, Materials &#x0026; Continua</journal-title>
</journal-title-group>
<issn pub-type="epub">1546-2226</issn>
<issn pub-type="ppub">1546-2218</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">27974</article-id>
<article-id pub-id-type="doi">10.32604/cmc.2022.027974</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Truncation and Rounding-Based Scalable Approximate Multiplier Design for Computer Imaging Applications</article-title>
<alt-title alt-title-type="left-running-head">Truncation and Rounding-Based Scalable Approximate Multiplier Design for Computer Imaging Applications</alt-title>
<alt-title alt-title-type="right-running-head">Truncation and Rounding-Based Scalable Approximate Multiplier Design for Computer Imaging Applications</alt-title>
</title-group>
<contrib-group content-type="authors">
<contrib id="author-1" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Rooban</surname><given-names>S.</given-names></name><xref ref-type="aff" rid="aff-1">1</xref><email>sroban123@gmail.com</email></contrib>
<contrib id="author-2" contrib-type="author">
<name name-style="western"><surname>Yamini Naga Ratnam</surname><given-names>A.</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>Ramprasad</surname><given-names>M. V. S.</given-names></name><xref ref-type="aff" rid="aff-2">2</xref></contrib>
<contrib id="author-4" contrib-type="author">
<name name-style="western"><surname>Subbulakshmi</surname><given-names>N.</given-names></name><xref ref-type="aff" rid="aff-3">3</xref></contrib>
<contrib id="author-5" contrib-type="author">
<name name-style="western"><surname>Uma Mageswari</surname><given-names>R.</given-names></name><xref ref-type="aff" rid="aff-4">4</xref></contrib>
<aff id="aff-1"><label>1</label><institution>Department of Electronics and Communication Engineering, Koneru Lakshmaiah Education Foundation</institution>, <addr-line>Vaddeswaram, Guntur, 522502, Andhra Pradesh</addr-line>, <country>India</country></aff>
<aff id="aff-2"><label>2</label><institution>Department of EECE, GITAM (Deemed to be University)</institution>, <addr-line>Visakhapatnam, AP</addr-line>, <country>India</country></aff>
<aff id="aff-3"><label>3</label><institution>Department of Electronics and Communication Engineering, Francis Xavier Engineering College</institution>, <addr-line>Tirunelveli, 627003, Tamil Nadu</addr-line>, <country>India</country></aff>
<aff id="aff-4"><label>4</label><institution>Department of Computer Science and Engineering, Vardhaman College of Engineering</institution>, <addr-line>Shamshabad, Hyderabad, 501218, Telangana</addr-line>, <country>India</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Author: S. Rooban. Email: <email>sroban123@gmail.com</email></corresp>
</author-notes>
<pub-date pub-type="epub" date-type="pub" iso-8601-date="2022-07-25"><day>25</day>
<month>07</month>
<year>2022</year></pub-date>
<volume>73</volume>
<issue>3</issue>
<fpage>5169</fpage>
<lpage>5184</lpage>
<history>
<date date-type="received"><day>30</day><month>1</month><year>2022</year></date>
<date date-type="accepted"><day>12</day><month>4</month><year>2022</year></date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2022 Rooban et al.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Rooban et al.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CMC_27974.pdf"></self-uri>
<abstract>
<p>Advanced technology used for arithmetic computing application, comprises greater number of approximate multipliers and approximate adders. Truncation and Rounding-based Scalable Approximate Multiplier (TRSAM) distinguish a variety of modes based on height (<italic>h</italic>) and truncation (<italic>t</italic>) as TRSAM (<italic>h, t</italic>) in the architecture. This TRSAM operation produces higher absolute error in Least Significant Bit (LSB) data shift unit. A new scalable approximate multiplier approach that uses truncation and rounding TRSAM (<italic>3, 7</italic>) is proposed to increase the multiplier accuracy. With the help of foremost one bit architecture, the proposed scalable approximate multiplier approach reduces the partial products. The proposed approximate TRSAM multiplier architecture gives better results in terms of area, delay, and power. The accuracy of 95.2&#x0025; and the energy utilization of 24.6 nJ is observed in the proposed multiplier design. The proposed approach shows 0.11&#x0025;, 0.23&#x0025;, and 0.24&#x0025; less Mean Absolute Relative Error (MARE) when compared with the existing approach for the input of 8-bit, 16-bit, and 32-bit respectively. It also shows 0.13&#x0025;, 0.19&#x0025;, and 0.2&#x0025; less Variance of Absolute Relative Error (VARE) when compared with the existing approach for the input of 8-bit, 16-bit, and 32-bit respectively. The proposed approach is implemented with Field-Programmable Gate Array (FPGA) and shows the delay of 3.640, 6.481, 12.505, 22.572, and 36.893&#x2005;ns for the input of 8-bit, 16-bit, 32-bit, 64-bit, and 128-bit respectively. The proposed approach is applied in digital filters design which shows the Peak-Signal-to-Noise Ratio (PSNR) of 25.05&#x2005;dB and Structural Similarity Index Measure (SSIM) of 0.98 with 393 pJ energy consumptions when used in image application. The proposed approach is simulated with Xilinx and MATLAB and implemented with FPGA.</p></abstract>
<kwd-group kwd-group-type="author">
<kwd>Truncation rounding based scalable approximate multiplier</kwd>
<kwd>foremost one detector</kwd>
<kwd>field programmable gate array</kwd>
<kwd>peak-signal-to-noise-ratio</kwd>
<kwd>structural similarity index measure</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1"><label>1</label><title>Introduction</title>
<p>Digital system requires binary arithmetic operations to realize any boolean function. Adders are the key components in binary arithmetic, and they are used in all basic operations such as addition, subtraction, multiplication, and division. Adders are used to implement increment, decrement, and other similar operations in addition to arithmetic operations. As a result, at the micro-architecture level of abstraction, adders are considered as the fundamental building blocks in digital systems, particularly for data-intensive applications design. In computer arithmetic, addition and multiplication are the commonly used operations. For addition, full-adder cells are investigated for approximate computation. Approximate circuits are preferred over exact circuits because of their reliability, in most of the cases approximate circuits are energy efficient also occupying less Look-Up tables (LUT&#x2019;s) [<xref ref-type="bibr" rid="ref-1">1</xref>]. In [<xref ref-type="bibr" rid="ref-2">2</xref>], compares various adders and provides several new measures for assessing approximate and probabilistic adders. When two numbers of equal bit size are multiplied, the size of the result is twice the bit size of the multiplicand or multiplier. Finite Impulse Response (FIR) filtering, Fast Fourier Transform (FFT), and Discrete Cosine Transform (DCT) are used in digital signal processing applications, and they are heavily relying on arithmetic operations. Arithmetic operations like multiplication and division on binary number system are computationally demanding reduced area, latency, and power. The fixed-point number system is suffered with overflow and scaling due to its lack of precision, but the floating-point number system offers superior precision and scalability. In [<xref ref-type="bibr" rid="ref-3">3</xref>], a method is proposed to derive an analytical correction term for reducing the error. A truncated multiplication approach is used in majority of these designs. Power consumption is one of the most significant qualities for any electronic equipment, particularly in battery-powered hand-held gadgets.</p>
<p>In [<xref ref-type="bibr" rid="ref-4">4</xref>], a static truncation-based technique that approximates the outcome by using the higher, medium, or lower portions of the inputs. These techniques are with the drawbacks that they are not easily scale to larger input widths, and their benefits are diminishing as the input size expands. Rounding of operands to the nearest exponent in both signed and unsigned multiplications are supported in [<xref ref-type="bibr" rid="ref-5">5</xref>], but hardware implementation is quite complex. In [<xref ref-type="bibr" rid="ref-6">6</xref>&#x2013;<xref ref-type="bibr" rid="ref-8">8</xref>], to achieve high performance, the core operations of the arithmetic processor, such as addition, and multiplication are critical. For approximation computing, addition is intensively explored for power usage and latency reduction, these techniques can be used in image processing application. In [<xref ref-type="bibr" rid="ref-9">9</xref>], symmetric bit stacking approach multiplication is made easier by reducing the complexity of the wallace tree structure. A 4:2 compressor with a low supply voltage is proposed in [<xref ref-type="bibr" rid="ref-10">10</xref>&#x2013;<xref ref-type="bibr" rid="ref-12">12</xref>], this design achieves lower delay with high energy efficiency, but worse normalized mean error.</p>
<p>In [<xref ref-type="bibr" rid="ref-13">13</xref>], Fin Field Effect Transistor (Fin-FET) is used to implement the ultra-efficient [<xref ref-type="bibr" rid="ref-14">14</xref>], approximate 4:2 compressor at the transistor level. Reduced number of transistors shows low power, low propagation delay and a shorter critical path avoids the capacitance effects. In [<xref ref-type="bibr" rid="ref-15">15</xref>], approximate booth multiplier models with radix-4 modified booth encoding technique is proposed. Compared with the existing approximate booth multipliers, shows significant increase in accuracy and hardware performance error rate when the approximation factor increased. For 8-Bit use, three new approximate 4:2 multiplier compressors are designed in [<xref ref-type="bibr" rid="ref-16">16</xref>]. In addition, an Error Correction Module (ECM) is included to improve the error performance in 4:2 compressors. The number of outputs of the approximate 4:2 compressor is reduced to one, and the energy efficiency is enhanced. In [<xref ref-type="bibr" rid="ref-17">17</xref>], new metrics including error distance (ED), mean error distance (MED) and Normalized Error Distance (NED) are proposed for evaluating the design of approximate adders.</p>
<p>In [<xref ref-type="bibr" rid="ref-18">18</xref>], a configurable architecture for implementing the FIR filter with low complexity is proposed by using Constant Shifts Method (CSM) and Programmable Shift Method (PSM). This approach is the combination of Shift and Add unit, Multiplexer unit, Final shifter unit, and Final adder unit. The proposed methodology in [<xref ref-type="bibr" rid="ref-19">19</xref>], round off the operands to the nearest power of two. This method works for both signed and unsigned augmentations and offer three approximate multiplier executions, one for unsigned tasks and two for marked tasks. The Truncation and Rounding based Scalable Approximate Multiplier (TOSAM) in [<xref ref-type="bibr" rid="ref-20">20</xref>], uses versatile approximated multiplier that reduce the number of partial products by truncating all the operands based on their leading one-bit location. In this structure, fixed-width result shows significant improvements in energy utilization and area when compared to other multipliers. The input operands are modified to the nearest odd number to improve the complete exactness. The method proposed in [<xref ref-type="bibr" rid="ref-21">21</xref>], relies on mathematical reasoning to separate the polynomial of the multiplier.</p>
<p>The above discussed methods determine the correctness by the values of <italic>t</italic> and <italic>h</italic> parameters, where the width of the input operands having no major impact. As a result, the proposed multiplier has a scalability characteristic. The following are the main contributions of the proposed work.
<list list-type="order">
<list-item><p>A new scalable approximate multiplier using truncation and rounding technique is proposed to increase the multiplier accuracy.</p></list-item>
<list-item><p>The combinations of different (<italic>h, t</italic>) parameters are examined in order to find a balance between accuracy, delay, and energy usage.</p></list-item>
<list-item><p>A hardware implementation of the truncation and rounding-based scalable approximate multiplier (TRSAM) for both signed and unsigned operations are presented.</p></list-item>
</list></p>
<p>The remainder of this work is organized as follows. Section 2 introduces the proposed approximate multiplier and its hardware implementation. Section 3 compares the results along with error analysis. The proposed approximate multiplier is used in image [<xref ref-type="bibr" rid="ref-6">6</xref>,<xref ref-type="bibr" rid="ref-7">7</xref>], processing applications, the parameters delay, area, and power are also analyzed in Section 4. Section 5 concludes this article.</p>
</sec>
<sec id="s2"><label>2</label><title>Proposed Multiplier Approach</title>
<p>The proposed approach is an updated version of [<xref ref-type="bibr" rid="ref-20">20</xref>]. The proposed approximate multiplier performs the rounding and truncation at the inputs which reduces the error rate and the variation in MARE and VARE values to improve the overall circuit performance. In the proposed approach the LSB&#x2019;s are modified in such a way, so that the approximate value is nearly equal to the accurate value. TRSAM modes based on height (<italic>h</italic>) and truncation (<italic>t</italic>) such as (<italic>h, t</italic>) in the architecture TRSAM (<italic>0, 2</italic>), TRSAM (<italic>0, 3</italic>), TRSAM (<italic>1, 5</italic>), TRSAM (<italic>2, 6</italic>), TRSAM (<italic>3, 7</italic>), TRSAM (<italic>4, 8</italic>) TRSAM (<italic>5, 9</italic>). <xref ref-type="fig" rid="fig-1">Fig. 1</xref> shows the proposed approximate multiplier for implementing various combinations of (<italic>h, t</italic>) along with bit lengths. There are few changes in the architecture and algorithm for different combinations of (<italic>h,&#x00A0;t</italic>).</p>
<fig id="fig-1"><label>Figure 1</label><caption><title>Block diagram of the proposed approximate multiplier</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-1.png"/></fig>
<p>The proposed approximate multiplier consists of Approximate absolute unit (AAU), Foremost one Detector bit (FOD), Truncation Unit (TU), Arithmetic Unit (AU), Shift Unit (SU) and Operand Detector Unit (ODU). The AAU is applied with the inputs of A and B. If the input operand is negative, the results are inverted; if the input operand is positive, the results are unchanged. This AAU can be removed for unsigned multipliers. The operation in AAU is explained in detail through <xref ref-type="fig" rid="fig-2">Fig. 2</xref>. The output of AAU is denoted as (Y<sub>A</sub>)<sub>APX</sub> and (Y<sub>B</sub>)<sub>APX</sub>. FOD unit takes the input (Y<sub>A</sub>)<sub>APX</sub> and (Y<sub>B</sub>)<sub>APX</sub>, values. By using these values, k<sub>A</sub> and k<sub>B</sub> are detected, which detects position of the bit value with &#x2018;1&#x2019; from the MSB. These k<sub>A</sub> and k<sub>B</sub> are responsible for shifting operation. TU inputs are k<sub>A,</sub> k<sub>B</sub>, (Y<sub>A</sub>)<sub>APX</sub>, and (Y<sub>B</sub>)<sub>APX</sub>. The approximation inputs are trimmed and converted to a fixed width operands and they occupy the foremost position of the input operands. The output is obtained from the truncation unit (Y<sub>A</sub>)<sub>t</sub> and (Y<sub>B</sub>)<sub>t</sub> are given as inputs to the arithmetic unit. The terms (Y<sub>A</sub>)<sub>t</sub> and (Y<sub>B</sub>)<sub>t</sub> acquired from the truncated unit which is represented in <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref>.
<disp-formula id="eqn-1"><label>(1)</label><mml:math id="mml-eqn-1" display="block"><mml:mrow><mml:mtext>TU</mml:mtext></mml:mrow><mml:mo>=</mml:mo><mml:mn>10000000000</mml:mn><mml:mo>+</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mtext>Y</mml:mtext></mml:mrow><mml:mrow><mml:mtext>A</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mtext>Y</mml:mtext></mml:mrow><mml:mrow><mml:mtext>B</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mtext>t</mml:mtext></mml:mrow></mml:msub></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mtext>Y</mml:mtext></mml:mrow><mml:mrow><mml:mtext>A</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mtext>APX</mml:mtext></mml:mrow></mml:mrow></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mtext>Y</mml:mtext></mml:mrow><mml:mrow><mml:mtext>B</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mrow><mml:mrow><mml:mtext>APX</mml:mtext></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula></p>
<fig id="fig-2"><label>Figure 2</label><caption><title>Example of 16-bit TRSAM (3, 7) (Case 1: A, B are positive)</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-2.png"/></fig>
<p>The arithmetic unit performs addition on the truncated fixed width operands ((Y<sub>A</sub>)<sub>t</sub> &#x002B; (Y<sub>B</sub>)<sub>t</sub>) and multiply the approximation inputs ((Y<sub>A</sub>)<sub>APX</sub> (Y<sub>B</sub>)<sub>APX</sub>), result is enlarged to one bit by concatenating &#x2018;1&#x2019; at the MSB. The MSBs of (Y<sub>A</sub>)<sub>APX</sub> and (Y<sub>B</sub>)<sub>APX</sub> are identical to those of (Y<sub>A</sub>)<sub>t</sub> and (Y<sub>B</sub>)<sub>t</sub>. The adders and logic AND gates in the Arithmetic Unit use power gating, based on the operating mode. In TU, the arithmetic unit&#x2019;s output is left shifted by k<sub>A</sub> &#x002B; k<sub>B</sub> times (k<sub>A</sub> and k<sub>B</sub> are the leading one-bit values of A and B). The term (1000000000 &#x002B; (Y<sub>A</sub>)<sub>t</sub> &#x002B; (Y<sub>B</sub>)<sub>t</sub> &#x002B; (Y<sub>A</sub>)<sub>APX</sub> (Y<sub>B</sub>)<sub>APX</sub>) is obtained by performing the shifting operation. The maximum possible truncation &#x2018;<italic>t&#x2019;</italic> and rounding &#x2018;<italic>h&#x2019;</italic> values are (<italic>5, 9</italic>) where <italic>h&#x2009;</italic>&#x003D;&#x2009;5 and <italic>t&#x2009;</italic>&#x003D;&#x2009;9.The output operand sign is determined by the sign of the input operands, and if at least one of the inputs is zero, the output is set to zero. The AAU is eliminated, by the unsigned input operands if ODU remains constant, in other words for unsigned operands this AAU is not producing any output, for signed operands. There are three cases for verifying the output they are <italic>i)</italic> both the inputs are positive, <italic>ii)</italic> both inputs are negative and <italic>iii)</italic> either one of the inputs is positive.</p>
<p><xref ref-type="fig" rid="fig-2">Fig. 2</xref> is the example of 16-bit TRSAM approximate multiplier for the parameter (<italic>3, 7</italic>), the rounding value &#x2018;<italic>h</italic>&#x2019; is 3 and truncation value &#x2018;<italic>t</italic>&#x2019; is 7. Considering A and B both are positive inputs, assume A as 0010 1101 1111 0001 (binary of 11761) and B as 0000 1001 1011 0010 (binary of 2482). The Foremost one detector unit (FOD) represents K<sub>A</sub> and K<sub>B</sub> values are 13 and 11 respectively. The values of (Y<sub>A</sub>)<sub>APX&#x2009;</sub>&#x003D;&#x2009;0111 and (Y<sub>B</sub>)<sub>APX&#x2009;</sub>&#x003D;&#x2009;0011. These (Y<sub>A</sub>)<sub>APX</sub> and (Y<sub>B</sub>)<sub>APX</sub> are multiplied and gives the product value as 0001 0101. The values of (Y<sub>A</sub>)<sub>t</sub>are seven places from the K<sub>A</sub> and the LSB bit is considered as 0, the (Y<sub>A</sub>)<sub>t</sub> &#x003D;&#x2009;0110 1110 and in similar way (Y<sub>B</sub>)<sub>t</sub> &#x003D;&#x2009;0011 0110. Next the product value of (Y<sub>A</sub>)<sub>APX</sub> and (Y<sub>B</sub>)<sub>APX</sub> are to be added with the terms (Y<sub>A</sub>)<sub>t</sub> and (Y<sub>B</sub>)<sub>t</sub> <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref>, represents the product value. The computed product value is 01 1011 1001. Here the shifting is based upon the addition of K<sub>A</sub> and K<sub>B</sub> values. The final output is 0000 0001 1011 1001 1111 1111 1111 1111 (binary of 28 966 911), so the value is (A &#x00D7; B)<sub>Proposed&#x2009;</sub>&#x003D;&#x2009;28 966 911, where the exact value is (A &#x00D7; B)<sub>Exact&#x2009;</sub>&#x003D;&#x2009;29 190 802, the difference value is 223 891.</p>
<p><xref ref-type="fig" rid="fig-3">Fig. 3</xref> follows the above algorithm, considering A is positive, and B is negative inputs, assume the value of A as 0000 0001 1100 0000 (binary of 448) and B as 1100 0000 0000 0000 (binary of &#x2212;16384). Here the K<sub>A</sub> and K<sub>B</sub> values are 8 and 15 respectively. The values (Y<sub>A</sub>)<sub>APX</sub> &#x003D;&#x2009;1101 and (Y<sub>B</sub>)<sub>APX&#x2009;</sub>&#x003D;&#x2009;1001. These (Y<sub>A</sub>)<sub>APX</sub> and (Y<sub>B</sub>)<sub>APX</sub> are multiplied and gives the product value as 0111 0101. The values of (Y<sub>A</sub>)<sub>t&#x2009;</sub>&#x003D;&#x2009;1100 0000, similarly (Y<sub>B</sub>)<sub>t</sub> &#x003D;&#x2009;1000 0000. The <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref>, represents the product value as 10 1011 0101. Base on shifting the final output is 1111 1111 0101 1010 1111 1111 1111 1111 (binary of &#x2212;10 813 441), so the value is (A &#x00D7; B)<sub>Proposed</sub> &#x003D; &#x2212;10 813 441, where the exact value is (A &#x00D7; B)<sub>Exact</sub> &#x003D; &#x2212;7 340 032, the difference value is 3 473 409.</p>
<fig id="fig-3"><label>Figure 3</label><caption><title>Example of 16-bit TRSAM (3, 7) (Case II: A is positive, and B is negative)</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-3.png"/></fig>
<p><xref ref-type="fig" rid="fig-4">Fig. 4</xref> is the example of 16-bit TRSAM approximate multiplier for the parameter (<italic>3, 7</italic>). Considering A and B are negative inputs, assume the value of A as 1111 1111 1100 0110 (binary of &#x2212;58) and B as 1111 1111 1110 1010 (binary of &#x2212;22). Here the K<sub>A</sub> and K<sub>B</sub> values 15 and 15 respectively. The values (Y<sub>A</sub>)<sub>APX&#x2009;</sub>&#x003D;&#x2009;1111 and (Y<sub>B</sub>)<sub>APX&#x2009;</sub>&#x003D;&#x2009;1111. These (Y<sub>A</sub>)<sub>APX</sub> and (Y<sub>B</sub>)<sub>APX</sub> are multiplied and gives the product value as 1110 0001. The values (Y<sub>A</sub>)<sub>t</sub>is considered to be seven places from the K<sub>A</sub> and the LSB bit is considered as 0 in this case, so the (Y<sub>A</sub>)<sub>t&#x2009;</sub>&#x003D;&#x2009;1111 1110 and in similar way (Y<sub>B</sub>)<sub>t&#x2009;</sub>&#x003D;&#x2009;1111 1110. Next the product value of (Y<sub>A</sub>)<sub>APX</sub> and (Y<sub>B</sub>)<sub>APX</sub> are to be added with the terms (Y<sub>A</sub>)<sub>t</sub> and (Y<sub>B</sub>)<sub>t</sub> and the <xref ref-type="disp-formula" rid="eqn-1">Eq. (1)</xref>, represents the final product the final value is 10 0001 1101. The final output is 0000 0000 0000 0000 0000 0010 0001 1101 (binary of 541), so the value is (A &#x00D7; B)<sub>Proposed&#x2009;</sub>&#x003D;&#x2009;541, where the exact value is (A &#x00D7; B)<sub>Exact</sub> &#x003D;&#x2009;1276, the difference value is 735. The existed approach having more difference value also the error rate is more which is explained in [<xref ref-type="bibr" rid="ref-20">20</xref>]. We cannot say that the accuracy value is same or not for all the combinations, it depends on the parameters of (h, t).</p>
<fig id="fig-4"><label>Figure 4</label><caption><title>Example of 16-bit TRSAM (<italic>3, 7</italic>) (Case III: A, B are negative)</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-4.png"/></fig>
</sec>
<sec id="s3"><label>3</label><title>Performance Analysis</title>
<p>The proposed approximate multiplier is simulated in Xilinx Vivado. Several error metrics are used in approximate computing to quantify the errors and accuracy. Error Distance (ED) is defined as the arithmetic distance between a generated output and the correct one for every input. By considering the average impact of many inputs and the normalizing of multiple-bit adders, the mean error distance (MED) and the normalized error distance (NED) are determined. Because the NED is nearly invariant with the size of an implementation, it is used to examine the reliability of the design.</p>
<p><bold>Error Rate (ER):</bold> The percentage of approximate outputs among all outputs.</p>
<p><bold>Error Distance (ED):</bold> The arithmetic difference between the exact and approximate result.</p>
<p><bold>Mean Error Distance (MED):</bold> The average of EDs for a set of outputs obtained by applying a set &#x2002;&#x2002;&#x2002;of inputs.</p>
<p><bold>Relative Error Distance (RED):</bold> The ratio of ED to exact result.</p>
<p><bold>Mean Relative Error Distance (MRED):</bold> The average value of all possible relative error distance &#x2002;&#x2002;&#x2002;(RED).</p>
<p><bold>Normalized Mean Error Distance (NMED):</bold> The normalization of mean error distance (MED) by &#x2002;&#x2002;&#x2002;the maximum output of the accurate design. This metric is used for comparing circuit of different &#x2002;&#x2002;&#x2002;sizes.</p>
<p><bold>maxARE</bold> is specified as maximum absolute relative error (considered from relative error RE)</p>
<p><bold>MRE</bold> is specified as mean relative error</p>
<p><bold>MARE</bold> is specified as mean absolute relative error</p>
<p><bold>VARE</bold> is specified as variance of absolute relative error</p>
<p><bold>NED</bold> is specified as normalized error distance</p>
<p><bold>max_NED</bold> is specified as maximum normalized error distance</p>
<p><xref ref-type="table" rid="table-1">Tab. 1</xref> shows the MARE and the variance of ARE (VARE) <italic>vs.</italic> the width of the unsigned multipliers to demonstrate the scalability of the existing and the proposed approach. From the results it is identified that the multiplier&#x2019;s accuracy is greatly influenced by the h value (bit length of the multiplier). It is observed that increasing <italic>h</italic> by one, virtually halves the MARE value. The proposed approximation multiplier is implemented for 8-bit, 16-bit, 32-bit, 64-bit and 128-bits with the rounding and truncation values of (<italic>3, 7</italic>). The simulated results are shown in <xref ref-type="fig" rid="fig-5 fig-6 fig-7 fig-8 fig-9">Figs. 5&#x2013;9</xref>. From the results it is observed that the proposed approach gives better results than the earlier approaches.</p>
<table-wrap id="table-1"><label>Table 1</label><caption><title>MARE and VARE of the approximate multiplier with different widths and different h, t values</title></caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left" rowspan="2">TOSAM architecture (<italic>h, t</italic>)</th>
<th align="center" colspan="2">8-bit [<xref ref-type="bibr" rid="ref-20">20</xref>]</th>
<th align="center" colspan="2">16-bit [<xref ref-type="bibr" rid="ref-20">20</xref>]</th>
<th align="center" colspan="2">32-bit [<xref ref-type="bibr" rid="ref-20">20</xref>]</th>
</tr>
<tr>
<th align="left">MARE (&#x0025;)</th>
<th align="left">VARE (&#x0025;)</th>
<th align="left">MARE (&#x0025;)</th>
<th align="left">VARE (&#x0025;)</th>
<th align="left">MARE (&#x0025;)</th>
<th align="left">VARE (&#x0025;)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">(<italic>0, 2</italic>)</td>
<td align="left">10.12</td>
<td align="left">43.73</td>
<td align="left">10.91</td>
<td align="left">46.55</td>
<td align="left">10.90</td>
<td align="left">46.63</td>
</tr>
<tr>
<td align="left">(<italic>0, 3</italic>)</td>
<td align="left">7.66</td>
<td align="left">28.23</td>
<td align="left">7.60</td>
<td align="left">28.78</td>
<td align="left">7.61</td>
<td align="left">28.81</td>
</tr>
<tr>
<td align="left">(<italic>1, 5</italic>)</td>
<td align="left">4.06</td>
<td align="left">8.38</td>
<td align="left">3.95</td>
<td align="left">7.61</td>
<td align="left">3.95</td>
<td align="left">7.60</td>
</tr>
<tr>
<td align="left">(<italic>2, 6</italic>)</td>
<td align="left">2.11</td>
<td align="left">2.29</td>
<td align="left">2.06</td>
<td align="left">2.00</td>
<td align="left">2.06</td>
<td align="left">2.00</td>
</tr>
<tr>
<td align="left">(<italic>3, 7</italic>)</td>
<td align="left">1.12</td>
<td align="left">0.65</td>
<td align="left">1.05</td>
<td align="left">0.51</td>
<td align="left">1.05</td>
<td align="left">0.52</td>
</tr>
<tr>
<td align="left">Proposed (<italic>3, 7</italic>)</td>
<td align="left">1.04</td>
<td align="left">0.52</td>
<td align="left">0.82</td>
<td align="left">0.32</td>
<td align="left">0.81</td>
<td align="left">0.32</td>
</tr>
<tr>
<td align="left">(<italic>4, 8</italic>)</td>
<td align="left">0.62</td>
<td align="left">0.20</td>
<td align="left">0.53</td>
<td align="left">0.13</td>
<td align="left">0.53</td>
<td align="left">0.13</td>
</tr>
<tr>
<td align="left">(<italic>5, 9</italic>)</td>
<td align="left">0.37</td>
<td align="left">0.06</td>
<td align="left">0.26</td>
<td align="left">0.03</td>
<td align="left">0.27</td>
<td align="left">0.03</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="fig-5"><label>Figure 5</label><caption><title>Results of 8-bit TRSAM (3, 7)</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-5.png"/></fig>
<fig id="fig-6"><label>Figure 6</label><caption><title>Results of 16-bit TRSAM (3, 7)</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-6.png"/></fig>
<fig id="fig-7"><label>Figure 7</label><caption><title>Results of 32-bit proposed approach TRSAM (3, 7)</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-7.png"/></fig>
<fig id="fig-8"><label>Figure 8</label><caption><title>Results of 64-bit proposed approach TRSAM (3, 7) approximate multiplier</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-8.png"/></fig>
<fig id="fig-9"><label>Figure 9</label><caption><title>Results of 128-bit proposed approach TRSAM (3, 7) approximate multiplier</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-9.png"/></fig>
<p>In <xref ref-type="fig" rid="fig-5">Fig. 5</xref> considering the value of A as 45 and B value as 27 the output is generated in the next cycle. The exact output (A &#x00D7; B)<sub>Exact&#x2009;</sub>&#x003D;&#x2009;1215, the output obtained from the previous approach [<xref ref-type="bibr" rid="ref-20">20</xref>], is (A &#x00D7; B)<sub>Existed&#x2009;</sub>&#x003D;&#x2009;714, and produce the error value of 501. For the same input case, the proposed approach produces the value for (A &#x00D7; B)<sub>Proposed&#x2009;</sub>&#x003D;&#x2009;715, the error value is 500. In other cases, for the same inputs the error value is gradually decreasing by using the proposed approach. Due to delay the first output value is shown in second cycle. In <xref ref-type="fig" rid="fig-6">Fig. 6</xref> considering A value is 11761 and B value is 2482. The exact output (A &#x00D7; B)<sub>Exact&#x2009;</sub>&#x003D;&#x2009;29 190 802, the output obtained from the previous approach [<xref ref-type="bibr" rid="ref-20">20</xref>], (A &#x00D7; B)<sub>Existed&#x2009;</sub>&#x003D;&#x2009;28 901 376, here the error value is 289 426. For the same input case using the proposed approach the value is (A &#x00D7; B)<sub>Proposed&#x2009;</sub>&#x003D;&#x2009;28 966 911 here the error value is 223 891. In other cases, for the same inputs the error value is gradually decreasing by using the proposed approach. Due to delay the first output value is shown in second cycle.</p>
<p>The <xref ref-type="fig" rid="fig-7">Fig. 7</xref> input A value is 75563008 and B value is 1795686 exact outputs (A &#x00D7; B)<sub>Exact</sub> &#x003D; 135 687 435 583 488, the proposed approach value (A &#x00D7; B)<sub>Proposed&#x2009;</sub>&#x003D;&#x2009;138 263 587 192 832 the difference value is 2 576 151 609 344 is to be subtracted from the obtained output to get the exact value. Here in this 32-bit approximate multiplier approach for certain input values it is showing the same output value because we are considering k<sub>A</sub> and k<sub>B</sub> values and using those values in shifting, but the output value which is obtained from the proposed approach is so close to exact value, while the existing approach [<xref ref-type="bibr" rid="ref-20">20</xref>], is far from the exact value.</p>
<p>The results of 64-bit are shown in <xref ref-type="fig" rid="fig-8">Fig. 8</xref>. To calculate the error rate a multiplier is used which shows the difference values in between proposed approach to exact value. For 128-bit the results are shown in <xref ref-type="fig" rid="fig-9">Fig. 9</xref>. Three cases, A and B are positive, A is negative value and B is positive, A and B both are negative are considered for 64-bit and 128-bit simulations. The error rate is the difference value in between the original output and A &#x00D7; B<sub>app</sub>. The bit length goes on increasing, to find the exact output value it becomes so difficult, so for higher bit lengths original output block is used. The accuracy depends on the difference in between exact value and obtained value.</p>
<p><xref ref-type="fig" rid="fig-10">Fig. 10</xref> is the RTL schematic of TRSAM approximate multiplier with parameter (<italic>3, 7</italic>). RTL schematic gives an overview of how the connections between one module to another module also, it explains the data path between the modules. The schematic is same for all the bit lengths and different combinations of (h, t).</p>
<fig id="fig-10"><label>Figure 10</label><caption><title>Schematic of proposed TRSAM (3, 7) for 16-bit approximate multiplier</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-10.png"/></fig>
<p>In the <xref ref-type="table" rid="table-2">Tab. 2</xref>, comparison consists of parameters like Slice register, Slice LUT, IOB, Fan-out, Power, Delay of 8-Bit, 16-Bit, 32-Bit, 64-Bit, and 128-Bit for the parameter (<italic>h, t</italic>) in FPGA family [<xref ref-type="bibr" rid="ref-22">22</xref>], of Vertex 7FPGA (XC7VX330T-2FFG1761). <xref ref-type="fig" rid="fig-11">Fig. 11</xref> shows how the parameters are varying against all the bit lengths. The proposed structure is energy efficient because it consumes the power from 12.8 to 84.5 for various bitlengths.</p>
<table-wrap id="table-2"><label>Table 2</label><caption><title>Comparisons for 8-bit, 16-bit, 32-bit, 64-bit, and 128-bit in vertex 7FPGA</title></caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left" rowspan="2">Parameters</th>
<th align="center" colspan="5">TRSAM (<italic>3, 7</italic>) vertex 7FPGA (XC7VX330T-2FFG1761)</th>
</tr>
<tr>
<th align="left">8-BIT</th>
<th align="left">16-BIT</th>
<th align="left">32-BIT</th>
<th align="left">64-BIT</th>
<th align="left">128-BIT</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Slice Register</td>
<td align="left">12</td>
<td align="left">15</td>
<td align="left">17</td>
<td align="left">19</td>
<td align="left">21</td>
</tr>
<tr>
<td align="left">Slice LUT</td>
<td align="left">114</td>
<td align="left">236</td>
<td align="left">524</td>
<td align="left">1071</td>
<td align="left">2619</td>
</tr>
<tr>
<td align="left">IOB</td>
<td align="left">34</td>
<td align="left">66</td>
<td align="left">130</td>
<td align="left">258</td>
<td align="left">514</td>
</tr>
<tr>
<td align="left">Fan-out</td>
<td align="left">3.41</td>
<td align="left">3.63</td>
<td align="left">3.83</td>
<td align="left">3.81</td>
<td align="left">4.59</td>
</tr>
<tr>
<td align="left">Power (W)</td>
<td align="left">12.843</td>
<td align="left">19.933</td>
<td align="left">29.03</td>
<td align="left">47.978</td>
<td align="left">84.594</td>
</tr>
<tr>
<td align="left">Delay (ns)</td>
<td align="left">3.640</td>
<td align="left">6.481</td>
<td align="left">12.505</td>
<td align="left">22.572</td>
<td align="left">36.893</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="fig-11"><label>Figure 11</label><caption><title>Graphical representation of parameters of TRSAM (<italic>3, 7</italic>) approximate multiplier</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-11.png"/></fig>
</sec>
<sec id="s4"><label>4</label><title>Image Processing Application</title>
<p>This section discusses the proposed approximation multipliers in image processing application. Gaussian filter in image processing uses the proposed multiplier to decrease the Gaussian noise. Convolution of the original image with the defined Gaussian mask is used for filtering [<xref ref-type="bibr" rid="ref-6">6</xref>]. Image multiplication is one of the most important operations in image processing which is used to evaluate the efficiency of the proposed approximate multipliers through real-world applications. Image processing with the proposed approximate multipliers is simulated with MATLAB. To analyze the precision of the output images, Peak Signal to Noise Ratio (PSNR) is calculated and is used as the metric for image quality assessment [<xref ref-type="bibr" rid="ref-23">23</xref>&#x2013;<xref ref-type="bibr" rid="ref-26">26</xref>]. Convolution is performed through this approximate multiplier for an input picture of size 8 &#x00D7; 8 with 2D 3 &#x00D7; 3 Gaussian mask. Figure of merit PSNR is calculated using the mean-square error between the original and filtered images. The proposed truncated approximation multiplier is compared with various previous [<xref ref-type="bibr" rid="ref-22">22</xref>,<xref ref-type="bibr" rid="ref-27">27</xref>&#x2013;<xref ref-type="bibr" rid="ref-30">30</xref>], multipliers to validate the quality through the errors. Testing of the proposed multipliers is performed with sharpening and JPEG compression image processing applications. The various benchmarks for sharpening and JPEG compression are Baboon and Lena. The PSNR block calculates the peak signal-to-noise ratio between two images in decibels. This ratio is used to compare the quality of the original and the compressed images. Higher PSNR shows the better quality of the compressed or reconstructed image.</p>
<p>The PSNR ratio and Structural Similarity Index Measurement (SSIM) of approximate outcomes are calculated in the sharpening application using MATLAB simulations and compared with the precise output images. Additionally, the Multiplier and Accumulator (MAC) module is included to construct a sharpening unit, and the MAC energy consumption with various multipliers are calculated. Image multiplication is performed through pixel-by-pixel, where the two input images are combined and produces the output of one image. Along with the PSNR, SSIM is also utilized to measure the structural similarity of the approximation and exact images. SSIM is considered as accuracy and consistent with image quality in human perception. It can be used to implement video applications and recent image applications techniques with few architecture changes. With some changes in the algorithm and inserting some new blocks there is possibility for usage in small object detection [<xref ref-type="bibr" rid="ref-31">31</xref>,<xref ref-type="bibr" rid="ref-32">32</xref>].</p>
<p><xref ref-type="table" rid="table-3">Tab. 3</xref> compares the PSNR, SSIM and energy consumed by the various approximate multipliers which are obtained using MATLAB. <xref ref-type="fig" rid="fig-12 fig-13 fig-14 fig-15">Figs. 12&#x2013;15</xref> represents the exact and the output images which are produced by the proposed architecture. There is possibility for some modifications in the structure in case we want to use the Blocked RAM, as each multiplier can be associated with individual BRAM, or we can use alone based on the modifications which has to be done in the project.</p>
<table-wrap id="table-3"><label>Table 3</label><caption><title>Comparison of PSNR (Decibel), SSIM and energy of JPEG encoder for 16-bit</title></caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left" rowspan="2">Benchmark</th>
<th align="center" colspan="2">Baboon</th>
<th align="center" colspan="2">Lena</th>
<th align="left" rowspan="2">Energy (pJ)</th>
</tr>
<tr>
<th align="left">PSNR</th>
<th align="left">SSIM</th>
<th align="left">PSNR</th>
<th align="left">SSIM</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Exact [<xref ref-type="bibr" rid="ref-20">20</xref>]</td>
<td align="left">26.21</td>
<td align="left">0.913</td>
<td align="left">32.27</td>
<td align="left">0.973</td>
<td align="left">836</td>
</tr>
<tr>
<td align="left">TOSAM [<xref ref-type="bibr" rid="ref-20">20</xref>]</td>
<td align="left">25.56</td>
<td align="left">0.901</td>
<td align="left">30.88</td>
<td align="left">0.967</td>
<td align="left">187</td>
</tr>
<tr>
<td align="left">DRUM (4) [<xref ref-type="bibr" rid="ref-23">23</xref>]</td>
<td align="left">25.48</td>
<td align="left">0.904</td>
<td align="left">30.84</td>
<td align="left">0.966</td>
<td align="left">339</td>
</tr>
<tr>
<td align="left">DRUM (5) [<xref ref-type="bibr" rid="ref-23">23</xref>]</td>
<td align="left">26.07</td>
<td align="left">0.912</td>
<td align="left">32.01</td>
<td align="left">0.972</td>
<td align="left">440</td>
</tr>
<tr>
<td align="left">RoBA [<xref ref-type="bibr" rid="ref-19">19</xref>]</td>
<td align="left">26.00</td>
<td align="left">0.910</td>
<td align="left">31.86</td>
<td align="left">0.972</td>
<td align="left">319</td>
</tr>
<tr>
<td align="left">LETAM (3, 4) [<xref ref-type="bibr" rid="ref-24">24</xref>]</td>
<td align="left">26.19</td>
<td align="left">0.912</td>
<td align="left">32.21</td>
<td align="left">0.973</td>
<td align="left">388</td>
</tr>
<tr>
<td align="left">LETAM (4, 5) [<xref ref-type="bibr" rid="ref-24">24</xref>]</td>
<td align="left">26.21</td>
<td align="left">0.912</td>
<td align="left">32.26</td>
<td align="left">0.973</td>
<td align="left">395</td>
</tr>
<tr>
<td align="left">Kulkarni et al. [<xref ref-type="bibr" rid="ref-29">29</xref>]</td>
<td align="left">25.63</td>
<td align="left">0.915</td>
<td align="left">32.24</td>
<td align="left">0.965</td>
<td align="left">398</td>
</tr>
<tr>
<td align="left">Proposed TRSAM</td>
<td align="left">25.05</td>
<td align="left">0.989</td>
<td align="left">28.03</td>
<td align="left">0.998</td>
<td align="left">393</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="fig-12"><label>Figure 12</label><caption><title>Baboon (a) exact image, (b) by using existed TOSAM, (c) by using proposed TRSAM</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-12.png"/></fig>
<fig id="fig-13"><label>Figure 13</label><caption><title>Lena (d) exact image, (e) by using existed TOSAM, (f) by using proposed TRSAM</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-13.png"/></fig>
<fig id="fig-14"><label>Figure 14</label><caption><title>Eyes (g) input image, (h) image by using the proposed approach</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-14.png"/></fig>
<fig id="fig-15"><label>Figure 15</label><caption><title>Voice (i) input image, (j) image by using the proposed approach</title></caption><graphic mimetype="image" mime-subtype="png" xlink:href="CMC_27974-fig-15.png"/></fig>
<p>The PSNR is the expression for the ratio between the maximum power of a signal and the power consumed by the distorting noise which affects the quality. The SSIM ratio is calculated by measuring the quality between the original input image and the output image. Higher PSNR value gives the better quality of the compressed or reconstructed output image. By using SSIM quality of a digital television and pictures can be predicted. This is used for measuring the similarity between the two images. Generally, the SSIM values ranges from 0 to 1, 1 means perfect match of reconstructed image with original one. The values 0.97, 0.98, 0.99 are for a good quality reconstruction image. The PSNR and SSIM of different image values are tabulated in <xref ref-type="table" rid="table-4">Tab. 4</xref>. The Accuracy for the respective architecture is tabulated in <xref ref-type="table" rid="table-5">Tab. 5</xref>.</p>
<table-wrap id="table-4"><label>Table 4</label><caption><title>PSNR and SSIM by using the proposed approach for 32-bit TRSAM</title></caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left">Benchmark</th>
<th align="left">PSNR of output image</th>
<th align="left">SSIM</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Baboon</td>
<td align="left">44.0926</td>
<td align="left">0.999</td>
</tr>
<tr>
<td align="left">Cartoon</td>
<td align="left">44.1494</td>
<td align="left">0.998</td>
</tr>
<tr>
<td align="left">Dog</td>
<td align="left">44.1423</td>
<td align="left">0.998</td>
</tr>
<tr>
<td align="left">Eyes</td>
<td align="left">44.0828</td>
<td align="left">0.999</td>
</tr>
<tr>
<td align="left">Lena</td>
<td align="left">44.0635</td>
<td align="left">0.999</td>
</tr>
<tr>
<td align="left">Penguin</td>
<td align="left">44.3095</td>
<td align="left">0.998</td>
</tr>
<tr>
<td align="left">Voice</td>
<td align="left">46.2662</td>
<td align="left">0.998</td>
</tr>
<tr>
<td align="left">Flowers</td>
<td align="left">44.1552</td>
<td align="left">0.999</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="table-5"><label>Table 5</label><caption><title>Accuracy and energy comparison for 16-bit approximate multipliers</title></caption>
<table frame="hsides">
<colgroup>
<col align="left"/>
<col align="left"/>
<col align="left"/>
</colgroup>
<thead>
<tr>
<th align="left">Architecture</th>
<th align="left">Accuracy (&#x0025;)</th>
<th align="left">Energy (nJ)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">Wallace [<xref ref-type="bibr" rid="ref-17">17</xref>]</td>
<td align="left">94.9</td>
<td align="left">64.3</td>
</tr>
<tr>
<td align="left">TOSAM [<xref ref-type="bibr" rid="ref-17">17</xref>]</td>
<td align="left">93.8</td>
<td align="left">14.9</td>
</tr>
<tr>
<td align="left">DRUM (4) [<xref ref-type="bibr" rid="ref-19">19</xref>]</td>
<td align="left">94.7</td>
<td align="left">29.7</td>
</tr>
<tr>
<td align="left">LETAM (3, 4) [<xref ref-type="bibr" rid="ref-20">20</xref>]</td>
<td align="left">94.5</td>
<td align="left">25.0</td>
</tr>
<tr>
<td align="left">RoBA [<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td align="left">94.1</td>
<td align="left">26.2</td>
</tr>
<tr>
<td align="left">Noorbasha et al. [<xref ref-type="bibr" rid="ref-22">22</xref>]</td>
<td align="left">93.2</td>
<td align="left">25.6</td>
</tr>
<tr>
<td align="left">Proposed TRSAM</td>
<td align="left">95.2</td>
<td align="left">24.6</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s5"><label>5</label><title>Conclusions</title>
<p>A novel method to reduce the error in approximation truncation multiplier is proposed. This proposed multiplier is scalable and outperformed when compared with other approximate multipliers in terms of area, delay, and power. The proposed 32-bit TRSAM multiplier improves the average energy utilization by 2&#x0025; when compared to the exact ROBA, Wallace Tree, LETAM, DRUM, DSM, and DQ4:2C4 multipliers. This proposed approximation TRSAM (<italic>3, 7</italic>) multiplier shows an efficient result in three cases of inputs; they are <italic>i)</italic> both the inputs are positive, <italic>ii)</italic> one input is positive and another is negative and <italic>iii)</italic> both the inputs are negative. The accuracy for this approach is 95.2&#x0025; and the energy utilized is 24.6 nJ. The proposed design shows 0.11&#x0025;, 0.23&#x0025;, and 0.24&#x0025; less MARE for the input of 8-bit, 16-bit, and32-bit and also shows 0.13&#x0025;, 0.19&#x0025;, and 0.2&#x0025; less VARE for the input of 8-bit, 16-bit, 32-bit respectively. This proposed method can be used in image processing, digital signal processing, and classification-based applications. Rounding of the patterns are optimized based on the level of precision required and the compression techniques used. The proposed approach in image applications shows better performance than the existing approaches in terms of PSNR and SSIM. The future scope of the proposed work is combining the approximation approach with high efficiency video coding to improve the efficiency. By incorporating few changes in the proposed techniques there is a possible way for acquiring high PSNR values and the SSIM values of closer to &#x2018;1&#x2019;. Modern-day computing with increasing power and sophistication, the concept of computation can be expanded beyond to approximate computation in future to reduce the power consumption and delay.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="other"><p><bold>Funding Statement:</bold> The authors received no specific funding for this study.</p></fn>
<fn fn-type="conflict"><p><bold>Conflicts of Interest:</bold> The authors declare that they have no conflicts of interest to report regarding the present study.</p></fn>
</fn-group>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>I.</given-names> <surname>Alouani</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Ahangari</surname></string-name>, <string-name><given-names>O.</given-names> <surname>Ozturk</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Niar</surname></string-name></person-group>, &#x201C;<article-title>A novel heterogeneous approximate multiplier for low power and high performance</article-title>,&#x201D; <source>IEEE Embedded Systems Letters</source>, vol. <volume>10</volume>, no. <issue>2</issue>, pp. <fpage>45</fpage>&#x2013;<lpage>48</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Q.</given-names> <surname>Xu</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Mytkowicz</surname></string-name> and <string-name><given-names>N. S.</given-names> <surname>Kim</surname></string-name></person-group>, &#x201C;<article-title>Approximate computing: A survey</article-title>,&#x201D; <source>IEEE Design &#x0026; Test</source>, vol. <volume>33</volume>, no. <issue>1</issue>, pp. <fpage>8</fpage>&#x2013;<lpage>22</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Low</surname></string-name> and <string-name><given-names>C.</given-names> <surname>Jong</surname></string-name></person-group>, &#x201C;<article-title>Unified mitchell-based approximation for efficient logarithmic conversion circuit</article-title>,&#x201D; <source>IEEE Transactions on Computers</source>, vol. <volume>64</volume>, no. <issue>6</issue>, pp. <fpage>1783</fpage>&#x2013;<lpage>1797</lpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Narayanamoorthy</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Moghaddam</surname></string-name>, <string-name><given-names>Z.</given-names> <surname>Liu</surname></string-name>, <string-name><given-names>T.</given-names> <surname>Park</surname></string-name> and <string-name><given-names>N. S.</given-names> <surname>Kim</surname></string-name></person-group>, &#x201C;<article-title>Energy-efficient approximate multiplication for digital signal processing and classification applications</article-title>,&#x201D; <source>IEEE Transactions on Very Large Scale Integration Systems</source>, vol. <volume>23</volume>, no. <issue>6</issue>, pp. <fpage>1180</fpage>&#x2013;<lpage>1184</lpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>N. S.</given-names> <surname>Arya</surname></string-name> and <string-name><given-names>R. M.</given-names> <surname>Nair</surname></string-name></person-group>, &#x201C;<article-title>Approximate computing: A new trend in VLSI based multipliers for error resilient DIP applications</article-title>,&#x201D; <source>International Research Journal of Engineering and Technology (IRJET</source><italic>)</italic>, vol. <volume>5</volume>, no. <issue>4</issue>, pp. <fpage>3866</fpage>&#x2013;<lpage>3869</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>O. S.</given-names> <surname>Faragallah</surname></string-name>, <string-name><given-names>A. I.</given-names> <surname>Sallam</surname></string-name> and <string-name><given-names>H. S.</given-names> <surname>El-Sayed</surname></string-name></person-group>, &#x201C;<article-title>Utilization of HEVC chacha20-based selective encryption for secure telehealth video conferencing</article-title>,&#x201D; <source>Computers, Materials &#x0026; Continua</source>, vol. <volume>70</volume>, no. <issue>1</issue>, pp. <fpage>831</fpage>&#x2013;<lpage>845</lpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>G. M. S.</given-names> <surname>Latha</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Rooban</surname></string-name></person-group>, &#x201C;<article-title>Quantum-dot cellular automata circuits using an efficient design and performance analysis</article-title>,&#x201D; <source>Sustainable Energy Technologies and Assessments</source>, vol. <volume>48</volume>, no. <issue>3</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>10</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>W.</given-names> <surname>Liu</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Chen</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>M. O.</given-names> <surname>Neill</surname></string-name> and <string-name><given-names>F.</given-names> <surname>Lombardi</surname></string-name></person-group>, &#x201C;<article-title>Design and analysis of inexact floating-point adders</article-title>,&#x201D; <source>IEEE Transactions on Computers</source>, vol. <volume>65</volume>, no. <issue>1</issue>, pp. <fpage>308</fpage>&#x2013;<lpage>314</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Suguna</surname></string-name> and <string-name><given-names>R.</given-names> <surname>Kiruthika</surname></string-name></person-group>, &#x201C;<article-title>Low latency and power efficient approximate multipliers using compressors</article-title>,&#x201D; <source>International Journal of Engineering Research &#x0026; Technology (IJERT)</source>, vol. <volume>8</volume>, no. <issue>4</issue>, pp. <fpage>593</fpage>&#x2013;<lpage>596</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Praveena</surname></string-name></person-group>, &#x201C;<article-title>Enhanced portable LUT multiplier with gated power optimization for biomedical therapeutic devices</article-title>,&#x201D; <source>Computers, Materials &#x0026; Continua</source>, vol. <volume>63</volume>, no. <issue>1</issue>, pp. <fpage>85</fpage>&#x2013;<lpage>95</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Rooban</surname></string-name>, <string-name><given-names>N.</given-names> <surname>Subbulakshmi</surname></string-name> and <string-name><given-names>Y. P.</given-names> <surname>Vamsi</surname></string-name></person-group>, &#x201C;<article-title>Low power circuit design for dynamic comparator</article-title>,&#x201D; <source>International Journal of Performability Engineering</source>, vol. <volume>17</volume>, no. <issue>5</issue>, pp. <fpage>444</fpage>&#x2013;<lpage>4450</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>C. H.</given-names> <surname>Chang</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Guand</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Zhang</surname></string-name></person-group>, &#x201C;<article-title>Ultra low-voltage low-power CMOS 4-2 and 5-2 compressors for fast arithmetic circuits</article-title>,&#x201D; <source>IEEE Transactions on Circuits and Systems-I</source>, vol. <volume>51</volume>, no. <issue>10</issue>, pp. <fpage>1985</fpage>&#x2013;<lpage>1997</lpage>, <year>2004</year>.</mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H.</given-names> <surname>Jiang</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Han</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Qiao</surname></string-name> and <string-name><given-names>F.</given-names> <surname>Lombardi</surname></string-name></person-group>, &#x201C;<article-title>Approximate radix-8 booth multipliers for low-power and high-performance operation</article-title>,&#x201D; <source>IEEE Transactions on Computers</source>, vol. <volume>65</volume>, no. <issue>8</issue>, pp. <fpage>2638</fpage>&#x2013;<lpage>2644</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Nour</surname></string-name>, <string-name><given-names>S. A.</given-names> <surname>Salem</surname></string-name> and <string-name><given-names>S. M.</given-names> <surname>Habashy</surname></string-name></person-group>, &#x201C;<article-title>A hybrid model for reliability aware and energy-efficient in multicore systems</article-title>,&#x201D; <source>Computers, Materials &#x0026; Continua</source>, vol. <volume>70</volume>, no. <issue>3</issue>, pp. <fpage>4447</fpage>&#x2013;<lpage>4466</lpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Venkatachalam</surname></string-name>, <string-name><given-names>E.</given-names> <surname>Adams</surname></string-name>, <string-name><given-names>H. J.</given-names> <surname>Lee</surname></string-name> and <string-name><given-names>S. B.</given-names> <surname>Ko</surname></string-name></person-group>, &#x201C;<article-title>Design and analysis of area and power efficient approximate booth multipliers</article-title>,&#x201D; <source>IEEE Transactions on Computers</source>, vol. <volume>68</volume>, no. <issue>11</issue>, pp. <fpage>1697</fpage>&#x2013;<lpage>1703</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>H.</given-names> <surname>Pei</surname></string-name>, <string-name><given-names>X.</given-names> <surname>Yi</surname></string-name>, <string-name><given-names>H.</given-names> <surname>Zhou</surname></string-name> and <string-name><given-names>Y.</given-names> <surname>He</surname></string-name></person-group>, &#x201C;<article-title>Design of ultra-low power consumption approximate 4-2 compressors based on the compensation characteristic</article-title>,&#x201D; <source>IEEE Transactions on Circuits and Systems II: Express Briefs</source>, vol. <volume>68</volume>, no. <issue>1</issue>, pp. <fpage>461</fpage>&#x2013;<lpage>465</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. Z.</given-names> <surname>Rahman</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Anupriya</surname></string-name>, <string-name><given-names>C.</given-names> <surname>Kavitha</surname></string-name> and <string-name><given-names>P.</given-names> <surname>Hithendra Sai Kumar</surname></string-name></person-group>, &#x201C;<article-title>Development of decision feedback equalizer using simplified adaptive algorithms</article-title>,&#x201D; <source>Journal of Critical Reviews</source>, vol. <volume>7</volume>, no. <issue>4</issue>, pp. <fpage>305</fpage>&#x2013;<lpage>309</lpage>, <year>2020</year>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>G.</given-names> <surname>Thumbur</surname></string-name>, <string-name><given-names>N. B.</given-names> <surname>Gayathri</surname></string-name>, <string-name><given-names>P. V.</given-names> <surname>Reddy</surname></string-name>, <string-name><given-names>M. Z. U.</given-names> <surname>Rahman</surname></string-name> and <string-name><given-names>A.</given-names> <surname>Lay-Ekuakille</surname></string-name></person-group>, &#x201C;<article-title>Efficient pairing-free identity-based ADS-B authentication scheme with batch verification</article-title>,&#x201D; <source>IEEE Transactions on Aerospace and Electronic Systems</source>, vol. <volume>55</volume>, no. <issue>5</issue>, pp. <fpage>2473</fpage>&#x2013;<lpage>2486</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Zendegani</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Kamal</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Bahadori</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Afzali-Kusha</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Pedram</surname></string-name></person-group>, &#x201C;<article-title>RoBA multiplier: A rounding-based approximate multiplier for high-speed yet energy-efficient digital signal processing</article-title>,&#x201D; <source>IEEE Transactions on Very Large Scale Integration (VLSI) Systems</source>, vol. <volume>25</volume>, no. <issue>2</issue>, pp. <fpage>393</fpage>&#x2013;<lpage>401</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Vahdat</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Kamal</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Afzali-Kusha</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Pedram</surname></string-name></person-group>, &#x201C;<article-title>TOSAM: An energy-efficient truncation- and rounding based scalable approximate multiplier</article-title>,&#x201D; <source>IEEE Transactions on Very Large Scale Integration (VLSI) Systems</source>, vol. <volume>27</volume>, no. <issue>5</issue>, pp. <fpage>1161</fpage>&#x2013;<lpage>1173</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Hsiao</surname></string-name>, <string-name><given-names>J. Z.</given-names> <surname>Jian</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Chen</surname></string-name></person-group>, &#x201C;<article-title>Low-cost FIR filter designs based on faithfully rounded truncated multiple constant multiplication/accumulation</article-title>,&#x201D; <source>IEEE Transactions on Circuits and Systems II: Express Briefs Very</source>, vol. <volume>60</volume>, no. <issue>5</issue>, pp. <fpage>287</fpage>&#x2013;<lpage>291</lpage>, <year>2013</year>.</mixed-citation></ref>
<ref id="ref-22"><label>[22]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>F.</given-names> <surname>Noorbasha</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Manasa</surname></string-name>, <string-name><given-names>R.</given-names> <surname>Tulasi Gouthami</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Sruthi</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Hari Priya</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>FPGA implementation of cryptographic systems for symmetric encryption</article-title>,&#x201D; <source>Journal of Theoretical and Applied Information Technology</source>, vol. <volume>95</volume>, no. <issue>9</issue>, pp. <fpage>2038</fpage>&#x2013;<lpage>2045</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-23"><label>[23]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Hashemi</surname></string-name>, <string-name><given-names>R. I.</given-names> <surname>Bahar</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Reda</surname></string-name></person-group>, &#x201C;<article-title>DRUM: A dynamic range unbiased multiplier for approximate multipliers</article-title>,&#x201D; in <conf-name>ICCAD&#x2019;15: Proc. of the IEEE/ACM Int. Conf. on Computer-Aided Design</conf-name>, <conf-loc>Austin</conf-loc>, pp. <fpage>418</fpage>&#x2013;<lpage>425</lpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-24"><label>[24]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>S.</given-names> <surname>Vahdat</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Kamal</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Afzali-Kusha</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Pedram</surname></string-name></person-group>, &#x201C;<article-title>LETAM: A low energy truncation-based approximate multiplier</article-title>,&#x201D; <source>Computers &#x0026; Electrical Engineering</source>, vol. <volume>63</volume>, no. <issue>4</issue>, pp. <fpage>1</fpage>&#x2013;<lpage>17</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-25"><label>[25]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>Z.</given-names> <surname>Wang</surname></string-name>, <string-name><given-names>A. C.</given-names> <surname>Bovik</surname></string-name>, <string-name><given-names>H. R.</given-names> <surname>Sheikhand</surname></string-name> and <string-name><given-names>E. P.</given-names> <surname>Simoncelli</surname></string-name></person-group>, &#x201C;<article-title>Image quality assessment: From error visibility to structural similarity</article-title>,&#x201D; <source>IEEE Transactions on Image Processing</source>, vol. <volume>13</volume>, no. <issue>4</issue>, pp. <fpage>600</fpage>&#x2013;<lpage>612</lpage>, <year>2004</year>.</mixed-citation></ref>
<ref id="ref-26"><label>[26]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M. Z. U.</given-names> <surname>Rahman</surname></string-name>, <string-name><given-names>S.</given-names> <surname>Surekha</surname></string-name>, <string-name><given-names>K. P.</given-names> <surname>Satamraju</surname></string-name>, <string-name><given-names>S. S.</given-names> <surname>Mirza</surname></string-name> and <string-name><given-names>A.</given-names> <surname>Lay-Ekuakille</surname></string-name></person-group>, &#x201C;<article-title>A collateral sensor data sharing framework for decentralized healthcare systems</article-title>,&#x201D; <source>IEEE Sensors Journal</source>, vol. <volume>21</volume>, no. <issue>24</issue>, pp. <fpage>27848</fpage>&#x2013;<lpage>27857</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-27"><label>[27]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>G. J.</given-names> <surname>Sullivan</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Ohm</surname></string-name>, <string-name><given-names>W. J.</given-names> <surname>Hanand</surname></string-name> and <string-name><given-names>T.</given-names> <surname>Wiegand</surname></string-name></person-group>, &#x201C;<article-title>Overview of the high efficiency video coding (HEVC) standard</article-title>,&#x201D; <source>IEEE Transactions on Circuits and Systems for Video Technology</source>, vol. <volume>22</volume>, no. <issue>12</issue>, pp. <fpage>1649</fpage>&#x2013;<lpage>1668</lpage>, <year>2012</year>.</mixed-citation></ref>
<ref id="ref-28"><label>[28]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>I.</given-names> <surname>Almomani</surname></string-name>, <string-name><given-names>A.</given-names> <surname>Alkhayer</surname></string-name> and <string-name><given-names>W.</given-names> <surname>El-Shafai</surname></string-name></person-group>, &#x201C;<article-title>Novel ransomware hiding model using HEVC steganography approach</article-title>,&#x201D; <source>Computers, Materials &#x0026; Continua</source>, vol. <volume>70</volume>, no. <issue>1</issue>, pp. <fpage>1209</fpage>&#x2013;<lpage>1228</lpage>, <year>2022</year>.</mixed-citation></ref>
<ref id="ref-29"><label>[29]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>P.</given-names> <surname>Kulkarni</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Gupta</surname></string-name>, and <string-name><given-names>M.</given-names> <surname>Ercegovac</surname></string-name></person-group>, &#x201C;<article-title>Trading accuracy for power with an underdesigned multiplier architecture</article-title>,&#x201D; <source>Journal of low Power Electronics</source>, vol. <volume>7</volume>, no. <issue>4</issue>, pp. <fpage>490</fpage>&#x2013;<lpage>501</lpage>, <year>2011</year>.</mixed-citation></ref>
<ref id="ref-30"><label>[30]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Manasa</surname></string-name>, <string-name><given-names>F.</given-names> <surname>Noorbasha</surname></string-name>, <string-name><given-names>C. L.</given-names> <surname>Sudheshna</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Santhosh</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Rahman</surname></string-name> <etal>et al.,</etal></person-group> &#x201C;<article-title>Comparative analysis of cordic algorithm and taylor series expansion</article-title>,&#x201D; <source>Journal of Theoretical &#x0026; Applied Information Technology</source>, vol. <volume>95</volume>, no. <issue>9</issue>, pp. <fpage>2015</fpage>&#x2013;<lpage>2022</lpage>, <year>2017</year>.</mixed-citation></ref>
<ref id="ref-31"><label>[31]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>W.</given-names> <surname>Sun</surname></string-name>, <string-name><given-names>G. Z.</given-names> <surname>Dai</surname></string-name>, <string-name><given-names>X. R.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>X. Z.</given-names> <surname>He</surname></string-name> and <string-name><given-names>X.</given-names> <surname>Chen</surname></string-name></person-group>, &#x201C;<article-title>TBE-net: A three-branch embedding network with part-aware ability and feature complementary learning for vehicle re-identification</article-title>,&#x201D; <source>IEEE Transactions on Intelligent Transportation Systems</source>, vol. <volume>99</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>13</lpage>, <year>2021</year>.</mixed-citation></ref>
<ref id="ref-32"><label>[32]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>W.</given-names> <surname>Sun</surname></string-name>, <string-name><given-names>L.</given-names> <surname>Dai</surname></string-name>, <string-name><given-names>X. R.</given-names> <surname>Zhang</surname></string-name>, <string-name><given-names>P. S.</given-names> <surname>Chang</surname></string-name> and <string-name><given-names>X. Z.</given-names> <surname>He</surname></string-name></person-group>, &#x201C;<article-title>RSOD: Real-time small object detection algorithm in UAV-based traffic monitoring</article-title>,&#x201D; <source>Applied Intelligence</source>, vol. <volume>52</volume>, pp. <fpage>1</fpage>&#x2013;<lpage>16</lpage>, <year>2021</year>.</mixed-citation></ref>
</ref-list>
</back>
</article>