<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CMC</journal-id>
<journal-id journal-id-type="nlm-ta">CMC</journal-id>
<journal-id journal-id-type="publisher-id">CMC</journal-id>
<journal-title-group>
<journal-title>Computers, Materials &#x0026; Continua</journal-title>
</journal-title-group>
<issn pub-type="epub">1546-2226</issn>
<issn pub-type="ppub">1546-2218</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">19789</article-id>
<article-id pub-id-type="doi">10.32604/cmc.2022.019789</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Efficient Energy Optimized Faithful Adder with Parallel Carry Generation</article-title>
<alt-title alt-title-type="left-running-head">Efficient Energy Optimized Faithful Adder with Parallel Carry Generation</alt-title>
<alt-title alt-title-type="right-running-head">Efficient Energy Optimized Faithful Adder with Parallel Carry Generation</alt-title>
</title-group>
<contrib-group content-type="authors">
<contrib id="author-1" contrib-type="author">
<name name-style="western">
<surname>Vijeyakumar</surname>
<given-names>K. N.</given-names>
</name>
<xref ref-type="aff" rid="aff-1">1</xref>
</contrib>
<contrib id="author-2" contrib-type="author" corresp="yes">
<name name-style="western">
<surname>Maragatharaj</surname>
<given-names>S.</given-names>
</name>
<xref ref-type="aff" rid="aff-2">2</xref><email>maragatharaj.ece@dgct.ac.in</email>
</contrib>
<aff id="aff-1"><label>1</label><institution>Dr. Mahalingam College of Engineering and Technology</institution>, <addr-line>Pollachi, Tamilnadu</addr-line>, <country>India</country></aff>
<aff id="aff-2"><label>2</label><institution>Dhirajlal Gandhi College of Technology</institution>, <addr-line>Salem, Tamilnadu</addr-line>, <country>India</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Author: S. Maragatharaj. Email: <email>maragatharaj.ece@dgct.ac.in</email></corresp>
</author-notes>
<pub-date pub-type="epub" date-type="pub" iso-8601-date="2021-09-13"><day>13</day><month>9</month><year>2021</year></pub-date>
<volume>70</volume>
<issue>2</issue>
<fpage>2543</fpage>
<lpage>2561</lpage>
<history>
<date date-type="received"><day>25</day><month>4</month><year>2021</year>
</date>
<date date-type="accepted"><day>04</day><month>6</month><year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2022 Vijeyakumar and Maragatharaj</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Vijeyakumar and Maragatharaj</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CMC_19789.pdf"></self-uri>
<abstract>
<p>Approximate computing has received significant attention in the design of portable CMOS hardware for error-tolerant applications. This work proposes an approximate adder that to optimize area delay and achieve energy efficiency using Parallel Carry (PC) generation logic. For &#x2018;<italic>n</italic>&#x2019; bits in input, the proposed algorithm use approximate addition for least <italic>n</italic>/2 significant bits and exact addition for most <italic>n</italic>/2 significant bits. A simple OR logic with no carry propagation is used to implement the approximate part. In the exact part, addition is performed using 4-bit adder blocks that implement PC at block level to reduce node capacitance in the critical path. Evaluations reveal that the maximum error of the proposed adder confines not more than 2<sup>n/2</sup>. As an enhancement of the proposed algorithm, we use the Error Recovery (ER) module to reduce the average error. Synthesis results of Proposed-PC (P-PC) and Proposed-PCER (P-PCER) adders with n-16 in 180nm Application Specific Integrated Circuit (ASIC) PDK technology revealed 44.2% &#x0026; 41.7% PDP reductions and 43.4% &#x0026; 40.7% ADP reductions, respectively compared to the latest best approximate design compared. The functional and driving effectiveness of proposed adders are examined through digital image processing applications.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Application-specific integrated circuit</kwd>
<kwd>approximate computing</kwd>
<kwd>power dissipation</kwd>
<kwd>energy dissipation</kwd>
<kwd>image processing</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>Full adder (FA) is a vital component of processing elements used in multimedia computing, digital signal processing and digital image processing systems. The bit-width of the arithmetic units depends on the processing capability of digital systems. As the bit width in adders increase, the delay associated with carry propagation also increases linearly-O (n), where &#x2018;<italic>n</italic>&#x2019; is the length of input bits [<xref ref-type="bibr" rid="ref-1">1</xref>]. Diverse topologies of adders such as Carry Select Adder (CSLA) [<xref ref-type="bibr" rid="ref-1">1</xref>], Carry Skip Adder (CSKA) [<xref ref-type="bibr" rid="ref-2">2</xref>], Carry Propagate Adder(CPA) [<xref ref-type="bibr" rid="ref-3">3</xref>], Carry Look Ahead Adder(CLA) [<xref ref-type="bibr" rid="ref-1">1</xref>] are proposed to moderate carry propagation delay at the expense of area overhead. Area efficient CSLA with gate-level Binary to Excess one Conversion (BEC) circuit is proposed in [<xref ref-type="bibr" rid="ref-4">4</xref>]. CLA achieves significantly less delay (O (log(n)) at the expense of gate area (O(n log(n))) [<xref ref-type="bibr" rid="ref-5">5</xref>,<xref ref-type="bibr" rid="ref-6">6</xref>].</p>
<p>Parallel adders offer a reliable alternative to conventional adders for high-speed applications. Aggressive scaling in deep submicron technologies and demand for faster compact ICs have put the onus on adders to be structurally optimized. The nature of multimedia signals and image processing applications involving human perception provides the opportunity to exploit the limited range of resources at the expense of certain errors in processed outputs [<xref ref-type="bibr" rid="ref-7">7</xref>,<xref ref-type="bibr" rid="ref-8">8</xref>]. Approximate computing is a novel technique to design low power, reduced hardware architecture for the above mentioned error-tolerant applications [<xref ref-type="bibr" rid="ref-9">9</xref>&#x2013;<xref ref-type="bibr" rid="ref-13">13</xref>].</p>
<p>Initial works on approximate adders were implemented in the form of speculative adders [<xref ref-type="bibr" rid="ref-14">14</xref>]. Speculative adders use a control signal to decide the choice among input bits in approximate and accurate logic in multi-bit addition. Segmented adders to reduce the propagation of error signals were put forth in [<xref ref-type="bibr" rid="ref-15">15</xref>]. Approximate designs for addition viz., ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA [<xref ref-type="bibr" rid="ref-17">17</xref>], SARA [<xref ref-type="bibr" rid="ref-18">18</xref>] and area-efficient parallel adder (AEPA) [<xref ref-type="bibr" rid="ref-19">19</xref>] are proposed in literature. ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] use approximate 1-bit FA cell for all bits in input operands and hence exhibit significant area saving at the cost of high error. For 16-bit addition, SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] uses conventional CSLA for the most significant 8 bits and ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] for the least significant 8 bits (inaccurate part). The hybrid design demonstrates better accuracy compared to ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>]. Reference [<xref ref-type="bibr" rid="ref-17">17</xref>] proposed two variants of high-performance error-tolerant adders (HPETAs) for image processing applications. Reference [<xref ref-type="bibr" rid="ref-18">18</xref>] proposed a multi-bit adder that segregates input bits into sub-groups, and approximate/exact carry-in for each sub-block is configured. Though the design in Reference [<xref ref-type="bibr" rid="ref-18">18</xref>] uses a look-ahead based approach for carry generation, it reduces area overhead significantly compared to conventional carry look-ahead adder. Reference [<xref ref-type="bibr" rid="ref-19">19</xref>] proposed Area Efficient Parallel Adder (AEPA) using new error-tolerant FA cells in approximate part that limits the maximal error within bound. Simulations revealed that the area of AEPA is significantly high compared to approximate adders in [<xref ref-type="bibr" rid="ref-18">18</xref>].</p>
<p>In this brief, we propose a high-speed area efficient approximate adder that performs multi-bit addition in block sets, with parallel carry generation logic implemented at block level in the accurate part and simple OR logic on the least significant n/2 bits(approximate part). Evaluations revealed that the maximal error in the proposed adder, due to approximation, confines to 2<sup>n/2</sup>. The rest of the paper is organized as follows. Section 2 presents the area and delay evaluation of various adder algorithms. Section 3 presents the design methodology of proposed adders. Next, Section 4 evaluates the performance of the proposed adders. Furthermore, Section 5 discusses the implementations of the proposed adders in digital image processing applications. Finally, Section 6 gives a brief conclusion on the proposed work.</p>
</sec>
<sec id="s2">
<label>2</label>
<title>Area and Delay Evaluation of Adder Algorithms</title>
<p>The SOP implementation of combinational functions employs AOI (AND-OR-INVERT) logic for CMOS realization. AOI structure for Sum and Carry outputs of a conventional 1-bit Full Adder (FA) cell is shown in <?A3B2 "fig1",5,"anchor"?><xref ref-type="fig" rid="fig-1">Fig. 1</xref>. The gates in same row within dotted lines perform a parallel operation, and hence we consider one gate delay for parallel gates. Area cost is evaluated by the total count of AOI gates used in the logic design. From <xref ref-type="fig" rid="fig-1">Fig. 1</xref>, the logic delay and gate count (GC) of 1 bit FA are given by <xref ref-type="disp-formula" rid="eqn-1">(1)</xref> &#x0026; <xref ref-type="disp-formula" rid="eqn-2">(2)</xref>, respectively.</p>
<p><disp-formula id="eqn-1">
<label>(1)</label>
<mml:math id="mml-eqn-1" display="block"><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mi>u</mml:mi><mml:mi>m</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>D</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi>O</mml:mi><mml:mi>R</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>3</mml:mn><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mi>O</mml:mi><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-2">
<mml:math id="mml-eqn-2" display="block"><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="italic">c</mml:mi><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">r</mml:mi><mml:mi mathvariant="italic">r</mml:mi><mml:mi mathvariant="italic">y</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mspace width="thinmathspace" /><mml:mrow><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>D</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi>O</mml:mi><mml:mi>R</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi>N</mml:mi><mml:mi>O</mml:mi><mml:mi>T</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-3">
<label>(2)</label>
<mml:math id="mml-eqn-3" display="block"><mml:mi>G</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>F</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>7</mml:mn><mml:mspace width="thinmathspace" /><mml:mrow><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>D</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>4</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi>O</mml:mi><mml:mi>R</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>10</mml:mn><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mi>O</mml:mi><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula></p>
<p>where &#x2018;&#x002B;&#x2019; represents arithmetic OR operation. Based on the above methodology, we have evaluated various adders&#x0027; critical delay and area and discussed them in the following subsections.</p>
<sec id="s2_1">
<label>2.1</label>
<title>Ripple Carry Addition</title>
<p>In CMOS realization RCA [<xref ref-type="bibr" rid="ref-1">1</xref>] offers better structural regularity due to the series arrangement of FA cells. An n bit RCA is constructed using n FA cells cascaded together, with the carryout bit of one FA tied to the carry-in bit of the next higher weight FA cell. However, RCA incurs high carry propagation delay due to rippling of carry signal from least significant bit position to the most. Critical delay of n-bit RCA (t<sub>RCA</sub>) is given by <xref ref-type="disp-formula" rid="eqn-3">(3)</xref>.</p>
<p><disp-formula id="eqn-4">
<label>(3)</label>
<mml:math id="mml-eqn-4" display="block"><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>R</mml:mi><mml:mi>C</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>F</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-5">
<mml:math id="mml-eqn-5" display="block"><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>F</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mspace width="thinmathspace" /><mml:mrow><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>D</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi>O</mml:mi><mml:mi>R</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi>N</mml:mi><mml:mi>O</mml:mi><mml:mi>T</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>where <italic>t</italic><sub><italic>FA</italic></sub> is the carry generation delay of 1 bit FA cell. From <xref ref-type="disp-formula" rid="eqn-3">(3)</xref>, it is evident that the delay of <italic>n</italic> bit RCA depends on <italic>t</italic><sub><italic>FA</italic></sub> and it increases in O (n). Hence, minimizing <italic>t</italic><sub><italic>FA</italic></sub> of 1bit FA cell will minimize <italic>t</italic><sub><italic>RCA</italic></sub>. From <xref ref-type="fig" rid="fig-1">Fig. 1</xref>, GC of <italic>n</italic>-bit RCA is given by</p>
<p><disp-formula id="eqn-6">
<label>(4)</label>
<mml:math id="mml-eqn-6" display="block"><mml:mi>G</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>R</mml:mi><mml:mi>C</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>n</mml:mi><mml:mrow><mml:mo>{</mml:mo><mml:mn>7</mml:mn><mml:mspace width="thinmathspace" /><mml:mrow><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>D</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>4</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi>O</mml:mi><mml:mi>R</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>10</mml:mn><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mi>O</mml:mi><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>}</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<fig id="fig-1">

<label>Figure 1</label>
<caption>
<title>AOI implementation of a) 1 bit FA sum, b) 1 bit FA carry, c) 2:1 mux</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-1.png"/>
</fig>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Carry Select Addition</title>
<p>CSLA [<xref ref-type="bibr" rid="ref-1">1</xref>] performs high-speed addition using an array of RCA pair and 2:1 Muxes. Each RCA block performs addition independently with carry-in as either logic low or logic high. For n &#x003D; 16, with 4 bits in an RCA block, the CSLA design will have 4 RCA pairs. MUX units use the Carryout (C<sub>out</sub>) signal from preceding blocks to produce final output from Sum signals generated by the RCA pair at the corresponding binary weight. <?A3B2 "fig2",5,"anchor"?><xref ref-type="fig" rid="fig-2">Fig. 2</xref> shows the block level architecture for n bit CSLA.</p>
<fig id="fig-2">

<label>Figure 2</label>
<caption>
<title>Block diagram of n bit CSLA</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-2.png"/>
</fig>
<p>If the carry propagation time of 1bit FA and propagation delay of Mux units are represented as t<sub>FA</sub> and t<sub>mux</sub> respectively, then delay of <italic>n</italic> bit CSLA design with <italic>m</italic> Fas in a block is given by <xref ref-type="disp-formula" rid="eqn-5">(5)</xref></p>
<p><disp-formula id="eqn-7">
<mml:math id="mml-eqn-7" display="block"><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>S</mml:mi><mml:mi>L</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>m</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>F</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mfrac><mml:mi>n</mml:mi><mml:mi>m</mml:mi></mml:mfrac><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>u</mml:mi><mml:mi>x</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula></p>

<p><disp-formula id="eqn-8">
<label>(5)</label>
<mml:math id="mml-eqn-8" display="block"><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>S</mml:mi><mml:mi>L</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>m</mml:mi><mml:mo>+</mml:mo><mml:mfrac><mml:mi>n</mml:mi><mml:mi>m</mml:mi></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>D</mml:mi><mml:mo>}</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mn>2</mml:mn><mml:mi>m</mml:mi><mml:mo>+</mml:mo><mml:mfrac><mml:mi>n</mml:mi><mml:mi>m</mml:mi></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mi>O</mml:mi><mml:mi>R</mml:mi><mml:mo>}</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mn>2</mml:mn><mml:mi>m</mml:mi><mml:mo>+</mml:mo><mml:mn>3</mml:mn><mml:mi>n</mml:mi><mml:mrow><mml:mo>/</mml:mo></mml:mrow><mml:mi>m</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mi>N</mml:mi><mml:mi>O</mml:mi><mml:mi>T</mml:mi><mml:mo>}</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>From <xref ref-type="fig" rid="fig-2">Fig. 2</xref>, the gate count of <italic>n</italic>-bit CSLA is given by</p>
<p><disp-formula id="eqn-9">
<label>(6)</label>
<mml:math id="mml-eqn-9" display="block"><mml:mi>G</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>S</mml:mi><mml:mi>L</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mn>9</mml:mn><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mfrac><mml:mrow><mml:mn>2</mml:mn><mml:mi>n</mml:mi></mml:mrow><mml:mi>m</mml:mi></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>D</mml:mi><mml:mo>}</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mn>5</mml:mn><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mfrac><mml:mi>n</mml:mi><mml:mi>m</mml:mi></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mi>O</mml:mi><mml:mi>R</mml:mi><mml:mo>}</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mn>3</mml:mn><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn>3</mml:mn><mml:mfrac><mml:mi>n</mml:mi><mml:mi>m</mml:mi></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mi>N</mml:mi><mml:mi>O</mml:mi><mml:mi>T</mml:mi><mml:mo>}</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Carry Look Ahead Addition</title>
<p>CLA [<xref ref-type="bibr" rid="ref-5">5</xref>,<xref ref-type="bibr" rid="ref-6">6</xref>] eliminates the carry-rippling problem encountered in RCA by generating carry signals at corresponding binary weights using C<sub>in</sub> and Propagate (P), Generate (G) signals generated at lower weights. Hence the area cost and hardware complexity of CLA are high for large input bit widths compared to RCA. Following are the functional symbols used to express logic functions viz., |&#x2212;OR; &#x2217;&#x2212;AND; &#x2229;&#x2212;XOR. Based on the above representation, the logic expression for Carry signal (Ci) is given by</p>
<p><disp-formula id="eqn-10">
<label>(7)</label>
<mml:math id="mml-eqn-10" display="block"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:msubsup><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">j</mml:mi></mml:mrow><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:mspace width="thinmathspace" /><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">]</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:msubsup><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">]</mml:mo><mml:mo>|</mml:mo></mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant="normal">C</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi><mml:mi mathvariant="normal">n</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:mrow><mml:mrow><mml:mi mathvariant="normal">l</mml:mi></mml:mrow><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">i</mml:mi></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:msub><mml:mrow><mml:mi mathvariant="normal">p</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant="normal">l</mml:mi></mml:mrow></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>]</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>where, <inline-formula id="ieqn-1"><mml:math id="mml-ieqn-1"><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mo>&#x2227;</mml:mo></mml:mrow></mml:msubsup><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and &#x2018;i&#x2019; represents the bit position.</p>
<p>Using <xref ref-type="disp-formula" rid="eqn-7">(7)</xref>, critical delay of <italic>n</italic>-bit CLA (t<sub>CLA</sub>) is given by</p>
<p><disp-formula id="eqn-11">
<label>(8)</label>
<mml:math id="mml-eqn-11" display="block"><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>L</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mi>n</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo>)</mml:mo></mml:mrow><mml:mspace width="thinmathspace" /><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>D</mml:mi><mml:mo>+</mml:mo><mml:mn>2</mml:mn><mml:mrow><mml:mo>(</mml:mo><mml:mi>O</mml:mi><mml:mi>R</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>+</mml:mo><mml:mn>3</mml:mn><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mi>O</mml:mi><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula></p>
<p>Based on AOI representation in <xref ref-type="fig" rid="fig-1">Fig. 1</xref>, GC of <italic>n</italic>-bit CLA is given by</p>
<p><disp-formula id="eqn-12">
<label>(9)</label>
<mml:math id="mml-eqn-12" display="block"><mml:mi>G</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>L</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>7</mml:mn><mml:mi>n</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>D</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mn>3</mml:mn><mml:mi>n</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>O</mml:mi><mml:mi>R</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mn>12</mml:mn><mml:mi>n</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>N</mml:mi><mml:mi>O</mml:mi><mml:mi>T</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula></p>
<p><?A3B2 "tbl1",5,"anchor"?><xref ref-type="table" rid="table-1">Tab. 1</xref> compares the logic depth and area of the various adders discussed in subsections <xref ref-type="fig" rid="fig-2">Figs. 2a</xref>&#x2013;<xref ref-type="fig" rid="fig-2">2c</xref>, assuming delay and area of each gate to be 1 unit. Note from <xref ref-type="table" rid="table-1">Tab. 1</xref>, RCA performs better in terms of area saving while CSLA performs better in speed improvement. CLA can optimize area and delay significantly.</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>Area and logic depth comparison of adder algorithms</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Adder algorithm</th>
<th>Logic depth</th>
<th>Area (gate count)</th>
</tr>
</thead>
<tbody>
<tr>
<td>RCA</td>
<td>5 n</td>
<td>21 n</td>
</tr>
<tr>
<td>CSLA</td>
<td>5(m &#x002B; n/m); m &#x003C;&#x003C; n</td>
<td>17 n &#x002B; 6 n/m</td>
</tr>
<tr>
<td>CLA</td>
<td>n &#x002B; 6</td>
<td>22 n</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>However, CLA exhibits high hardware complexity and insufficient regularity of cell arrangement compared to RCA for large n, making it unsuitable for CMOS realization. Limiting the number of bits in CLA improves the regularity of structure and reduces hardware complexity. Hence, in the proposed approach input bits are grouped into four, and PC logic-based addition is performed within the group in the accurate part. Furthermore, the approximate part of the adder is implemented with simple OR logic to reduce area and delay.</p>
</sec>
</sec>
<sec id="s3">
<label>3</label>
<title>Proposed Approximate Adders</title>
<p>This section briefs the proposed approximate adders that implement carry-free addition in the least significant approximate part (n/2 bits) and error-free addition in the most significant part. Accurate part of the adder is implemented with a hybrid approach that combines block-level addition with look ahead based fast parallel carry generation. <?A3B2 "fig3",5,"anchor"?><xref ref-type="fig" rid="fig-3">Fig. 3</xref> shows the block-level architecture of the proposed adder.</p>
<fig id="fig-3">

<label>Figure 3</label>
<caption>
<title>Block diagram of proposed adder</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-3.png"/>
</fig>
<p>In the proposed addition algorithm, for <italic>n</italic> bits in input operands and 4 bits in a block, the number of blocks in the exact part will be <italic>n</italic>/<italic>8</italic>. It is noted from <xref ref-type="disp-formula" rid="eqn-9">(9)</xref>, C<sub>i</sub> is a combination Primary Carry Propagate (PCP) term that propagates input carry-C<sub>in</sub> and Intermediate Carry Propagate (ICP) term that propagates in-between carry signals generated at lower significant bit positions. PCP and ICP are defined by <xref ref-type="disp-formula" rid="eqn-10">(10)</xref> &#x0026; <xref ref-type="disp-formula" rid="eqn-11">(11)</xref>, respectively.</p>
<p><disp-formula id="eqn-13">
<label>(10)</label>
<mml:math id="mml-eqn-13" display="block"><mml:mi>P</mml:mi><mml:mi>C</mml:mi><mml:mi>P</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-14">
<label>(11)</label>
<mml:math id="mml-eqn-14" display="block"><mml:mi>I</mml:mi><mml:mi>C</mml:mi><mml:mi>P</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mi>j</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>Analysis of <xref ref-type="disp-formula" rid="eqn-10">(10)</xref> &#x0026; <xref ref-type="disp-formula" rid="eqn-11">(11)</xref> reveals that parallelism can be exploited at two levels viz., (i) within ICP by merging consecutive odd and even terms, (ii) between ICP and PCP by generating PCP AND terms in parallel with level-1 logic reduction in ICP. Generation of carry signals using proposed delay balanced carry look ahead algorithm for 4-bit addition is shown in <xref ref-type="disp-formula" rid="eqn-12">(12)</xref>&#x2013;<xref ref-type="disp-formula" rid="eqn-15">(15)</xref>, and the corresponding gate level diagram is shown in <?A3B2 "fig4",5,"anchor"?><xref ref-type="fig" rid="fig-4">Fig. 4</xref>.</p>
<p><disp-formula id="eqn-15">
<label>(12)</label>
<mml:math id="mml-eqn-15" display="block"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>|</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-16">
<label>(13)</label>
<mml:math id="mml-eqn-16" display="block"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>|</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo stretchy="false">&#x21D2;</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>|</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-17">
<label>(14)</label>
<mml:math id="mml-eqn-17" display="block"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>|</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>|</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo stretchy="false">&#x21D2;</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mo>&#x2212;</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>|</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-18">
<mml:math id="mml-eqn-18" display="block"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>2</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-19">
<label>(15)</label>
<mml:math id="mml-eqn-19" display="block"><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>32</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo>|</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>10</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo stretchy="false">&#x21D2;</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mn>4</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mn>3</mml:mn><mml:mo>&#x2212;</mml:mo><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>3</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:mrow><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>Note from <xref ref-type="fig" rid="fig-4">Fig. 4</xref>, ICP use certain PCP AND terms generated in level-1 logic reduction in further stages and this in turn reduces the gate count and delay significantly. From <xref ref-type="fig" rid="fig-4">Fig. 4</xref>, the logic delay of C<sub>in</sub> to C<sub>1</sub>&#x2013;C<sub>4</sub> (t<sub>c</sub>) is given by</p>
<p><disp-formula id="eqn-20">
<label>(16)</label>
<mml:math id="mml-eqn-20" display="block"><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy="false">(</mml:mo><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mi>D</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mn>1</mml:mn><mml:mo stretchy="false">(</mml:mo><mml:mi>O</mml:mi><mml:mi>R</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math>
</disp-formula></p>
<p>Also, it is noted from <xref ref-type="fig" rid="fig-4">Fig. 4</xref> that delay balanced PC logic maintains uniform gate delay from C<sub>in</sub> to carry out signals in the accurate part. Furthermore, it is observed that C<sub>in</sub> is used as an input in the pre-final stage and hence provides a choice for further delay reduction in subsequent higher weight adder blocks. Evaluations revealed that the Proposed PC (P-PC) adder incurs n/8 &#x002A; t<sub>c</sub> logic delay while conventional RCA implementation of accurate part incurs n/2(t<sub>c</sub>) logic delay. Hence, delay balanced PC logic in the proposed adder demonstrates 83.3% logic depth reduction compared to the standard implementation. In the least significant approximate part of the P-PC adder, carry signals are not generated, and sum signals are generated using approximate full adder (AFA) cells. Boolean expression that defines the logic of AFA cell is given by <xref ref-type="disp-formula" rid="eqn-17">(17)</xref>.</p>
<p><disp-formula id="eqn-21">
<label>(17)</label>
<mml:math id="mml-eqn-21" display="block"><mml:mi>S</mml:mi><mml:mi>u</mml:mi><mml:msub><mml:mi>m</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mi>F</mml:mi><mml:mi>A</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>a</mml:mi><mml:mrow><mml:mo stretchy="false">|</mml:mo></mml:mrow><mml:mi>b</mml:mi></mml:math>
</disp-formula></p>
<fig id="fig-4">

<label>Figure 4</label>
<caption>
<title>Gate level diagram of Carry generation using PC logic for n &#x2212; 4</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-4.png"/>
</fig>
<p>Note from <xref ref-type="disp-formula" rid="eqn-10">(10)</xref> AFA in the approximate part generates an error for input a &#x003D; b &#x003D; logic high. However, approximate sum &#x003D; 1 for a &#x003D; b &#x003D; 1, which reduces the mean error distance (MED) due to the elimination of carry signal to 1 at the corresponding binary weight. Hence, the proposed approximation logic reduces delay and average error significantly. Evaluations reveal that the maximum error due to approximation in P-PC adder is not more than 1unit at bit position n/2 (UBP-n/2).</p>
<p>The novelty of the proposed design is that it trades-off error for penalty in area overhead, and this can be realized using an error recovery unit. In proposed design 2 (P-PCER), the algorithm generates an error recovery (E<sub>R</sub>) signal using Generate (G) and approximate sum signals and is added to the exact part of adder at least significant bit position. Boolean expression defining logic of E<sub>R</sub> is given by <xref ref-type="disp-formula" rid="eqn-18">(18)</xref>.</p>
<p><disp-formula id="eqn-22">
<label>(18)</label>
<mml:math id="mml-eqn-22" display="block"><mml:msub><mml:mi>E</mml:mi><mml:mrow><mml:mi>R</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:munderover><mml:msub><mml:mi>E</mml:mi><mml:mrow><mml:mfrac><mml:mi>n</mml:mi><mml:mn>2</mml:mn></mml:mfrac><mml:mo>&#x2212;</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-23">
<mml:math id="mml-eqn-23" display="block"><mml:msub><mml:mi>E</mml:mi><mml:mrow><mml:mfrac><mml:mi>n</mml:mi><mml:mn>2</mml:mn></mml:mfrac><mml:mo>&#x2212;</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>{</mml:mo><mml:mtable columnalign="left left" rowspacing=".2em" columnspacing="1em" displaystyle="false"><mml:mtr><mml:mtd><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mfrac><mml:mi>n</mml:mi><mml:mn>2</mml:mn></mml:mfrac><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mtd><mml:mtd><mml:mi>f</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mtext>&#xA0;</mml:mtext><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msubsup><mml:mi mathvariant="normal">&#x0026;</mml:mi><mml:mrow><mml:mi>x</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mfrac><mml:mi>n</mml:mi><mml:mn>2</mml:mn></mml:mfrac><mml:mo>&#x2212;</mml:mo><mml:mi>x</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:msub><mml:mi>G</mml:mi><mml:mrow><mml:mfrac><mml:mi>n</mml:mi><mml:mn>2</mml:mn></mml:mfrac><mml:mo>&#x2212;</mml:mo><mml:mi>j</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow><mml:mo>}</mml:mo></mml:mrow><mml:mo>,</mml:mo></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="italic">o</mml:mi><mml:mi mathvariant="italic">t</mml:mi><mml:mi mathvariant="italic">h</mml:mi><mml:mi mathvariant="italic">e</mml:mi><mml:mi mathvariant="italic">r</mml:mi><mml:mi mathvariant="italic">w</mml:mi><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">s</mml:mi><mml:mi mathvariant="italic">e</mml:mi></mml:mrow></mml:mtd></mml:mtr></mml:mtable><mml:mo fence="true" stretchy="true" symmetric="true"></mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>where &#x2018;j&#x2019; represents the count of most significant bits in the approximate part considered for E<sub>R</sub> signal generation. Note that, E<sub>R</sub> signal reduces the maximum error (ME) in the proposed adder to 2<sup>n/2 &#x2212; 1</sup> with probability P(ME) &#x003D; 2i/n and mean error significantly. <?A3B2 "fig5",5,"anchor"?><xref ref-type="fig" rid="fig-5">Fig. 5</xref> shows the block level architecture of the P-PCER design.</p>
<fig id="fig-5">

<label>Figure 5</label>
<caption>
<title>Block diagram of P-PCER adder</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-5.png"/>
</fig>
</sec>
<sec id="s4">
<label>4</label>
<title>Performance Evaluation</title>
<sec id="s4_1">
<label>4.1</label>
<title> Error Metrics</title>
<p>Error metrics are the essential parameters to evaluate the efficiency of an approximate design in error-tolerant applications. In this section, performance of the proposed approximate adders and state-of-the-art approximate designs is evaluated in terms of various error metrics [<xref ref-type="bibr" rid="ref-19">19</xref>,<xref ref-type="bibr" rid="ref-20">20</xref>] using standard output as the reference. The accuracy metrics considered are: Mean Error Distance (MED), Mean Relative Error Distance (MRED), Normalized Error Distance (NED) and Percentage Accuracy defined as follows.</p>
<p><disp-formula id="eqn-24">
<label>(19)</label>
<mml:math id="mml-eqn-24" display="block"><mml:mrow><mml:mi mathvariant="normal">M</mml:mi><mml:mi mathvariant="normal">E</mml:mi><mml:mi mathvariant="normal">D</mml:mi></mml:mrow><mml:mo>=</mml:mo><mml:msup><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac><mml:mrow><mml:mn>2</mml:mn><mml:mi>N</mml:mi></mml:mrow></mml:msup><mml:mrow><mml:mo>(</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>2</mml:mn><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:munderover><mml:mrow><mml:mo>|</mml:mo><mml:mi>E</mml:mi><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-25">
<label>(20)</label>
<mml:math id="mml-eqn-25" display="block"><mml:mrow><mml:mi mathvariant="normal">M</mml:mi><mml:mi mathvariant="normal">R</mml:mi><mml:mi mathvariant="normal">E</mml:mi><mml:mi mathvariant="normal">D</mml:mi></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>2</mml:mn><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:mfrac><mml:mrow><mml:mo>(</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>2</mml:mn><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:munderover><mml:mfrac><mml:mrow><mml:mi>E</mml:mi><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mfrac><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-26">
<label>(21)</label>
<mml:math id="mml-eqn-26" display="block"><mml:mrow><mml:mi mathvariant="normal">N</mml:mi><mml:mi mathvariant="normal">E</mml:mi><mml:mi mathvariant="normal">D</mml:mi></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>2</mml:mn><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:mfrac><mml:mrow><mml:mo>(</mml:mo><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:msup><mml:mn>2</mml:mn><mml:mrow><mml:mn>2</mml:mn><mml:mi>N</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:munderover><mml:mfrac><mml:mrow><mml:mi>E</mml:mi><mml:msub><mml:mi>D</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mi>D</mml:mi></mml:mfrac><mml:mo>)</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-27">
<label>(22)</label>
<mml:math id="mml-eqn-27" display="block"><mml:mrow><mml:mi mathvariant="italic">P</mml:mi><mml:mi mathvariant="italic">e</mml:mi><mml:mi mathvariant="italic">r</mml:mi><mml:mi mathvariant="italic">c</mml:mi><mml:mi mathvariant="italic">e</mml:mi><mml:mi mathvariant="italic">n</mml:mi><mml:mi mathvariant="italic">t</mml:mi><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">g</mml:mi><mml:mi mathvariant="italic">e</mml:mi></mml:mrow><mml:mtext>&#xA0;</mml:mtext><mml:mi>o</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#xA0;</mml:mtext><mml:mrow><mml:mi mathvariant="italic">A</mml:mi><mml:mi mathvariant="italic">c</mml:mi><mml:mi mathvariant="italic">c</mml:mi><mml:mi mathvariant="italic">u</mml:mi><mml:mi mathvariant="italic">r</mml:mi><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">c</mml:mi><mml:mi mathvariant="italic">y</mml:mi></mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>A</mml:mi><mml:mi>C</mml:mi><mml:mi>C</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:mfrac><mml:mrow><mml:mi>E</mml:mi><mml:mi>D</mml:mi></mml:mrow><mml:msub><mml:mi>R</mml:mi><mml:mrow><mml:mi>e</mml:mi></mml:mrow></mml:msub></mml:mfrac><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x00D7;</mml:mo><mml:mn>100</mml:mn><mml:mo>]</mml:mo></mml:mrow></mml:math>
</disp-formula></p>
<p>where, ED<sub>i</sub>-Error Distance at output combination corresponding to i<sup>th</sup> input combination, S<sub>i</sub>-exact output corresponding to i<sup>th</sup> combination, D-maximum possible ED of an approximate circuit and R<sub>e</sub>-exact result.</p>
<p><?A3B2 "tbl2",5,"anchor"?><xref ref-type="table" rid="table-2">Tab. 2</xref> compares MED, MRED and NED value of the proposed and prior adder designs. Note from <xref ref-type="table" rid="table-2">Tab. 2</xref>, that proposed adders fares better MED and MRED values compared to ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>], SARA [<xref ref-type="bibr" rid="ref-18">18</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] designs. AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>] design use exact full adder cell in approximate and accurate part. Hence MED, MRED and NED values of AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>] design are significantly better compared to proposed and state-of the art adder designs. P-PCER design fare better in MED, MRED and NED values compared to the P-PC design, thanks to the error recovery unit used in P-PCER design.</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>Error metrics of proposed adder and prior designs for N &#x003D; 16</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Adder</th>
<th>ET-CSLA<break/> [<xref ref-type="bibr" rid="ref-16">16</xref>]</th>
<th>SAET-<break/>CSLA<break/> [<xref ref-type="bibr" rid="ref-16">16</xref>]</th>
<th>HPETA 1<break/> [<xref ref-type="bibr" rid="ref-17">17</xref>]</th>
<th>HPETA 2<break/> [<xref ref-type="bibr" rid="ref-17">17</xref>]</th>
<th>SARA<break/> [<xref ref-type="bibr" rid="ref-18">18</xref>]</th>
<th>AEPA-<break/>EFA<break/> [<xref ref-type="bibr" rid="ref-19">19</xref>]</th>
<th>AEPA-<break/>FTFA1<break/> [<xref ref-type="bibr" rid="ref-19">19</xref>]</th>
<th>AEPA-<break/>FTFA2<break/> [<xref ref-type="bibr" rid="ref-19">19</xref>]</th>
<th>P-PC</th>
<th>P-PCER</th>
</tr>
</thead>
<tbody>
<tr>
<td>MED</td>
<td>214.8</td>
<td>96.4</td>
<td>46.67</td>
<td>44.28</td>
<td>47.12</td>
<td>9.2</td>
<td>47.1</td>
<td>53.8</td>
<td>45.7</td>
<td>41.4</td>
</tr>
<tr>
<td>MRED</td>
<td>0.322</td>
<td>0.047</td>
<td>0.032</td>
<td>0.027</td>
<td>0.028</td>
<td>0.0017</td>
<td>0.025</td>
<td>0.028</td>
<td>0.024</td>
<td>0.018</td>
</tr>
<tr>
<td>NED</td>
<td>0.007</td>
<td>0.38</td>
<td>0.24</td>
<td>0.21</td>
<td>0.24</td>
<td>0.49</td>
<td>0.19</td>
<td>0.21</td>
<td>0.19</td>
<td>0.16</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Area, Delay and Power Comparison</title>
<p>Performance of the proposed and state-of the art adders in terms of total power dissipation (power), area, delay, Area-Delay Product (ADP), Power-Delay Product (PDP) and Energy-Delay Product (EDP) are shown in <?A3B2 "tbl3",5,"anchor"?><xref ref-type="table" rid="table-3">Tab. 3</xref>. Note from <xref ref-type="table" rid="table-3">Tab. 3</xref>, P-PC and P-PCER adders demonstrates 30.8% &#x0026; 28.8%; 35.9% &#x0026; 34%; 54.7% &#x0026; 53.4%; 45.5% &#x0026; 43.8%; 44.7% &#x0026; 43.1%; 40.3% &#x0026; 38.5%;58.8% &#x0026; 57.6%;47.2% &#x0026; 45.6% and 42.1% &#x0026; 40.4% power reductions compared to conventional CLA [<xref ref-type="bibr" rid="ref-1">1</xref>], ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>], SARA [<xref ref-type="bibr" rid="ref-18">18</xref>], AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] designs respectively. Though SARA [<xref ref-type="bibr" rid="ref-18">18</xref>] implement look ahead based carry propagation in the accurate part, it shows 11.2% and 9.8% high delay compared to P-PC and P-PCER adders, thanks to the parallel carry generation logic that reduce delay of the proposed designs significantly. ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>], AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] adders use carry select algorithm and hence delay of these adders depends on the number of FA cells in a block and increases in the order of number of blocks and 2:1 Mux delay. In terms of PDP, P-PC and P-PCER adders demonstrates 50.8% &#x0026; 48.5%; 35.7% &#x0026; 32%; 57.3% &#x0026; 55.3%; 49.3% &#x0026; 46.9%; 48.9% &#x0026; 46.6%; 47% &#x0026; 44.5%; 68.1% &#x0026; 66.6%; 49.5% &#x0026; 47.2% and 44.2% &#x0026; 41.7% reductions compared to conventional CLA [<xref ref-type="bibr" rid="ref-1">1</xref>], ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>],SARA [<xref ref-type="bibr" rid="ref-18">18</xref>], AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] adders respectively.</p>
<p>Also note from <xref ref-type="table" rid="table-3">Tab. 3</xref>, P-PC and P-PCER adders show 34.5% &#x0026; 32.4%; 41.1% &#x0026; 39.2%; 58.3% &#x0026; 56.8%; 47.4% &#x0026; 45.9%; 48.2% &#x0026; 46.5%; 20.8% &#x0026; 18.3%; 50% &#x0026; 48.4%; 45.4% &#x0026; 43.7% and 41.3% &#x0026; 39.5% area reductions compared to conventional CLA [<xref ref-type="bibr" rid="ref-1">1</xref>], ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>], SARA [<xref ref-type="bibr" rid="ref-18">18</xref>], AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] adders respectively. AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>] shows the highest area compared to other approximate adders related. On the other hand, AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>] demonstrates the least error.</p>
<table-wrap id="table-3">
<label>Table 3</label>
<caption>
<title>Performance comparison of proposed and prior designs (N &#x003D; 16)-in 180 nm ASIC PDK technology</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Adder designs</th>
<th>Area<break/>(&#x03BC;m<sup>2</sup>)</th>
<th>Delay<break/>(ps)</th>
<th>Power<break/>(&#x03BC;W)</th>
<th>PDP<break/>(x10<sup>-15</sup> J)</th>
<th>ADP<break/>(x10<sup>-13</sup> m<sup>2</sup>-s)</th>
<th>EDP<break/>(X10<sup>-23</sup> Js)</th>
<th>Maximum<break/>error</th>
<th>% average<break/> error</th>
</tr>
</thead>
<tbody>
<tr>
<td>Conventional CLA [<xref ref-type="bibr" rid="ref-1">1</xref>]</td>
<td>1127.65</td>
<td>3235</td>
<td>68.83</td>
<td>222.67</td>
<td>36.48</td>
<td>72.03</td>
<td>&#x2013;</td>
<td>&#x2013;</td>
</tr>
<tr>
<td>ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td>1254.05</td>
<td>2289</td>
<td>74.26</td>
<td>169.98</td>
<td>28.71</td>
<td>38.91</td>
<td>32768</td>
<td>32.23</td>
</tr>
<tr>
<td>SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td>1639.92</td>
<td>2437</td>
<td>105.20</td>
<td>256.37</td>
<td>39.96</td>
<td>62.48</td>
<td>256</td>
<td>3.76</td>
</tr>
<tr>
<td>HPETA 1 [<xref ref-type="bibr" rid="ref-17">17</xref>]</td>
<td>1403.51</td>
<td>2473</td>
<td>87.31</td>
<td>215.92</td>
<td>34.71</td>
<td>53.40</td>
<td>256</td>
<td>1.96</td>
</tr>
<tr>
<td>HPETA 2 [<xref ref-type="bibr" rid="ref-17">17</xref>]</td>
<td>1424.34</td>
<td>2490</td>
<td>86.14</td>
<td>214.49</td>
<td>35.47</td>
<td>53.41</td>
<td>256</td>
<td>1.82</td>
</tr>
<tr>
<td>SARA [<xref ref-type="bibr" rid="ref-18">18</xref>]</td>
<td>932.00</td>
<td>2592</td>
<td>79.72</td>
<td>206.63</td>
<td>24.16</td>
<td>53.56</td>
<td>256</td>
<td>2.08</td>
</tr>
<tr>
<td>AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>]</td>
<td>1476.92</td>
<td>2968</td>
<td>115.63</td>
<td>343.10</td>
<td>43.84</td>
<td>101.83</td>
<td>17</td>
<td>0.23</td>
</tr>
<tr>
<td>AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>]</td>
<td>1353.84</td>
<td>2406</td>
<td>90.13</td>
<td>216.85</td>
<td>32.57</td>
<td>52.17</td>
<td>256</td>
<td>2.39</td>
</tr>
<tr>
<td>AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>]</td>
<td>1258.05</td>
<td>2386</td>
<td>82.31</td>
<td>196.39</td>
<td>30.02</td>
<td>46.86</td>
<td>256</td>
<td>2.75</td>
</tr>
<tr>
<td>P-PC</td>
<td>738.46</td>
<td>2301</td>
<td>47.62</td>
<td>109.57</td>
<td>16.99</td>
<td>25.21</td>
<td>256</td>
<td>2.42</td>
</tr>
<tr>
<td>P-PC ER</td>
<td>761.75</td>
<td>2337</td>
<td>49.03</td>
<td>114.58</td>
<td>17.80</td>
<td>26.78</td>
<td>128</td>
<td>1.72</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Furthermore, it is noted that ADP and EDP metrics of proposed adders are significantly better compared to other approximate adders considered for evaluation. <?A3B2 "tbl4",5,"anchor"?><xref ref-type="table" rid="table-4">Tab. 4</xref> gives a brief comparison of power, delay, area, PDP and EDP metrics of proposed adders for various bit-widths in input operands (n &#x003D; 8, 16, 32). It is noted from <xref ref-type="table" rid="table-4">Tab. 4</xref>, area, power and PDP increase at the least by 1.8X, 1.8X and 3X proportion for 2X increase in input operand bit-width. Also, it is noted, the proportion of PDP rise of proposed adders is more for higher bit-widths in input operands.</p>
<table-wrap id="table-4">
<label>Table 4</label>
<caption>
<title>Area, power, delay comparison of proposed adders for various input bits (N &#x003D; 16)-180 nm technology</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th rowspan="2">Parameter</th>
<th colspan="2">n &#x003D; 8</th>
<th colspan="2">n &#x003D; 16</th>
<th colspan="2">n &#x003D; 32</th>
</tr>
<tr>
<th>P-PC</th>
<th>P-PCER</th>
<th>P-PC</th>
<th>P-PC</th>
<th>P-PCER</th>
<th>P-PC</th>
</tr>
</thead>
<tbody>
<tr>
<td>Power (<inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula>10<sup>&#x2212;6</sup> w)</td>
<td>20.70</td>
<td>21.47</td>
<td>37.62</td>
<td>39.03</td>
<td>82.76</td>
<td>85.86</td>
</tr>
<tr>
<td>Delay (ns)</td>
<td>1.37</td>
<td>1.39</td>
<td>2.29</td>
<td>2.31</td>
<td>4.81</td>
<td>4.853</td>
</tr>
<tr>
<td>Area (&#x03BC;m<sup>2</sup>)</td>
<td>406.31</td>
<td>418.96</td>
<td>738.46</td>
<td>761.75</td>
<td>1624.61</td>
<td>1675.85</td>
</tr>
<tr>
<td>PDP (<inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula>10<sup>&#x2212;15</sup> J)</td>
<td>28.41</td>
<td>29.77</td>
<td>86.11</td>
<td>90.19</td>
<td>397.74</td>
<td>416.67</td>
</tr>
<tr>
<td>EDP (<inline-formula id="ieqn-5"><mml:math id="mml-ieqn-5"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula>10<sup>&#x2212;24</sup> J)</td>
<td>39</td>
<td>41.29</td>
<td>197.11</td>
<td>208.42</td>
<td>1911.53</td>
<td>2022.09</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><?A3B2 "fig6",5,"anchor"?><xref ref-type="fig" rid="fig-6">Fig. 6a</xref> shows the efficacy of proposed and other approximate adders in optimizing area and error through area <italic>vs</italic>. MRED plot, <xref ref-type="fig" rid="fig-6">Figs. 6b</xref> and <xref ref-type="fig" rid="fig-6">6c</xref> exhibits the capability of proposed adders in optimizing ADP, PDP against error through N<sub>ADP</sub>-MRED product and N<sub>PDP</sub>-MRED product plots. N<sub>ADP</sub> and N<sub>PDP</sub>, defined as normalized ADP and PDP metrics, respectively, are calculated using <xref ref-type="disp-formula" rid="eqn-23">(23)</xref> &#x0026; <xref ref-type="disp-formula" rid="eqn-24">(24)</xref>.</p>
<p><disp-formula id="eqn-28">
<label>(23)</label>
<mml:math id="mml-eqn-28" display="block"><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>A</mml:mi><mml:mi>D</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>A</mml:mi><mml:mi>D</mml:mi><mml:mi>P</mml:mi><mml:mtext>&#xA0;</mml:mtext><mml:mi>o</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#xA0;</mml:mtext><mml:mrow><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">p</mml:mi><mml:mi mathvariant="italic">p</mml:mi><mml:mi mathvariant="italic">r</mml:mi><mml:mi mathvariant="italic">o</mml:mi><mml:mi mathvariant="italic">x</mml:mi><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">m</mml:mi><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">t</mml:mi><mml:mi mathvariant="italic">e</mml:mi></mml:mrow><mml:mtext>&#xA0;</mml:mtext><mml:mrow><mml:mi mathvariant="italic">s</mml:mi><mml:mi mathvariant="italic">y</mml:mi><mml:mi mathvariant="italic">s</mml:mi><mml:mi mathvariant="italic">t</mml:mi><mml:mi mathvariant="italic">e</mml:mi><mml:mi mathvariant="italic">m</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>A</mml:mi><mml:mi>D</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>x</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:math>
</disp-formula></p>
<p><disp-formula id="eqn-29">
<label>(24)</label>
<mml:math id="mml-eqn-29" display="block"><mml:msub><mml:mi>N</mml:mi><mml:mrow><mml:mi>P</mml:mi><mml:mi>D</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>P</mml:mi><mml:mi>D</mml:mi><mml:mi>P</mml:mi><mml:mtext>&#xA0;</mml:mtext><mml:mi>o</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#xA0;</mml:mtext><mml:mrow><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">p</mml:mi><mml:mi mathvariant="italic">p</mml:mi><mml:mi mathvariant="italic">r</mml:mi><mml:mi mathvariant="italic">o</mml:mi><mml:mi mathvariant="italic">x</mml:mi><mml:mi mathvariant="italic">i</mml:mi><mml:mi mathvariant="italic">m</mml:mi><mml:mi mathvariant="italic">a</mml:mi><mml:mi mathvariant="italic">t</mml:mi><mml:mi mathvariant="italic">e</mml:mi></mml:mrow><mml:mtext>&#xA0;</mml:mtext><mml:mrow><mml:mi mathvariant="italic">s</mml:mi><mml:mi mathvariant="italic">y</mml:mi><mml:mi mathvariant="italic">s</mml:mi><mml:mi mathvariant="italic">t</mml:mi><mml:mi mathvariant="italic">e</mml:mi><mml:mi mathvariant="italic">m</mml:mi></mml:mrow></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mi>D</mml:mi><mml:msub><mml:mi>P</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>a</mml:mi><mml:mi>x</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:math>
</disp-formula></p>
<p>where, ADP<sub>max</sub> and PDP<sub>max</sub> represent maximum ADP and PDP values of approximate systems compared.</p>
<fig id="fig-6">
<label>Figure 6</label>
<caption>
<title>Error-performance comparison of proposed and prior adders (a) Area <italic>vs</italic>. MRED, (b) NADP <italic>vs</italic>. MRED (c) NPDP <italic>vs</italic>. MRED</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-6.png"/>
</fig>
<p>Note from <xref ref-type="fig" rid="fig-6">Figs. 6b</xref> and <xref ref-type="fig" rid="fig-6">6c</xref>, P-PC and P-PCER adders demonstrate low N<sub>ADP</sub>-MRED and N<sub>PDP</sub>-MRED products compared to ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>], SARA [<xref ref-type="bibr" rid="ref-18">18</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] adders. AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>] design shows the least N<sub>ADP</sub>-MRED and N<sub>PDP</sub>-MRED products compared to the proposed and other approximate adders. Conversely, the area of AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>] is high, as noted from <xref ref-type="fig" rid="fig-6">Fig. 6a</xref>. Also, from <xref ref-type="fig" rid="fig-6">Fig. 6a</xref>, P-PC and P-PCER adders demonstrate the least area and MRED while ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] demonstrate high MRED and moderate area. SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] exhibits high area and moderate MRED, HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>] and HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>] demonstrates high area and small MRED.</p>
<p>In addition, we have extracted power dissipation of P-PC and P-PCER adders against various operating frequency and temperature using transistor level schematics designed in Cadence Virtuoso design environment and simulations using Cadence Spectre. <?A3B2 "fig7",5,"anchor"?><xref ref-type="fig" rid="fig-7">Fig. 7</xref> shows the Power vs. frequency plot of P-PC adder in 90 and 180 nm ASIC PDK technology.</p>
<fig id="fig-7">

<label>Figure 7</label>
<caption>
<title>Power dissipation of P-PC adder against various operating frequency in 90 and 180 nm ASIC PDK technology</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-7.png"/>
</fig>
<p>We have used supply voltage &#x003D; 1 &#x0026; 1.8 V for simulations with 90 and 180 nm ASIC PDK technology, respectively. Note from <xref ref-type="fig" rid="fig-7">Fig. 7</xref>, the power dissipation increases linearly for a rise in frequency. <?A3B2 "fig8",5,"anchor"?><xref ref-type="fig" rid="fig-8">Fig. 8</xref> shows the power vs. temperature plots of proposed P-PC and P-PCER adders in 180 nm ASIC PDK. Note from <xref ref-type="fig" rid="fig-8">Fig. 8</xref>, power dissipation of proposed adders decreases moderately for a rise in temperature in the range 27 to 120&#x00B0;C and exhibits functionality without any degradation.</p>
<fig id="fig-8">
<label>Figure 8</label>
<caption>
<title>Power dissipation of proposed designs against various operating temperature (Vdd &#x003D; 1.8 V &#x0026; f &#x003D; 400 MHz)</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-8.png"/>
</fig>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Implementation in Digital Image Processing</title>
<p>To determine the novelty of the proposed approximate adders in fault-tolerant applications and verify their driving competence, implementations in image enhancement applications viz., filtering and blending are done in FPGA platform. The Verilog HDL models of the proposed adders and state-of-the-art approximate designs defined in literature are synthesized using Xilinx ISE 14.2, tool and hardware for the application system is prototyped on Spartan 6 FPGA (XC6XLX45-CSG324) device. Input images of size 256 <inline-formula id="ieqn-6"><mml:math id="mml-ieqn-6"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula> 256 are fed from MATLAB environment to FPGA hardware using Xilinx-MATLAB co-simulation with System Generator tool.</p>
<sec id="s5_1">
<label>5.1</label>
<title>Gaussian Filter</title>
<p>To assess the performance of proposed P-PC and P-PCER adders in recurring addition, an implementation in Gaussian Filter (GF) is done. The Gaussian-smoothing filter is a 2-D convolution filter used to remove Gaussian noise and smoothen the noisy input image. Statistical noise having Probability Density Function (PDF) equal to that of the normal distribution is known as Gaussian noise. The PDF of Gaussian random variable is given by:</p>
<p><disp-formula id="eqn-30">
<label>(25)</label>
<mml:math id="mml-eqn-30" display="block"><mml:msub><mml:mi>p</mml:mi><mml:mrow><mml:mi>G</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>(</mml:mo><mml:mi>z</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mi>&#x03C3;</mml:mi><mml:msqrt><mml:mn>2</mml:mn></mml:msqrt><mml:mi>&#x03C0;</mml:mi></mml:mrow></mml:mfrac><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mfrac><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mi>z</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03BC;</mml:mi><mml:msup><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:msup><mml:mi>&#x03C3;</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mrow></mml:msup></mml:math>
</disp-formula></p>
<p>where, z &#x003D; grey level, &#x03BC; &#x003D; mean value and &#x03C3; &#x003D; standard deviation.</p>
<p><?A3B2 "fig9",5,"anchor"?><xref ref-type="fig" rid="fig-9">Fig. 9</xref> shows the block-level architecture of GF. 256 <inline-formula id="ieqn-7"><mml:math id="mml-ieqn-7"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula> 256 standard images corrupted with Gaussian noise (&#x03BC; &#x003D; 0.2 and &#x03C3; &#x003D; 0.005) are used as inputs to the GF system implemented with exact and various approximate adders. Processed outputs of various GFs are shown in <?A3B2 "fig10",5,"anchor"?><xref ref-type="fig" rid="fig-10">Fig. 10</xref>.</p>
<fig id="fig-9">

<label>Figure 9</label>
<caption>
<title>Block-level architecture of Gaussian filter</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-9.png"/>
</fig>
<p>The quality metrics Mean Square Error (MSE) and Structural Similarity Index (SSIM) between output images processed by approximate and exact GF systems are used as a measure to evaluate the performance of the proposed approximate designs. MSE is given by <xref ref-type="disp-formula" rid="eqn-26">(26)</xref>.</p>
<p><disp-formula id="eqn-31">
<label>(26)</label>
<mml:math id="mml-eqn-31" display="block"><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mi>E</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:mi>a</mml:mi><mml:mi>b</mml:mi></mml:mrow></mml:mfrac><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>a</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:munderover><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>b</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:munderover><mml:mo stretchy="false">[</mml:mo><mml:mi>S</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>S</mml:mi><mml:mi mathvariant="normal">&#x2032;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:msup><mml:mo stretchy="false">]</mml:mo><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msup></mml:math>
</disp-formula></p>
<p>where, a, b-size of the image; S (i, j) &#x0026; S&#x2019; (i, j) are the exact and approximate filter outputs. Note that, from <xref ref-type="fig" rid="fig-10">Fig. 10</xref>, SSIM and MSE values of output images processed by P-PC and P-PCER GFs are better compared to ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], SARA [<xref ref-type="bibr" rid="ref-18">18</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] GF systems. AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>] GF system fare better SSIM and MSE compared to proposed and other approximate GFs. This is due to the exact FA cells used in the approximate logic of AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>], limiting the maximum error to UBP-n/4.</p>
<p>On the other hand, ADP, PDP values of AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>] are significantly high compared to all other approximate GFs. Note from <xref ref-type="fig" rid="fig-10">Fig. 10</xref>, visual quality of processed images by P-PC, and P-PCER GFs are better and analogous with output image processed by standard GF. Output image processed by ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] based GF exhibit poor visual quality with patches due to high error performance. In addition, we have extracted Average MSE (AMSE) metric using different output images processed by approximate GF systems with various standard input images-Lena, Boat, Cameraman, Bridge and Peppers and shown in <?A3B2 "fig11",5,"anchor"?><xref ref-type="fig" rid="fig-11">Fig. 11</xref>. It is noted from <xref ref-type="fig" rid="fig-11">Fig. 11</xref>, AMSE of P-PCER adder based GF is significantly reduced compared to P-PC and other approximate adder based GFs. AMSE of P-PC adder based GF is better compared to ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], SARA [<xref ref-type="bibr" rid="ref-18">18</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] based GFs.</p>
<fig id="fig-10">

<label>Figure 10</label>
<caption>
<title>(a) Standard Image (b) 5% noisy image (c)&#x2013;(n) Output images processed by Gaussian filter implemented using various adders (c) Conventional CLA [<xref ref-type="bibr" rid="ref-1">1</xref>] (d) ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] (e) SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] (f) HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>] (g) HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>] (h) SARA [<xref ref-type="bibr" rid="ref-18">18</xref>] (i) AEFA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>] (j) AEFA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] (k) AEFA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] (l) P-PC adder (m) P-PCER adder</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-10.png"/>
</fig>
<fig id="fig-11">
<label>Figure 11</label>
<caption>
<title>AMSE comparison of GF system implemented with proposed and prior approximate adders</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-11.png"/>
</fig>
<p>Since GF system involves more than one adder, we have evaluated area, ADP and PDP metrics of the system with proposed and prior approximate adders in ASIC platform using Cadence Encounter with 180 nm PDK Technology and shown in <?A3B2 "tbl5",5,"anchor"?><xref ref-type="table" rid="table-5">Tab. 5</xref>. Note from <xref ref-type="table" rid="table-5">Tab. 5</xref>, GF with P-PC and P-PCER adders fare 34.7% &#x0026; 32.6%; 40.8% &#x0026; 38.9%; 54.8% &#x0026; 53.3%; 46.8% &#x0026; 45.1%; 47.7% &#x0026; 46.1%; 20.9% &#x0026; 18.4%; 49.8% &#x0026; 48.2%; 45.3% &#x0026; 43.6% and 41% &#x0026; 39.1% area reductions compared to conventional CLA [<xref ref-type="bibr" rid="ref-1">1</xref>], ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>], SARA [<xref ref-type="bibr" rid="ref-18">18</xref>], AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] based filter systems respectively.</p>
<table-wrap id="table-5">
<label>Table 5</label>
<caption>
<title>Area, power and delay estimates of GF system implemented with various adders in 180 nm ASIC technology</title>
</caption>
<table>
<colgroup>
<col charoff="7"/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>GF system with</th>
<th>Area (&#x03BC;m<sup>2</sup>)</th>
<th>Delay (ps)</th>
<th>Power (&#x03BC;W)</th>
<th>PDP (<inline-formula id="ieqn-8"><mml:math id="mml-ieqn-8"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula>10<sup>&#x2212;15</sup> J)</th>
<th>ADP (<inline-formula id="ieqn-9"><mml:math id="mml-ieqn-9"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula>10<sup>&#x2212;13</sup> m<sup>2</sup>-s)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Conventional CLA [<xref ref-type="bibr" rid="ref-1">1</xref>]</td>
<td>9.208</td>
<td>13.09</td>
<td>564.41</td>
<td>73.88</td>
<td>120.5</td>
</tr>
<tr>
<td>ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td>10.156</td>
<td>9.29</td>
<td>608.93</td>
<td>56.6</td>
<td>94.4</td>
</tr>
<tr>
<td>SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>]</td>
<td>13.286</td>
<td>10.04</td>
<td>862.64</td>
<td>86.6</td>
<td>133.4</td>
</tr>
<tr>
<td>HPETA 1 [<xref ref-type="bibr" rid="ref-17">17</xref>]</td>
<td>11.304</td>
<td>10.39</td>
<td>715.94</td>
<td>74.36</td>
<td>117.4</td>
</tr>
<tr>
<td>HPETA 2 [<xref ref-type="bibr" rid="ref-17">17</xref>]</td>
<td>11.5</td>
<td>10.62</td>
<td>706.35</td>
<td>75.01</td>
<td>122.1</td>
</tr>
<tr>
<td>SARA [<xref ref-type="bibr" rid="ref-18">18</xref>]</td>
<td>7.598</td>
<td>10.19</td>
<td>653.7</td>
<td>66.6</td>
<td>77.42</td>
</tr>
<tr>
<td>AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>]</td>
<td>11.97</td>
<td>12.47</td>
<td>947.92</td>
<td>118.16</td>
<td>149.27</td>
</tr>
<tr>
<td>AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>]</td>
<td>10.99</td>
<td>10.11</td>
<td>739.07</td>
<td>74.68</td>
<td>111.1</td>
</tr>
<tr>
<td>AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>]</td>
<td>10.18</td>
<td>10.02</td>
<td>674.94</td>
<td>67.64</td>
<td>102</td>
</tr>
<tr>
<td>P-PC</td>
<td>6.01</td>
<td>9.37</td>
<td>390.48</td>
<td>36.6</td>
<td>56.31</td>
</tr>
<tr>
<td>P-PC ER</td>
<td>6.203</td>
<td>9.48</td>
<td>402.05</td>
<td>38.11</td>
<td>58.8</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Consistently P-PC and P-PCER GFs demonstrate significant ADP reduction compared to all other approximate GF systems related. The processing speed of P-PC and P-PCER adder based GF systems are significantly improved compared to conventional CLA [<xref ref-type="bibr" rid="ref-1">1</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>], SARA [<xref ref-type="bibr" rid="ref-18">18</xref>], AEPA-EFA [<xref ref-type="bibr" rid="ref-19">19</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] based system, thanks to the proposed parallel carry generation logic that reduces the critical delay of proposed adders. ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] based GF system demonstrates the best processing speed compared to all other approximate systems. Conversely, the AMSE value of ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] GF is enormous.</p>
</sec>
<sec id="s5_2">
<label>5.2</label>
<title>Image Blending</title>
<p>Image blending is the process of adding the corresponding pixel values of two input images of the same size. Pixel intensity values of the input images are scaled such that the pixel value of the resulting image does not exceed the maximum threshold. <?A3B2 "fig12",5,"anchor"?><xref ref-type="fig" rid="fig-12">Fig. 12</xref> shows the block-level architecture for image blending. Input images of size 256 <inline-formula id="ieqn-10"><mml:math id="mml-ieqn-10"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula> 256, with pixel intensity values scaled by fractional constant &#x2018;&#x03B1;&#x2019; and &#x2018;1 &#x2212; &#x03B1;&#x2019; respectively, are fed to the adder unit. Adder unit sums the scaled images to produce the resultant blended image.</p>
<fig id="fig-12">

<label>Figure 12</label>
<caption>
<title>Block diagram of image blending system</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-12.png"/>
</fig>
<p><?A3B2 "fig13",5,"anchor"?><xref ref-type="fig" rid="fig-13">Fig. 13</xref> shows the output images processed by the image blending system implemented with proposed and prior approximate adders. Note from <xref ref-type="fig" rid="fig-13">Fig. 13</xref>, SSIM and MSE values of output images processed by image blending system implemented with P-PC and P-PCER adders are significantly better compared to output images processed by adders considered for comparison. In addition, we have extracted the AMSE metric of various blending systems using processed images for various combinations of standard input images-Lena, Boat, Cameraman, Bridge and Peppers with &#x03B1; &#x003D; 0.8, 0.5 &#x0026; 0.2 and shown in <?A3B2 "fig14",5,"anchor"?><xref ref-type="fig" rid="fig-14">Fig. 14</xref>. It is noted from <xref ref-type="fig" rid="fig-14">Fig. 14</xref>, AMSE values of P-PC and P-PCER adder systems are significantly reduced compared to conventional CLA [<xref ref-type="bibr" rid="ref-1">1</xref>], ET CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>], HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>], HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>], SARA [<xref ref-type="bibr" rid="ref-18">18</xref>], AEPA-FTFA1 [<xref ref-type="bibr" rid="ref-19">19</xref>] and AEPA-FTFA2 [<xref ref-type="bibr" rid="ref-19">19</xref>] systems.</p>
<fig id="fig-13">
<label>Figure 13</label>
<caption>
<title>(a)&#x2013;(b) Input images (c)&#x2013;(n) Output images processed by Image blending system implemented using various adders. (c) Conventional CLA [<xref ref-type="bibr" rid="ref-1">1</xref>] (d) ET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] (e) SAET-CSLA [<xref ref-type="bibr" rid="ref-16">16</xref>] (f) HPETA1 [<xref ref-type="bibr" rid="ref-17">17</xref>] (g) HPETA2 [<xref ref-type="bibr" rid="ref-17">17</xref>] (h) SARA [<xref ref-type="bibr" rid="ref-18">18</xref>] (i) AEFA-EFA [<xref ref-type="bibr" rid="ref-20">20</xref>] (j) AEFA-FTFA1 [<xref ref-type="bibr" rid="ref-20">20</xref>] (k) AEFA-FTFA2 [<xref ref-type="bibr" rid="ref-20">20</xref>] (l) P-PC adder (m) P-PCER adder</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-13.png"/>
</fig>
<fig id="fig-14">
<label>Figure 14</label>
<caption>
<title>AMSE comparison of Image blending system using various adders</title>
</caption>
<graphic mimetype="image" mime-subtype="png" xlink:href="CMC_19789-fig-14.png"/>
</fig>
</sec>
</sec>
<sec id="s6">
<label>6</label>
<title>Conclusion</title>
<p>In this article, we have proposed area-delay and energy optimized faithful approximate adders using parallel carry generation logic with reduced critical node capacitance. To trade-off error for trifling increase in area, error recovery units are used in the approximate part. Synthesis results with n-16 and 4-bit parallel carry generation block revealed that the proposed P-PC and P-PCER adders outperformed standard and other approximate designs in terms of area, ADP and PDP reductions. Functionality and driving capability of the proposed adders are evaluated with implementation in image de-noising and image blending systems. The superiority of proposed adders in image processing is demonstrated through a visual examination and structural similarity of processed outputs. At an instance, the proposed adders with n &#x003D; 16 demonstrated at least 94% structural similarity and 42.4% &#x0026; 43.7% decline in ADP and PDP, respectively, compared to the recent approximate design.</p>
</sec>
</body>
<back>
<ack>
<p>Thanks to the reviewing committee for their valuable points and notes.</p>
</ack>
<fn-group>
<fn fn-type="other">
<p><bold>Funding Statement:</bold> The author received no specific funding for this study.</p>
</fn>
<fn fn-type="conflict">
<p><bold>Conflicts of Interest:</bold> The authors declare that they have no conflicts of interest to report regarding the present study.</p>
</fn>
</fn-group>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>D.</given-names> <surname>Ercegovac</surname></string-name> and <string-name><given-names>T.</given-names> <surname>Lang</surname></string-name></person-group>, <source>Digital Arithmetic</source>, <edition>1<sup>st</sup> </edition> ed., vol. <volume>1</volume>. <publisher-loc>Amsterdam, Netherlands</publisher-loc>: <publisher-name>Elsevier</publisher-name>, pp. <fpage>51</fpage>&#x2013;<lpage>129</lpage>, <year>2004</year>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>J. P.</given-names> <surname>Uyemura</surname></string-name></person-group>, <source>Introduction to VLSI Circuits and Systems</source>. vol. <volume>1</volume>. <publisher-loc>Hoboken, New Jersey</publisher-loc>: <publisher-name>John Wiley &#x0026; Sons</publisher-name>, pp. <fpage>467</fpage>&#x2013;<lpage>469</lpage>, <year>2002</year>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>J. P.</given-names> <surname>Uyemura</surname></string-name></person-group>, <source>CMOS Logic Circuit Design</source>. vol. <volume>10</volume>. <publisher-loc>Berlin, Germany</publisher-loc>: <publisher-name>Springer Science &#x0026; Business Media</publisher-name>, pp. <fpage>392</fpage>&#x2013;<lpage>404</lpage>, <year>2007</year>.</mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>B.</given-names> <surname>Ramkumar</surname></string-name> and <string-name><given-names>H. M.</given-names> <surname>Kittur</surname></string-name></person-group>, &#x201C;<article-title>Low-power and area-efficient carry select adder</article-title>,&#x201D; <source>IEEE Trans. on Very Large Scale Integration (VLSI) Systems</source>, vol. <volume>20</volume>, no. <issue>2</issue>, pp. <fpage>371</fpage>&#x2013;<lpage>375</lpage>, <year>2012</year>.</mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="book"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Han</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Honglan</surname></string-name> and <string-name><given-names>L.</given-names> <surname>Liu</surname></string-name></person-group>, <source>Approximate Circuits Methodologies and CAD</source>. <publisher-loc>Berlin, Germany</publisher-loc>: <publisher-name>Springer</publisher-name>, pp. <fpage>67</fpage>&#x2013;<lpage>72</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>M.</given-names> <surname>Valinataj</surname></string-name></person-group>, &#x201C;<article-title>Fault-tolerant carry look-ahead adder architectures robust to multiple simultaneous errors</article-title>,&#x201D; <source>Microelectronics Reliability</source>, vol. <volume>55</volume>, no. <issue>12, Part B</issue>, pp. <fpage>2845</fpage>&#x2013; <lpage>2857</lpage>, <year>2015</year>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>M. A.</given-names> <surname>Breuer</surname></string-name> and <string-name><given-names>H.</given-names> <surname>Zhu</surname></string-name></person-group>, &#x201C;<article-title>Error-tolerance and multi-media</article-title>,&#x201D; in <conf-name>Int. Conf. on Intelligent Information Hiding and Multimedia</conf-name>, <publisher-loc>Pasadena, CA, USA</publisher-loc>, pp. <fpage>521</fpage>&#x2013;<lpage>524</lpage>, <year>2006</year>. </mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Han</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Orshansky</surname></string-name></person-group>, &#x201C;<article-title>Approximate computing: An emerging paradigm for energy-efficient design</article-title>,&#x201D; in <conf-name>18th IEEE European Test Symp.</conf-name>, <publisher-loc>Avignon, France</publisher-loc>, pp. <fpage>1</fpage>&#x2013;<lpage>6</lpage>, <year>2013</year>. </mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>N.</given-names> <surname>Zhu</surname></string-name>, <string-name><given-names>W. L.</given-names> <surname>Goh</surname></string-name> and <string-name><given-names>W.</given-names> <surname>Zhang</surname></string-name></person-group>, &#x201C;<article-title>Design of low-power high-speed truncation-error-tolerant adder and its application in digital signal processing</article-title>,&#x201D; <source>IEEE Trans. Very Large Scale Integration Systems</source>, vol. <volume>18</volume>, no. <issue>8</issue>, pp. <fpage>1225</fpage>&#x2013;<lpage>1229</lpage>, <year>2009</year>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>O.</given-names> <surname>Akbari</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Kamal</surname></string-name> and <string-name><given-names>M.</given-names> <surname>Pedram</surname></string-name></person-group>, &#x201C;<article-title>RAP-CLA: A reconfigurable approximate carry look-ahead adder</article-title>,&#x201D; <source>IEEE Transactions on Circuits and Systems&#x2014;II: Express Briefs</source>, vol. <volume>65</volume>, no. <issue>8</issue>, pp. <fpage>1089</fpage>&#x2013;<lpage>1093</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>V.</given-names> <surname>Gupta</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Mohapatra</surname></string-name> and <string-name><given-names>A.</given-names> <surname>Raghunathan</surname></string-name></person-group>, &#x201C;<article-title>Low-power digital signal processing using approximate adders</article-title>,&#x201D; <source>IEEE Trans. on Computer-Aided Design of Integrated Circuits and Systems</source>, vol. <volume>32</volume>, no. <issue>1</issue>, pp. <fpage>124</fpage>&#x2013;<lpage>137</lpage>, <year>2013</year>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>P.</given-names> <surname>Patali</surname></string-name> and <string-name><given-names>S. T.</given-names> <surname>Kassim</surname></string-name></person-group>, &#x201C;<article-title>High throughput FIR filter architectures using retiming and modified CSLA based adders, IET Circuits</article-title>,&#x201D; <source>Devices &#x0026; Systems</source>, vol. <volume>13</volume>, no. <issue>7</issue>, pp. <fpage>1007</fpage>&#x2013;<lpage>1017</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>V.</given-names> <surname>Gupta</surname></string-name>, <string-name><given-names>D.</given-names> <surname>Mohapatra</surname></string-name> and <string-name><given-names>S. P.</given-names> <surname>Park</surname></string-name></person-group>, &#x201C;<article-title>IMPACT: IMPrecise adders for low-power approximate computing</article-title>,&#x201D; in <conf-name>IEEE/ACM Int. Symp. on Low Power Electronics and Design</conf-name>, <publisher-loc>Fukuoka, Japan</publisher-loc>, pp. <fpage>409</fpage>&#x2013;<lpage>414</lpage>, <year>2011</year>. </mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>A. K.</given-names> <surname>Verma</surname></string-name>, <string-name><given-names>P.</given-names> <surname>Brisk</surname></string-name> and <string-name><given-names>P.</given-names> <surname>Ienne</surname></string-name></person-group>, &#x201C;<article-title>Variable latency speculative addition: A new paradigm for arithmetic circuit design</article-title>,&#x201D; in <conf-name>2008 Design, Automation and Test in Europe</conf-name>, <publisher-loc>Munich, Germany</publisher-loc>, pp. <fpage>1250</fpage>&#x2013;<lpage>1255</lpage>, <year>2008</year>. </mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><given-names>K.</given-names> <surname>Al-Maaitah</surname></string-name>, <string-name><given-names>G.</given-names> <surname>Tarawneh</surname></string-name> and <string-name><given-names>A.</given-names> <surname>Soltan</surname></string-name></person-group>, &#x201C;<article-title>Approximate adder segmentation technique and significance-driven error correction</article-title>,&#x201D; in <conf-name>27th Int. Symp. on Power and Timing Modeling, Optimization and Simulation</conf-name>, <publisher-loc>Thessaloniki, Greece</publisher-loc>, pp. <fpage>1</fpage>&#x2013;<lpage>6</lpage>, <year>2017</year>. </mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Jothin</surname></string-name> and <string-name><given-names>C.</given-names> <surname>Vasanthanayaki</surname></string-name></person-group>, &#x201C;<article-title>High performance significance approximation error tolerance adder for image processing applications</article-title>,&#x201D; <source>Journal of Electronic Testing</source>, vol. <volume>32</volume>, no. <issue>3</issue>, pp. <fpage>377</fpage>&#x2013;<lpage>383</lpage>, <year>2016</year>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>R.</given-names> <surname>Jothin</surname></string-name> and <string-name><given-names>C.</given-names> <surname>Vasanthanayaki</surname></string-name></person-group>, &#x201C;<article-title>High performance error tolerant adders for image processing applications</article-title>,&#x201D; <source>IETE Journal of Research</source>, vol. <volume>67</volume>, no. <issue>2</issue>, pp. <fpage>205</fpage>&#x2013;<lpage>216</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>W.</given-names> <surname>Xu</surname></string-name>, <string-name><given-names>S. S.</given-names> <surname>Sapatnekar</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Hu</surname></string-name></person-group>, &#x201C;<article-title>A simple yet efficient accuracy-configurable adder design</article-title>,&#x201D; <source>IEEE Trans. on Very Large Scale Integration Systems</source>, vol. <volume>26</volume>, no. <issue>6</issue>, pp. <fpage>1112</fpage>&#x2013;<lpage>1125</lpage>, <year>2018</year>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>P.</given-names> <surname>Gnanambikai</surname></string-name>, <string-name><given-names>K. N.</given-names> <surname>Vijeyakumar</surname></string-name> and <string-name><given-names>S.</given-names> <surname>Kalaiselvi</surname></string-name></person-group>, &#x201C;<article-title>Area-efficient parallel adder with faithful approximation for image and signal processing applications</article-title>,&#x201D; <source>IET Image Processing</source>, vol. <volume>13</volume>, no. <issue>13</issue>, pp. <fpage>2587</fpage>&#x2013;<lpage>2594</lpage>, <year>2019</year>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><given-names>J.</given-names> <surname>Liang</surname></string-name>, <string-name><given-names>J.</given-names> <surname>Han</surname></string-name> and <string-name><given-names>F.</given-names> <surname>Lombardi</surname></string-name></person-group>, &#x201C;<article-title>New metrics for the reliability of approximate and probabilistic adders</article-title>,&#x201D; <source>IEEE Trans. on Computers</source>, vol. <volume>62</volume>, no. <issue>9</issue>, pp. <fpage>1760</fpage>&#x2013;<lpage>1771</lpage>, <year>2013</year>.</mixed-citation></ref>
</ref-list>
</back>
</article>
