<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="research-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">CMC</journal-id>
<journal-id journal-id-type="nlm-ta">CMC</journal-id>
<journal-id journal-id-type="publisher-id">CMC</journal-id>
<journal-title-group>
<journal-title>Computers, Materials &#x0026; Continua</journal-title>
</journal-title-group>
<issn pub-type="epub">1546-2226</issn>
<issn pub-type="ppub">1546-2218</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">65812</article-id>
<article-id pub-id-type="doi">10.32604/cmc.2025.065812</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>You KAN See through the Sand in the Dark: Uncertainty-Aware Meets KAN in Joint Low-Light Image Enhancement and Sand-Dust Removal</article-title>
<alt-title alt-title-type="left-running-head">You KAN See through the Sand in the Dark: Uncertainty-Aware Meets KAN in Joint Low-Light Image Enhancement and Sand-Dust Removal</alt-title>
<alt-title alt-title-type="right-running-head">You KAN See through the Sand in the Dark: Uncertainty-Aware Meets KAN in Joint Low-Light Image Enhancement and Sand-Dust Removal</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author">
<name name-style="western"><surname>Wei</surname><given-names>Bingcai</given-names></name><xref ref-type="aff" rid="aff-1">1</xref></contrib>
<contrib id="author-2" contrib-type="author" corresp="yes">
<name name-style="western"><surname>Liu</surname><given-names>Hui</given-names></name><xref ref-type="aff" rid="aff-1">1</xref><email>loweliu@whu.edu.cn</email></contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>Qian</surname><given-names>Chuang</given-names></name><xref ref-type="aff" rid="aff-2">2</xref></contrib>
<contrib id="author-4" contrib-type="author">
<name name-style="western"><surname>Shen</surname><given-names>Haoliang</given-names></name><xref ref-type="aff" rid="aff-3">3</xref></contrib>
<contrib id="author-5" contrib-type="author">
<name name-style="western"><surname>Chen</surname><given-names>Yibiao</given-names></name><xref ref-type="aff" rid="aff-3">3</xref></contrib>
<contrib id="author-6" contrib-type="author">
<name name-style="western"><surname>Wang</surname><given-names>Yixin</given-names></name><xref ref-type="aff" rid="aff-3">3</xref></contrib>
<aff id="aff-1"><label>1</label><institution>School of Computer Science, Wuhan University</institution>, <addr-line>Wuhan, 430072</addr-line>, <country>China</country></aff>
<aff id="aff-2"><label>2</label><institution>Intelligent Transport Systems Research Center, Wuhan University of Technology</institution>, <addr-line>Wuhan, 430062</addr-line>, <country>China</country></aff>
<aff id="aff-3"><label>3</label><institution>GNSS Research Center, Wuhan University</institution>, <addr-line>Wuhan, 430072</addr-line>, <country>China</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>&#x002A;</label>Corresponding Author: Hui Liu. Email: <email>loweliu@whu.edu.cn</email></corresp>
</author-notes>
<pub-date date-type="collection" publication-format="electronic">
<year>2025</year>
</pub-date>
<pub-date date-type="pub" publication-format="electronic">
<day>30</day><month>07</month><year>2025</year>
</pub-date>
<volume>84</volume>
<issue>3</issue>
<fpage>5095</fpage>
<lpage>5109</lpage>
<history>
<date date-type="received">
<day>21</day>
<month>3</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>11</day>
<month>6</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2025 The Authors.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Published by Tech Science Press.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_CMC_65812.pdf"></self-uri>
<abstract>
<p>Within the domain of low-level vision, enhancing low-light images and removing sand-dust from single images are both critical tasks. These challenges are particularly pronounced in real-world applications such as autonomous driving, surveillance systems, and remote sensing, where adverse lighting and environmental conditions often degrade image quality. Various neural network models, including MLPs, CNNs, GANs, and Transformers, have been proposed to tackle these challenges, with the Vision KAN models showing particular promise. However, existing models, including the Vision KAN models use deterministic neural networks that do not address the uncertainties inherent in these processes. To overcome this, we introduce the Uncertainty-Aware Kolmogorov-Arnold Network (UAKAN), a novel structure that integrates KAN with uncertainty estimation. Our approach uniquely employs Tokenized KANs for sampling within a U-Net architecture&#x2019;s encoder and decoder layers, enhancing the network&#x2019;s ability to learn complex representations. Furthermore, for aleatoric uncertainty, we propose an uncertainty coupling certainty module that couples uncertainty distribution learning and residual learning in a feature fusion manner. For epistemic uncertainty, we propose a feature selection mechanism for spatial and pixel dimension uncertainty modeling, which captures and models uncertainty by learning the uncertainty contained between feature maps. Notably, our uncertainty-aware framework enables the model to produce both high-quality enhanced images and reliable uncertainty maps, which are crucial for downstream applications requiring confidence estimation. Through comparative and ablation studies on our synthetic SLLIE6K dataset, designed for low-light enhancement and sand-dust removal, we validate the effectiveness and theoretical robustness of our methodology.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Kolmogorov-arnold network</kwd>
<kwd>uncertainty-aware</kwd>
<kwd>distribution attention</kwd>
<kwd>image enhancement</kwd>
<kwd>feature selection</kwd>
</kwd-group>
<funding-group>
<award-group id="awg1">
<funding-source>National Key R&#x0026;D Program of China</funding-source>
<award-id>2023YFB2504400</award-id>
</award-group>
</funding-group>
</article-meta>
</front>
<body>
<sec id="s1">
<label>1</label>
<title>Introduction</title>
<p>The presence of sand-dust in the atmosphere severely degrades image quality through complex optical interactions [<xref ref-type="bibr" rid="ref-1">1</xref>]. Sand particles scatter and absorb light, which causes reduced contrast, hazy appearance, and color distortion in images. These degradations profoundly impact subsequent visual tasks [<xref ref-type="bibr" rid="ref-2">2</xref>,<xref ref-type="bibr" rid="ref-3">3</xref>]. For example, in object detection, obscured edges and diminished visibility hinder accurate object identification and localization. In instance segmentation, unclear boundaries due to sand-dust interference result in inaccurate pixel-level classification. In scene recognition, the weakening of critical visual cues such as texture and color gradients leads to unreliable semantic interpretation. Insufficient lighting compounds these issues by introducing high noise, narrow dynamic range, and loss of fine details that render feature extraction unreliable, as seen in low-light object detection where inadequate illumination challenges the detection of small or dark objects [<xref ref-type="bibr" rid="ref-4">4</xref>&#x2013;<xref ref-type="bibr" rid="ref-6">6</xref>]. The core challenge arises when sand-dust degradation and low-light conditions coexist. Existing methods either focus on dust removal under natural light or low-light enhancement without addressing dust contamination. These methods, however, are ill-equipped to disentangle the synergistic deteriorations caused by dust-induced light scattering and low-light photon starvation, as the two phenomena exacerbate each other in complex ways. Addressing this gap is critical for real-world applications, which include autonomous driving in desert sandstorms and remote sensing in arid regions. In these regions, both dust and low light frequently compromise camera visibility. Developing a method to jointly mitigate dust contamination and enhance low-light visibility while preserving fine details is essential.</p>
<p>To tackle these challenges, various neural network architectures have been developed. Convolutional Neural Networks (CNNs) exploit local spatial correlations via convolutional operations, performing well in many visual tasks. However, their fixed receptive fields limit the capture of long-range dependencies. Multi-Layer Perceptrons (MLPs), with fully connected structures, model complex nonlinear relationships but suffer from high computational costs and overfitting. Generative Adversarial Networks (GANs), comprising generators and discriminators, generate realistic images but face unstable training. Transformers leverage self-attention mechanisms to model global dependencies, enabling exceptional performance in long-range contextual reasoning. However, their effectiveness is often constrained by significant data dependency, necessitating substantial training datasets for optimal results.</p>
<p>Recently, Kolmogorov-Arnold Networks (KANs) have emerged as a promising alternative to traditional MLPs. Rooted in the Kolmogorov-Arnold superposition theorems, KANs offer distinct advantages in accuracy and interpretability. Unlike MLPs, KANs employ a more compact architecture to represent complex functions, thereby reducing computational complexity. Their design enables superior feature representation, enhancing performance in visual tasks. Moreover, KANs&#x2019; interpretability allows researchers to decipher how the network processes information.</p>
<p>However, within the domain of low-level vision, the application of KANs remains relatively unexplored, and their potential benefits are yet to be fully harnessed. In the realm of low-level vision, complex tasks that encompass substantial uncertainty, such as simultaneous low-light image enhancement and single-image sand-dust removal, have received scant attention in research. These tasks pose unique challenges due to the inherent variability and unpredictability of the data. Specifically, traditional methods often fail to adequately address these complexities. To address the aforementioned challenges, this paper introduces an uncertainty-aware KAN framework, designated as UAKAN. Specifically, we propose Downsample Tokenized KAN (DTKAN) and Upsample Tokenized KAN (UTKAN) that utilize KAN for the downsampling and upsampling process in U-Net. Unlike the interpolate operation in traditional U-Net, our DTKAN and UTKAN offer enhanced flexibility and interoperability. Concurrently, to tackle aleatoric uncertainty in joint low-light image enhancement and sand-dust removal, we introduce an uncertainty coupling certainty module. This mechanism leverages distribution modeling that is used to capture the uncertainty and a residual learning branch for certainty. By coupling these two branches, our module effectively utilizes uncertainty and certainty across data dimensions. Furthermore, to address the epistemic uncertainty associated with model dimensions, we propose an uncertainty-aware distributional spatial modulator. This mechanism selectively filters features through uncertainty modeling, identifying and discarding redundant features with high uncertainty to refine the model&#x2019;s feature selection.</p>
<p>In summary, the primary contributions of this paper can be categorized into four main aspects:
<list list-type="bullet">
<list-item>
<p>To the best of our knowledge, we are the first to explore the task of single-image Joint Low-Light Image Enhancement and Sand-dust Removal (JLLIESR), we are also the first to utilize uncertainty estimation and KAN for this task.</p></list-item>
<list-item>
<p>This is the first work to utilize tokenized KAN in the low-level vision task and we propose Downsample Tokenized KAN (DTKAN) and Upsample Tokenized KAN (UTKAN) implanted in a U-Net.</p></list-item>
<list-item>
<p>We propose an Uncertainty Coupling Certainty Module (UCCM) to handle the data uncertainty of the JLLIESR task, and we propose an Uncertainty-aware Distributional Spatial Modulator (UDSM) to model the uncertainty in the epistemic dimension for our model.</p></list-item>
<list-item>
<p>We propose a framework called Uncertainty-Aware KAN (UAKAN) for accurate single-image joint low-light image enhancement and sand removal. Through comprehensive experiments conducted on our proposed SLLIE6K dataset, we demonstrate that our proposed method outperforms current state-of-the-art techniques.</p></list-item>
</list></p>
</sec>
<sec id="s2">
<label>2</label>
<title>Related Works</title>
<sec id="s2_1">
<label>2.1</label>
<title>Sand Images Formulation</title>
<p>The widely utilized physical model for explaining the formation of an image affected by light transmission hazed [<xref ref-type="bibr" rid="ref-7">7</xref>] is typically defined as follows:
<disp-formula id="eqn-1"><label>(1)</label><mml:math id="mml-eqn-1" display="block"><mml:mi>I</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mi>J</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x2212;</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-1"><mml:math id="mml-ieqn-1"><mml:mi>I</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> represents the observed hazy image, <inline-formula id="ieqn-2"><mml:math id="mml-ieqn-2"><mml:mi>t</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> signifies the medium transmission map, <italic>A</italic> denotes the global atmosphere light, and <inline-formula id="ieqn-3"><mml:math id="mml-ieqn-3"><mml:mi>J</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> corresponds to the haze-free image. Further, <inline-formula id="ieqn-4"><mml:math id="mml-ieqn-4"><mml:mi>t</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> can be expressed as: <inline-formula id="ieqn-5"><mml:math id="mml-ieqn-5"><mml:mi>t</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mi>&#x03B2;</mml:mi><mml:mi>d</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:msup></mml:math></inline-formula>, where <inline-formula id="ieqn-6"><mml:math id="mml-ieqn-6"><mml:mi>&#x03B2;</mml:mi></mml:math></inline-formula> represents the scattering coefficient of the atmosphere, and <inline-formula id="ieqn-7"><mml:math id="mml-ieqn-7"><mml:mi>d</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> signifies the scene depth between the digital camera and the captured object for each pixel <inline-formula id="ieqn-8"><mml:math id="mml-ieqn-8"><mml:mi>x</mml:mi></mml:math></inline-formula> in the image. During periods of dusty conditions, the varying degradation rates of red, green, and blue color components result in characteristic attributes like displacement, density, and temporal variance in the affected images. It can be formulated as:
<disp-formula id="eqn-2"><label>(2)</label><mml:math id="mml-eqn-2" display="block"><mml:mrow><mml:mover><mml:mi>A</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mo>=&#x003C;</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>R</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>G</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>B</mml:mi></mml:mrow></mml:msub><mml:mo>&gt;</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-9"><mml:math id="mml-ieqn-9"><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>G</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>k</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>R</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula id="ieqn-10"><mml:math id="mml-ieqn-10"><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>B</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>k</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:msub><mml:mi>A</mml:mi><mml:mrow><mml:mi>R</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, <inline-formula id="ieqn-11"><mml:math id="mml-ieqn-11"><mml:mrow><mml:mover><mml:mi>A</mml:mi><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow></mml:math></inline-formula> is the global color deviation value of the sand dust image, <inline-formula id="ieqn-12"><mml:math id="mml-ieqn-12"><mml:mi>b</mml:mi></mml:math></inline-formula> is the disturbance amount, and <inline-formula id="ieqn-13"><mml:math id="mml-ieqn-13"><mml:mi>k</mml:mi></mml:math></inline-formula> is the spatial distribution coefficient of the atmospheric light value of the three basic color spectrums. Based on the above formula, various degrees and types of sandy images can be synthesized. Then, with the help of these sand-dust images, we can get low-light degraded images. Specifically, for the generation of synthetic low-light images, we initially apply gamma correction to attenuate the luminance of the images. This process can be mathematically encapsulated by the following formula:
<disp-formula id="eqn-3"><label>(3)</label><mml:math id="mml-eqn-3" display="block"><mml:mi>I</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>&#x03B2;</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x03B1;</mml:mi><mml:mo>&#x00D7;</mml:mo><mml:mi>I</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:msup><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mi>&#x03B3;</mml:mi></mml:mrow></mml:msup></mml:math></disp-formula>where <inline-formula id="ieqn-14"><mml:math id="mml-ieqn-14"><mml:mi>I</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mi>l</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the low-light sand-dust image, <inline-formula id="ieqn-15"><mml:math id="mml-ieqn-15"><mml:mi>&#x03B1;</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x03B2;</mml:mi></mml:math></inline-formula> and <inline-formula id="ieqn-16"><mml:math id="mml-ieqn-16"><mml:mi>&#x03B3;</mml:mi></mml:math></inline-formula> are the parameters of gamma correction. We randomly select these parameters to generate different levels of illumination for low-light images. <inline-formula id="ieqn-17"><mml:math id="mml-ieqn-17"><mml:mi>&#x03B1;</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x03B2;</mml:mi></mml:math></inline-formula> and <inline-formula id="ieqn-18"><mml:math id="mml-ieqn-18"><mml:mi>&#x03B3;</mml:mi></mml:math></inline-formula> are defined as <inline-formula id="ieqn-19"><mml:math id="mml-ieqn-19"><mml:mi>U</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mn>0.65</mml:mn><mml:mo>,</mml:mo><mml:mn>0.7</mml:mn><mml:mo stretchy="false">)</mml:mo><mml:mo>,</mml:mo><mml:mi>U</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mn>0.65</mml:mn><mml:mo>,</mml:mo><mml:mn>0.7</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> and <inline-formula id="ieqn-20"><mml:math id="mml-ieqn-20"><mml:mi>U</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mn>1.5</mml:mn><mml:mo>,</mml:mo><mml:mn>1.6</mml:mn><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>, respectively. The visual examples of synthetic low-light sand-dust images are shown in <xref ref-type="fig" rid="fig-1">Fig. 1</xref>. Furthermore, visual examples of real-world and synthetic dust images and their pixel distribution histograms are shown in <xref ref-type="fig" rid="fig-2">Fig. 2</xref>. The top two rows represent real sand dust images and the corresponding RGB (Red, Green, Blue) distribution histograms; The bottom two rows are the synthetic degraded images and the corresponding histograms. In <xref ref-type="fig" rid="fig-2">Fig. 2</xref>, we find the same pattern as in [<xref ref-type="bibr" rid="ref-8">8</xref>], that is, the distribution of sand dust images has obvious prior characteristics for shifting, concentration, and sequential. It can be seen that the distribution of the synthetic image satisfies all the properties of the real sandstorm image, proving the correctness and superiority of our method for synthesizing dust image data.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>Visual examples of the used synthetic low-light sand-dust images in this paper</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_65812-fig-1.tif"/>
</fig><fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>Pixel distribution comparison examples of real-world and synthetic sand dust images</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_65812-fig-2a.tif"/>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_65812-fig-2b.tif"/>
</fig>
</sec>
<sec id="s2_2">
<label>2.2</label>
<title>Image Recovery from Degraded Weather</title>
<p>Li et al. [<xref ref-type="bibr" rid="ref-9">9</xref>] introduced AOD-Net, a pioneering end-to-end CNN model for image dehazing, highlighting its potential for enhancing computer vision tasks in hazy conditions. Chen et al. [<xref ref-type="bibr" rid="ref-10">10</xref>] present NAFNet, a pioneering approach that simplifies image restoration by eliminating nonlinear activations, achieving comparable state-of-the-art results with reduced computational complexity. Cui et al. [<xref ref-type="bibr" rid="ref-11">11</xref>] introduced OKNet, an efficient convolutional network for image restoration that leverages an omni-kernel module to capture multi-scale receptive fields, demonstrating state-of-the-art performance across various restoration tasks including dehazing, desnowing, and defocus deblurring. Gao et al. [<xref ref-type="bibr" rid="ref-12">12</xref>] propose a single-stage design, based on a simple U-Net architecture, with a mountain-shaped structure. The DSANet proposed by Cui and Knoll [<xref ref-type="bibr" rid="ref-13">13</xref>] enhances representation learning through spatial and frequency strip attention units, offering a significant advancement in image restoration efficiency and effectiveness. Gao et al. [<xref ref-type="bibr" rid="ref-14">14</xref>] introduced ECFNet, an efficient image restoration framework that combines spatial and frequency attention mechanisms with multi-scale blocks to adaptively handle varying degradation levels across image regions, achieving superior performance on multiple benchmark datasets. Cui et al. [<xref ref-type="bibr" rid="ref-15">15</xref>] introduced FSNet, an image restoration framework that employs frequency selection mechanisms to dynamically decompose and emphasize frequency components for effective image recovery.</p>
</sec>
<sec id="s2_3">
<label>2.3</label>
<title>Uncertainty-Aware Probabilistic Modeling</title>
<p>Data uncertainty and model uncertainty in deep learning are key factors influencing performance, and probabilistic modeling approaches to address data and model uncertainty are gaining growing attention. Bayesian SegNet [<xref ref-type="bibr" rid="ref-16">16</xref>] leverages Monte Carlo Dropout [<xref ref-type="bibr" rid="ref-17">17</xref>] at test time to approximate posterior distributions, offering a measure of uncertainty crucial for decision-making in applications such as autonomous driving and robotic interaction. Wang et al. [<xref ref-type="bibr" rid="ref-18">18</xref>] introduce a data-uncertainty guided multi-phase learning approach for semi-supervised object detection that effectively leverages unlabeled data across varying difficulty levels. Zhou et al. [<xref ref-type="bibr" rid="ref-19">19</xref>] introduced an Uncertainty-Aware Edge Detector (UAED) that leverages the ambiguity in multiple annotations to enhance edge detection performance, representing a significant advancement in the field. Yang et al. [<xref ref-type="bibr" rid="ref-20">20</xref>] present UGTR, a pioneering approach that integrates Bayesian learning with Transformer-based [<xref ref-type="bibr" rid="ref-21">21</xref>] reasoning to enhance camouflaged object detection. Zhang et al. [<xref ref-type="bibr" rid="ref-22">22</xref>] introduced GLENet, a generative framework for modeling label uncertainty in 3D object detection using conditional variational autoencoders, which improves detection accuracy by capturing the diversity of potential bounding boxes for objects. Tang et al. [<xref ref-type="bibr" rid="ref-23">23</xref>] proposed UA-Track, an uncertainty-aware framework for 3D multi-object tracking that introduces probabilistic attention, query denoising, and uncertainty-reduced query initialization to address tracking challenges in complex scenarios. Dong et al. [<xref ref-type="bibr" rid="ref-24">24</xref>] proposed UAC, a semi-supervised learning method that combines multi-perturbation strategies and uncertainty estimation to improve medical image segmentation performance. Shao et al. [<xref ref-type="bibr" rid="ref-25">25</xref>] proposed UA-Fusion, an uncertainty-aware multimodal data fusion framework for 3-D object detection in autonomous vehicles. This framework effectively addresses challenges related to uncertainty in dynamic traffic environments through probabilistic cross-modal attention and query denoising strategies.</p>
</sec>
</sec>
<sec id="s3">
<label>3</label>
<title>Proposed Method</title>
<sec id="s3_1">
<label>3.1</label>
<title>Overall Pipeline</title>
<p>We propose UAKAN for the removal of sand-dust in a single-image under low-light conditions, as shown in <xref ref-type="fig" rid="fig-3">Fig. 3</xref>. To expand the application of Kolmogorov-Arnold Networks (KANs) in low-level vision, we propose novel Tokenized KAN modules, namely the Downsampling Tokenized KAN (DTKAN) and Upsampling Tokenized KAN (UTKAN) modules. These modules leverage Patch Embedding and Patch Unembedding techniques for tokenization, followed by the application of KAN layers for further embedding, thereby enhancing the representational capabilities of the network. Furthermore, to tackle the aleatoric uncertainty inherent in the data dimensions of the concurrent task of single-image low-light enhancement and sand-dust removal, we introduce a module that couples uncertainty awareness with deterministic representation termed the Uncertainty Coupling Certainty Module (UCCM). The UCCM achieves the integration of uncertainty and determinism by feature coupling between a distribution modeling representation module and a residual learning module, thereby characterizing the fusion of uncertainty and determinism. Finally, addressing uncertainties of the epistemic dimension, we introduce the Uncertainty-aware Distributional Spatial Modulator (UDSM) and integrate it into the backbone of our framework, which we term the Uncertainty-aware Feature Modeling Block (UFMBlock). The incorporation of UFMBlock equips our framework with the capability to model comprehensive uncertainty, subsequently augmenting its ability to encapsulate the intrinsic uncertainties associated with the dual tasks of low-light image enhancement and image sand-dust removal. Our network adopts the same MIMOUNet [<xref ref-type="bibr" rid="ref-26">26</xref>] architecture, which is a multi-input multi-output U-shaped encoder-decoder structure. The total structure of our proposed UAKAN is shown in <xref ref-type="fig" rid="fig-3">Fig. 3</xref>.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Overall structure of the proposed method. Our UAKAN employs the classic MIMOUNet [<xref ref-type="bibr" rid="ref-26">26</xref>] as its basic framework. MIMOUNet is a multi-input multi-output UNet architecture that is commonly seen in recent image restoration frameworks. The encoder accepts input images at three scales, and the decoder outputs images at three scales. We use UFMBlocks as the backbone network. In the encoder stage, we use DTKAN for feature downsampling, and in the decoder stage, we use UTKAN for feature upsampling</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_65812-fig-3.tif"/>
</fig>
</sec>
<sec id="s3_2">
<label>3.2</label>
<title>Tokenized KAN for Sampling</title>
<p>Analogous to a Multilayer Perceptron (MLP), a KAN with K layers can be described as a composition of successive KAN layers, this process can be formulated as following:
<disp-formula id="eqn-4"><label>(4)</label><mml:math id="mml-eqn-4" display="block"><mml:mi>K</mml:mi><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>I</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi mathvariant="normal">&#x03A6;</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x2299;</mml:mo><mml:msub><mml:mi mathvariant="normal">&#x03A6;</mml:mi><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x2212;</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>&#x2299;</mml:mo><mml:mo>&#x22EF;</mml:mo><mml:mo>&#x2299;</mml:mo><mml:msub><mml:mi mathvariant="normal">&#x03A6;</mml:mi><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x2299;</mml:mo><mml:msub><mml:mi mathvariant="normal">&#x03A6;</mml:mi><mml:mrow><mml:mn>0</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>where <italic>I</italic> is the input feature; <inline-formula id="ieqn-21"><mml:math id="mml-ieqn-21"><mml:msub><mml:mi mathvariant="normal">&#x03A6;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> denote the <inline-formula id="ieqn-22"><mml:math id="mml-ieqn-22"><mml:mi>i</mml:mi></mml:math></inline-formula>-th layer of the KAN framework. Each KAN-Layer, with <inline-formula id="ieqn-23"><mml:math id="mml-ieqn-23"><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>-dimensional input and <inline-formula id="ieqn-24"><mml:math id="mml-ieqn-24"><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>-dimensional output, <inline-formula id="ieqn-25"><mml:math id="mml-ieqn-25"><mml:mi mathvariant="normal">&#x03A6;</mml:mi></mml:math></inline-formula> compries <inline-formula id="ieqn-26"><mml:math id="mml-ieqn-26"><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> learnable activation functions <inline-formula id="ieqn-27"><mml:math id="mml-ieqn-27"><mml:mi>&#x03D5;</mml:mi></mml:math></inline-formula>:
<disp-formula id="eqn-5"><label>(5)</label><mml:math id="mml-eqn-5" display="block"><mml:mi mathvariant="normal">&#x03A6;</mml:mi><mml:mo>=</mml:mo><mml:mo fence="false" stretchy="false">{</mml:mo><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>q</mml:mi><mml:mo>,</mml:mo><mml:mi>p</mml:mi></mml:mrow></mml:msub><mml:mo fence="false" stretchy="false">}</mml:mo><mml:mo>,</mml:mo><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mi>q</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x22EF;</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></disp-formula>where <inline-formula id="ieqn-28"><mml:math id="mml-ieqn-28"><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>q</mml:mi><mml:mo>,</mml:mo><mml:mi>p</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> is the parameter that can be learned, and the computing process of the <inline-formula id="ieqn-29"><mml:math id="mml-ieqn-29"><mml:mi>i</mml:mi></mml:math></inline-formula>-layer to the <inline-formula id="ieqn-30"><mml:math id="mml-ieqn-30"><mml:mi>i</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>-layer can be formulated in matrix from <inline-formula id="ieqn-31"><mml:math id="mml-ieqn-31"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="normal">&#x03A6;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>, where:
<disp-formula id="eqn-6"><label>(6)</label><mml:math id="mml-eqn-6" display="block"><mml:msub><mml:mi mathvariant="normal">&#x03A6;</mml:mi><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mtable columnalign="center center center center" rowspacing="4pt" columnspacing="1em"><mml:mtr><mml:mtd><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x2026;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x2026;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mo>&#x22EE;</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x22EE;</mml:mo></mml:mtd><mml:mtd /><mml:mtd><mml:mo>&#x22EE;</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mtd><mml:mtd><mml:mo>&#x2026;</mml:mo></mml:mtd><mml:mtd><mml:msub><mml:mi>&#x03D5;</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>,</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>k</mml:mi><mml:mo>+</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>n</mml:mi><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mo>&#x22C5;</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:mtd></mml:mtr></mml:mtable><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>by employing edge-learned activation functions and parameterized activation functions in place of weights, KAN distinguishes itself from traditional MLP, thereby eliminating the need for linear weight matrices. The architectural design bolsters the interpretability of KAN while maintaining high performance, rendering them apt for a broad spectrum of applications. The detailed framework of KAN layers is shown in <xref ref-type="fig" rid="fig-3">Fig. 3d</xref>.</p>
<p>To expand the application of KAN in low-level vision, we introduce a novel approach that integrates KANs with the sampling layers within the U-Net architecture, specifically the Down-sampling Tokenized KAN (DTKAN) and Up-sampling Tokenized KAN (UTKAN) modules. These modules utilize Patch Embedding (PE) and Patch Unembedding (PU) techniques for tokenization, followed by the application of KAN layers for advanced embedding processes, thereby enhancing the network&#x2019;s capability to handle complex visual tasks. The detailed structure of DTKAN and UTKAN is shown in <xref ref-type="fig" rid="fig-3">Fig. 3b</xref>,<xref ref-type="fig" rid="fig-3">c</xref>. For DTKAN, this process can be formulated as:
<disp-formula id="eqn-7"><label>(7)</label><mml:math id="mml-eqn-7" display="block"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>K</mml:mi><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>P</mml:mi><mml:mi>E</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>meanwhile, for UTKAN, it can be expressed as:
<disp-formula id="eqn-8"><label>(8)</label><mml:math id="mml-eqn-8" display="block"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>K</mml:mi><mml:mi>A</mml:mi><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>P</mml:mi><mml:mi>U</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-32"><mml:math id="mml-ieqn-32"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-33"><mml:math id="mml-ieqn-33"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> are the input and output features, respectively.</p>
</sec>
<sec id="s3_3">
<label>3.3</label>
<title>Uncertainty Coupling Certainty Module</title>
<p>To address the aleatoric uncertainty inherent in the data dimensions of the intricate visual task encompassing joint low-light image enhancement and sand-dust removal, we introduce the Uncertainty Coupling Certainty Module (UCCM). This module is designed to manage the stochastic elements present in these challenging image-processing tasks, thereby enhancing the robustness and reliability of the visual system. The detailed structure of our UCCM is shown in <xref ref-type="fig" rid="fig-4">Fig. 4</xref>. In the UCCM, we employ a coupling mechanism between an uncertainty modeling branch and a deterministic representation branch to estimate the stochastic uncertainty present in the data. For the deterministic branch, we utilize a residual block from the ResNet [<xref ref-type="bibr" rid="ref-27">27</xref>] architecture for certain representations. As a quintessential component within convolutional neural networks, residual blocks possess formidable learning capabilities. The detailed process of the residual block can be articulated through the following formula:<disp-formula id="eqn-9"><label>(9)</label><mml:math id="mml-eqn-9" display="block"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>c</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mi>v</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>R</mml:mi><mml:mi>e</mml:mi><mml:mi>L</mml:mi><mml:mi>U</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mi>v</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>meanwhile, for the uncertainty branch, we utilize a distribution modeling module for learning the uncertainty inside data-wise, it can be expressed as:
<disp-formula id="eqn-10"><label>(10)</label><mml:math id="mml-eqn-10" display="block"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>u</mml:mi><mml:mi>n</mml:mi><mml:mi>c</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>S</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>&#x03BC;</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mi>&#x03F5;</mml:mi><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>&#x03C3;</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula>where <italic>S</italic> is the sampling operator from the Gaussian distribution by the parameterization trick [<xref ref-type="bibr" rid="ref-19">19</xref>], <inline-formula id="ieqn-34"><mml:math id="mml-ieqn-34"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>c</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-35"><mml:math id="mml-ieqn-35"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>u</mml:mi><mml:mi>n</mml:mi><mml:mi>c</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> are the output features of the certainty branch and the uncertainty branch, respectively, and <inline-formula id="ieqn-36"><mml:math id="mml-ieqn-36"><mml:mi>&#x03F5;</mml:mi><mml:mrow><mml:mo>&#x223C;</mml:mo></mml:mrow><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mi>I</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. The pseudo-code of the uncertainty-aware sampling processing <italic>S</italic> is shown in Algorithm 1. We utilize a learnable distribution in the feature space to introduce uncertainty modeling into the sand-dust removal task. We use two independent convolutional layers <inline-formula id="ieqn-37"><mml:math id="mml-ieqn-37"><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>&#x03BC;</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> and <inline-formula id="ieqn-38"><mml:math id="mml-ieqn-38"><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>&#x03C3;</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> to achieve the feature distribution, and we model the distribution of <inline-formula id="ieqn-39"><mml:math id="mml-ieqn-39"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> at each pixel as Gaussian following [<xref ref-type="bibr" rid="ref-20">20</xref>]. The results are denoted as the mean <inline-formula id="ieqn-40"><mml:math id="mml-ieqn-40"><mml:mi>&#x03BC;</mml:mi></mml:math></inline-formula> and the variance <inline-formula id="ieqn-41"><mml:math id="mml-ieqn-41"><mml:mi>&#x03C3;</mml:mi></mml:math></inline-formula> of these two layers. Unlike other ordinary convolutional layers, these two convolutional layers learn a distribution <inline-formula id="ieqn-42"><mml:math id="mml-ieqn-42"><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x03BC;</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x03C3;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula> parameterized by mean <inline-formula id="ieqn-43"><mml:math id="mml-ieqn-43"><mml:mi>&#x03BC;</mml:mi></mml:math></inline-formula> and variance <inline-formula id="ieqn-44"><mml:math id="mml-ieqn-44"><mml:mi>&#x03C3;</mml:mi></mml:math></inline-formula>. Meanwhile, the representation of each sample is not a deterministic point embedding but a stochastic embedding sampled from <inline-formula id="ieqn-45"><mml:math id="mml-ieqn-45"><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>&#x03BC;</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x03C3;</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>. Finally, we employ a convolutional layer to reduce the dimensionality of the concatenated features from these two sections, a process we refer to as &#x201C;coupling&#x201D;. This coupling process of our proposed UCCM can be shown as:
<disp-formula id="eqn-11"><label>(11)</label><mml:math id="mml-eqn-11" display="block"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mi>v</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mo stretchy="false">[</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>c</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>u</mml:mi><mml:mi>n</mml:mi><mml:mi>c</mml:mi><mml:mi>e</mml:mi><mml:mi>r</mml:mi><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mi>n</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">]</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula></p>
<fig id="fig-4">
<label>Figure 4</label>
<caption>
<title>Detailed process of the uncertainty coupling certainty module (UCCM), uncertainty-aware distributional spatial modulator (UDSM), and uncertainty-aware feature modeling block (UFMBlock) in our UAKAN</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_65812-fig-4.tif"/>
</fig>
<fig id="fig-6">
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_65812-fig-6.tif"/>
</fig>
</sec>
<sec id="s3_4">
<label>3.4</label>
<title>Uncertainty-Aware Distributional Spatial Modulator</title>
<p>In order to make a proper response to the epistemic uncertainty in neural networks, that is, the model uncertainty, we embed an uncertainty-aware feature selective distribution learning module. This module is called the Uncertainty-aware Distributional Spatial Module (UDSM). We incorporate a distribution representation learning module, enabling our model to learn both a mean feature and a standard deviation feature. These two features are then combined to form a distribution from which we sample to obtain a single-channel selective map. This selective feature map is subsequently used to refine the expression of the input features selectively, yielding the final feature map for low-light enhancement and simultaneous sand-dust removal. The detailed structure of our proposed UDSM is shown in <xref ref-type="fig" rid="fig-4">Fig. 4</xref>, and the whole process of UDSM can be described in <xref ref-type="disp-formula" rid="eqn-12">Eq. (12)</xref>:
<disp-formula id="eqn-12"><label>(12)</label><mml:math id="mml-eqn-12" display="block"><mml:mi>U</mml:mi><mml:mi>D</mml:mi><mml:mi>S</mml:mi><mml:mi>M</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>=</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>&#x00D7;</mml:mo><mml:mi>S</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>&#x03BC;</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:mi>&#x03F5;</mml:mi><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mi>&#x03C3;</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>x</mml:mi><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo></mml:math></disp-formula></p>
<p>Furthermore, we embed UDSM in the backbone of our UAKAN and call the backbone Uncertainty-aware Feature Modeling Block (UFMBlock). Inspired by the architecture of Transformers [<xref ref-type="bibr" rid="ref-21">21</xref>], our backbone structure can be divided into two analogous stages, each initiated with a layer normalization. Drawing from the design of residual blocks [<xref ref-type="bibr" rid="ref-27">27</xref>], we incorporate two convolutional layers within each stage, with an activation function interposed between them. Influenced by the NAFBlock [<xref ref-type="bibr" rid="ref-10">10</xref>], we introduce channel attention [<xref ref-type="bibr" rid="ref-28">28</xref>] and depth-wise convolution in the first stage to enhance the network&#x2019;s fitting and learning capabilities. Consequently, we integrate the UDSM into the second stage of our backbone to balance the computational load between the two stages. The specific process of our backbone can be delineated by the following formula:<disp-formula id="eqn-13"><label>(13)</label><mml:math id="mml-eqn-13" display="block"><mml:mtable columnalign="right left right left right left right left right left right left" rowspacing="3pt" columnspacing="0em 2em 0em 2em 0em 2em 0em 2em 0em 2em 0em" displaystyle="true"><mml:mtr><mml:mtd /><mml:mtd><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>C</mml:mi><mml:mi>A</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>S</mml:mi><mml:mi>G</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>D</mml:mi><mml:mi>W</mml:mi><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mi>v</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>L</mml:mi><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mtd></mml:mtr><mml:mtr><mml:mtd /><mml:mtd><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>U</mml:mi><mml:mi>D</mml:mi><mml:mi>S</mml:mi><mml:mi>M</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mi>S</mml:mi><mml:mi>G</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo stretchy="false">(</mml:mo><mml:mi>L</mml:mi><mml:mi>N</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo stretchy="false">)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>where <inline-formula id="ieqn-60"><mml:math id="mml-ieqn-60"><mml:mi>C</mml:mi><mml:mi>A</mml:mi><mml:mo>,</mml:mo><mml:mi>S</mml:mi><mml:mi>G</mml:mi><mml:mo>,</mml:mo><mml:mi>D</mml:mi><mml:mi>W</mml:mi><mml:mi>C</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mi>v</mml:mi></mml:math></inline-formula>, <inline-formula id="ieqn-61"><mml:math id="mml-ieqn-61"><mml:msub><mml:mi>f</mml:mi><mml:mrow><mml:mn>1</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub></mml:math></inline-formula>, <italic>LN</italic>, and <inline-formula id="ieqn-62"><mml:math id="mml-ieqn-62"><mml:msub><mml:mi>I</mml:mi><mml:mrow><mml:mi>m</mml:mi><mml:mi>i</mml:mi><mml:mi>d</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> denote channel attention [<xref ref-type="bibr" rid="ref-28">28</xref>], simple gate activation function, depth-wise convolution, convolutional layer with kernel 1, layer normalization [<xref ref-type="bibr" rid="ref-29">29</xref>], and the middle features, respectively.</p>
</sec>
</sec>
<sec id="s4">
<label>4</label>
<title>Experiments</title>
<sec id="s4_1">
<label>4.1</label>
<title>Implementation Details</title>
<p>All the experiments are performed on the Ubuntu 20.04 operating system with a NVIDIA RTX4090 GPU (24 GB) and an Intel i7-13700k CPU processor. The network training and testing processes are implemented by using Python 3.10 language environment and encompass torch 2.3.1&#x002B;cu118 in conjunction with Anaconda. We train all the models for 200 epochs with the AdamW optimizer. The initial learning rate is set to <inline-formula id="ieqn-63"><mml:math id="mml-ieqn-63"><mml:mn>1</mml:mn><mml:mo>&#x00D7;</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>4</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula>, and we employ CosineAnnealingLR to adjust the learning rate. For data augmentation, we apply horizontal flipping and randomly rotate the image to 0, 90, 180, and 270 degrees. The batch size is set as 4 with a patch size of 128 <inline-formula id="ieqn-64"><mml:math id="mml-ieqn-64"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula> 128. Flops are computed on a patch size of 128 <inline-formula id="ieqn-65"><mml:math id="mml-ieqn-65"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula> 128.</p>
<p>For synthetic images used to train neural networks, we use a dataset named SLLIE6K that contains 6000 images for training and validation, and 2397 images for testing. We chose the Pascal VOC 2012 [<xref ref-type="bibr" rid="ref-30">30</xref>] dataset as clear images. For the training set, we picked 799 sharp images from the Pascal VOC 2012 dataset. Then, according to different Atmospheric Light, we synthesized three types of degraded images: dust, sand and sandstorm, and each type of image contained five degraded images with different degrees according to the Scattering Coefficient. As a result, 11,985 degraded images were obtained. Finally, in order to reduce the training data set and remove some training data with similar training effects, we selected 6000 images from all degraded images as training data. Then, for the test set, we also selected 799 clear images from the Pascal VOC 2012 dataset, and then, according to different Atmospheric Light, we synthesized three types of degraded images: Dust, Sand and Sandstorm. Therefore, the test data totals 2397 image pairs.</p>
<p>We use the Char loss [<xref ref-type="bibr" rid="ref-31">31</xref>] as the loss function for training all the neural networks. The Char loss [<xref ref-type="bibr" rid="ref-31">31</xref>] can be formulated as:<disp-formula id="eqn-14"><label>(14)</label><mml:math id="mml-eqn-14" display="block"><mml:mi>&#x2113;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mrow><mml:mtext mathvariant="bold">I</mml:mtext></mml:mrow><mml:mrow><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:mrow><mml:mover><mml:mrow><mml:mtext mathvariant="bold">I</mml:mtext></mml:mrow><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msqrt><mml:msup><mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo><mml:msup><mml:mrow><mml:mtext mathvariant="bold">I</mml:mtext></mml:mrow><mml:mrow><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:mrow></mml:msup><mml:mo>&#x2212;</mml:mo><mml:mrow><mml:mover><mml:mrow><mml:mtext mathvariant="bold">I</mml:mtext></mml:mrow><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow><mml:mo symmetric="true">&#x2016;</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup><mml:mo>+</mml:mo><mml:msup><mml:mi>&#x03F5;</mml:mi><mml:mn>2</mml:mn></mml:msup></mml:msqrt><mml:mo>,</mml:mo></mml:math></disp-formula>where <inline-formula id="ieqn-66"><mml:math id="mml-ieqn-66"><mml:msup><mml:mrow><mml:mtext mathvariant="bold">I</mml:mtext></mml:mrow><mml:mrow><mml:mi mathvariant="normal">&#x2032;</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula> is the restored image, <inline-formula id="ieqn-67"><mml:math id="mml-ieqn-67"><mml:mrow><mml:mover><mml:mrow><mml:mtext mathvariant="bold">I</mml:mtext></mml:mrow><mml:mo stretchy="false">&#x005E;</mml:mo></mml:mover></mml:mrow></mml:math></inline-formula> is the ground-truth image, and <inline-formula id="ieqn-68"><mml:math id="mml-ieqn-68"><mml:mi>&#x03F5;</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mn>10</mml:mn><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>3</mml:mn></mml:mrow></mml:msup></mml:math></inline-formula> is a constant in all the experiments.</p>
</sec>
<sec id="s4_2">
<label>4.2</label>
<title>Quantitative and Qualitative Results</title>
<p>In the comparative experiments with other state-of-the-art methods, we compare our proposed method with various cutting-edge approaches and compute PSNR, SSIM, MAE, and LPIPS scores using the RGB channel following [<xref ref-type="bibr" rid="ref-1">1</xref>]. In addition, to compare the computational complexity of these learning-based methods, we have also listed relevant measurement metrics, including Params, FLOPs, inference tims, and memory cost. As shown in <xref ref-type="table" rid="table-1">Table 1</xref>, our method achieves significant performance gains that are more than the state-of-the-art methods. Our method outperforms all comparison methods in all indicators. It outperforms the second-best FSNet by 0.11 dB in PSNR, and shows significant superiority in MAE and LPIPS. For the running time and memory consumption, we use a server equipped with an RTX4090 graphics card for inference time evaluation, and the image size is 128 <inline-formula id="ieqn-69"><mml:math id="mml-ieqn-69"><mml:mo>&#x00D7;</mml:mo></mml:math></inline-formula> 128 during inference. Although our method achieves SOTA, our method is obviously more computationally demanding than Sandformer [<xref ref-type="bibr" rid="ref-1">1</xref>], SANet [<xref ref-type="bibr" rid="ref-32">32</xref>], and OKNet [<xref ref-type="bibr" rid="ref-11">11</xref>], which is a disadvantage of our method. However, compared to the second-best FSNet, our method has less than half of the computational cost, whether in terms of Flops(G) or Params(M). Furthermore, the visual results shown in <xref ref-type="fig" rid="fig-5">Fig. 5</xref> are in close alignment with the quantitative findings, substantiating the superior image restoration capabilities of our proposed method. Given that all comparison methods yield similar results for real images, we employ error maps [<xref ref-type="bibr" rid="ref-33">33</xref>] to highlight the differences from the original clear images. As shown in the error maps, AODNet [<xref ref-type="bibr" rid="ref-9">9</xref>], NAFNet [<xref ref-type="bibr" rid="ref-10">10</xref>], SANet [<xref ref-type="bibr" rid="ref-32">32</xref>], OKNet [<xref ref-type="bibr" rid="ref-11">11</xref>], and ECFNet [<xref ref-type="bibr" rid="ref-14">14</xref>] deliver sub-par image restoration, with large-scale errors. In contrast, DSANet [<xref ref-type="bibr" rid="ref-13">13</xref>] and FSNet [<xref ref-type="bibr" rid="ref-15">15</xref>] produce smaller errors, yet struggle with details in texture-lacking areas like the sky or walls. Our method, however, stands out by restoring the most visually pleasing images, highlighting its superior performance. Therefore, <xref ref-type="fig" rid="fig-5">Fig. 5</xref> shows the superiority of our proposed method in the task of simultaneously removing the dust attachment from the image and performing low-light image enhancement.</p>
<table-wrap id="table-1">
<label>Table 1</label>
<caption>
<title>Comparative results with state-of-the-art methods. Red/Blue text indicates best/second-best results</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<thead>
<tr>
<th>Method</th>
<th>PSNR<inline-formula id="ieqn-70"><mml:math id="mml-ieqn-70"><mml:mo stretchy="false">&#x2191;</mml:mo></mml:math></inline-formula></th>
<th>SSIM<inline-formula id="ieqn-71"><mml:math id="mml-ieqn-71"><mml:mo stretchy="false">&#x2191;</mml:mo></mml:math></inline-formula></th>
<th>MAE<inline-formula id="ieqn-72"><mml:math id="mml-ieqn-72"><mml:mo stretchy="false">&#x2193;</mml:mo></mml:math></inline-formula></th>
<th>LPIPS<inline-formula id="ieqn-73"><mml:math id="mml-ieqn-73"><mml:mo stretchy="false">&#x2193;</mml:mo></mml:math></inline-formula></th>
<th align="center">Flops (G)</th>
<th align="center">Params (M)</th>
<th align="center">Inf. time (ms)</th>
<th align="center">Mem. cost (M)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Degenerated images</td>
<td>8.29</td>
<td>0.228</td>
<td>0.3349</td>
<td>0.460</td>
<td>/</td>
<td>/</td>
<td>/</td>
<td>/</td>
</tr>
<tr>
<td>(TCE&#x2019;07) BPDHE [<xref ref-type="bibr" rid="ref-34">34</xref>]</td>
<td>16.13</td>
<td>0.748</td>
<td>0.1386</td>
<td>0.251</td>
<td>/</td>
<td>/</td>
<td>/</td>
<td>/</td>
</tr>
<tr>
<td>(TPAMI&#x2019;10) DCP [<xref ref-type="bibr" rid="ref-35">35</xref>]</td>
<td>8.18</td>
<td>0.239</td>
<td>0.3500</td>
<td>0.400</td>
<td>/</td>
<td>/</td>
<td>/</td>
<td>/</td>
</tr>
<tr>
<td>(arxiv&#x2019;17) AOD-Net [<xref ref-type="bibr" rid="ref-9">9</xref>]</td>
<td>18.01</td>
<td>0.765</td>
<td>0.1082</td>
<td>0.337</td>
<td>0.029</td>
<td>0.0017</td>
<td>0.13</td>
<td>0.01</td>
</tr>
<tr>
<td>(ICCV&#x2019;21) MIMOUNet [<xref ref-type="bibr" rid="ref-26">26</xref>]</td>
<td>30.18</td>
<td><styled-content style-type="color" style="color: #1900FF;">0.958</styled-content></td>
<td>0.0238</td>
<td>0.080</td>
<td>16.82</td>
<td>6.81</td>
<td>2.260</td>
<td>25.97</td>
</tr>
<tr>
<td>(CVPR&#x2019;22) Transweather [<xref ref-type="bibr" rid="ref-36">36</xref>]</td>
<td>29.39</td>
<td>0.949</td>
<td>0.0255</td>
<td>0.089</td>
<td>1.56</td>
<td>38.05</td>
<td>3.330</td>
<td>145.16</td>
</tr>
<tr>
<td>(ECCV&#x2019;22) NAFNet [<xref ref-type="bibr" rid="ref-10">10</xref>]</td>
<td>29.82</td>
<td>0.954</td>
<td>0.0243</td>
<td>0.083</td>
<td>4.03</td>
<td>17.11</td>
<td>6.493</td>
<td>65.28</td>
</tr>
<tr>
<td>(ICASSP&#x2019;23) Sandformer [<xref ref-type="bibr" rid="ref-1">1</xref>]</td>
<td>30.25</td>
<td>0.956</td>
<td>0.0241</td>
<td>0.082</td>
<td>2.69</td>
<td>4.28</td>
<td>8.177</td>
<td>16.32</td>
</tr>
<tr>
<td>(IJCAI&#x2019;23) SANet [<xref ref-type="bibr" rid="ref-32">32</xref>]</td>
<td>30.04</td>
<td>0.956</td>
<td>0.0241</td>
<td>0.082</td>
<td>9.93</td>
<td>3.81</td>
<td>2.557</td>
<td>14.54</td>
</tr>
<tr>
<td>(TVC&#x2019;24) M3SNet [<xref ref-type="bibr" rid="ref-12">12</xref>]</td>
<td>30.19</td>
<td>0.957</td>
<td>0.0237</td>
<td>0.078</td>
<td>4.71</td>
<td>16.73</td>
<td>7.630</td>
<td>63.83</td>
</tr>
<tr>
<td>(AAAI&#x2019;24) OKNet [<xref ref-type="bibr" rid="ref-11">11</xref>]</td>
<td>29.99</td>
<td>0.956</td>
<td>0.0243</td>
<td>0.083</td>
<td>9.93</td>
<td>4.72</td>
<td>1.768</td>
<td>18.02</td>
</tr>
<tr>
<td>(NN&#x2019;24) DSANet [<xref ref-type="bibr" rid="ref-13">13</xref>]</td>
<td>30.13</td>
<td>0.957</td>
<td>0.0238</td>
<td>0.081</td>
<td>9.46</td>
<td>3.86</td>
<td>3.960</td>
<td>14.74</td>
</tr>
<tr>
<td>(arxiv&#x2019;24) ECFNet [<xref ref-type="bibr" rid="ref-14">14</xref>]</td>
<td>29.23</td>
<td>0.950</td>
<td>0.0261</td>
<td>0.084</td>
<td>7.72</td>
<td>3.65</td>
<td>15.146</td>
<td>13.93</td>
</tr>
<tr>
<td>(TPAMI&#x2019;24) FSNet [<xref ref-type="bibr" rid="ref-15">15</xref>]</td>
<td><styled-content style-type="color" style="color: #1900FF;">30.30</styled-content></td>
<td><styled-content style-type="color" style="color: #FF000C;">0.959</styled-content></td>
<td><styled-content style-type="color" style="color: #1900FF;">0.0235</styled-content></td>
<td><styled-content style-type="color" style="color: #1900FF;">0.076</styled-content></td>
<td>27.68</td>
<td>13.28</td>
<td>14.155</td>
<td>50.66</td>
</tr>
<tr>
<td>Ours</td>
<td><styled-content style-type="color" style="color: #FF000C;">30.41</styled-content></td>
<td><styled-content style-type="color" style="color: #FF000C;">0.959</styled-content></td>
<td><styled-content style-type="color" style="color: #FF000C;">0.0232</styled-content></td>
<td><styled-content style-type="color" style="color: #FF000C;">0.073</styled-content></td>
<td>12.09</td>
<td>5.02</td>
<td>18.107</td>
<td>19.15</td>
</tr>
</tbody>
</table>
</table-wrap><fig id="fig-5">
<label>Figure 5</label>
<caption>
<title>Joint low-light image enhancement and sand-dust removal results on SLLIE6K Dataset</title>
</caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="CMC_65812-fig-5.tif"/>
</fig>
</sec>
<sec id="s4_3">
<label>4.3</label>
<title>Ablation Studies</title>
<p><xref ref-type="table" rid="table-2">Table 2</xref> presents ablation experimental metrics for our proposed DTKAN, UTKAN, UCCM, and UDSM. We conducted detailed ablation experiments on the components proposed in this paper. For the convenience and efficiency of the experiment, only 60% of the 6000 images used for training and validation in the SLLIE6K dataset were used for model training in the ablation experiment section, and the remaining images were used for testing. As <xref ref-type="table" rid="table-2">Table 2</xref> shown, it demonstrates that our proposed component enhances the performance of the model, underscoring the rationality of the model. For example, replacing the pooling convolution or strided convolution with the tokeniz KAN can enhance the network&#x2019;s ability of image restoration. Embedding the UCCM module can increase PSNR by 1.22 dB. Our proposed UDSM module outperforms the spatial attention module and plain convolution layers by 0.21 and 0.30 dB, respectively. Therefore, <xref ref-type="table" rid="table-2">Table 2</xref> also indicates the superiority of our proposed individual modules.</p>
<table-wrap id="table-2">
<label>Table 2</label>
<caption>
<title>Ablation studies on the independent components of our proposed method. PConv, UPConv, SConv, and TConv denote polling convolution, upsampling convolution, strided convolution and transposed convolution, respectively</title>
</caption>
<table>
<colgroup>
<col/>
<col/>
<col/>
<col/>
<col/>
</colgroup>
<tbody>
<tr>
<td>DTKAN/UTKAN</td>
<td>w/o</td>
<td>PConv/UPConv</td>
<td>SConv/TConv</td>
<td>Ours</td>
</tr>
<tr>
<td>PSNR<inline-formula id="ieqn-74"><mml:math id="mml-ieqn-74"><mml:mo stretchy="false">&#x2191;</mml:mo></mml:math></inline-formula> / SSIM<inline-formula id="ieqn-75"><mml:math id="mml-ieqn-75"><mml:mo stretchy="false">&#x2191;</mml:mo></mml:math></inline-formula></td>
<td>29.24/0.920</td>
<td>29.32/0.924</td>
<td>29.34/0.925</td>
<td>29.44/0.934</td>
</tr>
<tr>
<td>UCCM</td>
<td>w/o</td>
<td>w/o certainty</td>
<td>w/o uncertainty</td>
<td>Ours</td>
</tr>
<tr>
<td>PSNR<inline-formula id="ieqn-76"><mml:math id="mml-ieqn-76"><mml:mo stretchy="false">&#x2191;</mml:mo></mml:math></inline-formula> / SSIM<inline-formula id="ieqn-77"><mml:math id="mml-ieqn-77"><mml:mo stretchy="false">&#x2191;</mml:mo></mml:math></inline-formula></td>
<td>29.24/0.920</td>
<td>29.54/0.943</td>
<td>30.24/0.964</td>
<td>30.46/0.975</td>
</tr>
<tr>
<td>UDSM</td>
<td>w/o</td>
<td>Spatial attention [<xref ref-type="bibr" rid="ref-37">37</xref>]</td>
<td>Convolution</td>
<td>Ours</td>
</tr>
<tr>
<td>PSNR<inline-formula id="ieqn-78"><mml:math id="mml-ieqn-78"><mml:mo stretchy="false">&#x2191;</mml:mo></mml:math></inline-formula> / SSIM<inline-formula id="ieqn-79"><mml:math id="mml-ieqn-79"><mml:mo stretchy="false">&#x2191;</mml:mo></mml:math></inline-formula></td>
<td>29.24/0.920</td>
<td>29.41/0.930</td>
<td>29.32/0.923</td>
<td>29.62/0.945</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="s5">
<label>5</label>
<title>Conclusion</title>
<p>In this paper, we introduce the concept of uncertainty learning in the joint removal of sand-dust and low-light enhancement in a single-image. Building upon this, we present the UAKAN framework that integrates our innovative Downsampling Tokenized KAN (DTKAN) and Upsampling Tokenized KAN (UTKAN) with Uncertainty-aware Distribution Spatial Modulator (UDSM) and Uncertainty Coupling Certainty Module (UCCM). Extensive evaluations confirm our method&#x2019;s superiority over state-of-the-art image restoration techniques, with significant advantages in low-light sand-dust scenes. By modeling uncertainties in the JLLIESR task, UAKAN effectively addresses aleatoric uncertainty (data-level randomness) and epistemic uncertainty (model-level cognitive limitations). Meanwhile, UAKAN employs Tokenized KAN for sampling. Therefore, UAKAN can effectively perform the JLLIESR task and can effectively restore the original clear image from images where both sand-dust adhesion and low-light degradation occur simultaneously. However, our proposed method still has limitations. For example, our method still has room for improvement in image processing speed while keeping low computational complexity and memory occupation. This may lead to some potential failure cases or scenarios in which our proposed UAKAN may perform poorly, such as when deployed in terminals with high real-time requirements. Therefore, in the future, we will further optimize the running time of the reduced model and improve the performance of the model in order to achieve faster and more accurate image restoration.</p>
</sec>
</body>
<back>
<ack>
<p>Thanks for all the help of the teachers and students of the related universities.</p>
</ack>
<sec>
<title>Funding Statement</title>
<p>This research is supported by National Key R&#x0026;D Program of China (2023YFB2504400).</p>
</sec>
<sec>
<title>Author Contributions</title>
<p>Study conception and design: Bingcai Wei, Hui Liu, Chuang Qian; data collection: Bingcai Wei, Haoliang Shen, Yibiao Chen; analysis and interpretation of results: Bingcai Wei, Haoliang Shen, Yixin Wang; draft manuscript preparation: Bingcai Wei, Hui Liu. All authors reviewed the results and approved the final version of the manuscript.</p>
</sec>
<sec sec-type="data-availability">
<title>Availability of Data and Materials</title>
<p>The data that support the findings of this study are available from the corresponding author, Hui Liu, upon reasonable request.</p>
</sec>
<sec>
<title>Ethics Approval</title>
<p>Not applicable.</p>
</sec>
<sec sec-type="COI-statement">
<title>Conflicts of Interest</title>
<p>The authors declare no conflicts of interest to report regarding the present study.</p>
</sec>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>[1]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Shi</surname> <given-names>J</given-names></string-name>, <string-name><surname>Wei</surname> <given-names>B</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>G</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>L</given-names></string-name></person-group>. <article-title>Sandformer: CNN and transformer under gated fusion for sand dust image restoration</article-title>. In: <conf-name>ICASSP 2023&#x2013;2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</conf-name>; <year>2023 Jun 4&#x2013;10</year>; <publisher-name>Rhodes Island</publisher-name>, <publisher-loc>Greece</publisher-loc>. p. <fpage>1</fpage>&#x2013;<lpage>5</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ICASSP49357.2023.10095242</pub-id>.</mixed-citation></ref>
<ref id="ref-2"><label>[2]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Tian</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Ye</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Doermann</surname> <given-names>D</given-names></string-name></person-group>. <article-title>YOLOv12: Attention-centric real-time object detectors. arXiv:2502.12524</article-title>. <year>2025</year>. doi:<pub-id pub-id-type="doi">10.48550/arXiv.2502.12524</pub-id>.</mixed-citation></ref>
<ref id="ref-3"><label>[3]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Yuan</surname> <given-names>X</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>R</given-names></string-name>, <string-name><surname>Li</surname> <given-names>X</given-names></string-name>, <string-name><surname>Hou</surname> <given-names>Q</given-names></string-name>, <etal>et al</etal></person-group>. <article-title>YOLO-MS: rethinking multi-scale representation learning for real-time object detection</article-title>. <source>IEEE Trans Pattern Anal Mach Intell</source>. <year>2025</year>;<volume>47</volume>(<issue>6</issue>):<fpage>4240</fpage>&#x2013;<lpage>52</lpage>. doi:<pub-id pub-id-type="doi">10.1109/TPAMI.2025.3538473</pub-id>; <pub-id pub-id-type="pmid">40031746</pub-id></mixed-citation></ref>
<ref id="ref-4"><label>[4]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Guo</surname> <given-names>C</given-names></string-name>, <string-name><surname>Li</surname> <given-names>C</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>J</given-names></string-name>, <string-name><surname>Change Loy</surname> <given-names>C</given-names></string-name>, <string-name><surname>Hou</surname> <given-names>J</given-names></string-name>, <string-name><surname>Kwong</surname> <given-names>S</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Zero-reference deep curve estimation for low-light image enhancement</article-title>. In: <conf-name>Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition</conf-name>; <year>2020 Jun 13&#x2013;19</year>; <publisher-loc>Seattle, WA, USA</publisher-loc>. p. <fpage>1780</fpage>&#x2013;<lpage>9</lpage>. doi:<pub-id pub-id-type="doi">10.1109/CVPR42600.2020.00185</pub-id>.</mixed-citation></ref>
<ref id="ref-5"><label>[5]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>W</given-names></string-name>, <string-name><surname>Yin</surname> <given-names>B</given-names></string-name>, <string-name><surname>Li</surname> <given-names>L</given-names></string-name>, <string-name><surname>Li</surname> <given-names>L</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>H</given-names></string-name></person-group>. <article-title>A low light image enhancement method based on dehazing physical model</article-title>. <source>Comput Model Eng Sci</source>. <year>2025</year>;<volume>143</volume>(<issue>2</issue>):<fpage>1595</fpage>&#x2013;<lpage>616</lpage>. doi:<pub-id pub-id-type="doi">10.32604/cmes.2025.063595</pub-id>.</mixed-citation></ref>
<ref id="ref-6"><label>[6]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>M</given-names></string-name>, <string-name><surname>Li</surname> <given-names>J</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>C</given-names></string-name></person-group>. <article-title>Low-light image enhancement by deep learning network for improved illumination map</article-title>. <source>Comput Vis Image Underst</source>. <year>2023</year>;<volume>232</volume>(<issue>4</issue>):<fpage>103681</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.cviu.2023.103681</pub-id>.</mixed-citation></ref>
<ref id="ref-7"><label>[7]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Narasimhan</surname> <given-names>SG</given-names></string-name>, <string-name><surname>Nayar</surname> <given-names>SK</given-names></string-name></person-group>. <article-title>Vision and the atmosphere</article-title>. <source>Int J Comput Vis</source>. <year>2002</year>;<volume>48</volume>(<issue>3</issue>):<fpage>233</fpage>&#x2013;<lpage>54</lpage>. doi:<pub-id pub-id-type="doi">10.1023/A:1016328200723</pub-id>.</mixed-citation></ref>
<ref id="ref-8"><label>[8]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Si</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>F</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>W</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>Y</given-names></string-name></person-group>. <article-title>A comprehensive benchmark analysis for sand dust image reconstruction</article-title>. <source>J Vis Commun Image Represent</source>. <year>2022</year>;<volume>89</volume>(<issue>12</issue>):<fpage>103638</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.jvcir.2022.103638</pub-id>.</mixed-citation></ref>
<ref id="ref-9"><label>[9]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>B</given-names></string-name>, <string-name><surname>Peng</surname> <given-names>X</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>J</given-names></string-name>, <string-name><surname>Feng</surname> <given-names>D</given-names></string-name></person-group>. <article-title>An all-in-one network for dehazing and beyond. arXiv:1707.06543</article-title>. <year>2017</year>. doi:<pub-id pub-id-type="doi">10.48550/arXiv.1707.06543</pub-id>.</mixed-citation></ref>
<ref id="ref-10"><label>[10]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>L</given-names></string-name>, <string-name><surname>Chu</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>J</given-names></string-name></person-group>. <article-title>Simple baselines for image restoration</article-title>. In: <conf-name>European Conference on Computer Vision</conf-name>; <year>2022 Oct 23&#x2013;27</year>; <publisher-loc>Tel Aviv, Israel</publisher-loc>. p. <fpage>17</fpage>&#x2013;<lpage>33</lpage>. doi:<pub-id pub-id-type="doi">10.1007/978-3-031-20071-7_2</pub-id>.</mixed-citation></ref>
<ref id="ref-11"><label>[11]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cui</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Ren</surname> <given-names>W</given-names></string-name>, <string-name><surname>Knoll</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Omni-kernel network for image restoration</article-title>. <source>Proc AAAI Conf Artif Intell</source> <year>2024</year>;<volume>38</volume>(<issue>2</issue>):<fpage>1426</fpage>&#x2013;<lpage>34</lpage>. doi:<pub-id pub-id-type="doi">10.1609/aaai.v38i2.27907</pub-id>.</mixed-citation></ref>
<ref id="ref-12"><label>[12]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Gao</surname> <given-names>H</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>N</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Dang</surname> <given-names>D</given-names></string-name></person-group>. <article-title>A novel single-stage network for accurate image restoration</article-title>. <source>Vis Comput</source>. <year>2024</year>;<volume>40</volume>(<issue>10</issue>):<fpage>7385</fpage>&#x2013;<lpage>98</lpage>. doi:<pub-id pub-id-type="doi">10.1007/s00371-024-03599-6</pub-id>.</mixed-citation></ref>
<ref id="ref-13"><label>[13]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cui</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Knoll</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Dual-domain strip attention for image restoration</article-title>. <source>Neural Netw</source>. <year>2024</year>;<volume>171</volume>(<issue>5</issue>):<fpage>429</fpage>&#x2013;<lpage>39</lpage>. doi:<pub-id pub-id-type="doi">10.1016/j.neunet.2023.12.003</pub-id>; <pub-id pub-id-type="pmid">38142482</pub-id></mixed-citation></ref>
<ref id="ref-14"><label>[14]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Gao</surname> <given-names>H</given-names></string-name>, <string-name><surname>Ma</surname> <given-names>B</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Dang</surname> <given-names>D</given-names></string-name></person-group>. <article-title>Emphasizing crucial features for efficient image restoration. arXiv:2405.11468</article-title>. <year>2024</year>. doi:<pub-id pub-id-type="doi">10.48550/arXiv.2405.11468</pub-id>.</mixed-citation></ref>
<ref id="ref-15"><label>[15]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cui</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Ren</surname> <given-names>W</given-names></string-name>, <string-name><surname>Cao</surname> <given-names>X</given-names></string-name>, <string-name><surname>Knoll</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Image restoration via frequency selection</article-title>. <source>IEEE Trans Pattern Anal Mach Intell</source>. <year>2024</year>;<volume>46</volume>(<issue>2</issue>):<fpage>1093</fpage>&#x2013;<lpage>108</lpage>. doi:<pub-id pub-id-type="doi">10.1109/TPAMI.2023.3330416</pub-id>; <pub-id pub-id-type="pmid">37930909</pub-id></mixed-citation></ref>
<ref id="ref-16"><label>[16]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Kendall</surname> <given-names>A</given-names></string-name>, <string-name><surname>Badrinarayanan</surname> <given-names>V</given-names></string-name>, <string-name><surname>Cipolla</surname> <given-names>R</given-names></string-name></person-group>. <article-title>Bayesian segnet: model uncertainty in deep convolutional encoder-decoder architectures for scene understanding. arXiv:1511.02680</article-title>. <year>2015</year>. doi:<pub-id pub-id-type="doi">10.48550/arXiv.1511.02680</pub-id>.</mixed-citation></ref>
<ref id="ref-17"><label>[17]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Gal</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Ghahramani</surname> <given-names>Z</given-names></string-name></person-group>. <article-title>Dropout as a bayesian approximation: representing model uncertainty in deep learning</article-title>. In: <conf-name>Proceedings of The 33rd International Conference on Machine Learning</conf-name>; <year>2016 Jun 19&#x2013;24</year>; <publisher-loc>New York, NY, USA</publisher-loc>. Vol. <volume>48</volume>, p. <fpage>1050</fpage>&#x2013;<lpage>9</lpage>.</mixed-citation></ref>
<ref id="ref-18"><label>[18]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Wang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Guo</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Fang</surname> <given-names>L</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>S</given-names></string-name></person-group>. <article-title>Data-uncertainty guided multi-phase learning for semi-supervised object detection</article-title>. In: <conf-name>Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition</conf-name>; <year>2021 Jun 20&#x2013;25</year>; <publisher-loc>Nashville, TN, USA</publisher-loc>. p. <fpage>4568</fpage>&#x2013;<lpage>77</lpage>. doi:<pub-id pub-id-type="doi">10.1109/CVPR46437.2021.00454</pub-id>.</mixed-citation></ref>
<ref id="ref-19"><label>[19]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Zhou</surname> <given-names>C</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Pu</surname> <given-names>M</given-names></string-name>, <string-name><surname>Guan</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>L</given-names></string-name>, <string-name><surname>Ling</surname> <given-names>H</given-names></string-name></person-group>. <article-title>The treasure beneath multiple annotations: an uncertainty-aware edge detector</article-title>. In: <conf-name>Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition</conf-name>; <year>2023 Jun 17&#x2013;24</year>; <publisher-loc>Vancouver, BC, Canada</publisher-loc>. p. <fpage>15507</fpage>&#x2013;<lpage>17</lpage>. doi:<pub-id pub-id-type="doi">10.1109/CVPR52729.2023.01488</pub-id>.</mixed-citation></ref>
<ref id="ref-20"><label>[20]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Yang</surname> <given-names>F</given-names></string-name>, <string-name><surname>Zhai</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Li</surname> <given-names>X</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>R</given-names></string-name>, <string-name><surname>Luo</surname> <given-names>A</given-names></string-name>, <string-name><surname>Cheng</surname> <given-names>H</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>Uncertainty-guided transformer reasoning for camouflaged object detection</article-title>. In: <conf-name>Proceedings of the IEEE/CVF International Conference on Computer Vision</conf-name>; <year>2021 Oct 11&#x2013;17</year>; <publisher-loc>Montreal, QC, Canada</publisher-loc>. p. <fpage>4146</fpage>&#x2013;<lpage>55</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ICCV48922.2021.00411</pub-id>.</mixed-citation></ref>
<ref id="ref-21"><label>[21]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Vaswani</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Attention is all you need. In: Advances in neural information processing systems. 2017 [Internet]. [cited 2025 Jun 10]</article-title>. Available from: <ext-link ext-link-type="uri" xlink:href="https://proceedings.neurips.cc/paper_files/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html">https://proceedings.neurips.cc/paper_files/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html</ext-link>.</mixed-citation></ref>
<ref id="ref-22"><label>[22]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Hou</surname> <given-names>J</given-names></string-name>, <string-name><surname>Yuan</surname> <given-names>Y</given-names></string-name></person-group>. <article-title>Glenet: boosting 3D object detectors with generative label uncertainty estimation</article-title>. <source>Int J Comput Vis</source>. <year>2023</year>;<volume>131</volume>(<issue>12</issue>):<fpage>3332</fpage>&#x2013;<lpage>52</lpage>. doi:<pub-id pub-id-type="doi">10.1007/s11263-023-01869-9</pub-id>.</mixed-citation></ref>
<ref id="ref-23"><label>[23]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Tang</surname> <given-names>T</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>L</given-names></string-name>, <string-name><surname>Hao</surname> <given-names>P</given-names></string-name>, <string-name><surname>He</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Ho</surname> <given-names>K</given-names></string-name>, <string-name><surname>Gu</surname> <given-names>S</given-names></string-name>, <etal>et al.</etal></person-group> <article-title>S2-Track: a simple yet strong approach for end-to-end 3D multi-object tracking. arXiv:2406.02147</article-title>. <year>2024</year>. doi:<pub-id pub-id-type="doi">10.48550/arXiv.2406.02147</pub-id>.</mixed-citation></ref>
<ref id="ref-24"><label>[24]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dong</surname> <given-names>M</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>A</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Li</surname> <given-names>D</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Zhao</surname> <given-names>R</given-names></string-name></person-group>. <article-title>Uncertainty-aware consistency learning for semi-supervised medical image segmentation</article-title>. <source>Knowl Based Syst</source>. <year>2025</year>;<volume>309</volume>(<issue>1</issue>):<fpage>112890</fpage>. doi:<pub-id pub-id-type="doi">10.1016/j.knosys.2024.112890</pub-id>.</mixed-citation></ref>
<ref id="ref-25"><label>[25]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Shao</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>H</given-names></string-name>, <string-name><surname>Cai</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>L</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name></person-group>. <article-title>UA-fusion: uncertainty-aware multimodal data fusion framework for 3d object detection of autonomous vehicles</article-title>. <source>IEEE Trans Instrum Meas</source>. <year>2025</year>;<volume>74</volume>:<fpage>2514916</fpage>. doi:<pub-id pub-id-type="doi">10.1109/TIM.2025.3548184</pub-id>.</mixed-citation></ref>
<ref id="ref-26"><label>[26]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Cho</surname> <given-names>SJ</given-names></string-name>, <string-name><surname>Ji</surname> <given-names>SW</given-names></string-name>, <string-name><surname>Hong</surname> <given-names>JP</given-names></string-name>, <string-name><surname>Jung</surname> <given-names>SW</given-names></string-name>, <string-name><surname>Ko</surname> <given-names>SJ</given-names></string-name></person-group>. <article-title>Rethinking coarse-to-fine approach in single image deblurring</article-title>. In: <conf-name>Proceedings of the IEEE/CVF International Conference on Computer Vision</conf-name>; <year>2021 Oct 11&#x2013;17</year>; <publisher-loc>Montreal, QC, Canada</publisher-loc>. p. <fpage>4641</fpage>&#x2013;<lpage>50</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ICCV48922.2021.00460</pub-id>.</mixed-citation></ref>
<ref id="ref-27"><label>[27]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>He</surname> <given-names>K</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Ren</surname> <given-names>S</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>J</given-names></string-name></person-group>. <article-title>Deep residual learning for image recognition</article-title>. In: <conf-name>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</conf-name>; <year>2016 Jun 27&#x2013;30</year>; <publisher-loc>Las Vegas, NV, USA</publisher-loc>. p. <fpage>770</fpage>&#x2013;<lpage>8</lpage>. doi:<pub-id pub-id-type="doi">10.1109/CVPR.2016.90</pub-id>.</mixed-citation></ref>
<ref id="ref-28"><label>[28]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Hu</surname> <given-names>J</given-names></string-name>, <string-name><surname>Shen</surname> <given-names>L</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>G</given-names></string-name></person-group>. <article-title>Squeeze-and-excitation networks</article-title>. In: <conf-name>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</conf-name>; <year>2018 Jun 18&#x2013;23</year>; <publisher-loc>Salt Lake City, UT, USA</publisher-loc>. p. <fpage>7132</fpage>&#x2013;<lpage>41</lpage>. doi:<pub-id pub-id-type="doi">10.1109/CVPR.2018.00745</pub-id>.</mixed-citation></ref>
<ref id="ref-29"><label>[29]</label><mixed-citation publication-type="other"><person-group person-group-type="author"><string-name><surname>Ba</surname> <given-names>JL</given-names></string-name>, <string-name><surname>Kiros</surname> <given-names>JR</given-names></string-name>, <string-name><surname>Hinton</surname> <given-names>GE</given-names></string-name></person-group>. <article-title>Layer normalization. arXiv:1607.06450</article-title>. <year>2016. </year>doi:<pub-id pub-id-type="doi">10.48550/arXiv.1607.06450</pub-id>.</mixed-citation></ref>
<ref id="ref-30"><label>[30]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Everingham</surname> <given-names>M</given-names></string-name>, <string-name><surname>Gool</surname> <given-names>LV</given-names></string-name>, <string-name><surname>Williams</surname> <given-names>CKI</given-names></string-name>, <string-name><surname>Winn</surname> <given-names>J</given-names></string-name>, <string-name><surname>Zisserman</surname> <given-names>A</given-names></string-name></person-group>. <article-title>The pascal visual object classes (voc) challenge</article-title>. <source>Int J Comput Vis</source>. <year>2010</year>;<volume>88</volume>(<issue>2</issue>):<fpage>303</fpage>&#x2013;<lpage>38</lpage>. doi:<pub-id pub-id-type="doi">10.1007/s11263-009-0275-4</pub-id>.</mixed-citation></ref>
<ref id="ref-31"><label>[31]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Charbonnier</surname> <given-names>P</given-names></string-name>, <string-name><surname>Blanc-Feraud</surname> <given-names>L</given-names></string-name>, <string-name><surname>Aubert</surname> <given-names>G</given-names></string-name>, <string-name><surname>Barlaud</surname> <given-names>M</given-names></string-name></person-group>. <article-title>Two deterministic half-quadratic regularization algorithms for computed imaging</article-title>. In: <conf-name>Proceedings of 1st International Conference on Image Processing</conf-name>; <year>1994 Nov 13&#x2013;16</year>; <publisher-loc>Austin, TX, USA</publisher-loc>. p. <fpage>168</fpage>&#x2013;<lpage>72</lpage>. doi:<pub-id pub-id-type="doi">10.1109/ICIP.1994.413553</pub-id>.</mixed-citation></ref>
<ref id="ref-32"><label>[32]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Cui</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Tao</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Jing</surname> <given-names>L</given-names></string-name>, <string-name><surname>Knoll</surname> <given-names>A</given-names></string-name></person-group>. <article-title>Strip attention for image restoration</article-title>. In: <conf-name>Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence Main Track</conf-name>; <year>2023 Aug 19&#x2013;25</year>; <publisher-loc>Macao, China</publisher-loc>. p. <fpage>645</fpage>&#x2013;<lpage>53</lpage>. doi:<pub-id pub-id-type="doi">10.24963/ijcai.2023/72</pub-id>.</mixed-citation></ref>
<ref id="ref-33"><label>[33]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Zheng</surname> <given-names>C</given-names></string-name>, <string-name><surname>Shi</surname> <given-names>D</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Y</given-names></string-name></person-group>. <article-title>Windowing decomposition convolutional neural network for image enhancement</article-title>. In: <conf-name>Proceedings of the 29th ACM International Conference on Multimedia</conf-name>; <year>2021 Oct 20&#x2013;24</year>; <publisher-loc>Virtual Event, China</publisher-loc>. p. <fpage>424</fpage>&#x2013;<lpage>32</lpage>. doi:<pub-id pub-id-type="doi">10.1145/3474085.3475181</pub-id>.</mixed-citation></ref>
<ref id="ref-34"><label>[34]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ibrahim</surname> <given-names>H</given-names></string-name>, <string-name><surname>Kong</surname> <given-names>NSP</given-names></string-name></person-group>. <article-title>Brightness preserving dynamic histogram equalization for image contrast enhancement</article-title>. <source>IEEE Trans Consum Electron</source>. <year>2007</year>;<volume>53</volume>(<issue>4</issue>):<fpage>1752</fpage>&#x2013;<lpage>8</lpage>. doi:<pub-id pub-id-type="doi">10.1109/TCE.2007.4429280</pub-id>.</mixed-citation></ref>
<ref id="ref-35"><label>[35]</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>He</surname> <given-names>K</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>J</given-names></string-name>, <string-name><surname>Tang</surname> <given-names>X</given-names></string-name></person-group>. <article-title>Single image haze removal using dark channel prior</article-title>. <source>IEEE Trans Pattern Anal Mach Intell</source>. <year>2010</year>;<volume>33</volume>(<issue>12</issue>):<fpage>2341</fpage>&#x2013;<lpage>53</lpage>. doi:<pub-id pub-id-type="doi">10.1109/TPAMI.2010.168</pub-id>; <pub-id pub-id-type="pmid">20820075</pub-id></mixed-citation></ref>
<ref id="ref-36"><label>[36]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Valanarasu</surname> <given-names>JMJ</given-names></string-name>, <string-name><surname>Yasarla</surname> <given-names>R</given-names></string-name>, <string-name><surname>Patel</surname> <given-names>VM</given-names></string-name></person-group>. <article-title>Transweather: transformer-based restoration of images degraded by adverse weather conditions</article-title>. In: <conf-name>Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition</conf-name>; <year>2022 Jun 18&#x2013;24</year>; <publisher-loc>New Orleans, LA, USA</publisher-loc>. p. <fpage>2353</fpage>&#x2013;<lpage>63</lpage>. doi:<pub-id pub-id-type="doi">10.1109/CVPR52688.2022.00239</pub-id>.</mixed-citation></ref>
<ref id="ref-37"><label>[37]</label><mixed-citation publication-type="conf-proc"><person-group person-group-type="author"><string-name><surname>Woo</surname> <given-names>S</given-names></string-name>, <string-name><surname>Park</surname> <given-names>J</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>JY</given-names></string-name>, <string-name><surname>Kweon</surname> <given-names>IS</given-names></string-name></person-group>. <article-title>Cbam: convolutional block attention module</article-title>. In: <conf-name>Proceedings of the European Conference on Computer Vision (ECCV)</conf-name>; <year>2018 Sep 8&#x2013;14</year>; <publisher-loc>Munich, Germany</publisher-loc>. p. <fpage>3</fpage>&#x2013;<lpage>19</lpage>. doi:<pub-id pub-id-type="doi">10.1007/978-3-030-01234-2_1</pub-id>.</mixed-citation></ref>
</ref-list>
</back></article>







