Documents
Presentation Slides
X-CAUNET: CROSS-COLOR CHANNEL ATTENTION WITH UNDERWATER IMAGE-ENHANCING TRANSFORMER
- DOI:
- 10.60864/6j0s-df93
- Citation Author(s):
- Submitted by:
- Sandipan Sarma
- Last updated:
- 6 June 2024 - 10:27am
- Document Type:
- Presentation Slides
- Document Year:
- 2024
- Event:
- Presenters:
- Sandipan Sarma
- Paper Code:
- IVMSP-L12.5
- Categories:
- Log in to post comments
Underwater image enhancement is essential to mitigate the environment-centric noise in images, such as haziness, color degradation, etc. With most existing works focused on processing an RGB image as a whole, the explicit context that can be mined from each color channel separately goes unaccounted for, ignoring the effects produced by the wavelength of light in underwater conditions. In this work, we propose a framework called X-CAUNET that addresses this
research gap by using cross-attention transformers. The input image is split into three channels (R-G-B), local context is captured using convolutional layers with different receptive field sizes, and a message-passing mechanism allows for context correlation between them. To maintain consistency, another transformer is used on the original image to aggregate global context, and a weighted combination of all the outputs enhances the input degraded image. Extensive experiments demonstrate we achieve state-of-the-art PSNR and SSIM with 2.66% and 2.11% relative gains. Code is available at: https://github.com/Alik033/X-CAUNET.