Professional Documents
Culture Documents
decoupler, QF predictor, flexible controller and image enhances the quality of the output for images with intricate
reconstructor. [10] (Fig. 2) degradation patterns. [14]
JPEG is one of the most widely-used image compression
algorithms and formats due to its simplicity and fast
D. RealESRGAN
encoding/decoding speeds. However, it is a lossy compression
algorithm and can introduce annoying artifacts. FBCNN is a RealESRGAN is an advancement of the ESRGAN
flexible (Fig. 3) blind JPEG artifacts removal network for real framework for practical application, specifically targeting the
JPEG image restoration. [11] (Fig. 4) restoration of real-world images. It addresses challenges such
as over-smoothing and the loss of fine details, which were
prevalent in predecessor models. RealESRGAN achieves this
by generating textures that closely resemble those found in the
original high-resolution images. This results in restored images
that retain the naturalness and authenticity of the original
images. (Fig. 5) [16]
for single image super-resolution tasks, and it demonstrates B. Transformers in Image Restoration
superior performance compared to state-of-the-art methods Transformers, on the other hand, use self-attention
from both Transformer and Convolutional networks. [23] mechanisms to weigh the influence of different parts of the
input data. For image restoration, this means they can capture
long-range dependencies and global context more effectively
than CNNs.
Strengths of Transformers:
- Excel at capturing global context and long-range
dependencies within the image.
- Self-attention allows them to focus on relevant parts of the
(a) Source Image (b) 4x Upscale SRFormer
- Degraded Images: These are images that have been [3] PyTorch documentation [Online] Available:
https://pytorch.org/docs/stable/index.html
purposefully downgraded through processes such as blurring, [4] PyTorch Wikipedia [Online] Available:
adding noise, compression artifacts, or downsampling. https://en.wikipedia.org/wiki/PyTorch
[5] What is PyTorch? (Machine/Deep Learning) – Video by IBM Technology
- 11 oct. 2023 [Online] Available:
- High-Quality Images: The dataset must also include the https://www.youtube.com/watch?v=fJ40w_2h8kk
original or enhanced images that the model aims to reproduce. [6] Different types of CNN models [Online] Available:
https://iq.opengenus.org/different-types-of-cnn-models/
[7] Convolutional Neural Network: Tout ce qu’il y a à savoir – 25 June 2023
Step 2: Model Architecture Selection [Online] Available: https://datascientest.com/convolutional-neural-
Selecting the right model architecture is crucial. This choice network
depends on the specific task (denoising, deblurring, super- [8] How to Develop Convolutional Neural Network Models for Time Series
Forecasting by Jason Brownlee on August 28, 2020 [Online] Available:
resolution) and the available computational resources. https://machinelearningmastery.com/how-to-develop-convolutional-
neural-network-models-for-time-series-forecasting/
Step 3: Loss Function Design [9] Convolutional neural network Wikipedia [Online] Available:
https://en.wikipedia.org/wiki/Convolutional_neural_network
The loss function measures the difference between the model's [10] Towards Flexible Blind JPEG Artifacts Removal - Jiaxi Jiang, Kai
output and the ground truth high-quality image. Common loss Zhang, Radu Timofte - 29 September 2021 [Online] Available:
functions include Mean Squared Error (MSE), Structural https://arxiv.org/abs/2109.14573
[11] Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)
Similarity Index (SSIM), and perceptual loss based on features Github Official Code [Online] Available: https://github.com/jiaxi-
extracted from pretrained networks. jiang/FBCNN
[12] Towards Flexible Blind JPEG Artifacts Removal (FBCNN, ICCV 2021)
for Windows Github Official Code [Online] Available:
Step 4: Model Training https://github.com/bycloudai/FBCNN-Windows
During training, the model learns to restore images by adjusting [13] What is a Generative Adversarial Network? by Thomas Wood [Online]
its parameters to minimize the loss function. This process Available: https://deepai.org/machine-learning-glossary-and-
terms/generative-adversarial-network
involves: [14] ERSGAN: A Beginner's Guide to Harnessing the Power of Advanced AI
Image Enhancement - Zaid Meccai – 24 July 2023 [Online] Available:
- Forward Pass: The model processes the input degraded https://blog.segmind.com/how-to-get-started-with-the-ersgan-ai-model/
[15] Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure
images to generate restored outputs. Synthetic Data Github Official Code [Online] Available:
- Loss Calculation: The loss between the restored output and https://github.com/xinntao/Real-ESRGAN
the high-quality target image is calculated. [16] Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure
Synthetic Data [Online] Available: https://arxiv.org/abs/2107.10833
- Backpropagation: The model's parameters are updated in the [17] What Is the Transformer Architecture and How Does It Work? [Online]
direction that reduces the loss, using optimization algorithms Available: https://datagen.tech/guides/computer-vision/transformer-
like stochastic gradient descent (SGD) or Adam. architecture/
[18] What is a transformer model? [Online] Available:
https://www.ibm.com/topics/transformer-model
Step 5: Model Evaluation [19] The Transformer Model by Stefania Cristina on January 6, 2023 [Online]
After training, the model is evaluated on a separate validation Available: https://machinelearningmastery.com/the-transformer-model/
[20] Transformer (machine learning model) [Online] Available:
set to assess its generalization capabilities. Performance metrics https://en.wikipedia.org/wiki/Transformer_%28machine_learning_model
might include Peak Signal-to-Noise Ratio (PSNR), SSIM, and %29
visual inspection. [21] SRFormer: Permuted Self-Attention for Single Image Super-Resolution
(ICCV2023) [Online] Available: https://github.com/HVision-
NKU/SRFormer
[22] SRFormer: Efficient Yet Powerful Transformer Network for Single Image
VI. CONCLUSION Super Resolution - Armin Mehri, Parichehr Behjati, Dario Carpio, Angel
Domingo Sappa - 27 October 2023 [Online] Available:
In conclusion, the landscape of automatic image restoration https://ieeexplore.ieee.org/document/10298198
[23] SRFormer: Permuted Self-Attention for Single Image Super-Resolution -
and upscaling has witnessed significant advancements with the Yupeng Zhou, Zhen Li, Chun-Le Guo, Song Bai, Ming-Ming Cheng, Qibin
advent of CNN and transformer-based models. The selection of Hou - 17 Mars 2023 [Online] Available https://arxiv.org/abs/2303.09735
an appropriate architecture, whether SRFormer, HAT, FBCNN, [24] Activating More Pixels in Image Super-Resolution Transformer - Xiangyu
Chen, Xintao Wang, Jiantao Zhou, Yu Qiao, Chao Dong – 19 Mars 2023
ERSGAN, or RealESRGAN, depends on the specific image [Online] Available: https://arxiv.org/abs/2205.04437
restoration task. The creation of potent models necessitates [25] HAT: Hybrid Attention Transformer for Image Restoration - Xiangyu
careful architecture choice, rigorous training, and continuous Chen, Xintao Wang, Wenlong Zhang, Xiangtao Kong, Yu Qiao, Jiantao
Zhou, Chao Dong – 11 September 2023 [Online] Available:
refinement, facilitated by frameworks such as PyTorch. Despite https://arxiv.org/abs/2309.05239
their potential, these methods contend with ongoing challenges, [26] HAT Github [Online] Available: https://github.com/XPixelGroup/HAT
driving continuous research to further advance the field of [27] A Survey of Deep Learning Approaches to Image Restoration - Jingwen
Su, Boyan Xu, Hujun Yin - 25 February 2022 [Online] Available:
image restoration. https://research.manchester.ac.uk/en/publications/a-survey-of-deep-
learning-approaches-to-image-restoration
[28] On-Demand Learning for Deep Image Restoration – Ruohan Gao,
KristenGrauman – 2 August 2017 [Online] Available:
REFERENCES https://arxiv.org/pdf/1612.01380
[1] Discover the differences between classical and AI upscaling methods, [29] A Comprehensive Review of Deep Learning-Based Real-World Image
their benefits, and applications in enhancing image quality – 13 May 2023 Restoration - Lujun Zhai, Yonghui Wang, Suxia Cui, Yu Zhou - 6 March
[Online] Available: https://unimatrixz.com/topics/ai-upscaler/upscaling- 2023 [Online] Available: https://ieeexplore.ieee.org/document/10056934
methods/
[2] chaiNNer Github Official Code [Online] Available:
https://github.com/chaiNNer-org/chaiNNer