ASID Project Page

Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution

Karam Park, Jae Woong Soh, and Nam Ik Cho

Accepted for AAAI 2025

Paper and Supplementary Material

Environments

Ubuntu 18.04
PyTorch 2.2.2
CUDA 9.0 & cuDNN 7.1
Python 3.10.9

Dependencies:

PyTorch>1.10
OpenCV
Matplotlib 3.3.4
opencv-python
pyyaml
tqdm
numpy
torchvision

Acknowledgement

We would like to express our thanks to the authors of Omni-SR for generously releasing their code to the public. Our codes are built on Omni-SR. If you encounter any problems using the code, please refer to Omni-SR Issue Threads first.

Abstract

Transformer-based Super-Resolution (SR) methods have demonstrated superior performance compared to convolutional neural network (CNN)-based SR approaches due to their capability to capture long-range dependencies. However, their high computational complexity necessitates the development of lightweight approaches for practical use. To address this challenge, we propose the Attention-Sharing Information Distillation (ASID) network, a lightweight SR network that integrates attention-sharing and an information distillation structure specifically designed for Transformer-based SR methods. We modify the information distillation scheme, originally designed for efficient CNN operations, to reduce the computational load of stacked self-attention layers, effectively addressing the efficiency bottleneck. Additionally, we introduce attention-sharing across blocks to further minimize the computational cost of self-attention operations. By combining these strategies, ASID achieves competitive performance with existing SR methods while requiring only around 300K parameters - significantly fewer than existing CNN-based and Transformer-based SR models. Furthermore, ASID outperforms state-of-the-art SR methods when the number of parameters is matched, demonstrating its efficiency and effectiveness.

Proposed Method

ASID delivers competitive performance compared to existing lightweight SR methods while utilizing significantly fewer model parameters.

Overall Structure

Experimental Results

Quantitative Results

Visualized Results

More results are available in ./SR_Results.

Guidelines for Codes

Requisites should be installed beforehand.

Test

Make sure the location configuration is correct in ./env/env.json
Evaluate models with the following cmd

python test.py -v "Model_Name" -t tetser_Matlab -s 0 --test_dataset_name [Dataset]


[Model]: ASID_XN_DIV2K, ASIDd8_XN_DIV2K (N=2,3,4)
[Dataset]: Set5, Set14, B100, Urban100

Example:

python test.py -v "ASID_X2_DIV2K" -t tetser_Matlab -s 0 --test_dataset_name Set5

Execute ./PSNR_SSIM_Evaluate.m for PSNR/SSIM report. Make sure the location configuration and scale are correct in the Matlab file.

Train

Prepare training dataset (DIV2K) from here, then set the location configuration in ./env/env.json
Prepare yaml file which contains training details in ./train_yamls/
Train the models with the following cmd

[Training from scratch] python train.py -v "Model_Name" -p train --train_yaml "[Training_Setting.yaml]"
[Finetuning] python train.py -v "Model_Name" -p finetune -e [Epoch Number of the Pretrained Model] --train_yaml "[Training_Setting.yaml]"

Example:

python train.py -v "ASID_fromscratch_X2_DIV2K" -p train --train_yaml "train_ASID_X2_DIV2K.yaml"
python train.py -v "ASID_fromscratch_X2_DIV2K" -p finetune -e 500 --train_yaml "train_ASIDfinetune_X2_DIV2K.yaml"

How to reproduce ASID experimental results from scratch

Our model was trained from scratch using a learning rate of 5e-4 for 500 epochs, and then fine-tuned with a learning rate of 4e-4 for an additional 1000 epochs.

Citation

@inproceedings{park2025efficient,
  title={Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution},
  author={Park, Karam and Soh, Jae Woong and Cho, Nam Ik},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={},
  number={},
  pages={},
  year={2025}
}

Name	Name	Last commit message	Last commit date
Latest commit saturnian77 Update README.md Jan 30, 2025 b248b34 · Jan 30, 2025 History 12 Commits
SR_Results/BI	SR_Results/BI	250127	Jan 27, 2025
benchmark	benchmark	250127	Jan 27, 2025
components	components	250127	Jan 27, 2025
data_tools	data_tools	250127	Jan 27, 2025
env	env	250127	Jan 27, 2025
figs	figs	250127	Jan 27, 2025
ops	ops	250127	Jan 27, 2025
test_scripts	test_scripts	250127	Jan 27, 2025
train_logs	train_logs	250127	Jan 27, 2025
train_scripts	train_scripts	250127	Jan 27, 2025
train_yamls	train_yamls	250127	Jan 27, 2025
utilities	utilities	250127	Jan 27, 2025
PSNR_SSIM_Evaluate.m	PSNR_SSIM_Evaluate.m	250127	Jan 27, 2025
README.md	README.md	Update README.md	Jan 30, 2025
test.py	test.py	250127	Jan 27, 2025
train.py	train.py	250127	Jan 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASID Project Page

Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution

Environments

Acknowledgement

Abstract

Proposed Method

Overall Structure

Experimental Results

Quantitative Results

Visualized Results

Guidelines for Codes

Test

Train

How to reproduce ASID experimental results from scratch

Citation

About

Releases

Packages

Languages

hehongjie/ASID

Folders and files

Latest commit

History

Repository files navigation

ASID Project Page

Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution

Environments

Acknowledgement

Abstract

Proposed Method

Overall Structure

Experimental Results

Quantitative Results

Visualized Results

Guidelines for Codes

Test

Train

How to reproduce ASID experimental results from scratch

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages