Skip to content
/ RAM Public

[ECCV 2024] Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration

Notifications You must be signed in to change notification settings

Dragonisss/RAM

Repository files navigation

🔥 Restore Anything with Masks:Leveraging Mask Image Modeling for Blind All-in-One Image Restoration (ECCV2024)

This is the official PyTorch codes for the paper.

Restore Anything with Masks:Leveraging Mask Image Modeling for Blind All-in-One Image Restoration
Chujie Qin, Ruiqi Wu, Zikun Liu, Xin Lin, Chunle Guo, Hyun Hee Park, Chongyi Li* ( * indicates corresponding author)
European Conference on Computer Vision (ECCV), 2024

framework_img

🚀 Highlights:

  • RAM is a Blind All-In-One Image Restoration framework that can simultaneously handle 7 Restoration Tasks and achieve SOTA performance !
  • RAM focus on tackling how to extract Image Prior instead of degradation prior from diverse corrupted images by Leveraging Mask Image Modeling.

📰 News

  • Oct 20, 2024: Release pretrained weights on Google Drive.
  • Oct 3, 2024: Release related code of our paper.

🔧 Dependencies and Installation

  1. Clone and enter our repository:
    git clone https://github.com/Dragonisss/RAM.git RAM
    cd RAM
  2. Simply run the install.sh for installation!
    source install.sh
  3. Activate the environment whenever you test!
    conda activate RAM

✨ Datasets and Pretrained Models

Given the number of datasets involved, we plan to offer a unified download link in the future to make it easier to access all datasets.

We combine datasets from various restoration tasks to form the training set. Here are the relevant links for all the datasets used:

Dataset Phase Source Task for
OTS_ALPHA Train [Baidu Cloud(f1zz)] Dehaze
Rain-13k Train & Test [Google Drive] Derain
LOL-v2 Train & Test [Real Subset Baidu Cloud(65ay)] / [Synthetic Subset Baidu Cloud(b14u)] Low Light Enhancement
GoPro Train & Test [Download] Motion Deblur
LSDIR Train & Test [HomePage] Denoise DeJPEG DeBlur
SOTS Test [Download] Denoise DeJPEG DeBlur
CBSD68 Test [Download] Denoise
You need to collect required datasets above and place them under the `./datasets` Directory.

Symbolic links is a recommended approach, allowing you to place the datasets anywhere you prefer!

The final directory structure will be arranged as:

datasets
    |- CBSD68
        |- CBSD68
          |- noisy5
          |- noisy10
          |- ...
    |- gopro
        |- test
        |- train
    |- LOL-v2
        |- Real_captured
        |- Synthetic
    |- LSDIR
        |- 0001000
        |- 0002000
        |- ...
    |- OTS_ALPHA
        |- clear
        |- depth
        |- haze
    |- LSDIR-val
        |- 0000001.png
        |- 0000002.png
        |- ...
    |- rain13k
        |- test
        |- train
    |- SOTS
        |- outdoor

Our pipeline can be applied to any image restoration network. We provide the pre-trained and fine-tuned model files for SwinIR and PromptIR mentioned in the paper:

Method Phase Framework Download Links Config File
RAM Pretrain SwinIR [GoogleDrive] [options/RAM_SwinIR/ram_swinir_pretrain.yaml]
RAM Finetune SwinIR [GoogleDrive] [options/RAM_SwinIR/ram_swinir_finetune.yaml]
RAM Pretrain PromptIR [GoogleDrive] [options/RAM_PromptIR/ram_promptir_pretrain.yaml]
RAM Finetune PromptIR [GoogleDrive] [options/RAM_PromptIR/ram_promptir_finetune.yaml]

📷 Quick Demo

We provide scripts for inference your own images in inference/inference.py.
You could run python inference/inference.py --help to get detailed information of this scripts.

🤖 Training RAM From Scratch!

Before proceeding, please ensure that the relevant datasets have been prepared as required.

1.Pretraining with MIM We use the collected datasets for model training. First, we execute the following command:

python -m torch.distributed.launch \
--nproc_per_node=[num of gpus] \
--master_port=[PORT] ram/train.py \
-opt [OPT] \
--launcher pytorch

# e.g.
python -m torch.distributed.launch \
--nproc_per_node=8 \
--master_port=4321 ram/train.py \
-opt options/RAM_SwinIR/ram_swinir_pretrain.yaml \
--launcher pytorch

2.Mask Attribute Conductance Analysis

We use proposed Mask Attribute Conductance Analysis to analyze the importance of different layers for finetuning. You can run the following command to conduct MAC analysis:

python scripts/mac_analysis.py -opt [OPT] --launcher pytorch

# e.g.
python scripts/mac_analysis.py \
-opt options/RAM_SwinIR/ram_swinir_mac.yml --launcher pytorch

For convenience, we have provided the analysis results of the two models, RAM-SwinIR and RAM-PromptIR, mentioned in the paper. You can find them in ./mac_analysis_result/

3.Finetuning

python -m torch.distributed.launch --nproc_per_node=<num of gpus> --master_port=4321 ram/train.py \
-opt [OPT] --launcher pytorch

You can also add CUDA_DEVICE_VISIBLE= to choose gpu you want to use.

📈 Evaluation

We have provided a script for fast evaluation:

python -m torch.distributed.launch \
--nproc_per_node=1 \
--master_port=[PORT] ram/test.py \
-opt [OPT] --launcher pytorch

To benchmark the performance of RAM on the test dataset, you can run the following command:

# RAM-SwinIR
python -m torch.distributed.launch \
--nproc_per_node=1 \
--master_port=4321 ram/test.py \
-opt options/test/ram_swinir_benchmark.yml --launcher pytorch

# RAM-PromptIR
python -m torch.distributed.launch \
--nproc_per_node=1 \
--master_port=4321 ram/test.py \
-opt options/test/ram_promptir_benchmark.yml --launcher pytorch

📖 Citation

If you find our repo useful for your research, please consider citing our paper:

@misc{qin2024restoremasksleveragingmask,
      title={Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration}, 
      author={Chu-Jie Qin and Rui-Qi Wu and Zikun Liu and Xin Lin and Chun-Le Guo and Hyun Hee Park and Chongyi Li},
      year={2024},
      eprint={2409.19403},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2409.19403}, 
}

📮 Contact

For technical questions, please contact chujie.qin[AT]mail.nankai.edu.cn

About

[ECCV 2024] Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published