xpixelgroup / classsr Goto Github PK

View Code? Open in Web Editor NEW

352.0 4.0 42.0 1.37 MB

(CVPR2021) ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic

Python 95.26% MATLAB 4.74%

classsr's Introduction

ClassSR

(CVPR2021) ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic.

Paper

Authors: Xiangtao Kong, Hengyuan Zhao, Yu Qiao, Chao Dong

Abstract

We aim at accelerating super-resolution (SR) networks on large images (2K-8K). The large images are usually decomposed into small sub-images in practical usages. Based on this processing, we found that different image regions have different restoration difficulties and can be processed by networks with different capacities. Intuitively, smooth areas are easier to super-solve than complex textures. To utilize this property, we can adopt appropriate SR networks to process different sub-images after the decomposition. On this basis, we propose a new solution pipeline -- ClassSR that combines classification and SR in a unified framework. In particular, it first uses a Class-Module to classify the sub-images into different classes according to restoration difficulties, then applies an SR-Module to perform SR for different classes. The Class-Module is a conventional classification network, while the SR-Module is a network container that consists of the to-be-accelerated SR network and its simplified versions. We further introduce a new classification method with two losses -- Class-Loss and Average-Loss to produce the classification results. After joint training, a majority of sub-images will pass through smaller networks, thus the computational cost can be significantly reduced. Experiments show that our ClassSR can help most existing methods (e.g., FSRCNN, CARN, SRResNet, RCAN) save up to 50% FLOPs on DIV8K datasets. This general framework can also be applied in other low-level vision tasks.

Dependencies

Python >= 3.6 (Recommend to use Anaconda)
PyTorch >= 1.5.0
NVIDIA GPU + CUDA
Python packages: pip install numpy opencv-python lmdb
[option] Python packages: pip install tensorboardX, for visualizing curves.

Codes

Our codes version based on BasicSR.

How to test a single branch

Clone this github repo.

git clone https://github.com/Xiangtaokong/ClassSR.git
cd ClassSR

Download the testing datasets (DIV2K_valid).
Download the divide_val.log and move it to .codes/data_scripts/.
Generate simple, medium, hard (class1, class2, class3) validation data.

cd codes/data_scripts
python extract_subimages_test.py
python divide_subimages_test.py

Download pretrained models and move them to ./experiments/pretrained_models/ folder.
Run testing for a single branch.

cd codes
python test.py -opt options/test/test_FSRCNN.yml
python test.py -opt options/test/test_CARN.yml
python test.py -opt options/test/test_SRResNet.yml
python test.py -opt options/test/test_RCAN.yml

The output results will be sorted in ./results.

How to test ClassSR

Clone this github repo.

git clone https://github.com/Xiangtaokong/ClassSR.git
cd ClassSR

Download the testing datasets (Test2K, 4K, 8K) Google Drive or Baidu Drive (Password: 7dw1) .
You can also download the source data DIV8K (Using this link http://data.vision.ee.ethz.ch/timofter/AIM19ExtremeSR/trainHR_001to200.zip and change the "001to200" to "201to400" "401to600" …………"1401to1500" for all the data.). Test8K contains the images (index 1401-1500) from DIV8K. Test2K/4K contain the images (index 1201-1300/1301-1400) from DIV8K which are downsampled to 2K and 4K resolution. (In this way, you need register for the competition (Ntire 2020 was held on 2020, but we can register now), then you can download DIV8K dataset.)
Download pretrained models and move them to ./experiments/pretrained_models/ folder.
Run testing for ClassSR.

cd codes
python test_ClassSR.py -opt options/test/test_ClassSR_FSRCNN.yml
python test_ClassSR.py -opt options/test/test_ClassSR_CARN.yml
python test_ClassSR.py -opt options/test/test_ClassSR_SRResNet.yml
python test_ClassSR.py -opt options/test/test_ClassSR_RCAN.yml

The output results will be sorted in ./results.

How to train a single branch

Clone this github repo.

git clone https://github.com/Xiangtaokong/ClassSR.git
cd ClassSR

Download the training datasets(DIV2K) and validation dataset(Set5).
Download the divide_train.log and move it to .codes/data_scripts/.
Generate simple, medium, hard (class1, class2, class3) training data.

cd codes/data_scripts
python data_augmentation.py
python generate_mod_LR_bic.py
python extract_subimages_train.py
python divide_subimages_train.py

Run training for a single branch (default branch1, the simplest branch).

cd codes
python train.py -opt options/train/train_FSRCNN.yml
python train.py -opt options/train/train_CARN.yml
python train.py -opt options/train/train_SRResNet.yml
python train.py -opt options/train/train_RCAN.yml

The experiments will be sorted in ./experiments.

How to train ClassSR

Clone this github repo.

git clone https://github.com/Xiangtaokong/ClassSR.git
cd ClassSR

Download the training datasets (DIV2K) and validation dataset(DIV2K_valid, index 801-810).
Generate training data (the all data(1.59M) in paper).

cd codes/data_scripts
python data_augmentation.py
python generate_mod_LR_bic.py
python extract_subimages_train.py

Download pretrained models(pretrained branches) and move them to ./experiments/pretrained_models/ folder.
Run training for ClassSR.

cd codes
python train_ClassSR.py -opt options/train/train_ClassSR_FSRCNN.yml
python train_ClassSR.py -opt options/train/train_ClassSR_CARN.yml
python train_ClassSR.py -opt options/train/train_ClassSR_SRResNet.yml
python train_ClassSR.py -opt options/train/train_ClassSR_RCAN.yml

The experiments will be sorted in ./experiments.

How to generate demo images

Generate demo images like this one:

Change the 'add_mask: False' to True in test_ClassSR_xxx.yml and run testing for ClassSR.

Citation

@InProceedings{Kong_2021_CVPR,
    author    = {Kong, Xiangtao and Zhao, Hengyuan and Qiao, Yu and Dong, Chao},
    title     = {ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {12016-12025}
}

Contact

Email: [email protected]

classsr's People

Contributors

Stargazers

Watchers

classsr's Issues

which directory to place the training image and the test imag

hello, where is the training data placed

iteration over a 0-d tensor

Traceback (most recent call last):
File "train_ClassSR.py", line 347, in
main()
File "train_ClassSR.py", line 277, in main
model.test()
File "/home/dsm/ClassSR-main/codes/models/ClassSR_model.py", line 199, in test
self.num_res = self.print_res(type_res)
File "/home/dsm/ClassSR-main/codes/models/ClassSR_model.py", line 352, in print_res
for i in torch.max(type_res, 1)[1].data.squeeze():
File "/home/dsm/anaconda3/envs/nsw2/lib/python3.7/site-packages/torch/tensor.py", line 451, in iter
raise TypeError('iteration over a 0-d tensor')
TypeError: iteration over a 0-d tensor
While running the training code in classSR it occurs an error on this file train_ClassSR.py in line277.How could i solve this .Thanks in advance

data processing for DIV8K

Hello, I have downloaded the DIV8K, I want to know how to generate the datasets (includes test2k(LR/HR), test4k(LR/HR), and test8k(LR/HR))?

作者您好，怎么使用我自己的数据集生成divided_train.log?

作者您好，我想用人脸的图像来试试您的这个方法的效果。所以我想知道如何如何用我自己的人脸数据集生成divide_train.log，谢谢啦~

when do you release the paper?

Really interested in your work, really looking forward to see the paper

如何取消裁剪GT？

作者您好，我用您的模型跑了一些测试代码，但生成的图都是裁剪过的，可以取消裁剪GT图吗？（我改了crop_border为none也还是会裁剪掉）

学习率问题

作者您好，这是一篇非常不错的文章。我有一个问题想问一下，训练的第三阶段（联合训练阶段）的学习率和前两阶段保持一致吗，都是1e-3吗？

How to get the trained weights ?

Thank for the great work.

I have trained on a custom dataset by using ClassSR repo. But not able to find the trained weights after each epochs . Should i wait to complete the entire epochs to get the trained weights ?

怎样运行？

请问一下论文里提到的baseline的结果是如何获得的？

请问一下如何复现论文中的baseline结果
FSRCNN-O论文里提到baseline是25.61dB。然后我试了两种方式来复现：（ 1）用你的训练集训自己FSRCNN然后再跑test2k（与classsr的区别是直接对整张图超分，没有分解和组合图片的操作）；（2）用你提供的预训练的classsr模型，然后测试时候强制都分到最复杂的超分分支上。
这两种结果获得的PSNR都比论文里提到的FSRCNN-O要高，方式（1）高0.4，方式（2）高0.2。

（我另外试了其他几个backbone，复现Original module的时候都有挺大差异的，因此想了解一下论文中的baseline是如何获得的 :)

DIV2K数据集问题

您好，DIV2K下载页面有多个数据集，请问你用的是哪个呢？

divide_train.log 的问题

你好，请问一下：我想训练一个新的训练集，那么划分训练集所需的divide_train.log要如何生成？

How to download DIV 8K dataset?

class_loss=0, average_loss=1

你好，请问一下为什么我将三个分支的预训练模型修改为自己的模型，然后没有切分子图，在训练分类器的时候class_loss=0, average_loss=1？

tensor问题

大佬您好，我在训练RCAN的一个branch的时候经常会在八倍上出现tensor上的问题，但是其他的都能正常的跑，改了很久都没有改出来，不知道您是否方便分享一下八倍的RCAN的三个分支的预训练模型，很抱歉麻烦您！

Some questions about class module and train

Thanks, it is a interesting work. I have some questions to ask. Looking forward to your answer.
(1) You propose the Average-Loss and Class-Loss to train class module, It seems the class module will be trained to output the class result simple:medium:hard with 1:1:1. But in your paper, it is not.
(2) After trained the Class Module independently, why train Class & SR Module jointly? I think maybe the pre-trained SR Module has been the best?
(3) how many 2080Ti GPUS need to train(and total memory)? About how long it takes to train?

How to create a divide_rain.log folder？

Hello, I have now generated sub-images. I want to use the generated sub-images to train my models of different branches. How can I create a divide_train.log file suitable for my model?

Perhaps the CARN model is incorrectly implemented?

https://github.com/Xiangtaokong/ClassSR/blob/0ebefe75b2d5d08cffd8613c78d6dbb323dbd2c5/codes/models/archs/arch_util.py#L103-L114
I face train problems when train class_carn, finally I located the problem in above code.
In CARN official repo, I found there are no bn layer, So may be the CARN model is incorrectly implemented?

https://github.com/nmhkahn/CARN-pytorch/blob/master/carn/model/ops.py#L33-L48

Scale 2 not working

Hi, first of all thanks for your work!
I have question about custom data. If I have HR images with resolution 512384 and lowres with 256192, and I want to convert LR->HR, what parameters should I change change in test_ClassSR_RCAN.yml for example apart from scale ( if I change only scale I've got this error:


sr_img[i*step*self.scale:i*step*self.scale+patch_size*self.scale,j*step*self.scale:j*step*self.scale+patch_size*self.scale,:]+=sr_list[index]
ValueError: operands could not be broadcast together with shapes (64,64,3) (128,128,3) (64,64,3)

数据集放哪

下载的test2k数据集应该放在什么目录下面

关于多倍率的问题

请问可以使用这个网络进行除了x4以外其他倍率的SR操作吗？如果可以，大致需要做怎样的修改呢；如果不行，需要做什么别的工作呢？

训练过程

您好，Class-Module我已经训练了两万多个Iteration了，但是验证的时候所有的图片还是都只进入一个分支，并没有平均进入三个分支，请问这样可能是什么原因造成的呢，（initial learning rate设置为10-3，the weights w1, w2, w3 are set to 2000,1, 6）

'numpy.ndarray' object has no attribute 'float'*

The following problems occurred when testing with Test8K
The output is all tensors, except for the last one, numpy, which feels like a problem with 'rlt', how can we solve it, please?
error:

Testing [test8k]...
<models.ClassSR_model.ClassSR_Model object at 0x7fc340af9470>
Traceback (most recent call last):
File "/root/PycharmProjects/ClassSR-main/codes/test.py", line 70, in
sr_img = util.tensor2img(visuals['rlt']) # uint8 question
File "/root/PycharmProjects/ClassSR-main/codes/utils/util.py", line 137, in tensor2img
*tensor = tensor.squeeze().float().cpu().clamp_(min_max) # clamp
AttributeError: 'numpy.ndarray' object has no attribute 'float'

这个训练所有的模型需要多久

关于推理速度

作者你好，首先感谢带来非常优秀的工作！
该项工作在理论计算量（flops）上有着非常明显的加速，但是在测试中发现，切块，判断，多分支推理的pipline实际消耗的推理时间似乎并没有优势，甚至更耗时。

切块用for循环效率最低
尝试过reshape图像[n,1,32,32]进GPU，送入网络直接获得全部分类序列，分组推理再合并，但实际耗时依旧达不到单分支的速度。

不知道作者是否有相关的优化经验可以交流下，万分感谢！

the training process seems to be stuck

Hi, @Xiangtaokong ,
Thanks very much for your interesting work.
Currently I want to reproduce your work. I follow the readme and finish the data processing. while during run training command "python train_ClassSR.py -opt options/train/train_ClassSR_FSRCNN.yml", it seems to be stuck. Can you give me some suggestions?

Problems encountered when I test and train ClassSR: Bad Results and Lack of LMDB file

Recently, I am impressed by your brilliant and impressive idea and work in this paper:" ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic". It groundbreakingly accelerates SR models by classing sub-images, which attracts me a lot.

Therefore, I download the released code on Github, follow the guidance in README.md, and test it. However, the results are weird. Both PSNR and SSIM are acceptable in branches testing, nevertheless, these indexes drop to ~9dB when it comes to the whole model, which confused me a lot.

And when I try to train the model, it shows that there is a lack of LMDB file to handle validating set "Set5" during the training procedure. 2 affixed pictures demonstrate the situation. I also try to read through the above paper several times, but little things are related to this problem.

So in a word, there are 2 problems I need your valuable help with:

Is there something wrong when I test the model?
How can I tackle LMDB issues?

Thank you for your reading and priceless help!

{"mode":"full","isActive":false}

Questions about combining sub-images

Thank you for your amazing work!
I've got a little question about the way you combine the output sub-images. Due to the ill-posed nature, simply concat sub-images may cause unnatural effects. I noticed in your code that you have done the following transformation sr_img[:,j*step*self.scale:j*step*self.scale+(patch_size-step)*self.scale,:]/=2. What considerations led you to carry out this operation and is there any basis for it?

模型参数量

您好，请问模型参数量Parameters以及FLOPs是用什么方法测的啊
我用profile函数测得好像跟论文不一样

数据集问题

作者你好，我注意到你的损失并没有使用交叉熵等分类常用的损失，而是平均整体概率以及放大某一概率。
那么文章开始用网络对数据划分的作用体现在哪里呢？
训练中是否强化了分类网络将某一未知类别的数据归化到它已知的标签类别呢？

计算FLOPs相关

您好！我有两个问题需要请教：

请问代码里为什么需要除以sum(num_ress) ?
为什么对FRCNN网络的flops还需要加上7.4 ?
期待您的解答，谢谢！！

def cal_FLOPs(which_model,num_ress):
    if which_model =='classSR_3class_fsrcnn_net':
        flops = ((141 * num_ress[0] + 304 * num_ress[1] + 468 * num_ress[2]) / sum(num_ress))+7.4
        percent=flops/468
    elif which_model=='classSR_3class_carn':
        flops = (0.38 * num_ress[0] + 0.77 * num_ress[1] + 1.15 * num_ress[2]) / sum(num_ress)
        percent = flops / 1.15
    elif which_model=='classSR_3class_srresnet':
        flops = (1.66 * num_ress[0] + 3.44 * num_ress[1] + 5.20 * num_ress[2]) / sum(num_ress)
        percent = flops / 5.20
    elif which_model=='classSR_3class_rcan':
        flops = (10.33 * num_ress[0] + 19.90 * num_ress[1] + 32.60 * num_ress[2]) / sum(num_ress)
        percent = flops / 32.60
    return flops,percent

当运行extract_subimages_test.py出现错误

E:\Anaconda\envs\torch11\python.exe D:/迅雷下载/ClassSR-main/codes/data_scripts/extract_subimages_test.py
process GT...
mkdir [D:/迅雷下载/ClassSR-main/data0/xtkong/data/DIV2K_valid_HR_sub/GT] ...
[ ] 0/100, elapsed: 0s, ETA:
Start...
All subprocesses done.
process LR...
mkdir [D:/迅雷下载/ClassSR-main/data0/xtkong/data/DIV2K_valid_HR_sub/LR] ...
[ ] 0/100, elapsed: 0s, ETA:
Start...
All subprocesses done.
Traceback (most recent call last):
File "D:/迅雷下载/ClassSR-main/codes/data_scripts/extract_subimages_test.py", line 143, in
main()
File "D:/迅雷下载/ClassSR-main/codes/data_scripts/extract_subimages_test.py", line 66, in main
assert len(data_util._get_paths_from_images(save_GT_folder)) == len(
File "D:\迅雷下载\ClassSR-main\codes\data\util.py", line 31, in _get_paths_from_images
assert images, '{:s} has no valid image file'.format(path)
AssertionError: D:/迅雷下载/ClassSR-main/data0/xtkong/data/DIV2K_valid_HR_sub/GT has no valid image file

loss权重的相关问题

 您好，我对您的工作非常感兴趣。
 我注意到您的模型在训练分类器的时候设置的loss权重 l1 : class : avg 是 2000:1:6 。我想请问这部分是怎么设置的。其次，我想请教一下在训练过程中，添加权重后的各个损失的比例要接近1:1:1还是在l1的比重更大呢？
 期待您的回复，感谢！

About Other Models

First of all, thank you for your excellent work!
Secondly I have some questions that I don't quite understand and would like to ask.

Is your proposed data feature accelerated super-resolution network applicable to any super-resolution network, because I want the framework to reduce the training computation of other networks, how can I do it?
If I use the framework to reduce the training computation of other networks do I need to train a pre-trained model first and how to train it?
I saw that the network did some data preprocessing work before training, can you explain the role of these work, why do I need to downsample the images to 0.6, 0.7, 0.8, 0.9?
I see that the classification model for simple, medium and complex classification is chosen by the criterion of psnr, what images are selected to calculate the psnr?
When testing, I see you say that some datasets are too small for crop testing, how many times the minimum dataset can be tested?

I look forward to your reply, thank you!

关于生成训练集

请问一下，论文里“Then we densely crop 1.59M sub-images with size 32 × 32 from LR images. These sub-images are
equally divided into three classes (0.53M for each) according to their PSNR values through MSRResNet. ”
这里的1.59M户和0.53M 指的是文件大小还是其他？
20-04-14 00:47:54.197 - INFO: 0558_s290 - PSNR: 26.473503 dB; SSIM: 0.751480; PSNR_Y: 27.808832 dB; SSIM_Y: 0.769544.
20-04-14 00:47:54.248 - INFO: 0558_s291 - PSNR: 26.580721 dB; SSIM: 0.764139; PSNR_Y: 27.929448 dB; SSIM_Y: 0.785679.
20-04-14 00:47:54.300 - INFO: 0558_s292 - PSNR: 25.955706 dB; SSIM: 0.801685; PSNR_Y: 27.237049 dB; SSIM_Y: 0.822988.
divide_train.log 里面scale=1的图名字直接是0558_s290，0558_s291。。。。但是extract_subimages_train.py生成的文件夹DIV2K_scale_sub里面的图名字是1_0558_s290，所以在运行divide_sub_images_train.py时会报错
论文里说“To prepare the training data, we first downsample3 the original images with scaling factors 0.6, 0.7, 0.8, 0.9 to generate the HR images.” 这个意思时scale=1的图不参与训练吗？

the output resolution differs from the GT

hello, I tested 100 images from DIV2k(index from 800--900), use ClassSR-FSRCNN pretrained model, it seems that the output HR images(1232x1048, 0801_x4.png)'s revolution differs from the GT(2040x1356, 0801.png) . so is there something with me or the code ?

验证集

请问验证集set5需要做什么处理吗

生成图片怎么利用gpu呢？

大佬您好！请问生成图像的时候，怎么把gpu利用起来呀，难道是我只有一块gpu没有设置好吗？

1 请问为什么要对原图下采样[1, 0.9, 0.8, 0.7, 0.6]倍

作者您好，我看了论文中提到先对原始数据下采样 0.9, 0.8, 0.7, 0.6，作为HR，再对这些数据下采样4倍得到LR，请问这样做是出于什么原因呢？

little typo in extract_subimgs_train.py

Thanks for your great work.

line 45 and line 47, might be a valid check for height and width?

about Test2K

请问一下是通过将DIV8K进行×4 downsampling操作得到Test 2K吗

GPU-Util about 10%

Is this normal？

divide_train.log

我想使用别的数据集进行训练，想问一下该文件里面的PSNR、SSIM、PSNR_Y、SSIM_Y怎么获取？

data_augmentation.py

data_augmentation.py has some problem.
such as (_get_paths_from_images) (imresize)

超分后图像尺寸问题

您好，按照您的测试流程测试test2K文件夹下的图片，x4超分，超分后的图像尺寸与HR尺寸不一致，如1299.png：

请问这种情况是什么原因呢？既然尺寸不一致，psnr指标如何计算的呢？

Training speed

NICE WORK！
I train classSR structure to process the denoising task. the branchs adopt small network, which calculation cost is only about 5 to 16 GMAC.
There are two problems that bother me：
First, the training speed is too slow. I use two 3090 cards to train models and it takes 12 minutes to run 100iter. However, when I increased to four 3090 cards, the speed did not increase, and the utilization rate of each GPU was very low. Could you please give me some suggestions？
Second, because the training is too slow, I used .pth at 30000iter for testing, and the classification is uneven. The image results have a strong sense of demarcation, which is mainly because different patches take different branches.Will this phenomenon be improved in the next training？

            if rank <= 0:
                logger.info('Number of val images in [{:s}]: {:d}'.format(
                    dataset_opt['name'], len(val_set)))

But not check in here.
train.py#L215

                # tensorboard logger
                if opt['use_tb_logger'] and 'debug' not in opt['name']:
                    tb_logger.add_scalar('psnr', avg_psnr, current_step)

It should change to

                # tensorboard logger
                if rank <= 0 and opt['use_tb_logger'] and 'debug' not in opt['name']:
                    tb_logger.add_scalar('psnr', avg_psnr, current_step)

the train_ClassSR has same bug.