clovaai / cutblur Goto Github PK

View Code? Open in Web Editor NEW

374.0 374.0 62.0 13.36 MB

Rethinking Data Augmentation for Image Super-resolution (CVPR 2020)

License: MIT License

Python 2.07% Jupyter Notebook 97.93%

cutblur's People

Contributors

Stargazers

Watchers

cutblur's Issues

There are serious problems in training when changing dataset

I was reproducing cutblur in DIV2K dataset successfully and the cutblur has indeed brought considerable improvement. However, when I changed the face dataset, there was a serious error in the training, and the PSNR value was very low. I can't find the reason for that temporarily. I hope the author can give me some enlightenment.

By the way, this is a awesome project!

Thanks in advance.

about the size of input、output and HR in the demo

Hi, thank you for your job!
In the demo , the size of input、output and HR is the same?

ESRGAN model with Cutblur

Hi,
your work is excellent.
I noticed that your paper mentions the GAN model, especially in ESRGAN. However, I didn't find the ESRGAN model in the codes, would you provide the code for ESRGAN model?
Thank you

Why is X2 scale pretraining necessary for DIV2K

Hi I was trying to reproduce cutblur and failed because I didn't use X2 scale pretraining. Then I noticed that you mentioned in the README that "To achieve the result in the paper, X2 scale pretraining is necessary".

I'm a bit curious about have you found out why is this necessary?

Thanks in advance.

How to use cutblur on video super resolution?

Thanks for your awesome work!
One question: How to use cutblur on video super resolution in which the input is sequential images?

Training Problem

Thanks for your sharing,
I met a problem when I ran the training code.
My comment is like the following python main.py --model EDSR --augs cutblur --dataset DIV2K_SR --div2k_range 1-800/801-810 --scale 2 --dataset_root ./dataset/DIV2K.

Thanks again

Is the ground truth image modified in cutblur

According to the code it seems that the input is cutblurred and the gt is untouched. However in Eq.(1) in the paper, it seems that the gt is also modified.

How to use cutblur while in classifying images in training loop ?

Here's how I write the training loop ...

`
def train_loop_fn(data_loader, model, optimizer, device, scheduler):
running_loss = 0.0
model.train()

for inputs,labels in data_loader:
    inputs = inputs.to(device, dtype=torch.float)
    labels = labels.to(device, dtype=torch.float)

    optimizer.zero_grad()

    outputs = model(inputs)
    loss = loss_fn(outputs, labels)

    loss.backward()
    optimizer_step(optimizer)

    running_loss += loss.item() * inputs.size(0)

train_loss = running_loss / float(len(train_dataset))
scheduler.step(train_loss)

print('training Loss: {:.4f}'.format(train_loss))

Please tell me how to use cutblur in this loop ?

No Output and It says 0it [00:00, ?it/s]

When I try to run inference.py for an image in Set14, it doesn't give any output and it says 0it [00:00, ?it/s].

Can you please help figure out what am I doing wrong? Thank you!

How did you use Cutblur in the GAN setting?

Hi,

It's not clear in the paper and it's not in the repository. Do you cutblur augment the images in the G phase? D phase? both? Something else?

Thank you!

about cutblur function

hi，I am confused the cutblur function
cut_ratio = np.random.randn() * 0.01 + alpha
why not
cut_ratio = np.random.rand()

Blend & RGB channel permutation seems cause PSNR metric drop

Blend & channel permutation seems to cause PSNR metric drop.

no improvement, but rather a decline

Hi!
I ran cutblur and the results did not improve but decreased. The baseline is EDSR, and i use the alpha=0.7.

question about the input shape

Hi, thanks very much for your solid work. I have a question about the training input patch size for single image superresolution. I just find that many works just use training patch size=96x96 for scale=2x SISR. However, many deeper networks (RCAN) have a larger Receptive Field. I wonder whether training patch size=96x96 for scale=2x is the best choice?

image size

when I only use the cutblur data augmentation to retrain the model ,augmens.py in cutblut show ValueError with the rum command : python main.py --model EDSR --augs cutblur --dataset DIV2K_SR --div2k_range 1-800/801-810 --scale 4 --pretrain ./pt/297.pt --dataset_root ./data/DIV2K

what is the value of alpha you use in your paper?

Hi,

Thanks for your code and idea, quite interesting.
I decide to try to apply on some of the models and test the effectiveness.

May I know what is the value for alpha you used in your paper, for the cutblur function?
also, did you verify how different alpha value will affect the effectiveness of cutblur?

Thanks for your help

What is the size of the image block you use at the x2, x3, and x4 scales?

What is the size of the image block you use at the x2, x3, and x4 scales?
Are they all 48x48?
Thank you!

Train using different dataset

How do we use this if I have to train using a different dataset?

The cutout function in augments.py

When doing im2 *= fim2， but shall we do the same in im1 *= fim1？

Patchsize 24

I am trying to learn to do the training with another dataset. Before that I tried repeating div2k training with patch-size of 24. After training, I used the checkpoint file for testing and the error was as follows:

RuntimeError: Error(s) in loading state_dict for Net:
Missing key(s) in state_dict: "tail.0.2.weight", "tail.0.2.bias".
size mismatch for head.1.weight: copying a param with shape torch.Size([256, 12, 3, 3]) from checkpoint, the shape in current model is torch.Size([256, 48, 3, 3]).
Is it ok to change the patch_size to 24? In that case why I do get this error? Thank you for your help!

Upsample LR images with F.interpolate will cause color changes

Hi, in your code, you seem to use F.interpolate to upsample the LR image to match the resolution of HR in order to apply Cutblur. But have you checked the upsampled image? Cause when I do it in your way, I will get an image with server color shift, and that should not be the case in your paper.

Due to some unkown reasons, I cannot upload the images, but I can share with you my testing code.

HR = io.imread('data/DIV2K/DIV2K_train_HR/0159.png')
LR = io.imread('data/DIV2K/DIV2K_train_LR_bicubic/X4/0159x4.png')
HR_plot = HR[0:400, 200:600]
LR_plot = LR[0:100, 50:150]
LR_tensor = im2tensor(LR_plot).unsqueeze(0)
LR_plot = F.interpolate(LR_tensor, scale_factor=4, mode="nearest")[0].numpy().transpose(1,2,0)

f, axarr = plt.subplots(1, 2, figsize=(10, 5))
axarr[0].imshow(LR_plot)
axarr[1].imshow(HR_plot)

Hope you can try this, and tell me where I did wrong.

Thank you so much!

Is the test result the average value of multiple models?

hi authors,
i‘ve only tested the performance of cutblur once by using python main.py --model CARN --augs cutblur --alpha 0.7 --dataset RealSR --scale 4 --camera all --dataset_root ./input/RealSR/ --ckpt_root ./pt/RealSR/cutblur/ --save_result --save_root ./output/RealSR/cutblur/.
the obtained result is 28.89 which is lower than the result 29.00 in the paper.

therefore, i would like to know if 29.00 is the average of multiple models tested.

Validation data in Table1

Hello!

I have some questions regarding Table 1

You have shown the results for DIV2K and RealSR. I wanted to know whether you use Div2K validation for both DIV2K and RealSR? i.e. Trained with DIV2K and RealSR but use DIV2K validation data for testing both RealSR and DIV2K.

Thanks.

Segmentation fault when inferencing large size image

thanks for your great work.
And there is the problem (↓) when the input image size is [2000,3000,3] (size[<1800, <1800,3] is OK)， if the size of input images is limited?
Segmentation fault (core dumped)
more infoi:

torch            1.5.0              
torchfile        0.1.0              
torchvision      0.6.0 

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2018 NVIDIA Corporation
Built on Sat_Aug_25_21:08:01_CDT_2018
Cuda compilation tools, release 10.0, V10.0.130

GeForce RTX 2080Ti

RCAN X2 PSNR only 36.xx

Hi,

I just try to reproduce the result mentioned in the paper.

I follow the steps you listed in this repo, firstly train RCAN x2 then tune for RCAN x4. I use exactly the same code/command as you provided.

but RCAN x2 on DIV2K with MoA only gives PSNR 36.32, which i think is way too low.

is this normal? can provide more training details? like what PSNR you obtained when training RCAN x2?

Really appreciate.

Loss keeps oscillating during training

Hi！

I use the network I designed for training, but the loss keeps oscillating during the training process. I tried different learning rates, but it didn't work. Do you have any suggestions?

Thank you！

Pretrained model

All the links for pretrained model are not working. Can you please give us the new link? Thank you!

Why do you use nearest method for matching the resolution of (LR, HR) due to CutBlur ?

I have a question about how to match he resolution of (LR, HR) due to CutBlur.

When I check the code about matching the resolution of (LR, HR) due to CutBlur,
I found using nearest.

match the resolution of (LR, HR) due to CutBlur

        if HR.size() != LR.size():
            scale = HR.size(2) // LR.size(2)
            LR = F.interpolate(LR, scale_factor=scale, mode="nearest")

Why don't you use bicubic?

Most people use bicubic in super resolution.
Do you have some special things?

I am interest in your CutBlur.
Thank you for your attention.

train on custom dataset

hi i try to reproduce on my custom dataset,
i use "RealSR" settings to run, but got this error below from
File "main.py", line 27, in main
solver.fit()

raise ValueError("empty range for randrange() (%d, %d, %d)" % (istart, istop, width))
ValueError: empty range for randrange() (0, -36, -36)

anyone have idea about this error?

Division by zero error

I get division by zero error when I am trying to evaluate pretrained model using Set14. Can you please help me figure out why? I have put the Set14 in the directory for the dataset and mentioned Set14_SR for the dataset argument. Thank you!

File "/kaggle/working/cutblur/solver.py", line 139, in evaluate
return psnr/len(self.test_loader)
ZeroDivisionError: division by zero

why crop during evaluation?

Hi I am currently doing some research on data augmentation & SISR. I find that you crop a small margin of HR, SR image during evaluation. Is it because there might be some artifact of produced super-resoluted image that would impact the PSNR result?

Kind Regards,

clovaai / cutblur Goto Github PK

cutblur's People

Contributors

Stargazers

Watchers

Forkers

cutblur's Issues

match the resolution of (LR, HR) due to CutBlur

Recommend Projects

Recommend Topics

Recommend Org