kushajveersingh / resize_network_cv Goto Github PK

PyTorch implementation of the paper "Learning to Resize Images for Computer Vision Tasks" on Imagenette and Imagewoof datasets

License: Apache License 2.0

Shell 3.27% Python 96.73%

computer-vision pytorch pytorch-implementation pytorch-lightning

resize_network_cv's People

Contributors

Stargazers

Watchers

Forkers

bilzard xujinglin zyxjtu unique-chan jackzhousz w-arrior outbreak-hui chenpaopao tchaye59 doublehui97 tvgh-kid jiangyanpeng

resize_network_cv's Issues

What about different H * W

The situation you mentioned are the input image and out image are the same Height and Width.

What about if we want different Height and width?

cannot find /data/imagenette2.ckpt file

path = os.path.join(path, f"{cfg.data.name}.ckpt")
ckpt = torch.load(path)['state_dict']

The .ckpt file is not in the dataset I downloaded.
Where should I look for this file?Thanks！

Is this a bug in Resizer.forward()? My understanding is to return out here

def forward(self, x):
    residual = self.interpolate(x)

    out = self.module1(x)
    out_residual = self.interpolate(out)

    out = self.resblocks(out_residual)
    out = self.module3(out)
    out = out + out_residual

    out = self.module4(out)

    out = out + residual

    return x

Critical Bug: Resizer Outputs Original Image

I found this bug by printing model summary.
The model should return out instead of x (the original input image).

https://github.com/KushajveerSingh/resize_network_cv/blob/main/src/models/resizer.py#L76

Suggestion: Handle Input Image of Any Aspect Ratio

The original paper says:

... the proposed architecture allows for resizing an image to any target size and aspect ratio.

So the Resizer should handle input image with any aspect ratio.
In spite of that, the Resizer implementation (interpolation) seems to assume the input image is square (or, only handle uniform-scale resize: e.g. (512, 256) -> (256, 128)).

The current implementation interpolates by specifying scale_factor, though I think it's more useful to specify the output image size and calculate scale factor automatically.

The reference code:

# current implementation
self.interpolate = partial(F.interpolate,
                           scale_factor=self.scale_factor,
                           mode=self.interpolate_mode,
                           align_corners=False,
                           recompute_scale_factor=False)

# suggestion
self.interpolate = partial(F.interpolate,
                           size=output_size,
                           mode=self.interpolate_mode,
                           align_corners=False)

c.f. https://github.com/KushajveerSingh/resize_network_cv/blob/main/src/models/resizer.py#L56-L60

Is the module really useful?

I have run twice，one is follow you config，another is to extand the epoch,but the result is nothing different，
——————————————————
after try your config，the second try config is , ran the resize_net 800 epoch, learn rate step_size is 100，
and the fintune 50 epoch，but the result is the same，
is there any problem？thank you for your reply.
——————————————————
if needed, I will show the config

kushajveersingh / resize_network_cv Goto Github PK

resize_network_cv's People

Contributors

Stargazers

Watchers

Forkers

resize_network_cv's Issues

What about different H * W

cannot find /data/imagenette2.ckpt file

Is this a bug in Resizer.forward()? My understanding is to return out here

Critical Bug: Resizer Outputs Original Image

Suggestion: Handle Input Image of Any Aspect Ratio

Is the module really useful?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent