zjcv / knowledgereview Goto Github PK

View Code? Open in Web Editor NEW

7.0 0.0 1.0 60 KB

[CVPR 2021] Distilling Knowledge via Knowledge Review

License: Apache License 2.0

Python 100.00%

feature-distillation pytorch zcls distillation knowledge-review

knowledgereview's People

Contributors

Stargazers

Forkers

marssgon

knowledgereview's Issues

Question about ABF model

class ABF(nn.Module):

def __init__(self, in_channel, out_channel, mid_channel, is_fuse=True):
    super(ABF, self).__init__()

    self.conv_first = nn.Sequential(
        nn.Conv2d(in_channel, mid_channel, kernel_size=(1, 1), bias=False),
        nn.BatchNorm2d(mid_channel)
    )

    self.conv_last = nn.Sequential(
        nn.Conv2d(mid_channel, out_channel, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False),
        nn.BatchNorm2d(out_channel)
    )

    self.att_conv = None if not is_fuse else nn.Sequential(
        nn.Conv2d(mid_channel * 2, 2, kernel_size=(1, 1)),
        nn.Sigmoid()
    )

    self.__init_weights()

def __init_weights(self):
    nn.init.kaiming_uniform_(self.conv_first[0].weight, a=1)
    nn.init.kaiming_uniform_(self.conv_last[0].weight, a=1)

def forward(self, x, y=None, shape=None):
    assert len(x.shape) == 4
    N, _, H, W = x.shape[:4]

    x = self.conv_first(x)
    if self.att_conv is not None:
        # up sample residual features
        y = F.interpolate(y, shape, mode="nearest")
        # fusion
        z = torch.cat([x, y], dim=1)
        z = self.att_conv(z)

        x = (x * z[:, 0].view(N, 1, H, W) + y * z[:, 1].view(N, 1, H, W))
    y = self.conv_last(x)

    return y, x

In the 'forward' function, only the channel of y seems must be equal to mid_channel if self.att_conv could work.But the input y is res_features, the channel's number of res_features seem can't be guaranteed to be equal to mid_channel.

A Training Bug

There is a training bug in this project. That is I only set teacher model's require_grad_=False, but still put it's parameters to optimizer. So teacher model will update in training while it doesn't compute grad.

I don't have plan to fix it, because the training result shows it also works well.

feature after relu or before relu?

Hi,

I found out that you extract the feature before relu (https://github.com/ZJCV/KnowledgeReview/blob/master/rfd/model/resnet/resnet.py#L35).

But from the offical repo they extract the feature after relu (https://github.com/dvlab-research/ReviewKD/blob/master/CIFAR-100/model/resnet_cifar.py#L186)

Why did you make this difference?

ABF的那个class，就是相当于把特征向量匹配吗？能否用模型的中间层的输出直接进行计算？

Did you reimplement for object detection?

Hi,

Did you reimplement for object detection? I have tried ReviewKD for my own dataset and my own model, but found out it's not good.

zjcv / knowledgereview Goto Github PK

knowledgereview's People

Contributors

Stargazers

Forkers

knowledgereview's Issues

Question about ABF model

A Training Bug

feature after relu or before relu?

ABF的那个class，就是相当于把特征向量匹配吗？能否用模型的中间层的输出直接进行计算？

Did you reimplement for object detection?

最新的模型是怎么获取的

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent