albert0147 / aad_sfda Goto Github PK

Code for our NeurIPS 2022 (spotlight) paper 'Attracting and Dispersing: A Simple Approach for Source-free Domain Adaptation'

Python 95.84% Shell 1.54% Jupyter Notebook 2.63%

source-free-domain-adaptation neurips2022 neurips-2022

aad_sfda's People

Contributors

Stargazers

Watchers

Forkers

wangkai930418 manogna-s chenqs1995 glbreeze prithv1 zhaoxin94

aad_sfda's Issues

Selecting beta with SND

Hi, I really appreciate you sharing your code here. I have a question for you. Where in your code does it reflect that the value of beta is automatically selected using the SND method? Thank you!

It seems the code of loss part does not match the paper

Hi Shiqi,

Thanks you very much for providing code and congratulations on your paper acceptance!

After reading the loss function part, I could not find how the code reflects Eq.5 of paper. Please correct me if I miss anything:

First, the first term of Eq.5 is consistency between local neighbours. I understand you use a memory bank to save softmax outputs, and then get K neighbours.

AaD_SFDA/tar_adaptation.py

Line 316 in 9a4c8bf

softmax_out_un = softmax_out.unsqueeze(1).expand(-1, args.K,

However, I cannot find the connection between KlD loss and the first term in Eq.5

AaD_SFDA/tar_adaptation.py

Line 320 in 9a4c8bf

(F.kl_div(softmax_out_un, score_near, reduction='none').sum(-1)).sum(1))

This line of code does not equal to the first term of Eq.5. I am confused by this, please help me solve this.

Second, for the second term, it is to disperse the prediction of potential dissimilar features. However, your code does not reflect this thing.

AaD_SFDA/tar_adaptation.py

Line 330 in 9a4c8bf

dot_neg = (dot_neg * mask.cuda()).sum(-1) #batch

Given a test sample, this line of code regards the rest of samples in the batch as background. This is also not equal to the definition of background set in the paper.

This work claims that "provide a surprisingly simple solution for source-free domain adaptation, which is an upperbound of the proposed clustering objective". Therefore, I expect the code corresponds to equation.

Please help me address the concern, and correct my if I misunderstand anything.

Pre-trained source models

Thanks to the author for contributing the code! Can you provide your pre-trained source models?

DomainNet in ODA setting

Could you provide the DomainNet experiment code in ODA settings?

A question about dataloader for test

Hi, there!

I notice that in the dataloader for test, the parameter shuffle of the init of DataLoader is set to False in Visda17, whereas in office-home and office31, it is set to True. Is this setting on purpose?

i.e. in line 120 of tar_adaptation.py, line 154 of office-home/train_tar.py, line 197 of office-home/office31_tar.py

A question about implementation details

Dear author,

I'm an undergraduate student who is quite interested in SFDA, and I have been following your work from G-SFDA, NRC to AaD. I really appreciate your works and the wonderful performance they achieve.

Recently, I'm trying to understand your work of AaD. However, I become a little confused about some implementation details as I conbine your paper and source code.

From my understanding, the B_i in div term contains all other items in a mini batch except those in C_i. In other words, items that are k nearest neighbors should be excluded from B_i, as presented in the paper. However, in your code I copied below:

mask = torch.ones((inputs_target.shape[0], inputs_target.shape[0]))
diag_num = torch.diag(mask)
mask_diag = torch.diag_embed(diag_num)
mask = mask - mask_diag
if args.noGRAD:
    copy = softmax_out.T.detach().clone()
else:
    copy = softmax_out.T  # .detach().clone()  #
dot_neg = softmax_out @ copy  # batch x batch
dot_neg = (dot_neg * mask.cuda()).sum(-1)  # batch
neg_pred = torch.mean(dot_neg)
loss += neg_pred * alpha

it seems that only diagonal entries in mask are set to 0, rather than k nearest neighbors.

I suppose I must have some misunderstandings, so I create the issue, hoping to get your answer. I would appreciate it if you could answer my qusetion. Looking forward to your reply.

Experimental setting for Source-free paritial-set DA

Hi, professor. I'm very interesting in your paper. Could you provide the Office-Home experiment setting file in paritial-set DA settings, i.e., hyper-parameters K and β. Thank you very much and look forward to your early reply.

Low accuracy of OfficeHome dataset

Thanks to the author for contributing the code!
Based on the hyperparameters provided in the paper, I found in my experiments that the accuracy of the OfficeHome dataset is very low (only 1.58% in task A->P) and seems difficult to fit.

The source domain model parameters I used were provided by SHOT.

albert0147 / aad_sfda Goto Github PK

aad_sfda's People

Contributors

Stargazers

Watchers

Forkers

aad_sfda's Issues

Selecting beta with SND

It seems the code of loss part does not match the paper

Pre-trained source models

DomainNet in ODA setting

A question about dataloader for test

A question about implementation details

Experimental setting for Source-free paritial-set DA

Low accuracy of OfficeHome dataset

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent