Are the pretrained weights from the student or teacher model? about dino HOT 5 CLOSED

bfialkoff commented on August 19, 2024 5

Are the pretrained weights from the student or teacher model?

from dino.

Comments (5)

mathildecaron31 commented on August 19, 2024 5

The weights from the backbone only files are from the teacher, and results from our paper are obtained from the teacher weights as well. We have indeed shown in our paper that the teacher is performing better than the student in general.

Therefore, when using the video_generation script it is loading from teacher weights (though the visualization are nearly the same if you use the student weights in that case).

For any of our evaluation scripts, if you want to evaluate the student weights instead you can do so by specifying the path towards the full checkpoint with the --pretrained_weights argument and specifying --checkpoint_key student.

Hope that helps

from dino.

mathildecaron31 commented on August 19, 2024 3

@woctezuma thanks for helping to reply to this issue.

I have a minor remark. Our ultimate goal is to obtain the best model possible in an unsupervised way. We train the student with SGD and the teacher is an EMA of that student. We've found that the teacher is performing better than the student and that is why our final model used in downstream tasks is the teacher.

from dino.

woctezuma commented on August 19, 2024 3

@woctezuma thanks for helping to reply to this issue.

I have a minor remark. Our ultimate goal is to obtain the best model possible in an unsupervised way. We train the student with SGD and the teacher is an EMA of that student. We've found that the teacher is performing better than the student and that is why our final model used in downstream tasks is the teacher.

Oops, it looks like I was confused about that! Thanks for clearing that up!

Hopefully I have not confused others! Sorry about that, @bfialkoff!

from dino.

bfialkoff commented on August 19, 2024 1

Thanks for the clarification. I guess what I meant was in the video_generation script when we load a model, we are then loading the student or the backbone? Backbone to the base model and head refers to the part of the architecture that turns it into the student model?

from dino.

woctezuma commented on August 19, 2024

I dont understand which of the two models are later used for inference is it the student or teach?

~~The goal is to train a student. Same as in real life. The teacher is only an expendable mean towards that goal.~~
Edit: See the answer by the first author below!

Are the pretrained weights provided from the teacher or the student network?

Everything is provided.

from dino.

Recommend Projects

Are the pretrained weights from the student or teacher model? about dino HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent