Git Product home page Git Product logo

Comments (5)

mathildecaron31 avatar mathildecaron31 commented on August 19, 2024 5

Hi @bfialkoff

The weights from the backbone only files are from the teacher, and results from our paper are obtained from the teacher weights as well. We have indeed shown in our paper that the teacher is performing better than the student in general.

Therefore, when using the video_generation script it is loading from teacher weights (though the visualization are nearly the same if you use the student weights in that case).

For any of our evaluation scripts, if you want to evaluate the student weights instead you can do so by specifying the path towards the full checkpoint with the --pretrained_weights argument and specifying --checkpoint_key student.

Hope that helps

from dino.

mathildecaron31 avatar mathildecaron31 commented on August 19, 2024 3

@woctezuma thanks for helping to reply to this issue.

I have a minor remark. Our ultimate goal is to obtain the best model possible in an unsupervised way. We train the student with SGD and the teacher is an EMA of that student. We've found that the teacher is performing better than the student and that is why our final model used in downstream tasks is the teacher.

from dino.

woctezuma avatar woctezuma commented on August 19, 2024 3

@woctezuma thanks for helping to reply to this issue.

I have a minor remark. Our ultimate goal is to obtain the best model possible in an unsupervised way. We train the student with SGD and the teacher is an EMA of that student. We've found that the teacher is performing better than the student and that is why our final model used in downstream tasks is the teacher.

Oops, it looks like I was confused about that! Thanks for clearing that up!

Hopefully I have not confused others! Sorry about that, @bfialkoff!

from dino.

bfialkoff avatar bfialkoff commented on August 19, 2024 1

Thanks for the clarification. I guess what I meant was in the video_generation script when we load a model, we are then loading the student or the backbone? Backbone to the base model and head refers to the part of the architecture that turns it into the student model?

from dino.

woctezuma avatar woctezuma commented on August 19, 2024

I dont understand which of the two models are later used for inference is it the student or teach?

The goal is to train a student. Same as in real life. The teacher is only an expendable mean towards that goal.
Edit: See the answer by the first author below!

Student

Are the pretrained weights provided from the teacher or the student network?

Everything is provided.

Everything

from dino.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.