Git Product home page Git Product logo

Comments (8)

gaopengcuhk avatar gaopengcuhk commented on July 21, 2024 2

The batch/GPU for 150 epoch provided in the github is 4 img/GPU or 2img/GPU?

from detr.

szagoruyko avatar szagoruyko commented on July 21, 2024 1

@gaopengcuhk sorry for the confusion in the gist! the 150 epoch model we provide to simplify reproducibility on a single 8-gpu machine, it was trained with batch size 4, thus total 32, to fit into 16GB cards.
Both the ablation 300-epoch (40.6) and the final 500-epoch (42.0) models were trained on 4 nodes with 2 im/gpu, thus total 64.

I will update non-DC5 cmd lines to submitit, and move the gist to the repo.

from detr.

fmassa avatar fmassa commented on July 21, 2024

Hi,

The ablations in the paper were conducted on 300 epochs, while the models reported in table 1 (main results) were trained for 500 epochs.

I believe I have answered your question, and as such I'm closing the issue but let us know if you have further questions.

from detr.

gaopengcuhk avatar gaopengcuhk commented on July 21, 2024

The final performance is trained for 500 epoches by 16GPU with 4img/GPU for DETR/DETR-DC5/DETR-R101/DETR-DC5-R101?

from detr.

gaopengcuhk avatar gaopengcuhk commented on July 21, 2024

If I understand correctly, the performance for 150 epoch is 39.5, for 500 epoch is 42.0. What's the performance of 300 epochs? I guess all models should be trained with the same batch size.

from detr.

gaopengcuhk avatar gaopengcuhk commented on July 21, 2024

If I understand correctly, the performance for 300 epochs should be 40.6?

from detr.

fmassa avatar fmassa commented on July 21, 2024

Yes, the performance for 300 epochs is 40.6, from table 2 in the paper.

from detr.

twangnh avatar twangnh commented on July 21, 2024

@szagoruyko same question as @gaopengcuhk, also, could you pls provide the pretrained model of 150epoch run?

from detr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.