Git Product home page Git Product logo

Comments (7)

alcinos avatar alcinos commented on July 2, 2024 1

@linzzzzzz You're only using one process, so you can remove the distributed launch entirely:

python main.py --batch_size 2 --no_aux_loss --eval
--resume https://dl.fbaipublicfiles.com/detr/detr-r50-e632da11.pth
--coco_path /path/to/coco

from detr.

alcinos avatar alcinos commented on July 2, 2024

Hi @gaopengcuhk
If you are going to use slurm, we recommend using Submitit.
To run your eval on one node, one gpu, you can run
python run_with_submitit.py --ngpus 1 --nodes 1 --timeout 360 --batch_size 2 --no_aux_loss --eval --resume https://dl.fbaipublicfiles.com/detr/detr-r50-e632da11.pth --coco_path ../../dataset/

As a side note, your version of pytorch is a bit old, we recommend using pytorch 1.5.

from detr.

gaopengcuhk avatar gaopengcuhk commented on July 2, 2024

Thank you very much. I need to modify part of the code to run detr with pytorch 1.3. I will switch to pytorch 1.5 latter.

Thank you very much for your quick response.

from detr.

gaopengcuhk avatar gaopengcuhk commented on July 2, 2024

python run_with_submitit.py --ngpus 1 --nodes 1 --timeout 360 --batch_size 2 --no_aux_loss --eval --resume ./saved_model/detr-r50-e632da11.pth --coco_path ../../dataset/

When I run your code, I run into the following error.

submitit INFO (2020-06-02 21:34:32,188) - Starting with JobEnvironment(job_id=587603, hostname=SH-IDC1-10-198-6-145, local_rank=0(1), node=0(1), global_rank=0(1))
submitit INFO (2020-06-02 21:34:32,188) - Loading pickle: experiments/587603/587603_submitted.pkl
Process group: 1 tasks, rank: 0
submitit ERROR (2020-06-02 21:35:20,838) - Submitted job triggered an exception
~

from detr.

fmassa avatar fmassa commented on July 2, 2024

@gaopengcuhk can you paste which exception you got? As this might be an issue with submitit, and not with DETR

from detr.

linzzzzzz avatar linzzzzzz commented on July 2, 2024

I came across similar issue and ended up using the command below:

python -m torch.distributed.launch --nproc_per_node=1 --use_env main.py --batch_size 2 --no_aux_loss --eval
--resume https://dl.fbaipublicfiles.com/detr/detr-r50-e632da11.pth
--coco_path /path/to/coco

from detr.

GlennCGL avatar GlennCGL commented on July 2, 2024

python run_with_submitit.py --ngpus 1 --nodes 1 --timeout 360 --batch_size 2 --no_aux_loss --eval --resume ./saved_model/detr-r50-e632da11.pth --coco_path ../../dataset/

When I run your code, I run into the following error.

submitit INFO (2020-06-02 21:34:32,188) - Starting with JobEnvironment(job_id=587603, hostname=SH-IDC1-10-198-6-145, local_rank=0(1), node=0(1), global_rank=0(1))
submitit INFO (2020-06-02 21:34:32,188) - Loading pickle: experiments/587603/587603_submitted.pkl
Process group: 1 tasks, rank: 0
submitit ERROR (2020-06-02 21:35:20,838) - Submitted job triggered an exception
~

Hi , i am using on 1984 and get the same problems. Could you tell me the solution?

from detr.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.