Git Product home page Git Product logo

Comments (9)

rizar avatar rizar commented on June 29, 2024 2

Yes, it's the learning rate. It should be decreased to 1e-5, and then step 2 works. Note, that it will indeed use the groundtruth programs in Step 2.

from clevr-iep.

rizar avatar rizar commented on June 29, 2024 1

Same question here. According to TRAINING.md, in Step 2 "we train the execution engine, using programs predicted from the program generator in the previous step". In train_model.py line 238 says "train execution engine with ground-truth programs". Can you please explain this discrepancy?

In any case, when I train --model_type=EE without Step 1 pretraining, the learning doesn't really progress (still at ~50% accuracy after 100000) iterations.

from clevr-iep.

rizar avatar rizar commented on June 29, 2024 1

Can you try training longer?

from clevr-iep.

liuweide01 avatar liuweide01 commented on June 29, 2024

have you solve the problem?

from clevr-iep.

liuweide01 avatar liuweide01 commented on June 29, 2024

But how about the val accuracy?

from clevr-iep.

rizar avatar rizar commented on June 29, 2024

I get smth like 95-96%, which is what is reported in the paper.

from clevr-iep.

ankursikarwar avatar ankursikarwar commented on June 29, 2024

@rizar can you please tell how did you get 95-96% accuracy by directly training the execution engine using the ground truth programs (as in step 2). My accuracy is oscillating around 0.47 even after 5000 iterations when using lr = 1e-5

from clevr-iep.

ankursikarwar avatar ankursikarwar commented on June 29, 2024

Can you try training longer?

Thanks to you, I trained the execution engine for 100000 iterations with lr=1e-5 and got around 89% accuracy. Actually, accuracy increased quite slowly initially, and then between 40k and 60k iterations, it increased steeply.
accuracy graph

from clevr-iep.

smdp2000 avatar smdp2000 commented on June 29, 2024

minimum pc specs you all are using to train this model, can anyone suggest

from clevr-iep.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.