Git Product home page Git Product logo

Comments (4)

oarriaga avatar oarriaga commented on July 30, 2024

It could be several things. One of them being that the model in train.py saves the complete state of the optimizer. How much different it is?

from paz.

David9591 avatar David9591 commented on July 30, 2024

It could be several things. One of them being that the model in train.py saves the complete state of the optimizer. How much different it is?

When I choose the default, I do save all the information about the model, so I have 46452KB, but when I choose save_weights_only=True, I should only keep the weight data, but it still has 15567KB, but the official model is only 848KB, which is very strange, could you please explain it for me?Thank you very much!

from paz.

oarriaga avatar oarriaga commented on July 30, 2024

It could be that they are actually different models or that the tensorflow hdf5 file weight format changed and now models are slightly bigger. One way to look into this would be to load both models and call model.summary() on both and observe the differences.

from paz.

David9591 avatar David9591 commented on July 30, 2024

It could be that they are actually different models or that the tensorflow hdf5 file weight format changed and now models are slightly bigger. One way to look into this would be to load both models and call model.summary() on both and observe the differences.

Yes, you are right! After printing out the structure of the model, I found that there is indeed a big difference, and the difference in the number of model parameters can also be explained. I will continue to study it in depth. Thank you for your patient answer!

from paz.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.