Git Product home page Git Product logo

Comments (2)

dpressel avatar dpressel commented on August 18, 2024

It has been a long time since I have run with the same defaults as the original paper, but these numbers you post seem quite low. I typically see quite higher precision @ 1 than in the paper and slightly lower @ 2 (for me, this tradeoff is worth it), but this will depend on the params you pass. The numbers you show here are much lower than I typically see... It didnt help that I made it hard to tweak the hyper parameters without modding the code (see below). I think the performance really depends on the hyper params (batch size LR, LR decay, steps per decay).

To make it a bit easier, I have changed some of the defaults for the hyper params to be closer to original paper, and I made it easier to change the LR decay and steps per decay by offering flags. This should help you match closer to the original paper (and tweak as you see fit). I will probably change the default batch size as well shortly (lowering to 50).

In terms of the implementation, I believe that the L/H model should be identical to the original Caffe code. There is a slight difference in how we handle the mean image -- in TF I use https://www.tensorflow.org/api_docs/python/tf/image/per_image_standardization where original code uses the true mean image I believe.

from rude-carnie.

dpressel avatar dpressel commented on August 18, 2024

I have been able now to improve over the paper @ 1 and @ 2 with a higher learning rate, but similar decay. Try eta=0.01 and rest defaulted. You should see better results

from rude-carnie.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.