Git Product home page Git Product logo

Comments (2)

jaybdub avatar jaybdub commented on June 7, 2024 1

Hi @Mediumcore ,

This would require distilling a new model, for this you may be able to follow these steps.

Disclaimer: I haven't tested these, so let me know if you run into issues.

Step 1 - Register a new model

Register a new model, and ensure that it outputs features of shape 256x64x64 for your desired input resolution. For example, for an input of size 512x512, your model must have an output stride of 8.

You can register the model similar to here:

You could try registering a new model with a different stride, and setting the student size to a lower resolution (ie: 512x512).

Step 2 - Train the distilled model

Next, you'll need to train the model on unlabeled images. Follow the training instructions in the README, but set the "student_size" parameter to the desired size (512).

parser.add_argument("--student_size", type=int, default=1024, help="The size of image to feed to the student during distillation.")

Step 3 - Evaluate the distilled model

Follow the evaluation instructions in the README to compare the accuracy for small / medium / large objects.

As a note: It's worth noting that distillation only applies to the image encoder. It's worth benchmarking the mask decoder to see if this is worth it, as the image encoding speed is approaching the decoding speed and may no longer be a performance bottleneck.

Hope this helps. If we end up releasing a lower resolution model I will update this thread, but we have no current plans at the moment.
John

from nanosam.

Mediumcore avatar Mediumcore commented on June 7, 2024

Understood,thank you very much for reply

from nanosam.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.