Git Product home page Git Product logo

Comments (8)

felixfuu avatar felixfuu commented on July 29, 2024 1

@nqanh Thanks for you share! Very nice work,but how to make .sm file? Can you also share a example then i can make my own data?

from affordance-net.

Tetsujinfr avatar Tetsujinfr commented on July 29, 2024

is the pre-trained model trained on the IIT-AFF dataset ou quote in your paper, i.e. with the following classes?

image

If so, are yu planning on training it on other dataset? for instance VOC2012 or something with people in it? I do not have 11GB available to train the model on a dataset :(

thank you

Tets

from affordance-net.

nqanh avatar nqanh commented on July 29, 2024

Currently, we use threshold=0.9, if no box > 0.9, we choose the highest one - no matter how big the confidence is. You can change the param CONF_THRESHOLD = 0.9 to lower if you want to see more objects < 0.9 (really depends on the scene). You also may want to change/disable the part of code that choose the highest confidence box - line 154 - 159 in demo_img.py

Yes, we train AffordanceNet in IIT-AFF dataset, and the object class is as you posted. We can config AffordanceNet to train on Pascal VOC, but please note the mask in Pascal VOC is binary (i.e. background or foreground), and you will not see all the power of AffordanceNet. We design AffordanceNet to handle mutilclass in each object, not only binary.

I'll release a smaller version of the net soon, so you can train in any dataset (that has any objects) you want.

from affordance-net.

nqanh avatar nqanh commented on July 29, 2024

Thanks for your interest @felixfuu! I just added the utils folder. You can find the script to create .sm files and all relevant information to train AffordanceNet on your own data.

from affordance-net.

felixfuu avatar felixfuu commented on July 29, 2024

@nqanh Thanks for you reply!Another question,how to ensure the mask which is very small can be detected?

from affordance-net.

nqanh avatar nqanh commented on July 29, 2024

The affordance mask depends on the size of the object, and the object size depends on the anchor parameters (scale and ratio) of the object detector. The concept of anchor was proposed in the Faster R-CNN paper, here we use 15 anchors as in Mask R-CNN paper.

If you want to detect very small objects (e.g, 5x5), you should change the params related to anchor in the prototxt file. You can play with Faster RCNN first before doing for AffordanceNet, because if the object detector fails, then the mask branch will fail.

from affordance-net.

felixfuu avatar felixfuu commented on July 29, 2024

@nqanh Ok ,thanks! The Mask RCNN use FPN as feature extra model , but in this project,i can't fint the implement of FPN,is there any information about FPN?

from affordance-net.

nqanh avatar nqanh commented on July 29, 2024

No, we use VGG16 backbone to extract features and mainly focus on the mask branch (for multiclass affordances). The object detection branch is quite simple (only 2 fully connected layers are used). You can extend AffordanceNet with ResNet and FPN.

from affordance-net.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.