Comments (8)
@nqanh Thanks for you share! Very nice work,but how to make .sm file? Can you also share a example then i can make my own data?
from affordance-net.
is the pre-trained model trained on the IIT-AFF dataset ou quote in your paper, i.e. with the following classes?
If so, are yu planning on training it on other dataset? for instance VOC2012 or something with people in it? I do not have 11GB available to train the model on a dataset :(
thank you
Tets
from affordance-net.
Currently, we use threshold=0.9, if no box > 0.9, we choose the highest one - no matter how big the confidence is. You can change the param CONF_THRESHOLD = 0.9
to lower if you want to see more objects < 0.9 (really depends on the scene). You also may want to change/disable the part of code that choose the highest confidence box - line 154
- 159
in demo_img.py
Yes, we train AffordanceNet in IIT-AFF dataset, and the object class is as you posted. We can config AffordanceNet to train on Pascal VOC, but please note the mask in Pascal VOC is binary (i.e. background or foreground), and you will not see all the power of AffordanceNet. We design AffordanceNet to handle mutilclass in each object, not only binary.
I'll release a smaller version of the net soon, so you can train in any dataset (that has any objects) you want.
from affordance-net.
Thanks for your interest @felixfuu! I just added the utils folder. You can find the script to create .sm files and all relevant information to train AffordanceNet on your own data.
from affordance-net.
@nqanh Thanks for you reply!Another question,how to ensure the mask which is very small can be detected?
from affordance-net.
The affordance mask depends on the size of the object, and the object size depends on the anchor parameters (scale and ratio) of the object detector. The concept of anchor was proposed in the Faster R-CNN paper, here we use 15 anchors as in Mask R-CNN paper.
If you want to detect very small objects (e.g, 5x5), you should change the params related to anchor in the prototxt file. You can play with Faster RCNN first before doing for AffordanceNet, because if the object detector fails, then the mask branch will fail.
from affordance-net.
@nqanh Ok ,thanks! The Mask RCNN use FPN as feature extra model , but in this project,i can't fint the implement of FPN,is there any information about FPN?
from affordance-net.
No, we use VGG16 backbone to extract features and mainly focus on the mask branch (for multiclass affordances). The object detection branch is quite simple (only 2 fully connected layers are used). You can extend AffordanceNet with ResNet and FPN.
from affordance-net.
Related Issues (20)
- What is the difference between roi_alignment and roi_alignment2? HOT 2
- A question about .sm file HOT 6
- way to calculate F score HOT 6
- Regarding cudnn version and caffe installation HOT 5
- Sorry, Can you tell me how to create several .sm files in image with pascal voc datasets?
- A question about loss_mask layer
- I get some errors while doing python demo_img.py HOT 1
- src/caffe/layers/cudnn_relu_layer.cu(19): error: identifier "activ_desc_" is undefined HOT 2
- Add cudnn7 support
- How to create several .sm files in an image with pascal voc datasets?
- Camera parameters of IIT dataset HOT 1
- Check failed: registry.count(type) == 1 (0 vs. 1) Unknown layer type HOT 2
- How to change number of classes
- WARNING: filter_boxes() remove ALL proposal.
- One Object with multiple affordances
- Missing Affordance Masks
- Ros version Problem
- Color-coded affordance label HOT 2
- Caffe Installation
- AffordanceNet architecture implementation
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from affordance-net.