Comments (8)
Hi, Luca Bertinetto.
When I use the download imdb.mat training, mention this wrong "Reference to non-existent field 'id'.". What should I do?
from siamese-fc.
Hi @LinHungShi,
Sorry for the very late answer but I took a break from my PhD to do an internship at the moment.
- The concept of scale is related to the size of the object in the previous frame.
The update can be considered, for example: new_size = s*old_size . If s>1 then the object is increasing in size, if s<1 decreasing. At each frame we only search for 3 "scales", s=1, s=1.02 (or something similar) and s=1/1.02. - Not sure to understand what you are asking. Yes, all the images have been processed in the same way during data curation and we produced 2 crops of different size per frame. The procedure and the code are available in the
ILSVRC15-curation
folder. - The procedure to convert pixels in the response map to pixel in frame coordinates is detailed and documented in
tracker_step.m
from siamese-fc.
Hi,
Apologies for the late answer: I was at CVPR + holidays.
- Yep we do upsample the score map during tracking.
- Yes we want to limit the stride of the network to avoid reducing the spatial resolution too much. Do you need to use a pretrained network with a large stride? You can try just to upsample the activations, or instead you can train a head of the network which performs upsampling.
from siamese-fc.
Hi, thanks for reply. CVPR is really a great conference, hope you enjoyed your journey.
If no bother, I have a few more questions in regard to the paper that need your help.
-
In the paper, you talked about multiple scales, for example, "Multiple scales are searched in a single forward-pass by assembling a mini-batch of scaled images", "Tracking through scale space is achieved by processing several scaled versions of the search image. Any change in scale is penalized and updates of the current scale and damped" and "To handle scale variations, we also search for the object over five scales, and update the scale by linear interpolation with a factor of 0.35 to provide damping". I don't quite understand the meaning of "scale" in the context. Did you change the candidate/search image sizes? Could you explain the concept in more details?
-
My thought of the sizes of exemplar and candidate images is that you extract 127x127 and 256x256 patches from the image. However, in Data Curation, you did image scaling. Did you scale both exemplar and candidate images? Since only scale factor s is specified, it means the area of scaled image will be 127*127 (area of exemplar image), but the width and height might be different. Could you give me a general procedure on how you process the images?
-
After getting the score map, You upsample the it from 17x17 to 272x272. Since the candidate map is 256x256, which is smaller than the scaled score map, how do we know which score corresponds to which pixel ?
I'd really appreciate if you could give me some hints. Thanks.
from siamese-fc.
Hi, Luca Bertinetto.
I also meet the same problem as hanjainglong. When I use the download imdb.mat training, mention this wrong "Reference to non-existent field 'id'.". Besides, I don't understand the use of save_crops.m,what kind of crops does it generate and what's the use of these crops? Would you please give me a hint?
from siamese-fc.
Hi, @bertinetto ,
I am afraid of that you have not clarified one of @LinHungShi's questions, i.e: the response map is 17x17, and the network's stride is 8, in your code disp_instanceInput = disp_instanceFinal * p.totalStride / p.responseUp
, where the maximum value of disp_instanceInput
is 68
, which is much smaller than x_crop
's half size 127
, which means that object lies further than 68
pixels can not be detected.
Could you please explain this in detail, thanks?
from siamese-fc.
I guess I have figured it out myself. Apologies if any bother.
from siamese-fc.
Hi, Luca Bertinetto.
When I use the download imdb.mat training, mention this wrong "Reference to non-existent field 'id'.". What should I do?
from siamese-fc.
Related Issues (20)
- scale update HOT 1
- how to extend search region to whole image?
- why use the totalStride? HOT 2
- Error occured when I use my own dataset HOT 1
- how to judge whether it track failed? HOT 2
- about siamfc-R
- Training from scratch (a bit of guidance would be appreciated) HOT 3
- Some errors when using multi-gpus HOT 1
- how to understand? HOT 1
- about disk size. HOT 1
- some questions about training process.
- About wInfluence.
- Use the cudnn version
- Where is the file arch.dot HOT 1
- do not understand the codes in imagenet curation sub-folder HOT 2
- Training phrase.
- Error in vid_image_stats : vl_imreadjpeg HOT 3
- Help with fine tuning?
- tracking
- What the meaning of the 'response_up=8' in tracking.py? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from siamese-fc.