Comments (7)
No, it looks like not a problem of dataloader worker.
Maybe there is some problem in your evaluation data.
Use python run.py --type evaluate --config_file your_config_file_path
to reproduce this bug.
It occurs at File "/ldap_shared/home/s_zyt/inseg/code/snake/lib/datasets/collate_batch.py", line 7
from snake.
Yes, actually I use the code reproduced this bug.
python run.py --type evaluate --cfg_file configs/sbd_snake.yaml model custom_model train.dataset CocoTrain test.dataset CocoVal
But I have no idea why would this happend, the dataset I use can be trained on other model like the mmdet version mask rcnn.
from snake.
I think maybe the image passed into function default_collate
must have the same shape.
My problem is their shapes are not the same.
>>> [b['inp'].shape for b in batch]
[(3, 544, 800), (3, 480, 608), (3, 544, 800), (3, 544, 800), (3, 544, 800), (3, 544, 800), (3, 544, 800), (3, 448, 608), (3, 544, 800), (3, 544, 800), (3, 544, 800), (3, 544, 800), (3, 544, 800), (3, 448, 608), ...]
>>> default_collate(a)
RuntimeError: invalid argument 0: Sizes of tensors must match except in dimension 0. Got 544 and 480 in dimension 2 at /pytorch/aten/src/TH/generic/THTensor.cpp:689
How should I fix this problem?
from snake.
Set batch_size = 1
in test
.
Our code does not support batch evaluation.
from snake.
I fixed this problem and the network training is on going.
But I think the training process is very slow and the GPU utilization rate is very low. I have two 2080ti GPU card, but the GPU-Util
is 0% in most of the time.
Is that because of the CirConv
not being fully optimized by the modern dl framework pytorch?
Or other problem.
I wander how long it will take for your training on coco
or sbd
dataset?
My dataset is far smaller than them but it also take almost two days.
training batchsize=16
.
from snake.
Set a bigger num_workers.
The data loading of your training is very slow.
from snake.
The training on sbd takes about 8 hours.
from snake.
Related Issues (20)
- 疑问 HOT 1
- snake_config里面的ro的含义? HOT 3
- ImportError: cannot import name '_ext' from 'lib.csrc.dcn_v2' HOT 2
- Unable to open shared memory object HOT 2
- 数据部分疑问 HOT 2
- RuntimeError: Error(s) in loading state_dict for Network HOT 1
- 模型推理部分疑问 HOT 2
- 关于测试部分的问题 HOT 1
- 使用自己的数据集训练 HOT 1
- Direction of Octagon Initalization HOT 5
- 请问uniform_upsample(poly, p_num):与def uniformsample(pgtnp_px2, newpnum):的区别只是数据输入格式不一样吗? HOT 1
- 训练的时候初始化报错 HOT 5
- 以bbox+img作为输入测试snake时,bbox的储存形式为? HOT 1
- Configuration problem
- the problem of the extreme_utils HOT 1
- change backbone to dla102 HOT 2
- 关于affine transform中scale和center的疑问 HOT 1
- can I only train the snake? HOT 1
- get_affine_transform() is buggy HOT 1
- I can not find those files, can you help me ? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from snake.