Comments (29)
finetuning很简单,你在开头加data层,结尾加loss/acc层,改变fc6的名字和类别。
注意用param { lr_mult: 0 decay_mult: 0 }
固定batchnorm层的参数
from densenet-caffe.
我训练的时候,accuracy一直为0,这个是怎么回事
from densenet-caffe.
我是用python重写了一遍网络结构再生成的prototxt 没用这里的
from densenet-caffe.
All these models are converted from https://github.com/liuzhuang13/DenseNet
So you can ask Zhuang Liu for more details.
from densenet-caffe.
我是想基于你发布的caffe模型进行微调,因为看到你发布的prototxt的layer名与Zhuang Liu发布的配置文件不同,所以想问问你进行训练时用的配置文件
from densenet-caffe.
那请问一下,data层的crop_size必须指定为ImageNet里面的224*224的吗?你这个应该是deploy.prototxt吧?100多MB的caffemodel是准确率最高的caffemodel吗?然后我能否利用你已经上传的DenseNet的caffemodel来进行图片的性别分类呢?还是需要重新利用你给的网络模型结构来进行训练呢?
from densenet-caffe.
data层crop_size在训练的时候是224,你可以不指定到这个大小。
这是deploy文件,准确率已经写在表格里。
可以,你可以利用这个预训练模型,也可以自己从头训练,你自己看着办都行。
from densenet-caffe.
使用161 fine-tuning 的时候,使用的图片是224*224,
train batchsize=64,出现error == cudaSuccess (2 vs. 0) out of memory
改成batchsize=2,还是报错,检查显卡使用了11G多,显卡用的是K40
改batchsize=1,运行train成功,显卡使用2000M的样子,奇怪,怎么改成batchsize=1,会降低那么多,是哪里有问题吗?
另外,finetuning的时候发现loss震荡较大。
from densenet-caffe.
@shicai
你好 网络有实验过检测任务吗 效果会怎样?
from densenet-caffe.
@limin1130 关于显存,作者最近提出了一个内存优化的方案,见论文《Memory-Efficient Implementation of DenseNets》
@OPPOA113 我自己没用DenseNet做过检测,不过作者透露将很快发布一篇论文,使用DenseNet做检测也能获得state of the art的结果。所以,DenseNet做检测应该没什么问题。
from densenet-caffe.
@OPPOA113
用DenseNet做检测任务已经做好了吗?
from densenet-caffe.
@gittigxuy
NO 做过自己的数据的分类实验,参数没调试好,效果不理想。你那边可有检测?
from densenet-caffe.
@OPPOA113 ,我还没做检测,我做图片2分类,但是效果不太理想,可能是初学的原因吧,再略加修改一下,你那的图片分类的项目代码可否给我看一下?如果可以的话,麻烦发邮件到[email protected],谢谢了
from densenet-caffe.
@gittigxuy
就是caffe的代码训练的cifar10 没有什么其他的代码
from densenet-caffe.
@shicai ,求大佬给个用到DenseNet的项目案例,我train - loss图波动很大,找不到原因
from densenet-caffe.
Hi, 我从零开始训练DenseNet121,将fc6换成InnerProduct layer,然后加上Accuracy和SoftmaxWithLoss,loss基本不变,accuracy一直为0。是什么原因呢?
from densenet-caffe.
@yefanhust 只凭你的只言片语,我也不确定原因是什么。即使是随机的初始化,accuracy也会接近0.1%,不会是0的。
from densenet-caffe.
DenseNet121.txt
@shicai 这里附上我修改过的DenseNet121.prototxt neng 能麻烦帮我看下吗?训练中loss和accuracy一直停留在最初的水平。
from densenet-caffe.
@shicai 再附上solver的config,完全是按照原始paper中的数据设置的。
solver.txt
from densenet-caffe.
@yefanhust (1) 确定你的batch size是256,我看文件中是8;(2) 如果从头训练,需要给Convolution和Scale层加上param参数,控制lr和wd,有几个参数就加几个param,比如conv不带bias就加一个,scale层带bias就加两个:param { lr_mult: 1 decay_mult: 0 }
from densenet-caffe.
@shicai (1) 在服务器上训练时是256,8是在单机上测试用的 (2) 好的,我会根据你的建议加上相应的参数
谢谢
from densenet-caffe.
@yefanhust 请问你训练的时候256,为什么我训练的时候batch_size为10,都out of memory,我的显卡是GeForce GTX TITAN X
from densenet-caffe.
@nerddd 我是在DGX-1上训练的,用的8块P100
from densenet-caffe.
@shicai
你好。我用你提供的deploy改写的train_val进行训练,loss一直是2.5左右,不下降。我改train_val只改了开头的数据层和在结尾加了accuracy和softmaxwithloss,请问还有哪里需要修改?
from densenet-caffe.
@yefanhust 想问一下你问题解决了吗?解决了的话,可不可以给看一下修改后的prototxt文件?
from densenet-caffe.
I modified the prototxt according to the comments under this issue. The only difference is that my batchsize is 10, and the loss does not converge and it explodes. I wonder must the batchsize be 256? And is there a correct version of train_val.prototxt for densenet?
from densenet-caffe.
@yefanhust 你可以把你的生成densenet的python脚本分享一下吗?或者将densenet-caffe的train_test.prototxt和solver.prototxt分享一下,我的邮箱 [email protected]
谢谢
from densenet-caffe.
@yefanhust 四块P100表示hold不住batchsize256 多显卡跑caffe 你的cudnn禁用了么
from densenet-caffe.
@shicai
你好。我用你提供的deploy改写的train_val进行训练,loss一直是2.5左右,不下降。我改train_val只改了开头的数据层和在结尾加了accuracy和softmaxwithloss,请问还有哪里需要修改?
请问能加个联系方式吗?我也准备finetuning ,也只是改了开头和加了结尾,我的QQ1443563995
from densenet-caffe.
Related Issues (20)
- Is there anyone successed? My caffe version is not support! HOT 1
- how to use your deploy.prototoxt to 2 classification HOT 3
- Do you have Network definition in Python? HOT 1
- Where can I get the train_val.prototxt ? HOT 2
- about some details HOT 2
- The shapes in netscope are wrong? HOT 1
- About some computing problem HOT 1
- Scope name problem when conventing from LiuZhuang13's code HOT 3
- DenseNet和Faster R-CNN的结合 HOT 3
- { lr_mult: 1 decay_mult: 1}的含义 HOT 16
- Not achieving your reported performance! HOT 3
- when i train my own dataset,the loss is not decrease and it is always about 6.97 HOT 1
- cuDNN does not allow an eps value less than 1e-5
- Hello, I was using caffe once, using a custom version of Microsoft, I do not know how to solve this problem, I hope you can help me.
- Question about scale=0.017 HOT 1
- problems about the data preprocessing HOT 2
- Fine-tune Dense 201 problem
- 关于densenet训练阶段BN层的设置 HOT 1
- 能不能提供提供训练代码以及测试代码
- DenseNet161 deploy prototxt (ceil_mode and round_mode) error HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from densenet-caffe.