Git Product home page Git Product logo

Comments (10)

AlfredXiangWu avatar AlfredXiangWu commented on May 13, 2024
  1. 特征层对于A是eltwise6,对于B是eltwise_fc1
  2. 错误信息的字面意思,unknown blob input label to layer 1,你最好自己检查一下网络配置吧

from face_verification_experiment.

tianboguangding avatar tianboguangding commented on May 13, 2024

多谢

训练测试集我是把它分成9:1,而没有把所有数据拿去训练,结果按这个训练出的模型,在lfw上正确率只有65.8%,原因是什么?(孙祎的五点定位,检测,再按照您的方式alignment,最后是144*144的灰度图片,样本准备过程应该没有什么问题,lr:0.001,weight_decay:0.005)

from face_verification_experiment.

AlfredXiangWu avatar AlfredXiangWu commented on May 13, 2024

提取fc1 512维的特征是93.8%的准确率, 而变成eltwise6的256维就只有65.8%?

from face_verification_experiment.

tianboguangding avatar tianboguangding commented on May 13, 2024

您的模型变成eltwise6的256维准确率是97.7%;65.8%是我自己按上边网络重新训练模型并在lfw的测试结果,不知道问题出在哪儿?

from face_verification_experiment.

AlfredXiangWu avatar AlfredXiangWu commented on May 13, 2024

65.8%这个结果太低了,我这边网络的迭代4w次,validation只有30%~40%的时候在lfw也可以到90%左右。我觉得首先还是仔细检查下数据预处理吧,训练集和测试集预处理的方法是否一致,因为训练集裁出来的144x144的图像,测试集lfw应该是128x128的。另外看一下你网络在训练时的收敛状况,你这个65.8%结果的网络validation的准确率是多少呢?

from face_verification_experiment.

tianboguangding avatar tianboguangding commented on May 13, 2024

hi,吴老师:
是的,预处理昨天我又重新做了一遍,先港中文的批处理一遍,之后训练集是按照您的说明裁剪的144X144灰度图,lfw也是128X128的灰度图
训练集是全部的img,val集是从中选取了三万多张,下面是训练的部分收敛情况:
......
I0331 12:59:07.620589 4824 solver.cpp:214] Iteration 230900, loss = 2.82966
I0331 12:59:07.621589 4824 solver.cpp:229] Train net output #0: accuracy = 0.515625
I0331 12:59:07.621589 4824 solver.cpp:486] Iteration 230900, lr = 0.001
I0331 13:00:46.470243 4824 solver.cpp:294] Iteration 231000, Testing net (#0)
I0331 13:01:48.563794 4824 solver.cpp:330] Test loss: 1.67944
I0331 13:01:48.564795 4824 solver.cpp:343] Test net output #0: accuracy = 0.7663
I0331 13:01:48.940816 4824 solver.cpp:214] Iteration 231000, loss = 3.35271
I0331 13:01:48.940816 4824 solver.cpp:229] Train net output #0: accuracy = 0.375
I0331 13:01:48.941817 4824 solver.cpp:486] Iteration 231000, lr = 0.001
I0331 13:03:28.806529 4824 solver.cpp:214] Iteration 231100, loss = 3.02206
I0331 13:03:28.806529 4824 solver.cpp:229] Train net output #0: accuracy = 0.4375
I0331 13:03:28.807528 4824 solver.cpp:486] Iteration 231100, lr = 0.001
I0331 13:05:08.690242 4824 solver.cpp:214] Iteration 231200, loss = 2.77543
I0331 13:05:08.691241 4824 solver.cpp:229] Train net output #0: accuracy = 0.484375
......
这个是不是不太正常?
而在lfw上的正确率还是很低。

from face_verification_experiment.

AlfredXiangWu avatar AlfredXiangWu commented on May 13, 2024

我觉得你这个log变化很正常啊,然后你测试的时候lfw才只有65.8%? 你检查下你提特征的代码吧,提特征的时候图像像素要除255归一化到[0,1]

from face_verification_experiment.

tianboguangding avatar tianboguangding commented on May 13, 2024

hi,吴老师:
用我提特征的代码载入您的model从而生成的lfw.mat,检测正确率是96.8%,所以感觉提特征代码好像没什么问题;但载入我自己训练好的模型不是生成lfw.mat失败(Cannot copy param 0 weights from layer 'conv1'; shape mismatch. Source param shape is 96 3 9 9 (23328); target param shape is 96 1 9 9 (7776). To learn this layer's parameters from scratch rather than copying from a saved net, rename the layer)就是生成了正确率很低(65.8%),而之前您也说我训练的log变化是正常的,那这个问题是出现在哪里呢?

from face_verification_experiment.

AlfredXiangWu avatar AlfredXiangWu commented on May 13, 2024

Cannot copy param 0 weights from layer 'conv1'; shape mismatch. Source param shape is 96 3 9 9 (23328); target param shape is 96 1 9 9 (7776). To learn this layer's parameters from scratch rather than copying from a saved net, rename the layer

这个错误就是字面意思,彩色图像和灰度图像的问题。。。。
65.8%到底是什么问题我也不是特别确定,看log我觉得没问题。所以我还是觉得你提特征的部分有问题,尤其是你说你的代码有时候提取不成功,有时候提出来特征准确率是65.8%,这么不稳定的代码明显里面是存在bug的

from face_verification_experiment.

tianboguangding avatar tianboguangding commented on May 13, 2024

Hi,吴老师:
sorry,是您说的彩色图像和灰度图像的问题。现在好了,迭代12万次时,lfw测试准确率有94%。
不好意思,谢谢了。

from face_verification_experiment.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.