charrin / retinaface-cpp Goto Github PK

View Code? Open in Web Editor NEW

393.0 393.0 127.0 20.45 MB

RetinaFace detector with C++

C++ 94.49% Shell 5.51%

retinaface-cpp's People

Contributors

Stargazers

Watchers

Forkers

yemenr chaoso zhly0 vgg4resnet hanson-young runningj mx2017 tj1116 zhushansheng zhy520xp alakia sanyuanliu jackcc linecode qaz734913414 queryor githubcrj guanliulong shanliwa1 allensmile friendshipity nieshaoshuai github-luffy yjinyyzyq smuzyg zyg11 ahlfors binwanggit starstylesky mike07026 yanghedada slzhly wuwenlong168 howave niuxiaozhang duckj zyelite zhukkang ailib sunjunlishi mrbigdog shiyuan0806 column6942 xwyangjshb 1018365842 chenzhi1992 miaochenguo sy19870112 xuguozhi panda-lab gq124 yjingyu xiaoye77 youngergao pilotbear wu-ruijie liuguoyou pijiu1234 wzjai2018 eewenbinwu guruij yueyilia westnight laulian abeltianxiong azuredsky zhongtb leo-xxx freewind2016 joshuazero tony109060581 zhuqiming678 lqs19881030 bingqingsuimeng romeosgood imagednn fb-wh class8hawk sanrahul micricket datakalp cosmoshua 51w bill007bill timverion xiaqing10 tyro-mm luwei6896 happyday-lkj happythinker loveandhope knightofdawn daojishigailvlun ajiang17 winterxx alexjuncpp shafiahmed xrosliang pllin alleny1987

retinaface-cpp's Issues

使用vulkan 进行GPU加速

您好, 已经跑通您的这个程序, 但我使用Vulkan的时候, 发现不工作 (ncnn 提供的例子可以使用gpu加速),
ncnn::Net _net;

ncnn::create_gpu_instance();
_net.opt.use_vulkan_compute = 1;
int gpu_count = ncnn::get_gpu_count();
ncnn::VulkanDevice vkdev; 
_net.set_vulkan_device(&vkdev);

_net.load_param(param_path.data());
_net.load_model(bin_path.data());

why resize to 300x300 before extraction

hi @Charrin ,

ncnn::Mat input = ncnn::Mat::from_pixels_resize(img.data, ncnn::Mat::PIXEL_BGR2RGB, img.cols, img.rows, 300, 300);
cv::resize(img, img, cv::Size(300, 300));

300x300 is really small for bigger images, is it the network size? however, I found the param is 640:
Input data 0 1 data 0=640 1=640 2=3

可以提供一下caffe版本的预测sample代码吗？

您好，请问您能提供一下caffe版本的预测sample代码吗，我用您的mnet模型写出来的c++预测结果不太对。

inference time 11.890137
inference time 77.138916
inference time 14.939941
inference time 153.685059
inference time 11.651123
inference time 11.635986
inference time 18.880127
inference time 21.743896
inference time 18.770996
inference time 210.691895
inference time 15.924072
inference time 13.067871
inference time 11.803955
inference time 11.813965
inference time 12.151123
face detection time cost: min = 8.04ms max = 733.61ms avg = 64.47ms
why the inference time are not stable ?

error C2065: “anchor”: 未声明的标识符

void AnchorGenerator::landmark_pred(const CRect2f anchor, const std::vectorcv::Point2f& delta, std::vectorcv::Point2f& pts) {
float w = anchor[2] - anchor[0] + 1;
float h = anchor[3] - anchor[1] + 1;
float x_ctr = anchor[0] + 0.5 * (w - 1);
float y_ctr = anchor[1] + 0.5 * (h - 1);
pts.resize(delta.size());
for (int i = 0; i < delta.size(); ++i) {
pts[i].x = (delta[i].x*w + x_ctr)ratiow;
pts[i].y = (delta[i].yh + y_ctr)*ratioh;
}
}中出现该问题， @Charrin

模型问题

请问
RetinaFace-Cpp/Demo/ncnn/models/
这个目录下的
retina.bin 和 retina.param
是对应于您该链接中：
I convert R50 mxnet model to caffe model BaiDuYun密码:6evh | Google Drive
中提供的模型吗

Different size for input image

when i modify the size of input image to 640 * 640, got no result for stride8, but got the result for stride16 and stride32. but I don't know why.

Got different output between Mxnet and caffe anything wrong?

I'm using mobilenet-0.25 Retinaface on TensorRT & caffe. They getting the same output value but no bbox output. Then I compare the output of caffe to Mxnet, it is different(Mxnet can get the bbox).
The preprocess is like below
`cv::Mat img = cv::imread("./test.jpg");

    cv::cvtColor(img, img, CV_BGR2RGB);

cv::resize(img, img, cv::Size(INPUT_H, INPUT_W));

cv::Mat frame_copy = img.clone();

cv::resize(frame_copy, frame_copy, cv::Size(INPUT_W, INPUT_H));

float* data = new float[INPUT_C*INPUT_H*INPUT_W];

std::vector<cv::Mat> fd_input_channels;

for (int i = 0; i < 3; ++i) {
	cv::Mat channel(INPUT_H, INPUT_W, CV_32FC1, data);
	fd_input_channels.push_back(channel);
	data += INPUT_H * INPUT_W;
}

cv::Mat sample_float;

frame_copy.convertTo(sample_float, CV_32FC3, 1.0 / 128, -127.5 / 128);

cv::split(sample_float, fd_input_channels);`

The input is normalize to -1~1 planar,right?

mxnet model convert to ncnn

您好,
请问你的ncnn模型是用的 mnet.25-symbol.json mnet.25-0000.params 这两个文件转的吗,
我用ncnn最新代码转,运行会报错.查看了一下转换后的ncnn模型和你工程中的差别蛮大的, 你转换的时候用了什么技巧吗,你ncnn的tag 是多少

谢谢

Cmake or Make file ?

Hi,

How we can compile ? any Makefile or Cmake option ?

Best

Is the output same?

Hi,
Thanks for your good job.
I tried your retinaface-R50 caffe model, the output size is not same as the MXNET output.
For example, the stride 8 output size, if the input size is 640x640, the caffe model output size is 81x81, but MXNET output size is 80x80

Is it same as your caffe model output?

Any Speedups Compared to Python?

Thanks for the conversion.

Is there any speedups compared to the Python version? Can you provide some simple benchmarks?

Thanks

How you convert mxnet to caffe model, Can you share your code?

'inference_utils.hpp' file not found

./anchor_generator.h:8:10: fatal error: 'inference_utils.hpp' file not found

is crop necessary?

both input of the crop layer have identical shape. Do we really need crop layer?

关于用MNN推理框架在树莓派4B推理速度

测试了一下caffe的mnet模型在树莓派4B上的速度，推理框架用的阿里的MNN，树莓派4B的cpu型号是BCM2711（四核Cortex A72，主频1.5GHz），测试分辨率为VGA (640*480)，loop10次取平均：

核心数	fp32计算耗时（ms）	量化后int8计算耗时（ms）
1	167	183
2	116	102
3	105	76
4	96	61

MNN框架的加速优化还是做得挺不错的。

How to run inference using provided models

How to run inference using the model provided in convert_models folder ?
What piece of code should I place in the space provided in "detect.cpp" ?
What are the preferred libraries to be used ?
The detect.cpp file mentions "Inference ,NetContext and Cube objects" . Where can I find these classes ? (Line number 16 17 and 37 respectively

How to run at win10? thank you !

about Cube struct

Cube cls;
        Cube reg;
        Cube pts;

        // get blob output
        char clsname[100]; sprintf(clsname, "face_rpn_cls_prob_reshape_stride%d", _feat_stride_fpn[i]);
        char regname[100]; sprintf(regname, "face_rpn_bbox_pred_stride%d", _feat_stride_fpn[i]);
        char ptsname[100]; sprintf(ptsname, "face_rpn_landmark_pred_stride%d", _feat_stride_fpn[i]);

        net.Extract(nc, clsname, cls);
        net.Extract(nc, regname, reg);
        net.Extract(nc, ptsname, pts);
```when i use caffe C++, i find Cube not defined. Can you show me its structure？

onnx model when convert model

你好！
我在转换模型时遇到一些问题，请问可以给我您转换过程中用到的onnx模型吗
或者方便的话发送到我的邮箱[email protected] 感谢

How can I elevate the model performance？

The inference cost time is more than 200ms in my device... while I use 4 threads to match the core num... (network input size is 640*480)
However , when I use the inference in "SpeedTest", The forward time is less than 100ms, remains 4 thread...
Could U share the SpeedTest's source code？ Thank you

About arm platform (ncnn model)

Do you have any plans to transfer to ncnn on arm platform? I failed to convert ncnn with caffe model which you provide.

:~/Documents/3rdpart/ncnn/build/tools/caffe$ ./caffe2ncnn ./mnet.prototxt ./mnet.prototxt.caffemodel ./retina.param ./retina.bin
Segmentation fault (core dumped)

Widerface valuation

Hi，Charrin：
what format should I save the widerface test result in order to test the widerface val dataset，I could not find a doc that explain it.Thanks.

Questions about caffe converter

I use this repo: https://github.com/cypw/MXNet2Caffe to convert caffe model from mxnet.
But I found the BN op gives a big error like max bias = 0.##, Have you got some ideas to solve this problem?

批量裁剪图片时，预测出错

您好，我更改了您的代码用于批量裁剪图片。结果在裁剪了数张图之后，预测出现问题，返回错误坐标。

这是我更改后的Main.cpp

#include <stdio.h>
#include <vector>
#include <opencv2/core/core.hpp>
#include <opencv2/highgui/highgui.hpp>
#include <opencv2/imgproc/imgproc.hpp>

#include <iostream>
#include <string>
#include <fstream>
#include <sstream>
#include <stdlib.h>
#include <cstdio>

#include "platform.h"
#include "net.h"
#include<windows.h>
#include "anchor_generator.h"
#include "config.h"
#include "tools.h"

using namespace std;

const int target_size = 300;
//------------------------------------------------------------------------------------------
struct Object
{
	cv::Rect_<float> rect;
	int label;
	float prob;
};

static int detect_mobilenet(const cv::Mat& bgr, std::vector<Anchor>& result, const char* paramPath, const char* modelPath)
{
	ncnn::Net retina;

	retina.load_param(paramPath);
	retina.load_model(modelPath);

	int img_w = bgr.cols;
	int img_h = bgr.rows;

	ncnn::Mat in = ncnn::Mat::from_pixels_resize(bgr.data, ncnn::Mat::PIXEL_BGR2RGB, bgr.cols, bgr.rows, img_w, img_h);
	//ncnn::Mat in = ncnn::Mat::from_pixels_resize(bgr.data, ncnn::Mat::PIXEL_BGR, bgr.cols, bgr.rows, target_size, target_size);
	
	//const float mean_vals[3] = { 103.939f, 116.779f, 123.68f }; //retina
	//const float norm_vals[3] = { 1.0 / mean_vals[0],1.0 / mean_vals[1],1.0 / mean_vals[2] };
	const float mean_vals[3] = { 0,0,0 }; //retina
	const float norm_vals[3] = { 1,1,1 };
	in.substract_mean_normalize(mean_vals, norm_vals);

	ncnn::Extractor ex = retina.create_extractor();

	ex.input("data", in);

	//--------------------------------------------------------
	std::vector<AnchorGenerator> ac(_feat_stride_fpn.size());
	for (int i = 0; i < _feat_stride_fpn.size(); ++i) {
		int stride = _feat_stride_fpn[i];
		ac[i].Init(stride, anchor_cfg[stride], false);
	}
	std::vector<Anchor> proposals;
	proposals.clear();

	for (int i = 0; i < _feat_stride_fpn.size(); ++i) {
		ncnn::Mat cls;
		ncnn::Mat reg;
		ncnn::Mat pts;

		char clsname[100]; sprintf(clsname, "face_rpn_cls_prob_reshape_stride%d", _feat_stride_fpn[i]);
		char regname[100]; sprintf(regname, "face_rpn_bbox_pred_stride%d", _feat_stride_fpn[i]);
		char ptsname[100]; sprintf(ptsname, "face_rpn_landmark_pred_stride%d", _feat_stride_fpn[i]);

		ex.extract(clsname, cls);
		ex.extract(regname, reg);
		ex.extract(ptsname, pts);

		ac[i].FilterAnchor(cls, reg, pts, proposals);

	}
	
	nms_cpu(proposals, nms_threshold, result);

	retina.clear();
	ac.clear();
	proposals.clear();

	return 0;
}

void crop_objects(cv::Mat& bgr, std::vector<Anchor> result, const char* imagepath) {
	//resize(bgr, bgr, cv::Size(target_size, target_size));
	cv::Mat image = bgr.clone();
	
	if (result.size() == 0) {
		cout << "[ " << imagepath << " ] crop fail!\n" << endl;
		system("pause");
		exit(0);
	}
	Anchor res = result[0];
	for (int i = 1; i < result.size(); i++)
	{	
		cout << res.score << endl;
		cout << result[i].score << endl;

		if (res.score < result[i].score) {
			res = result[i];
			cout << "!" << endl;
		}

	}
    
	cout << res.score << endl << endl;
	cout << res.finalbox.x << " " << res.finalbox.y << " " << res.finalbox.width << " " << res.finalbox.height << endl;

	float w,h;
	w = h = 0;
	w = res.finalbox.width - res.finalbox.x + 1;
	h = res.finalbox.height - res.finalbox.y + 1;

	if (res.finalbox.x < 0) {
		res.finalbox.x = 0;
	}
	if ((res.finalbox.x + w) > image.cols) {
		w = image.cols - res.finalbox.x;
	}

	if (res.finalbox.y < 0) {
		res.finalbox.y = 0;
	}
	if ((res.finalbox.y + h) > image.rows) {
		h = image.rows - res.finalbox.y;
	}

	cout << res.finalbox.x << " " << res.finalbox.y << " " << w << " " << h << endl;
	image = bgr(cvRect(res.finalbox.x, res.finalbox.y, w, h));

	cv::imwrite(imagepath, image);
	//cv::imshow("t", image);
	//cv::waitKey(true);
	//Sleep(200);
	cout << "[ " << imagepath << " ] crop complete!\n" << endl;
}

int main(int argc, char** argv)
{
	string cropListPathStr = "C:/Users/SEARECLUSE/Desktop/RetinaFaceDemo/x64/Release/dataList.txt";
	string paramPathStr = "C:/Users/SEARECLUSE/Desktop/RetinaFaceDemo/x64/Release/model/mnet.ncnn.param";
	string modelPathStr = "C:/Users/SEARECLUSE/Desktop/RetinaFaceDemo/x64/Release/model/mnet.ncnn.bin";

	if (argc > 1) {
		cropListPathStr = argv[1];
		paramPathStr = argv[2];
		modelPathStr = argv[3];
	}

	cout << "确认裁剪列表: " << cropListPathStr << endl;
	cout << "确认模型结构: " << paramPathStr << endl;
	cout << "确认模型文件：" << modelPathStr << endl;
	cout << "准备就绪。" << endl;
	system("pause");

	const char* cropListPath = cropListPathStr.data();
	const char* paramPath = paramPathStr.data();
	const char* modelPath = modelPathStr.data();

	ifstream infile;
	infile.open(cropListPath, ios::in);
	while (!infile.eof()) {
		string str;
		getline(infile, str);

		const char* imagepath = str.data();
		cv::Mat m = cv::imread(imagepath, 1);

		cout << imagepath << endl;

		if (m.empty())
		{
			fprintf(stderr, "cv::imread %s failed\n", imagepath);
			return -1;
		}

		std::vector<Anchor> result;
		detect_mobilenet(m, result, paramPath, modelPath);
		crop_objects(m, result, imagepath);
		result.clear();
	}

	system("pause");
	return 0;
}

crop

作者你好，crop时特征图有时大小刚好相等，也有特征图不等的情况，那么offset是怎么设置的呢，prototxt中没有offset的值，caffe这个层不太懂，没有设置offset是默认居中裁剪吗？

Run on CPU only

Hello,
will this run on CPU?
I can't spot where you are loading the model.

No "inference.hpp" header file

there is no inference.hpp header file.

R50 mxnet convert to ncnn parameter file has inconsistent

1、convert R50 use mxnet2ncnn then load model
std::string param_path = "./models/retina-R50.param";
std::string bin_path = "./models/retina-R50.bin";
ncnn::Net _net;
_net.load_param(param_path.data());
_net.load_model(bin_path.data());

2、when run it error in _extractor.extract(clsname, cls) error:load_model error at layer 282, parameter file has inconsistent
clsname char [100] "face_rpn_cls_prob_reshape_stride16"

Cube not defined

int FilterAnchor(const Cube* cls, const Cube* reg, const Cube* pts, std::vector& result);

@Charrin you must forget somthing...

Onnx model

Hi! Could you provide onnx model for R50?

Could you introduce ROKIDCNN briefly？

Hi Charrin：

  RokidCNN is so fast compared to NCNN, could you introduce it briefly? many thanks.

About the result test on WiderFace_val set for SINGLE SCALE

Good job! But I am confused about the result tested on wider face val set. Accord to this deepinsight/insightface#669, the result tested on WIDER Face Hard using mnet0.25 model is 0.791, and your readme showed is about 0.36. Could you tell me why the result has so big difference??

运行环境

您好，这个代码运行的环境，您能详细说下吗，非常感谢！