610265158 / peppa_pig_face_landmark Goto Github PK

View Code? Open in Web Editor NEW

524.0 22.0 116.0 73.4 MB

A simple face detect and alignment method, which is easy and stable.

License: Apache License 2.0

Python 99.95% Shell 0.05%

facedetect face-alignment face-keypoint face-mask keypoint pytorch

peppa_pig_face_landmark's Introduction

Peppa_Pig_Face_Engine

introduction

It is a simple demo including face detection and face aligment, and some optimizations were made to make the result better.

click the gif to see the video:

and with face mask:

requirment

PyTorch
onnxruntime
opencv

model

1 face detector

yolov5-face

2 landmark detector

Simple keypoints detector.

WFLW	NME	Flops(G)	Params(M)	Pose	Exp.	Ill.	Mu.	Occ.	Blur	pretrained
Student@128	4.80	0.35	3.25	8.53	5.00	4.61	4.81	5.80	5.36	skps
Teacher@128	4.17	1.38	11.53	7.14	4.32	4.01	4.03	4.98	4.68	skps
Student@256	4.35	1.39	3.25	7.53	4.52	4.16	4.21	5.34	4.93	skps
Teacher@256	3.95	5.53	11.53	7.00	4.00	3.81	3.78	4.85	4.54	skps

I will release new model when there is better one.

By default student@256 is used in this project

install

git clone https://github.com/610265158/Peppa_Pig_Face_Landmark
python setup.py install

useage

# by code:

from Skps import FaceAna
facer = FaceAna()

result= facer.run(image)

## detect images, tracing is not used, add
##  facer.reset()

More details refer to demo.py

run python demo.py --cam_id 0 use a camera
or python demo.py --video test.mp4 detect for a video
or python demo.py --img_dir ./test detect for images dir no track

How to train

The codes are placed in folder TRAIN/face_landmark

Refer to TRAIN/face_landmark/README.md to train the model.

peppa_pig_face_landmark's People

Contributors

Stargazers

Watchers

Forkers

ooozxv2 xiaooquanwu baiyuang l1129433134 chentyjpm lzchuan ahuirecome meismaomao trendingtechnology xubin19939 aliushn magnetstone zhiqiang-li miwaliu luckynote backto77 jittapoo youngallien qaz734913414 zhaoyk1986 guyafeng fb-wh undercontroller freewind2016 jingziyou fondasup tkxcrf chapterfive-hero dex1990 seemaywang sudabai666 yhl2018 qcqcm jumpycat lifei-cn xiaowenhe lwcode akfheaven thandongtb oftenliu aodamiaomiao shaunzheng mornydew 1996scarlet spacetral jjandnn 3d2ydym dreadlord1984 jinseyun saddambinsyed rogerluojie match08 starstylesky mstaryi chdzhen bhaskarnil29 hxhh xuexuetong1993 wqz960 zhly0 zcc720 xhimileo dorniwang guoquanhao zhang-shi-chang johnbhlm ashimroy88 nickalg idesignitx jnulzl nsl2014fm adteven andrew940 jienimi xhwnobody qiwj1000 itrms wujinke brucema7 paranoth wolfworld6 jinjicheng intjun qiaone yyqgood adas-eye haoqiwansigou hzwhh liumarcus70s joe2hpimn incandescenced1 irvingshu mowenli ouya-bytes sinianyutian jxncyym cv-nlp zzz-xg wf1024966 zwilsonss

peppa_pig_face_landmark's Issues

跟踪问题

您好，在调试您的代码的时候发现：当突然遮挡住人脸的时候就会出现bug，貌似是
for i in range(landmarks.shape[0]):
track.append([np.min(landmarks[i][:, 0]), np.min(landmarks[i][:, 1]), np.max(landmarks[i][:, 0]),
np.max(landmarks[i][:, 1])])

这块代码的问题，我还在调试，没找到解决方案

请问关键点检测是用什么算法实现的

Confuse about model inference time

hi, dear Peppa man, I found you have listed those model inference time,

shufflenetv2_0.75 including tflite model,
(time cost: mac [email protected]， tf2.0 5ms+， tflite 3.7ms+- model size 2.5M)
but when I run the demo in my 1080/ cpu，they spent the almost same time.

In CPU（Intel(R) Xeon(R) Gold 6132 CPU @ 2.60GHz）：
one iamge cost 0.016803 s
facebox detect cost 0.0021691322326660156
in 1080:
one iamge cost 0.011924 s
facebox detect cost 0.001332998275756836

It seems to your face landmark model inference time is Shorter？

C++

你好,作者。
请问这个demo有C++实现的吗？

What does each of the [1,4200,5] output from the detector.tflite model represent?

关于使用OpenFace

能不能考虑直接使用OpenFace的检测代码，在Python上实现调用呢

窗外的麻雀在电线杆上多嘴，你说这一句很有夏天的感觉

Do we need a QQ group to communication？

与facebox 结合的问题

在您另外一个facelandmark 的工程中，我看到您的训练数据（landmark的输入img）的截取是根据landmark的外框来的，并没有去结合根据人脸检测的框，在peppa pig 的demo 用的landamrk就是用这种数据处理方式得到的吗？
观察到当我把人脸检测的方法换成别的时，精度明显没有用你的人脸检测的做出来的landmark 高，请问您是在训练landmark的数据预处理时参考了人脸检测框吗？

如果歪着头，检测效果似乎就不太好

我在使用的时候，发现这几张照片都检测不到关键点。歪着头的图片，检测效果会比较差吗？

TensorRT

你好，作者。
我想请教下，你试过在TensorRT中使用人脸特征点模型吗？

How to use GPU?

Thank you very much for your open source, I want to ask how to use GPU to detect faces, looking forward to your reply!

When I use multi-face video to perform inference based on the tflite model, the error is as follows

Traceback (most recent call last):
File "/home/changshuai/code/Peppa_test/demo.py", line 165, in
video("/home/lianping/sunxiaohu/face-detection-tensorrt-yolov3-tiny/test_video/20191010155113.avi")
File "/home/changshuai/code/Peppa_test/demo.py", line 30, in video
boxes, landmarks, states = facer.run(image)
File "/home/changshuai/code/Peppa_test/lib/core/api/facer.py", line 61, in run
landmarks,states=self.face_landmark.batch_call(image,boxes)
File "/home/changshuai/code/Peppa_test/lib/core/api/face_landmark.py", line 140, in batch_call
self.model.set_tensor(self.input_details[0]['index'], images_batched)
File "/home/lianping/develop_environment/anaconda2/envs/tensorflow_1.14_cuda9.0/lib/python3.5/site-packages/tensorflow/lite/python/interpreter.py", line 197, in set_tensor
self._interpreter.SetTensor(tensor_index, value)
File "/home/lianping/develop_environment/anaconda2/envs/tensorflow_1.14_cuda9.0/lib/python3.5/site-packages/tensorflow/lite/python/interpreter_wrapper/tensorflow_wrap_interpreter_wrapper.py", line 136, in SetTensor
return _tensorflow_wrap_interpreter_wrapper.InterpreterWrapper_SetTensor(self, i, value)
ValueError: Cannot set tensor: Dimension mismatch. Got 4 but expected 1 for dimension 0 of input 411.

eyeglasses

when I wear eyeglasses, the result is not good. How to improve?

C++ preprocess help needed

您好，我在尝试用mnn C++做keypoint模型的c++移植，对于这里的预处理不理解：https://github.com/610265158/Peppa_Pig_Face_Engine/blob/master/lib/core/api/face_landmark.py#L102
add = int(max(bbox_width, bbox_height))
bimg = cv2.copyMakeBorder(image, add, add, add, add, borderType=cv2.BORDER_CONSTANT, value=cfg.DATA.pixel_means)
bbox += add

请问能否协助指点C++的预处理吗？

face detect

Hi， the face detection algorithm used in the project is faceboxes？？

convert tensorflowLite error

I went to convert models into the TensorFlow Lite
keypoints success
but detector get error
packages\tensorflow_core\lite\python\lite.py", line 428, in convert
"invalid shape '{1}'.".format(_get_tensor_name(tensor), shape_list))
ValueError: None is only supported in the 1st dimension. Tensor 'images' has invalid shape '[None, None, None, None]'.
can help me?
Thinks.

The meaning of the “states” derived from this keypoint model

hi，I don’t understand the meaning of the “states” derived from this keypoint model, and why is this variable not used？

stable landmarks results like demo?

Nice job. I download the project, and test it with a video, but the landmarks is untstable like demo.

本来好好的，一段时间不用变成这样了

视频或多张图片都是一样，第一帧是正确的 landmark，后来很快变成一条近似直线。单张图测试是可以。

上次测试还是可以的，期间没动过代码，今天就出现这种问题。不知道是不是pip install 动了某些库……

2024-07-23_06-41-41.mp4

develope headpose

what a great work you have done! May I ask if you have finish developing headpose? or could you tell some details about it to me?
Thanks a lot.

looking forward to your model on WFLW dataset!

Thanks to your pretty good works , and i wish to see your model containing pupil-detection ,...

Anyone knows how to process the face that the eyes is in different status, such as one open, one close.

except for cropping the eyes region and then predict

视频关键点抖动问题

您好，
我看了您给的demo，关键点抖动稳定性比较好。我看在实际代码中，是通过利用前一帧的结果对人脸框和关键点做平滑来进行处理的。我按照同样的方式，在其他视频上测试，抖动稍微好些，但还是有抖动。请问有关视频中关键点抖动问题有什么更好的方法吗？

Do you need assistance to complete your work?

Hello, great effort on this implementation.

Suggestion - updating face detection to use YoloNas for facial detection.
Is the Main Teacher model outperforming Spiga?
Would you need GPU resources / other assistance to complete your work? How far are you from achieving SoTa results?

HeadPose calculation

Hi, I'm checking the head pose calculation of the face, and I have a problem when with a webcam I change the yaw of my face the pitch and roll change even though my head is not changing pitch and roll (the pitch is increasing for example). I've calibrated my webcam to minimize the distortion from the lenses and changed the D value.
I've seen the "object_pts" variable is an array with the points reference. How did you found these values?
It is possible to change from the "object_pts" which points pick to reduce the error? (for example, add the 4 points of the nose)

这用的是什么跟踪算法呢

关键点smooth中的OneEuroFilter 问题？

在lib/core/LK/lk.py中，
line86: result.append(self.filter(now_landmarks[i], previous_landmarks[i])) 使用OneEuroFilter 来smooth;
line144 : self.dx_prev = dx_hat 使用68个关键点中的上一个点的dx_prev，而不这个点对应上一帧的dx_prev。
这么写没问题么？虽然smooth对结果影响不是很大。

The TFLite model is unable to run on GPU

face detection accuracy low

Thanks for your good work.

When I testing the Tf1 version on ubuntu 18.04 OS then pepa-face engine application some time detecting nonface things as a face like below image

Is there any configuration I can increase the threshold for strict face detection?
TF2 version is better than TF1 version implementation ( face detect and landmark detect)?

please advise.

Confused about the code

https://github.com/610265158/Peppa_Pig_Face_Engine/blob/b2a39f9f5db13d833c263a6d2aafd4c2e4b41502/lib/core/api/facer.py#L65
Comments on this line of code said ‘calculate the head pose for the whole image‘，but It seems not right when I watch it. This function is for update landmark, do I understand right?

Is there a way to obtain the confidence of each landmark?

keypoints.tflite outpute details

Hi, Thanks for your awesome repository.
I compared your keypoints.tflite model to mediapipe face_landmark.tflite provided by mediapipe.
mediapipe output_details has 2 row containing landmark points and confidence(not sure).

but keypoints.tflite model output_details has 3 row. I don't know what are the first 2 rows but the third row is landmark points.
is the first 2 rows related to confidence? what are they?

竟然这样填充，代价很大的哟

填充之前图片还没缩小，bbox_width / bbox_height 可能很大，一下子扩大 9 倍以上

How to get each Facial landmarks of face?

Thanks for great work

could you please advise how to get the left and right eye landmark?

I want to calculate the EAR of the left and right eyes.

I followed the DLIB facial points to I am getting below error

cannot reshape array of size 600 into shape (1,10,20,1)

code

`def predict_eye_state(self, image):
global graph
with graph.as_default():
image = cv2.resize(image, (20, 10))
image = image.astype(dtype=np.float32)

        image_batch = np.reshape(image, (1, 10, 20, 1))
        image_batch = keras.applications.mobilenet.preprocess_input(image_batch)

        return np.argmax(self.eye_model.predict(image_batch)[0])

Help highly appreciated.

训练代码和数据会公布吗？

Will the training code and data be released?

lr 不随着step 增加而改变

1.我看到你在实现optimizer的时候是用apply_gradient那些自己去实现的，而不是直接用的minimize，请问为啥要自己写呢？有什么考虑
2.我跑了一下你的training发现lr 并没有在boudaries处变化，global step超过第一个节点150000之后lr还是0.001哎。甚至到了90k 也没变，是不是minize 有什么bug？

你的模型有做过量化的测试吗？比如在WFLW的结果是多少？还是只是肉眼测试？

Launching time long

the launching time is so long(after 1 minute) or I am doing something wrong

输出的Landmark:坐标和dlib的差别很大

输出Landmark坐标和dlib输出的差别很大

TypeError: load() missing 1 required positional argument: 'export_dir'

face_detector.py ave me above error:
self.model = tf.saved_model.load(self.model_path) in this line. In cofig.py as path I gave model/detector
Tensorflow 1.13

Can give me some improve opinions?

Hi, i checked your keypoints model, it's great! But i found that the 37th point(index:36) with a bigger deviation, what's more, eye can not close well when the angle of yaw bigger than about 30 degree. So what can i do to reduce these problems?
For me i want to use euler angle balance the half face, calculate eye area based on opencv to balance the eye close, and make a bigger weights for 37th point. Also remove the additional outputs, just for the 136 landmarks. My train is time consuming and i don't know whether my ideas is more effective or not, so can you give me some advices? Thx!