diffpose_video's Introduction

[DiffPose: Video Setting]

[Paper] | [Project Page] | [SUTD-VLG Lab]

Environment

The code is developed and tested under the following environment:

Python 3.8.2
PyTorch 1.7.1
CUDA 11.0

You can create the environment via:

conda env create -f environment.yml

Dataset

Our datasets are based on 3d-pose-baseline and Video3D data. We provide the GMM format data generated from the above datasets here. You should put the downloaded files into the ./data directory. Note that we only change the format of the Video3D data to make them compatible with our GMM-based DiffPose training strategy, and the value of the 2D pose in our dataset is the same as them.

Video-based experiments

Evaluating pre-trained models for frame-based experiments

We provide the pre-trained diffusion model (with CPN-dected 2D Pose as input) here. To evaluate it, put it into the ./checkpoint directory and run:

CUDA_VISIBLE_DEVICES=0 python main_diffpose_video.py \
--config human36m_diffpose_uvxyz_cpn.yml --batch_size 1024 \
--model_pose_path checkpoints/mixste_cpn_243f.bin \
--model_diff_path checkpoints/diffpose_video_uvxyz_cpn.pth \
--doc t_human36m_diffpose_uvxyz_cpn --exp exp --ni \
>exp/t_human36m_diffpose_uvxyz_cpn.out 2>&1 &

We also provide the pre-trained diffusion model (with Ground truth 2D pose as input) here. To evaluate it, put it into the ./checkpoint directory and run:

CUDA_VISIBLE_DEVICES=0 python main_diffpose_video.py \
--config human36m_diffpose_uvxyz_gt.yml --batch_size 1024 \
--model_pose_path checkpoints/mixste_cpn_243f.bin \
--model_diff_path checkpoints/diffpose_video_uvxyz_gt.pth \
--doc t_human36m_diffpose_uvxyz_gt --exp exp --ni \
>exp/t_human36m_diffpose_uvxyz_gt.out 2>&1 &

Bibtex

If you find our work useful in your research, please consider citing:

@InProceedings{gong2023diffpose,
    author    = {Gong, Jia and Foo, Lin Geng and Fan, Zhipeng and Ke, Qiuhong and Rahmani, Hossein and Liu, Jun},
    title     = {DiffPose: Toward More Reliable 3D Pose Estimation},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2023},
}

Acknowledgement

Part of our code is borrowed from DDIM, VideoPose3D, Graformer, MixSTE and PoseFormer. We thank the authors for releasing the codes.

diffpose_video's People

Contributors

Stargazers

Watchers

diffpose_video's Issues

Issiues about GMM-formatted Data and Associated Preprocessing Code

Thanks for your great work! How can I obtain the data you provided in GMM format, such as the file named data_2d_h36m_cpn_ft_h36m_dbb.npz? Could you please share the preprocessing code required for this specific data? Your assistance is much appreciated!

issues about the tested mpjpe per action

Thank you for deliver the outstanding work. But I have a problem when I try to test the diffpose_video project. After the pretrained models are reloaded, the mpjpe per action does not match the results listed in paper. For example, testing mpjpe of smoking is 40.72mm.
"----Smoking----
Test time augmentation: False
Protocol #1 Error (MPJPE): 40.72313231684394 mm
Protocol #2 Error (P-MPJPE): 32.65488504479497 mm
Protocol #3 Error (N-MPJPE): 39.919384486847 mm
Velocity Error (MPJVE): 2.8684418194658927 mm
----------“
but the mpjpe listed in paper for smoking is 37.3mm. What causes the difference, is there something wrong with my experimental settings?

Recommend Projects

gongjia0208 / diffpose_video Goto Github PK

diffpose_video's Introduction

[DiffPose: Video Setting]

Environment

Dataset

Video-based experiments

Evaluating pre-trained models for frame-based experiments

Bibtex

Acknowledgement

diffpose_video's People

Contributors

Stargazers

Watchers

diffpose_video's Issues

Issiues about GMM-formatted Data and Associated Preprocessing Code

issues about the tested mpjpe per action

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent