Git Product home page Git Product logo

yolov5-vitpose-video-annotator's People

Contributors

fan23j avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

yolov5-vitpose-video-annotator's Issues

wild video infer too slow

Hello, I have seen you from the issues of motionbert. Now I am using your warehouse to obtain json data, but the speed is very slow when I reason my own video on motionbert, 10s video takes more than ten minutes, and there are some warning messages, may I ask why?

MAGEIO FFMPEG_WRITER WARNING: input image is not divisible by macro_block_size=16, resizing from (923, 924) to (928, 928) to ensure video compatibility with most codecs and players. To prevent resizing, make your input image divisib

le by the macro_block_size or set the macro_block_size to 1 (risking incompatibility).

0% | ▋ | 1/296 [00:00 & lt; 04:16, 1.15 it/s] [

swscaler @ 000001df1adc4300] Warning: data is not aligned! This can lead to a speed loss

My testloader Settings are as follows: testloader_params = {

'batch_size': 1,

'shuffle': False,

'num_workers': 0,

'pin_memory': True,

'prefetch_factor': 2,

'persistent_workers': False,

'drop_last': False

}

How to train a custom halpe whobody model

Hi,
I download your ViTPose with Halpe model, and it works well. This model(vitpose-halpe-256x192.pth) supports halpe 26 keypoints, but I want 2 more keypoints(one in lefthand, another in righthand, both of which don't have to be very accurate ). Since vitpose-halpe-256x192.pth is so big( about 7.5GB), is it very difficult to train it? or more difficult for a custom 28 kpts model ? So, Could you give me some advice on how to train it? or is it a good idea to wait for someone else to share a hape wholebody model with hands keypoints?

Looking forward to your reply.

Yolov5 model file

Thank you for your project, I am a novice, I would like to ask which yolov5 model file should be downloaded ?

Support for Selective Person Tracking in Multi-Person Videos

Hi there!

Firstly, I'd like to extend my heartfelt gratitude for the incredible work on this project. The functionality and performance have been outstanding, and it's been instrumental in my current work involving pose estimation.

I'm currently utilizing the script to generate Alphapose-like outputs from VitPose, subsequently feeding this into MotionBert for 3D pose estimation. The results have been promising; however, I've encountered a specific challenge that I'd like to discuss.

In my current setup, I'm dealing with videos that feature multiple people. VitPose efficiently identifies all individuals, but for my purposes, I need to track a specific person through the video. This is crucial because, as noted in MotionBert documentation, they currently support single-person analysis only. In MotionBert, it's mentioned: "Note: Currently we only support single person. If your video contains multiple person, you may need to use the Pose Tracking Module for AlphaPose and set --focus to specify the target person id."

Given this requirement, I'm curious if we have a similar functionality in VitPose or if there's a workaround that can be implemented. Specifically, I'm looking for a feature that would allow me to select and track a single person in a video with multiple individuals, akin to the '--focus' option in AlphaPose.

Thank you once again for your amazing work and support.

support onnx/tensorrt

Hi,
I have another question for you.
Your ViTPose with Halpe model works well with Pytorch. I sucessfully exported some official vitpose models to onnx/tensorrt file by mmdeploy(command is like this), but I don't know how to export from your Haple model. Could you share me some more infomation about your Halpe model?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.