Git Product home page Git Product logo

Comments (9)

ZSC2017IM avatar ZSC2017IM commented on August 25, 2024

Unfortunately, this is not a bug for QSVENC because I got the same results with MSDK tool
sample_encode.exe h265 -dGfx -timeout 86400 -lowpower:on -hw -icq 28 -u speed -i 3.mkv -o 3.mp4 -w 720 -h 576

About 200 FPS and 20 CPU usage.

from qsvenc.

rigaya avatar rigaya commented on August 25, 2024

I got around 960fps at 12900K + Arc A380, with around 5% CPU utilization (as 12900K has 24 threads).

With such very high fps, there is many task to be done on the CPU side too, such as no wonder CPU usage getting little bit high.

from qsvenc.

ZSC2017IM avatar ZSC2017IM commented on August 25, 2024

@rigaya
Thanks for your test! I corrected my previous mental arithmetic results by timing with a stopwatch : 10000FPS in 12 seconds, 20% CPU. According to your results, high CPU usage is not the particularity of my platform and driver.
I use ffmpeg to call hevc_nvenc for transcoding, and use similar parameters. But a single instance can't run all the codecs of tesla p4. I ran four examples to make one encoder fully loaded, and obtained a similar FPS (about 960). But the total CPU utilization of all instances is less than 3%. As an excellent developer, do you think this is really caused by INTEL not adopting a similar scheme of NVIDIA?
20230318172906
https://developer.nvidia.com/blog/nvidia-ffmpeg-transcoding-guide/
I know that you also have Nvidia GPUs. Maybe you can also test NVDEC/NVENC's powerful low CPU usage ability in transcoding tasks.

from qsvenc.

ZSC2017IM avatar ZSC2017IM commented on August 25, 2024

this is really caused by INTEL not adopting a similar scheme of NVIDIA?
@rigaya
https://github.com/Intel-Media-SDK/MediaSDK/wiki/Media-SDK-Shaders-(EU-Kernels)#gpu-copy-and-horizontal-mirroring-kernels
I guess maybe this function of Intel is called "gpu copy", but QSVEnc doesn't provide it. I haven't improved it since I enabled it in ffmpeg/sample_encode. Among them, ffmpeg will report "GPU-accelerated memory copy only works in system memory mode." I think this may be related to the use of LPDDR4 memory as graphics memory in DG1. Can you test it on your machine? Parameters:
sample_encode.exe h265 -dGfx -timeout 86400 -lowpower:on -hw -icq 28 -u speed -i 3.mkv -o 3.mp4 -w 720 -h 576 -gpucopy::on
sample_encode.zip
ffmpeg -hwaccel qsv -hwaccel_output_format qsv -hwaccel_device 0 -vcodec h264_qsv -gpu_copy on -i 3.mkv -acodec copy -c:v hevc_qsv -low_power 1 -preset veryfast -profile:v rext -global_quality 28 -y 3.mp4

from qsvenc.

quamt avatar quamt commented on August 25, 2024

@ZSC2017IM
Here the result:
encoded 6000 frames, 1213.35 fps, 499.26 kbps, 14.28 MB encode time 0:00:04, CPU: 0.2%, VD: 99.2%
The taskmanger shows around 24% when running it, 20% for this task.
But keep in mind this system only uses a AMD Ryzen 5 5600G

from qsvenc.

ZSC2017IM avatar ZSC2017IM commented on August 25, 2024

@quamt
Thanks! This is very helpful! Can you retest the following commands? If "gpucopy" also fails to reduce CPU usage, I think this is indeed a common problem with Intel GPUs. I am submitting this issue to the intel community

sample_encode.exe h265 -dGfx -timeout 86400 -lowpower:on -hw -icq 28 -u speed -i 3.mkv -o 3.mp4 -w 720 -h 576 -gpucopy::on
sample_encode.zip

from qsvenc.

quamt avatar quamt commented on August 25, 2024

@ZSC2017IM
Running that the task manger shows CPU around 15 percent, wherein the task uses around 13%.

from qsvenc.

ZSC2017IM avatar ZSC2017IM commented on August 25, 2024

@quamt
Thanks! I have reported this issue to Intel.

from qsvenc.

rigaya avatar rigaya commented on August 25, 2024

I think CPU utilization of 20% is acceptable in high fps transcoding, I have closed this issue.

from qsvenc.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.