Comments (6)
Hi @marcreichman-pfi, thanks for raising this. I have encountered the same issue before. I think this would be a bug on the upstream (nvidia-ml-py
) with the incompatible NVIDIA driver. The nvidia-ml-py
returns invalid PIDs.
In [1]: import pynvml
In [2]: pynvml.nvmlInit()
In [3]: handle = pynvml.nvmlDeviceGetHandleByIndex(0)
In [4]: [p.pid for p in pynvml.nvmlDeviceGetComputeRunningProcesses(handle)]
Out[4]:
[1184,
0,
4294967295,
4294967295,
16040,
0,
4294967295,
4294967295,
19984,
0,
4294967295,
4294967295,
20884,
0,
4294967295,
4294967295,
26308,
0,
4294967295,
4294967295,
16336,
0,
4294967295,
4294967295,
5368,
0,
4294967295,
4294967295,
19828,
0,
4294967295]
I haven't found a solution for this yet. This may be due to an internal API change in the NVML library. We may need to wait for the next nvidia-ml-py
release.
As a temporary workaround, you could downgrade your NVIDIA driver version.
See also:
from nvitop.
Hi @XuehaiPan and thanks for your response and excellent tool!
We cannot downgrade because we need newer CUDA version support, so for now we'll just have to wait for an updated version with the NVML library fix.
from nvitop.
Hi @marcreichman-pfi, a new release of nvidia-ml-py
with version 12.535.77 came out several hours ago. You can upgrade your nvidia-ml-py
package with the command:
python3 -m pip install --upgrade nvidia-ml-py
This would resolve the unrecognized PIDs with CUDA 12 drivers.
I would also make a new release of nvitop
to resolve CUDA 12 driver support.
from nvitop.
Thanks @XuehaiPan - is there a way to do this in the docker version?
from nvitop.
Thanks @XuehaiPan - is there a way to do this in the docker version?
@marcreichman-pfi You could upgrade nvidia-ml-py
in your docker container.
from nvitop.
Thanks this did the trick! Here was what I did from your Dockerfile
:
$ git diff Dockerfile
diff --git a/Dockerfile b/Dockerfile
index c3194cf..96874da 100644
--- a/Dockerfile
+++ b/Dockerfile
@@ -32,6 +32,7 @@ COPY . /nvitop
WORKDIR /nvitop
RUN . /venv/bin/activate && \
python3 -m pip install . && \
+ python3 -m pip install --upgrade nvidia-ml-py && \
rm -rf /root/.cache
# Entrypoint
from nvitop.
Related Issues (20)
- [Feature Request] Add shortcut that shows parent process's name for selected process HOT 1
- [Question] Memory bandwidth utilization of GPUs? HOT 4
- [BUG] Monitor mode displays nothing under Python 3.12 in Windows 11 HOT 8
- [Feature Request] It is recommended to change the dependency from nvidia-ml-py to pynvml HOT 1
- [BUG][exporter] Process metrics still exist when the process is gone HOT 5
- [BUG] curser error init display on ubuntu 22.10 HOT 1
- [Question] How to log GPU performance to `wandb` HOT 2
- [Feature Request] add io stat for disk and process like glances HOT 2
- [BUG] Pytorch lightning callback HOT 3
- [Feature Request] Show real-time bandwidth under monitor mode
- [Question] ERROR: Failed to initialize `curses` (setupterm: could not find terminfo database) HOT 4
- [Question] How snapshot could be used HOT 2
- Installation: which step to follow? HOT 1
- [BUG] Prometheus connection refused HOT 4
- [Question] Grafana Dashboard Example
- [Feature Request] can you support word wrap for COMMAND information HOT 1
- Windows download link please.. HOT 1
- [Feature Request] Add CPU Processes
- [BUG] UTF-8 Error during decoding device name on R555 driver HOT 7
- [BUG] `nvitop.Device.from_cuda_visible_devices()` not detecting GPU HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nvitop.