Comments (2)
Runner coyote10
is unstable.
The CUDA device may become hidden at random in the middle of the testsuite:
103/184 Test #10: test_checkpoint__therm_lb__p3m_gpu__lj__lb_gpu_binary .........***Failed 1.70 sec
ERROR: CUDA error: no CUDA-capable device is detected
--------------------------------------------------------------------------
MPI_ABORT was invoked on rank 0 in communicator MPI_COMMUNICATOR 3
with errorcode 1.
NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
You may or may not see output from other processes, depending on
exactly when Open MPI kills them.
--------------------------------------------------------------------------
Or before the testsuite starts:
==================================================
START TEST
==================================================
Traceback (most recent call last):
File "<string>", line 1, in <module>
File "script_interface.pyx", line 413, in espressomd.script_interface.ScriptInterfaceHelper.__setattr__
File "script_interface.pyx", line 181, in espressomd.script_interface.PScriptInterface.set_params
RuntimeError: CUDA error: no CUDA-capable device is detected
An error occurred. Exiting...
Command that failed: exit 1
In both cases the GPU was detected at the beginning:
/usr/bin/nvidia-smi
Fri Dec 2 02:07:29 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 515.65.01 Driver Version: 515.65.01 CUDA Version: 11.7 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... Off | 00000000:21:00.0 Off | N/A |
| 35% 24C P8 N/A / 75W | 6MiB / 4096MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
from espresso.
Hasn't happened again in two months. Closing.
from espresso.
Related Issues (20)
- Decide api for system-wide propagation setup
- py: Allow passing ParticleHanlde and Particle Slice to observables
- CI build failed for merged PR HOT 1
- Kokkos based P3M HOT 1
- Add ZnDraw-based visualization to tutorials
- Template the floating point data type in P3M
- P3M: further FFt refactoring HOT 4
- Test failures with specific myconfig.hpp HOT 2
- Support Sympy for tabulated interactions HOT 1
- Add more ZnDraw-features HOT 8
- Add prefix to preprocessor macros
- espresso assumes numpy for pint HOT 7
- Visualisation does not wot work because np.mat is removed HOT 2
- Update Readme
- Restructure installation documentation
- CI build failed for merged PR HOT 1
- Simplify work with walberla kernels
- Walberla performance tracking ticket HOT 1
- system.part.by_id() accepts bool arguments
- bump `zndraw` version HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from espresso.