Comments (5)
Hey Gilly,
Yeah this is an on going issue we have raise with setonix on several occasions. For now we are stuck with build bare metal builds on setonix. I will upload some instructions for it later today.
from underworld2.
After digging further into the Pawsey doco I found this https://pawsey.org.au/technical-newsletter/ (see 13 March 2023 entry):
Parallel IO within Containers
Currently there are issues running MPI-enabled software that makes use of parallel IO from within a container being run by the Singularity container engine. The error message seen will be similar to:Example of error message
Assertion failed in file ../../../../src/mpi/romio/adio/ad_cray/ad_cray_adio_open.c at line 520: liblustreapi != NULL /opt/cray/pe/mpich/default/ofi/gnu/9.1/lib-abi-mpich/libmpi.so.12(MPL_backtrace_show+0x26) [0x14ac6c37cc4b] /opt/cray/pe/mpich/default/ofi/gnu/9.1/lib-abi-mpich/libmpi.so.12(+0x1ff3684) [0x14ac6bd2e684] /opt/cray/pe/mpich/default/ofi/gnu/9.1/lib-abi-mpich/libmpi.so.12(+0x2672775) [0x14ac6c3ad775] /opt/cray/pe/mpich/default/ofi/gnu/9.1/lib-abi-mpich/libmpi.so.12(+0x26ae1c1) [0x14ac6c3e91c1] /opt/cray/pe/mpich/default/ofi/gnu/9.1/lib-abi-mpich/libmpi.so.12(MPI_File_open+0x205) [0x14ac6c38e625]
Currently it is unclear exactly what is causing this issue. Investigations are ongoing.
Workaround:
There is no workaround that does not require a change in the workflow. Either the container needs to be rebuilt to not make use of parallel IO libraries (e.g. the container was built using parallel HDF5) or if that is not possible, the software stack must be built “bare-metal” on Setonix itself (see How to Install Software).
I guess I'm about to install UW2 from source on Setonix... Would you have any step-by-step recipe at hands for this specific Cray machine? I found the one you put together for Magnus a few years back.
from underworld2.
Hey Gilly,
To update you on this.
Setonix's permission setup means I can't install things for a project I'm not a user in. So I'm trying to put together bare metal instructions for you that make things as smooth as possible from your end.
I'm testing some instructions I have put together this afternoon and if things work out I'll send them though later.
from underworld2.
Hi Jules,
I have been off grid for the past couple weeks and back in the office now. If you have a recipe at hand for the install I would love to give it to!
Cheers
Gilly
from underworld2.
Hi Gilly,
https://support.pawsey.org.au/documentation/display/US/Containers+changes
I'm going to rebuild the docker image and try singularity again on setonix. I'll keep you posted.
cheers,
J
from underworld2.
Related Issues (20)
- RuntimeError: There were left over keyword arguments. kwargs = [{'_fn_fssa': None}]
- communication between the UW2 model and the Badlands surface processes model
- UWGeodynamics velocitySurface_2D Bug HOT 1
- Coupled Evolution of Deformation, Pore Fluid Pressure, and Fluid Flow in the thrust-fold mountain belt ? HOT 4
- Uncoupled temperature and compositional field in UW Geodynamics HOT 2
- Adding two plasticities to one material HOT 3
- SystemError: <built-in function SystemLinearEquations_UpdateSolutionOntoNodes> returned a result with an exception set HOT 1
- Parallel computing and An installation problem (MpiRun) HOT 4
- 'surface' integration HOT 2
- Model.mesh.deform_mesh() error with mpirun. Possibly two nodes are in an identical location.
- Installation error HOT 11
- Compressible mantle convection with tala approximation HOT 1
- A question about strata results from coupled model HOT 3
- May I ask how is the second-order invariant of strain rate(function: fn.tensor.second_invariant)in underworld2.13 calculated?
- Question about using mesh.specialSets['MinI_VertexSet'] as a boundary index with parallel run
- Converging issue for 3D subduction model with oblique trench shape. HOT 1
- May I ask how to use Underworld2 modeling the melt-related process?
- An issue when using trapezoid mesh HOT 1
- issue with surface processes
- To find the example to generate collision_wedge.gif HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from underworld2.