Comments (7)
I think the failures started with this AMReX change:
commit 5dfb0400581e0e2deb7bb0dc11b8f7efb14c8d17
Author: Weiqun Zhang <[email protected]>
Date: Mon Jul 24 20:43:04 2023 -0700
Disable Managed Memory for The_Arena by default. (#3438)
It used to be that The_Arena was managed for CUDA and SYCL, but not for
HIP. The users can still turn it on with
`amrex.the_arena_is_managed=1`. They can also use The_Managed_Arena
explicitly.
from maestroex.
in particular, it looks like we are using GPU::ManagedVector
a lot in Basestate.H
, so we need to use the managed arena.
from maestroex.
Okay, running with amrex.the_arena_is_managed=1
fixes the main issue and the test initializes correctly now.
However there is a new issue when we try to advance a timestep:
Timestep 0 starts with TIME = 0 DT = 0.0004741795139
Cell Count:
Level 0, 1327104 cells
inner sponge: r_sp , r_tp : 186468750, 224718750
<<< STEP 1 : react state >>>
<<< STEP 2 : make w0 >>>
<<< STEP 3 : create MAC velocities >>>
MLMG: Initial rhs = 2728631.973
MLMG: Initial residual (resid0) = 2728631.973
MLMG: Final Iter. 1 resid, resid/bnorm = -1.797693135e+308, -6.588257971e+301
MLMG: Timers: Solve = 0.175576918 Iter = 0.145792589 Bottom = 0.00399682
<<< STEP 4 : advect base >>>
: density_advance >>>
: tracer_advance >>>
Erroneous arithmetic operation
See Backtrace.0 file for details
I'll look more into this, to see exactly where it is breaking
from maestroex.
The Backtrace is pointing to MaestroBaseStateGeometry.cpp:55
When I build an executable with DEBUG=FALSE it happily continues running.
A DEBUG=TRUE executable takes so long to get there (stuck in the nodal solver) I gave up and built with TEST=TRUE which immediately got to the Erroneous arithmetic operation
from maestroex.
I found it - there is a race condition in SlopeZ where near certain physical boundaries the code is using slopes computed in neighboring cells which haven't necessarily been computed yet if running on a gpu. Interestingly SlopeX and SlopeY had already been fixed. I'll work on a PR.
from maestroex.
@biboyd test it with #402 this should fix it.
from maestroex.
Everything seems to work now, thanks for all the help!
from maestroex.
Related Issues (20)
- deprecation warnings HOT 1
- xrb_mixed sponge seems wrong
- enthalpy_pred_type logic is wrong HOT 2
- yt load issues HOT 1
- segfaults when using init_iters > 1 HOT 6
- identical if branches in BaseStateGeometrycpp HOT 1
- burner doesn't abort if fail on GPUs HOT 1
- change the Strang burn to no longer subtract off the energy offset
- SDC no longer compiles HOT 1
- Anelastic cutoff issues in Urca sims HOT 1
- drive_initial_convection not fully implemented HOT 1
- base state long vs. int HOT 3
- Issues with Species Prediction at Edges HOT 10
- enthalpy_pred_types that predict T seem to be missing T on edge?
- create an `eos_rh_t` type to speed up EOS calls HOT 1
- add more problems to CI
- MAESTROeX doesn't check if cells are square
- wdconvect stopped early & segment fault on GPU HOT 7
- eliminate simple_log?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from maestroex.