Comments (3)
@mjkanji the intended behavior here is that run retries should only trigger for runs that failed (status "Failure" in the UI:
not runs that were manually terminated from the Dagster UI or using the GraphQL API (status "Canceled" in the UI:
)
It sounds like this is different than what you're experiencing though - can you provide debug information for a run where this happened on a manually terminated run using the "Download debug information" dropdown on the runs page?
Or if it's a run in Dagster Cloud you could also email [email protected] with a link to the run.
from dagster.
Hi @gibsondan, please see the screenshot below. I terminated the run (see the DagsterExecutionInterruptedError
in the logs). However, as you can see from the right side pane, a child run was automatically started in the background, which is showing up as a "re-execution" in the Runs page like a retry normally would.
Unless I somehow pressed the "re-execute" button by mistake (which, I guess, is possible), as far as I can tell, the child run was triggered automatically. The run is also showing up as a Failure and not Canceled. I'm not sure why that's the case.
I've also emailed [email protected] with the links to the actual runs for you to review directly.
from dagster.
We identified the issue here (the pod not getting terminated cleanly in time before the process was killed, then our monitoring functionality incorrectly moving it into a failure state instead of a cancelled state, triggering a retry) - and rolled out a fix for this in the most recent cloud release. No upgrade to anything required if you're using cloud. Thanks for the report!
from dagster.
Related Issues (20)
- `dagster-mlflow` tracking config not used from run_config when already configured in definitions HOT 2
- make pyright error HOT 3
- Test Schedule "Open in Launchpad" not working HOT 2
- Incorrect return type of functions in load_assets_from_modules
- Auto-materialization of non-existing partitions
- dagster/max_runtime tag does not seem to work HOT 2
- Could not load job definition error "TypeError: unhashable type: 'list'" HOT 5
- Include Asset Description in the Dagster web-server search functionality HOT 1
- Dagster Serverless Deploy not working when importing dbt project HOT 3
- Add `YearlyPartitionsDefinition` HOT 2
- Unexpected GraphQL error when deployed on AWS ECS HOT 1
- MLflow hook updates MLflow run state based on all op events, not the just the final op
- execute_in_process with FakeDatetimes do not work
- Number of assets are different in Dagster+ HOT 1
- Make it easier to define freshness checks for dbt models
- ImportError: cannot import name 'GenericAlias' from partially initialized module 'types' HOT 6
- Could not load job definition. dagster._check.CheckError: Invariant failed. Description: No metadata found for CacheableAssetsDefinition with unique_id airbyte / Using I/O Manager BigQuery
- `_get_infer_single_to_multi_dimension_deps_result` is to restrictive, can't handle identityMapped staticPartitionDefintions
- Freshness sensor does not run when assets are pending or failed HOT 3
- Table IO Managers should capture column schemas with appropriate metadata tag HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dagster.