Comments (5)
I would happily work on this with a nudge in the right direction.
from dagster.
@leejlFG, I see your point about these being only run_tags
.
I hadn't considered that aspect before, good point 🤔 An ECSStepLauncher
would be nice to have, but it's not a requirement for me. It's acceptable if all assets for the same run share the same resources.
What I care most about is having some sort of tag per asset so that auto-materializations/ad-hoc runs are launched with the necessary resources.
from dagster.
Would this work for you? #14544
from dagster.
The lack of this feature is a significant drawback for us because we would have to apply the highest hardware requirements of all assets to every run, regardless of whether they're needed and, therefore, overprovision.
In addition, it would also be beneficial to provide capacityProvider
on an asset-level basis.
@leejlFG What I don't understand is why this isn't working. Shouldn't the tags be applied with the get_cpu_and_memory_overrides
function?
from dagster.
@yuhan using op_tags
would be fine for our purposes, but since we use the ECSRunLauncher
the maximum resources for any asset/op in the run needs to take priority for the entire ECS task, which is exactly the issue @ammerzon describes.
I may be wrong, but op_tags
and plain tags
on assets don't propagate and merge with run_tags
in the ECSRunLauncher
. Tags applied to jobs are run_tags
, nothing else is. The get_cpu_and_memory_overrides
function references only the run_tags
. If one new set of tags needs to be merged with run_tags
I think it makes the most sense to avoid using op_tags
as they are basically not used at all outside k8s and op-based workflows.
@ammerzon you really need an ECSStepLauncher
instead to have different assets inside the same run spin up with different resources, right?
Getting some sort of tag on assets so ad hoc and automaterializations launch with the necessary resources instead of the default ones sounds like a different but related problem than getting individual assets within a run to spawn with fresh containers in a step launcher, also with those specified resources.
from dagster.
Related Issues (20)
- Support for DBT Unit tests
- Consolidate the gRPC servers used by the Dagster webserver and daemon when running `dagster dev`
- Automaterialization with cpu/memory requirements per asset HOT 3
- Auto-materialization should gracefully recover from an unavailable code location HOT 6
- [Documentation Feedback] Problem on /guides/limiting-concurrency-in-data-pipelines page HOT 1
- Issue in `get_s3_keys` sensor function
- Missing asset checks tab in UI
- Schedules and Runs pages are unloadable in UI after upgrading to 1.7.11 HOT 1
- Make asset check failures with severity WARN more prominent
- Option to add asset code as metadata for software defined assets HOT 1
- [email protected] breaks `dagster dev` HOT 9
- ForwardRef._evaluate() missing 1 required keyword-only argument: 'recursive_guard' HOT 5
- Nested Resources in Jobs result in CheckError HOT 1
- Deleting materialization history in Lesson 8: Creating a Partition
- UnicodeDecodeError in -w mode, versions 1.6.4 to 1.7.13 HOT 1
- Lesson 8: ImportError: cannot import name 'utc_datetime_from_timestamp' from 'dagster._utils' HOT 1
- The dagster-pandera integration cannot handle `Annotated` types
- Dagster daemon heartbeat randomly stops HOT 1
- Dagster-slack is missing py.typed marker
- Examples for k8s configuration
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dagster.