Comments (2)
The issue you're describing seems to be related to how Flyte handles caching across different projects when using reference launch plans. According to the Flyte documentation on remote caching, cache keys are composed of the Project, Domain, Cache Version, Task Signature, and Task Input Values. This design ensures that task executions are isolated by project and domain to prevent data corruption.
However, your observation indicates that the cache key is being derived from the reference launch plan's identifier, which includes the project, version, domain, and name of the launch plan. This could lead to cache hits across different projects, which is not the intended behavior.
Expected Behavior
Cache keys should be specific to the project and domain of the current task execution, not the reference launch plan. This would ensure that tasks do not inadvertently read cached results from executions in other projects.
Rationale
The current design might be intended to facilitate reusability and efficiency, but it compromises the isolation between projects. Changing the cache key derivation to include the current task's project and domain would align with the documented behavior and ensure data integrity.
Would you like more details on how to potentially address this issue or any specific part of the caching mechanism?
Sources:
This is an AI-generated response and your feedback is appreciated! Please leave a 👍 if this is helpful and 👎 if it is not.
Mention @RunLLM to ask follow-up questions. See runllm.com for more!
from flyte.
@ysysys3074 , this is working as intended. This has deep implications about how Flyte entities (e.g. tasks and launch plans) are registered in the backend and how we refer to them. In the specific case of reference entities, they are tied to their original project+domain+name+version.
from flyte.
Related Issues (20)
- [BUG] python task retries=n causes "Requests overridden" error log message for interruptible task HOT 1
- [BUG] Handler for .well-known/openid-configuration constructs redirect path incorrectly HOT 2
- [Core feature] LiteralBlob and StructuredDataset metadata HOT 1
- More flexible configuration of SecurityContext for Pods/Containers started by flyte HOT 10
- [Core feature] `@dynamic` should accept all (?) of `@workflow` attributes HOT 6
- [BUG] New versions of viper breaks config loading HOT 3
- [BUG] nested dynamic won't bind pydantic models or dictionaries as inputs HOT 2
- [Core feature] Build multiple ImageSpec in parallel HOT 2
- [Housekeeping] Distributed Tracing Should Support OTLP Exporters HOT 1
- [Integration] NIM
- [BUG] ArrayNodes downloads all inputs for every subtasks HOT 2
- Flytekit checkpoint improvement- pytorch HOT 1
- [BUG] Union types fail for e.g. two different dataclasses HOT 4
- [BUG] Missing inputs when using datetime.date type hint. HOT 3
- [Core feature] UX improvement: `pyflyte run` includes imported local modules in the target workflow file HOT 2
- [Housekeeping] Remove the need of kwtypes in user code HOT 1
- [Core feature] pyflyte run --remote should support a url HOT 2
- [BUG] Error in workflow compilation logic for special variable names HOT 4
- SubWorkflow Error handling HOT 11
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from flyte.