Comments (4)
@vara-bonthu Can you scope this down to just what's missing/ wrong in Ray Serve? Is this really an issue that's required to change Ray code? Or if you already know what's the fix, also feel free to contribute to the codebase
from ray.
@vara-bonthu those are great findings! If you are develop on a mac, you can try those instructions to setup it up locally https://docs.ray.io/en/master/ray-contribute/development.html#building-ray-on-linux-macos-full
If this has to go onto a cluster, I think you can raise a draft PR and one of the CI step will generate a wheel that you can use to build the docker image for testing. This is an example of such build that generates the wheel.
from ray.
Does Ray for neuron support autoscaling based on neuron devices - represented by the device plugin as aws.amazon.com/neuron ?
from ray.
@GeneDer The issue is related solely to scaling new nodes for inf2 instances. It appears node_types
is not set, causing the code to break at this line in here .
What is the easiest way to debug the code to print the values passed to this method?
I will try to dig further and keep you posted
from ray.
Related Issues (20)
- There are some deprecated items that need to be removed HOT 1
- CI test linux://rllib:examples/evaluation/evaluation_parallel_to_training_multi_agent_duration_auto_torch_envrunner is flaky HOT 4
- Ray v2.11.0 missing windows distribution HOT 7
- [Ray Tune/ Train] Auth with aws_web_identity_token or use the provided file system provider in runtime config HOT 5
- [<Ray component: Core>] ray raise error on nvidia cuda machine for amdgpu missing HOT 2
- [Tune] Trials on pre-started game instances HOT 2
- Release test chaos_torch_batch_inference_16_gpu_300gb_raw.aws failed HOT 3
- [Data] Add `override_num_blocks` parameter to `from_pandas`
- Release test dataset_shuffle_push_based_sort_1tb.aws failed HOT 2
- [Core] Actor/Task cannot be scheduled on worker node. HOT 1
- [Ray core] Stopped job leaks worker HOT 5
- [serve] Support resource-based autoscaling HOT 1
- [Observability / Doc] Add support of ray debugger on windows HOT 6
- [Doc] Python 3.12 `docs` env has conflicts with `..scripts/format.sh` HOT 2
- [Dashboard] `py-spy` profiling initiated from the Ray Dashboard fails if `sudo` is not installed HOT 3
- [Ray Data] map_batches with actors is 25% slower than manually consuming with iter_batches
- [Core][Actors] Duplicate named actor exception should not be lazy if possible HOT 1
- [Core] `ray.wait` not actually wait until ready when the task is longer than 12 days HOT 3
- [tune] `tune.with_resources` with `PlacementGroupFactory` cannot find GPUs in `train_fn` HOT 4
- CI test linux://python/ray/dashboard:test_dashboard is flaky HOT 12
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ray.