Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
845 commits
Select commit Hold shift + click to select a range
7bb2a1a
[serve] swap toolz for collections.defaultdict in tests (#56249)
zcin Sep 4, 2025
b78555d
[core] Fix cancel race that leads to RAY_CHECK it->second.submitted_t…
dayshah Sep 5, 2025
88293d4
[Serve.llm] Gracefully return timeouts as HTTPException (#56264)
kouroshHakha Sep 5, 2025
ce32813
[core][gpu-objects] Fix wrong skipif in test_gpu_objects_gloo (#56258)
Qiaolin-Yu Sep 5, 2025
3303031
gc collect from a gc_thread (#55838)
yancanmao Sep 5, 2025
731c179
[Core] Remove the unnecessary redirection of get_protocols_provider (…
jjyao Sep 5, 2025
5278961
Enable ruff lint for `workers/`, `workflow/`, `setup-dev.py`, and `cl…
czgdp1807 Sep 5, 2025
9c7028b
[CORE][DOC] Fix documentation typos, grammar, and formatting issues i…
OneSizeFitsQuorum Sep 5, 2025
d9709c8
[core][autoscaler] Reword `Total Demands` and `Total Constraints` to …
rueian Sep 5, 2025
4a9c49c
[data] Adding in updated code to from uris release test (#56091)
omatthew98 Sep 5, 2025
1c637ac
add tests and DLQ business logic (#55608)
harshit-anyscale Sep 5, 2025
e363caa
[llm] Vllm bump -> 0.10.1.1 (#56099)
nrghosh Sep 5, 2025
91d9ac6
[CI] Add text embed benchmark tests to group: data-tests. (#56256)
crypdick Sep 5, 2025
669c938
[Core] [TaskEvent] Fix Missing Events Issue in Task Events (#55916)
MengjinYan Sep 5, 2025
5e85227
[core] (cgroups 3/n) Creating CgroupManager to setup Ray's cgroup hie…
israbbani Sep 5, 2025
45baf10
[CORE][DOC] Fix grammar, spelling, and formatting issues in Ray Core …
OneSizeFitsQuorum Sep 5, 2025
ba21a9b
[CORE][DOC] Fix typos and grammatical issues in Ray Core documentatio…
OneSizeFitsQuorum Sep 5, 2025
c01c880
[ci] raydepsets: single depset compilation with dependencies (#56263)
elliot-barn Sep 5, 2025
ccb4ce9
[core][gpu-objects] Add warning when GPU object refs passed back to t…
daiping8 Sep 5, 2025
61ad346
[Data] Bump test_json timeout (#56267)
srinathk10 Sep 5, 2025
befc4ba
[DOC] Fix grammar, syntax, and formatting issues in ray-more-libs doc…
OneSizeFitsQuorum Sep 5, 2025
beb70bf
[Core] Fix ABFSS (Azure Blob File System Secure) protocol support pro…
gangsf Sep 5, 2025
e5743a0
[DOC] Fix grammar, syntax, and formatting issues in ray-air and ray-c…
OneSizeFitsQuorum Sep 5, 2025
ad906af
[data] refactor download expression to use inheritance from `Abstract…
omatthew98 Sep 5, 2025
9341bd4
[core] Proper typing for ObjectRef (#55566)
minerharry Sep 5, 2025
fa2643c
[RLlib] [DOC] Fix documentation typos and grammatical issues in RLlib…
OneSizeFitsQuorum Sep 6, 2025
704c579
[core] (cgroups 4/n) adding constraint bounds checking to the CgroupM…
israbbani Sep 6, 2025
a5591b5
[Core] Remove test_head_node_job_agent_always_used (#56292)
jjyao Sep 6, 2025
adcad13
[core] Add a warning when returning an object w/ num_returns=0 (#56213)
KaisennHu Sep 6, 2025
7950cd1
[Data] Fixed `ParquetDatasource` encoding ratio estimation (#56268)
alexeykudinkin Sep 6, 2025
9b68241
[lint] change API annotations and policy check triggers (#56289)
aslonnie Sep 6, 2025
7095933
[deps] upgrade boto3 (#56291)
elliot-barn Sep 6, 2025
1fa52d2
[Data] Add total input/output row counts of Operator in the output of…
daiping8 Sep 6, 2025
369c780
[serve] Fix buffered logging reusing request context (Fixes #55851) (…
vaishdho1 Sep 6, 2025
7a3da11
[core] replace node_manager_client with raylet_client_lib (#56261)
rueian Sep 7, 2025
d25f4ea
[core] Delete unnecessary plasma.cc + macro (#56308)
dayshah Sep 7, 2025
32b778e
[serve] Require prefix `RAY_SERVE_` for env vars + value verification…
axreldable Sep 8, 2025
87a17dd
[docs] serve llm deployment examples refinement (#56287)
Aydin-ab Sep 8, 2025
bdac8cf
[CORE][DOC] Fix grammar, typos, and formatting issues in Ray Core doc…
OneSizeFitsQuorum Sep 8, 2025
106761e
[core][event/04] node event: add proto schema (#56031)
can-anyscale Sep 8, 2025
452a575
[core][metric] Redefine STATS_tasks using Metric interface (#56015)
can-anyscale Sep 8, 2025
9536fca
[core] (cgroups 5/n) Adding clean up methods to CgroupDriverInterface…
israbbani Sep 8, 2025
e740b36
[Docs] Include CR UID in KubeRay metrics reference (#56312)
win5923 Sep 8, 2025
f8e9645
[Data] Fixing empty projection handling in `ParquetDataSource` (#56299)
alexeykudinkin Sep 8, 2025
a3254a3
increase timeout for failed task test (#56328)
harshit-anyscale Sep 8, 2025
57fd3b5
[Core] Add PID to structured logs for tasks and actors (#55176)
pavitrabhalla Sep 8, 2025
47eae75
[Data] Refactor `from_torch` unit tests (#56331)
bveeramani Sep 8, 2025
33ce9e6
[core] Make RequestWorkerLease RPC Fault Tolerant (#56191)
Sparks0219 Sep 8, 2025
6cefec7
[DOC] Fix documentation issues in _includes and _templates directorie…
OneSizeFitsQuorum Sep 8, 2025
d55b35d
[release-test][data][train] Preload a subset of modules for torch dat…
justinvyu Sep 8, 2025
010791e
[Core] [Doc] Add OSS Document for Task Events (#56203)
MengjinYan Sep 8, 2025
fc73181
[serve] deflake test pow 2 router (#56342)
zcin Sep 8, 2025
5b70e3a
[Data] Remove deprecated `Dataset.to_torch` (#56333)
bveeramani Sep 8, 2025
4d45859
[core] disable test db for cpp tests (#56348)
can-anyscale Sep 8, 2025
ed8ccd6
[Data] Remove redundant `ignore_missing_paths` and partitioning tests…
bveeramani Sep 8, 2025
83bb39f
[cpp] performs rename when generating (#56338)
aslonnie Sep 8, 2025
1fb6965
[ci] raydepsets check lock files (#55856)
elliot-barn Sep 8, 2025
c67cec8
[data] Rename env var for enforce schemas (#56254)
iamjustinhsu Sep 9, 2025
668c4e9
[codeowner] remove aslonnie individual from many ownerships (#56364)
aslonnie Sep 9, 2025
907be9d
[TRAIN][DOC] Fix documentation issues in train/examples directory: ty…
OneSizeFitsQuorum Sep 9, 2025
6e265ba
[TRAIN][DOC] Fix typos, grammar, and formatting issues in Ray Train d…
OneSizeFitsQuorum Sep 9, 2025
3c4230e
[core] Clean up `actor_creator` targets (#56336)
edoakes Sep 9, 2025
c8f0279
[Data] Remove filesystem parameterizations of `test_csv` tests (#56345)
bveeramani Sep 9, 2025
e8d87af
[core] Banning implementation deps (#56359)
Sparks0219 Sep 9, 2025
0fbe017
[RLlib][release test] Upgrade g3 to g4 machine for aws release test (…
liulehui Sep 9, 2025
f17ba98
[core] Fix error handling for plasma put errors (#56070)
codope Sep 9, 2025
3d08229
[Tech-debt] Unify the Deprecation APIs across Ray libraries (#56326)
KaisennHu Sep 9, 2025
24834f3
[core] Fix batching logic in `CoreWorkerPlasmaStoreProvider::Get` (#5…
codope Sep 9, 2025
646c4dc
[deps] upgrade boto3 to 1.29.x (#56363)
elliot-barn Sep 9, 2025
7dab777
Enable ruff lint for the entire code base (#56080)
czgdp1807 Sep 9, 2025
d2ed918
fixing deployment scoped custom autoscaling (#56192)
harshit-anyscale Sep 9, 2025
f8d32c3
[core] initialize opentelemetry.metrics once (#56347)
can-anyscale Sep 9, 2025
d72d509
[dashboard] fix grafana dashboard generation bug (#56346)
eric-higgins-ai Sep 9, 2025
d107d71
[Data]: Fix mock_server cleanup on error (#56330)
srinathk10 Sep 9, 2025
6d0a650
[core] fix lint on test_deprecation (#56386)
aslonnie Sep 9, 2025
4efdd35
[serve] Add a test to ensure calling await multiple times on response…
akyang-anyscale Sep 9, 2025
248d069
Fixing broken CI due to linter issues (#56385)
israbbani Sep 9, 2025
b080aa1
[core] Allow task manager access with submitter mutex + unify retry (…
dayshah Sep 9, 2025
82b026a
[serve] Explicitly close `choose_replicas_with_backoff` async generat…
akyang-anyscale Sep 9, 2025
cd767bd
[Data] Add `max_task_concurrency`, `min_scheduling_resources`, and `p…
bveeramani Sep 9, 2025
3193145
[Data] Add hash and `to_resource_dict` to `ExecutionResources` (#56383)
bveeramani Sep 9, 2025
b5a6ad9
[Core] Rewrite JobManager _monitor_job_internal to fix hanging issue …
jjyao Sep 9, 2025
4ac91b3
[Core] Add S3 public bucket fallback to handle NoCredentialsError (#5…
gangsf Sep 9, 2025
6ff42bf
[release-test] Disable `drop_last` flag to fix division by zero in to…
justinvyu Sep 10, 2025
818a93a
[ci] remove old wheel building logic (#56375)
aslonnie Sep 10, 2025
7bb884c
[docs] [serve] [llm] Fix serve llm examples (#56382)
Aydin-ab Sep 10, 2025
119aa6c
Revert "[core] Correct bytes in flight when objects <5mb (#54349)" (#…
dayshah Sep 10, 2025
427100d
[core] Improve docs for custom serialization for exceptions + add tes…
sampan-s-nayak Sep 10, 2025
4f93b8f
[core][gpu objects] Rename GPU objects -> RDT objects in user-facing …
stephanie-wang Sep 10, 2025
4bfee4b
[bazel] change core BUILD files to BUILD.bazel (#56406)
aslonnie Sep 10, 2025
cf3389c
[wheel] remove nvm from manylinux build (#56376)
aslonnie Sep 10, 2025
d9b61ae
[bazel] rename train/tune/air BUILD files to BUILD.bazel (#56410)
aslonnie Sep 10, 2025
66ca1c5
[core][rdt] Fix check crash on gpu obj free if driver knows actor is …
dayshah Sep 10, 2025
a8ba8a6
[data.llm][API] Allow tuple for concurrency arg (#55867)
axreldable Sep 10, 2025
e88926c
[bazel] change all BUILD files from BUILD to BUILD.bazel (#56337)
aslonnie Sep 10, 2025
1a6c5a9
Make `ray-llm` code owner for `ray.data.llm` (#56420)
bveeramani Sep 10, 2025
212367d
foundation work for aggregating metrics on controller (#56295)
abrarsheikh Sep 10, 2025
dc954ed
[Serve] Allow access to request context of each request in the batch …
landscapepainter Sep 10, 2025
a6ee029
[data] fix metrics query for iteration + scheduling loop (#56390)
iamjustinhsu Sep 10, 2025
08a7068
[Data] Emit events rather than just logs for detected issues (#55717)
coqian Sep 10, 2025
dd7b78c
[Data] Support initial concurrency value (#56370)
owenowenisme Sep 10, 2025
f4dc12e
[ci] raydepsets: adding pre hooks for depsets (#56180)
elliot-barn Sep 10, 2025
48a6e7f
[Data] Fix resource reservation by excluding completed operators' usa…
xinyuangui2 Sep 10, 2025
33a540f
nccl change
Sep 10, 2025
62cf604
integrate deployment ranks with deployment state (#55829)
abrarsheikh Sep 10, 2025
752f894
[Data] Remove filesystem variants from JSON, NumPy, and Delta tests (…
bveeramani Sep 10, 2025
d28b3f4
[Data] Add `average_num_inputs_per_task` and `num_output_blocks_per_t…
bveeramani Sep 10, 2025
37ad9f5
[core] Migrate ray_logging from _private to _common[#53478] (#56368)
vaishdho1 Sep 10, 2025
6ca981d
working version
Sep 10, 2025
a86f6f9
lint
avigyabb Sep 10, 2025
9afb0bc
lint
avigyabb Sep 10, 2025
41d8268
moved conversion logic into collective group
avigyabb Sep 10, 2025
d199cf3
lint
avigyabb Sep 10, 2025
fecdac5
fix cpu test
avigyabb Sep 10, 2025
07505d6
init
avigyabb Sep 10, 2025
274a9dd
[core][1eventx/02] job event: add an interface for ray event recorder…
can-anyscale Sep 10, 2025
117f0de
test fix
avigyabb Sep 10, 2025
c1251d9
[core] [actor-event-01] Actor event: add proto schema (#56221)
sampan-s-nayak Sep 10, 2025
d5826d0
lint
avigyabb Sep 10, 2025
0047e72
[data] ignore metadata for pandas block (#56402)
iamjustinhsu Sep 11, 2025
ef7169a
[core][otel] clear gauge metric cache at export time (#56405)
can-anyscale Sep 11, 2025
65c9839
[Data] Make operator `target_max_block_size` optional and rename as o…
bveeramani Sep 11, 2025
857bea4
[core] Introduce env var to set rpc failure prob for all rpc's (#56413)
dayshah Sep 11, 2025
7d5a29c
[DOC] Fix documentation issues in ray-observability directory (#56069)
OneSizeFitsQuorum Sep 11, 2025
6e0e96c
[release] Add base image build step to release configs (#56438)
khluu Sep 11, 2025
aadd794
[core] Breaking up task_common + lease target no longer depends on ta…
Sparks0219 Sep 11, 2025
0409101
[serve] fix release tests that use locust (#56354)
zcin Sep 11, 2025
5e8107b
[Serve] Refactor test_deploy_app_2.py and add port attributes (#55748)
landscapepainter Sep 11, 2025
9f71321
[Doc][Core] Fix the Environment Variable Name in the Ray Event Export…
MengjinYan Sep 11, 2025
bdeb368
[core] deflake darwin://python/ray/tests:test_metric_cardinality_otel…
can-anyscale Sep 11, 2025
f3f75d5
Add operator panels id list to dataset export (#56428)
alanwguo Sep 11, 2025
a1a2bf2
[RLlib] Fix Metrics/Stats lifetime count and throughput measurement f…
sven1977 Sep 11, 2025
e88b3f8
[train][checkpoint] Add checkpoint_upload_mode to ray.train.report (#…
TimothySeah Sep 11, 2025
d92dcf6
Support ray.put() and ray.get() with nixl in gpu objects (#56146)
Qiaolin-Yu Sep 11, 2025
2a2a492
[core] Make Object Manager Unit Testable (#56315)
Sparks0219 Sep 11, 2025
f900567
[core] (cgroups 6/n) CgroupManager cleans up the entire cgroup hierar…
israbbani Sep 11, 2025
4c0ec7f
[release] Change back to g4dn.12x for 4 GPU (#56469)
liulehui Sep 11, 2025
a9a57a6
[data] Download op fusion / removal of interleaved partitioners (#56462)
omatthew98 Sep 11, 2025
d27f403
[core] Fix HandleRefRemoved thread safety (#56445)
dayshah Sep 11, 2025
8b5c898
[core] Starting log_monitor before starting the raylet. (#56471)
israbbani Sep 12, 2025
e829eaf
[core][1eventx/03] job event: send job events to the aggregator (#55213)
can-anyscale Sep 12, 2025
3f124af
[data.llm] Fix sglang byod on release (#55885)
kouroshHakha Sep 12, 2025
c05037e
[Data] Update image embedding benchmark to use `download` (#56245)
bveeramani Sep 12, 2025
d8be1b2
[core][otel] fix default value for missing metric tags (#56467)
can-anyscale Sep 12, 2025
e9670ed
[train] Add hf trainer support for dictionary of datasets (#56484)
wyhong3103 Sep 12, 2025
df5951e
[Data] - Improve performance for `unify_schemas` (#55880)
goutamvenkat-anyscale Sep 12, 2025
f2d046b
[Data] Add text embedding release test (#56459)
bveeramani Sep 12, 2025
fe0c6eb
[deps] changing compile llm requirements image (#56172)
elliot-barn Sep 12, 2025
35f3a69
[data] large schema release test (#56353)
iamjustinhsu Sep 12, 2025
b535134
[core][rdt] Wait on nccl id with event (#56322)
dayshah Sep 12, 2025
30899ba
[core] Fix ruff for #56322 (#56488)
dayshah Sep 12, 2025
6e7c96b
Revert "[llm] disable sglang release test (#55884)" (#56475)
eicherseiji Sep 12, 2025
baa70c2
add tests for replica ranks (#56120)
abrarsheikh Sep 12, 2025
08afa53
[data] add missing doc strings for DataContext (#56460)
iamjustinhsu Sep 12, 2025
268634f
[data] Wrong name for Inqueue Panel (#56463)
iamjustinhsu Sep 12, 2025
dd43876
[core][rdt] Support tensor transfer from outside owners of actors (#5…
dayshah Sep 12, 2025
2aec370
[ci] 2 different pydoclints (1 for ci and 1 for local) (#56464)
elliot-barn Sep 12, 2025
ef9168e
[LLM][Serve] Allow setting `data_parallel_size=1` in engine_kwargs (#…
lk-chen Sep 12, 2025
3054b71
[ci] removing python ver check for llm lockfile compile (#56495)
elliot-barn Sep 12, 2025
55dd659
fixed gloo test - take look
avigyabb Sep 12, 2025
1028dcc
[Data.llm] Fix multimodal image extraction when no system prompt is p…
nrghosh Sep 13, 2025
7f52b3a
working gloo_group_isolation
avigyabb Sep 13, 2025
bae1d45
[core][gpu-objects] Add initial docs (#55981)
stephanie-wang Sep 13, 2025
f5003e4
lint
avigyabb Sep 13, 2025
a900ae1
[Data] Refactor batch inference release test definitions (#56489)
bveeramani Sep 13, 2025
e26d21a
[ci] upgrading uv ver 0.8.17 (latest) (#56494)
elliot-barn Sep 13, 2025
ce4e473
[core] Fix ASAN issues in object manager test (#56492)
Sparks0219 Sep 13, 2025
895d78b
[core] Don't hold shared ptr to client in actor submitter queues (#56…
dayshah Sep 14, 2025
25bb624
[core] Fixing timeout in test_object_spilling_3.py (#56512)
Sparks0219 Sep 14, 2025
8259540
[core] Fix UBSAN errors in object_manager_test (#56521)
Sparks0219 Sep 15, 2025
f896b68
[core] Creating non-linux implementation for sysfs_cgroup_driver. (#…
israbbani Sep 15, 2025
33200dd
[image] add label for ray version and commit (#56493)
aslonnie Sep 15, 2025
7845f4c
[core] Fix `clang-format` pre-commit step (#56534)
edoakes Sep 15, 2025
f7ddcbe
[ci][deps] raydepsets: building ray img lockfiles (#56444)
elliot-barn Sep 15, 2025
03e5cd9
add more tests for async inf (#56408)
harshit-anyscale Sep 15, 2025
97e2b32
[SERVE] Proxy Actor Interface (#56288)
ok-scale Sep 15, 2025
237f792
stop ray instance in serve test logging (#56480)
abrarsheikh Sep 15, 2025
a78952e
[core][ci] Don't build cpp api in ci test container (#56517)
dayshah Sep 15, 2025
fba6fe7
[Data] Remove redundant check for initial size of actor pool (#56440)
owenowenisme Sep 15, 2025
72eb7a6
[core] Move `gcs_client` out of `gcs` directory (#56515)
edoakes Sep 15, 2025
e0548b1
Fix accidentally using task throughput instead of row throughput (#56…
alanwguo Sep 15, 2025
ee89e8c
use default gc frequency for proxy (#56511)
abrarsheikh Sep 15, 2025
ab72665
[ci] updating raydepsets llm check (#56439)
elliot-barn Sep 15, 2025
1d858e9
added shutdown fixture
avigyabb Sep 15, 2025
ac90fa0
[image] allow using explicit base type (#56545)
aslonnie Sep 15, 2025
28dd809
fix merge issue
avigyabb Sep 15, 2025
dc01c8a
[core] Remove `gcs_server` directory nesting (#56516)
edoakes Sep 15, 2025
212ce29
[image] change tag methods of container class to private (#56551)
aslonnie Sep 15, 2025
e5e4ae3
[image] add ray-llm image type check (#56542)
aslonnie Sep 15, 2025
13af56a
Add optional APIType filter to /api/serve/applications/ endpoint (#56…
jugalshah291 Sep 15, 2025
09b0826
Bump vLLM to 0.10.2 (#56535)
eicherseiji Sep 16, 2025
1ab7340
[core][ci] Install cpp api for java tests (#56571)
dayshah Sep 16, 2025
0c62bdb
[Data] - Optimize memory usage for One Hot Encoder (#56565)
goutamvenkat-anyscale Sep 16, 2025
47d6413
[release] Separate custom BYOD image build & set up image build depen…
khluu Sep 16, 2025
0b94c81
[core] Make Free Objects RPC Fault Tolerant (#56293)
Sparks0219 Sep 16, 2025
d2a3372
[data] skip loading dynamic modules for HF datasets >= v4.0.0 (#55908)
tohtana Sep 16, 2025
19bfc16
[Data][llm] Add chat_template_kwargs as option when building processo…
ahao-anyscale Sep 16, 2025
252c9e4
[docs] run apt-get update before install for ray debugger (#56579)
vie-serendipity Sep 16, 2025
aa04cb0
[Data] Fix a typo (#56587)
jjyao Sep 16, 2025
c11c858
Add parallelism parameter to read_snowflake (#56588)
gvspraveen Sep 16, 2025
b52ca5a
[ci] disable running release tests that have been long failing (#56561)
aslonnie Sep 16, 2025
de049f7
[Serve] Resolve test_deploy_app.py flakiness (#55721)
landscapepainter Sep 16, 2025
b69c091
[image] add test rules for image building files (#56554)
aslonnie Sep 16, 2025
bfe8139
[Core] Update cluster scheduler to handle label selector hard node id…
ryanaoleary Sep 16, 2025
f95c202
[Core][Autoscaler] Add `labels` to KubeRay autoscaling config (#56532)
ryanaoleary Sep 16, 2025
ff717bf
[Data] Allow `BlockOutputBuffer` to accept nullable output_block_size…
alexeykudinkin Sep 16, 2025
b25b7e2
[core] (cgroups 9/n) end-to-end integration of cgroups with ray start…
israbbani Sep 16, 2025
3a1750b
[core][1eventx/04] node event: send node events to the aggregator (#5…
can-anyscale Sep 16, 2025
3247290
[image] remove slim's dependency on normal bases (#56544)
aslonnie Sep 16, 2025
0516aa2
[Data][doc] Update on ray.data.Dataset.map() type hints (#52455)
chuang0221 Sep 16, 2025
23abd59
[data] Small typo fix (#56560)
omatthew98 Sep 17, 2025
a329636
[Data] Avoid unnecessary copying of blocks (#56569)
alexeykudinkin Sep 17, 2025
5bb44ca
[docs] Add guidance for matching Ray and Python versions with uv envi…
jmajety-dev Sep 17, 2025
7c28be2
[core] Make PinObjectIDs RPC Fault Tolerant (#56443)
Sparks0219 Sep 17, 2025
abd40b3
[core] (cgroups 10/n) Adding support in CgroupManager and CgroupDrive…
israbbani Sep 17, 2025
f8c60d2
[Data] Add documentation for `iter_torch_batches` `collate_fn` callab…
srinathk10 Sep 17, 2025
44a2360
[ci] rename `docker_login` to `ecr_docker_login` (#56622)
aslonnie Sep 17, 2025
416e365
[train] Abort reconciliation thread catches ray.util.state.get_actor …
TimothySeah Sep 17, 2025
220236f
[Train] Add PyTorch local mode support for multi-process training wit…
xinyuangui2 Sep 18, 2025
fe61bc9
[core][ci] Fix install cpp api for java tests (#56592)
dayshah Sep 18, 2025
e8e72e4
[image] add support for building and publishing ray-extra images (#56…
aslonnie Sep 18, 2025
8243e00
[core] Disable GcsNodeManager:TestRayEventRecorder (#56659)
can-anyscale Sep 18, 2025
c2bcf51
[core][otel] (yet another) clean up gauge metric cache (#56502)
can-anyscale Sep 18, 2025
a308e00
[Core][TaskEventFollowup/05] Make RayEventsTuple a Data Class (#56630)
MengjinYan Sep 18, 2025
dc96678
[ci] raydepsets: prehook args shlex split (2/?) (#56637)
elliot-barn Sep 18, 2025
9539786
[fix][core]A timeout should be set when submitting patch requests for…
daiping8 Sep 18, 2025
29bc824
[RLlib] Fix failing env step in `MultiAgentEnvRunner`. (#55567)
kamil-kaczmarek Sep 18, 2025
85420e1
remove PyBullet (#56698)
kamil-kaczmarek Sep 18, 2025
556ac28
[dashboard] Use pynvml for GPU metrics (#56000)
Bye-legumes Sep 18, 2025
4568cde
[core] Deleting unused variables inside the NodeManager. (#56641)
israbbani Sep 18, 2025
2330b62
[core] Cleanup metrics stuff + fix gcc >10 build (#56514)
dayshah Sep 18, 2025
492f383
[core] Migrate serialization from _private to _common (#56590)
vaishdho1 Sep 18, 2025
b03a8b1
[core] Fixing windows build with cgroups. Broken in cgroups 10/n. (#5…
israbbani Sep 18, 2025
77dead0
[Server] omit unnecessary newlines in the config generated by serve b…
daiping8 Sep 18, 2025
3142f3d
[core] Default main service metrics disabled (#56461)
ZacAttack Sep 18, 2025
41fdf01
[Data] Added support for projection pushdown into Parquet reads (#56500)
alexeykudinkin Sep 18, 2025
24013eb
Merge branch 'master' into avigyabb/util-collective-torch-gloo
avigyabb Sep 18, 2025
f2e6b29
[serve] Allow ProxyActor to return true/false for health check (#56660)
akyang-anyscale Sep 18, 2025
7f2065e
[ci][RLlib] [test utils] Improve error message on some test-failures …
Daraan Sep 18, 2025
ad2362e
[Data] [1/n] Predicate Expression Support (#56313)
goutamvenkat-anyscale Sep 18, 2025
74c93a8
[data] Reset external queue metrics (#56604)
iamjustinhsu Sep 18, 2025
77c83d2
Merge branch 'master' into avigyabb/util-collective-torch-gloo
Qiaolin-Yu Sep 18, 2025
a07ac97
lint
avigyabb Sep 18, 2025
7d46e48
fix failing test
avigyabb Sep 19, 2025
cbd5d30
revert change
avigyabb Sep 19, 2025
062cf0b
revert change
avigyabb Sep 19, 2025
6b03c4a
Update nixl_tensor_transport.py
avigyabb Sep 19, 2025
f2faccb
Update nixl_tensor_transport.py
avigyabb Sep 19, 2025
b413274
Update nixl_tensor_transport.py
avigyabb Sep 19, 2025
90a5cfa
removed exception for obj in a struct
avigyabb Sep 19, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
24 changes: 18 additions & 6 deletions .bazelrc
Original file line number Diff line number Diff line change
@@ -1,12 +1,11 @@
# Must be first. Enables build:windows, build:linux, build:macos, build:freebsd, build:openbsd
build --enable_platform_specific_config

build:linux --workspace_status_command="bash ./bazel/workspace_status.sh"

# Provides users an option to turn on strict action env.
# TODO(aslonnie): make this default; fix the python tests..
build --incompatible_strict_action_env
build:strict --incompatible_strict_action_env

build:linux --workspace_status_command="bash ./bazel/workspace_status.sh"

# To distinguish different incompatible environments.
build --action_env=RAY_BUILD_ENV

Expand Down Expand Up @@ -81,8 +80,6 @@ build:iwyu --output_groups=report
build:windows --attempt_to_print_relative_paths
# Save disk space by hardlinking cache hits instead of copying
build:windows --experimental_repository_cache_hardlinks
# Clean the environment before building, to make builds more deterministic
build:windows --incompatible_strict_action_env
# For colored output (seems necessary on Windows)
build:windows --color=yes
# For compiler colored output (seems necessary on Windows)
Expand Down Expand Up @@ -165,6 +162,18 @@ test:ci-base --test_output=errors
test:ci-base --test_verbose_timeout_warnings
test:ci-base --flaky_test_attempts=3

# Sending in PATH is required for tests to run on CI, after we enable
# --incompatible_strict_action_env, until we either convert all Python tests to
# hermetic tests -- which not only requires pinning all Python dependencies with bazel,
# but also requires building ray(test) wheel with bazel. Alternatively, we can
# also stop using bazel test to run ray's Python tests.
#
# This PATH test_env is intentionally not enabled on non-CI so that C/C++
# tests, which are all hermetic, can build, test and cache as intended, ray
# Python developers do not really use bazel test to run tests locally, but more
# often just run tests with "pytest" directly.
test:ci-base --test_env=PATH

build:ci --color=yes
build:ci --curses=no
build:ci --keep_going
Expand Down Expand Up @@ -220,6 +229,9 @@ build:cgroup --sandbox_writable_path=/sys/fs/cgroup --config=llvm
# ci/env/install-llvm-dependencies.sh
try-import %workspace%/.llvm-local.bazelrc

# Allow users to define custom options.
try-import %workspace%/.user.bazelrc

# Even with sandbox mode bazel prioritizes system headers over the ones in the sandbox.
# It picks up the system headers when someone has protobuf installed via Homebrew.
# Work around for https://github.com/bazelbuild/bazel/issues/8053
Expand Down
94 changes: 1 addition & 93 deletions .buildkite/_forge.rayci.yml
Original file line number Diff line number Diff line change
@@ -1,100 +1,8 @@
group: forge
sort_key: "_forge"
steps:
- name: forge
wanda: ci/docker/forge.wanda.yaml

- name: manylinux
wanda: ci/docker/manylinux.wanda.yaml

- name: raycudabase
label: "wanda: ray.py{{matrix.python}}.cu{{matrix.cuda}}.base"
tags:
- python_dependencies
- docker
wanda: ci/docker/ray.cuda.base.wanda.yaml
matrix:
setup:
python:
- "3.9"
- "3.10"
- "3.11"
- "3.12"
cuda:
- "11.7.1-cudnn8"
- "11.8.0-cudnn8"
- "12.1.1-cudnn8"
- "12.3.2-cudnn9"
- "12.4.1-cudnn"
- "12.5.1-cudnn"
- "12.8.1-cudnn"
env:
PYTHON_VERSION: "{{matrix.python}}"
CUDA_VERSION: "{{matrix.cuda}}"


- name: raycpubase
label: "wanda: ray.py{{matrix}}.cpu.base"
tags:
- python_dependencies
- python
- docker
- tune
- serve
wanda: ci/docker/ray.cpu.base.wanda.yaml
matrix:
- "3.9"
- "3.10"
- "3.11"
- "3.12"
env:
PYTHON_VERSION: "{{matrix}}"

- name: ray-llmbase
label: "wanda: ray-llm.py{{matrix.python}}.cu{{matrix.cuda}}.base"
tags:
- python_dependencies
- docker
wanda: ci/docker/ray-llm.base.wanda.yaml
depends_on: raycudabase
matrix:
setup:
python:
- "3.11"
cuda:
- "12.8.1-cudnn"
env:
PYTHON_VERSION: "{{matrix.python}}"
CUDA_VERSION: "{{matrix.cuda}}"

- name: ray-mlcudabase
label: "wanda: ray-ml.py{{matrix.python}}.cu{{matrix.cuda}}.base"
tags:
- python_dependencies
- docker
wanda: ci/docker/ray-ml.cuda.base.wanda.yaml
depends_on: raycudabase
matrix:
setup:
python:
- "3.9"
- "3.10"
- "3.11"
cuda:
- "12.1.1-cudnn8"
env:
PYTHON_VERSION: "{{matrix.python}}"
CUDA_VERSION: "{{matrix.cuda}}"

- name: ray-mlcpubase
label: "wanda: ray-ml.py{{matrix}}.cpu.base"
tags:
- python_dependencies
- docker
wanda: ci/docker/ray-ml.cpu.base.wanda.yaml
depends_on: raycpubase
matrix:
- "3.9"
- "3.10"
- "3.11"
env:
PYTHON_VERSION: "{{matrix}}"
225 changes: 225 additions & 0 deletions .buildkite/_images.rayci.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,225 @@
group: images
sort_key: "_images"
steps:
- name: raycpubase
label: "wanda: ray-py{{matrix}}-cpu-base"
tags:
- python_dependencies
- docker
wanda: docker/base-deps/cpu.wanda.yaml
matrix:
- "3.9"
- "3.10"
- "3.11"
- "3.12"
env:
PYTHON_VERSION: "{{matrix}}"
ARCH_SUFFIX: ""

- name: raycpubaseextra
label: "wanda: ray-py{{matrix}}-cpu-base-extra"
wanda: docker/base-extra/cpu.wanda.yaml
matrix:
- "3.9"
- "3.10"
- "3.11"
- "3.12"
env:
PYTHON_VERSION: "{{matrix}}"
IMAGE_TYPE: "ray"
ARCH_SUFFIX: ""
depends_on: raycpubase

- name: raycudabase
label: "wanda: ray-py{{matrix.python}}-cu{{matrix.cuda}}-base"
tags:
- python_dependencies
- docker
wanda: docker/base-deps/cuda.wanda.yaml
matrix:
setup:
python:
- "3.9"
- "3.10"
- "3.11"
- "3.12"
cuda:
- "11.7.1-cudnn8"
- "11.8.0-cudnn8"
- "12.1.1-cudnn8"
- "12.3.2-cudnn9"
- "12.4.1-cudnn"
- "12.5.1-cudnn"
- "12.6.3-cudnn"
- "12.8.1-cudnn"
env:
PYTHON_VERSION: "{{matrix.python}}"
CUDA_VERSION: "{{matrix.cuda}}"
ARCH_SUFFIX: ""

- name: raycudabaseextra
label: "wanda: ray-py{{matrix.python}}-cu{{matrix.cuda}}-base-extra"
wanda: docker/base-extra/cuda.wanda.yaml
matrix:
setup:
python:
- "3.9"
- "3.10"
- "3.11"
- "3.12"
cuda:
- "11.7.1-cudnn8"
- "11.8.0-cudnn8"
- "12.1.1-cudnn8"
- "12.3.2-cudnn9"
- "12.4.1-cudnn"
- "12.5.1-cudnn"
- "12.6.3-cudnn"
- "12.8.1-cudnn"
env:
PYTHON_VERSION: "{{matrix.python}}"
CUDA_VERSION: "{{matrix.cuda}}"
IMAGE_TYPE: "ray"
ARCH_SUFFIX: ""
depends_on: raycudabase

- name: ray-llmbase
label: "wanda: ray-llm-py{{matrix.python}}-cu{{matrix.cuda}}-base"
tags:
- python_dependencies
- docker
wanda: docker/ray-llm/cuda.wanda.yaml
depends_on: raycudabase
matrix:
setup:
python:
- "3.11"
cuda:
- "12.8.1-cudnn"
env:
PYTHON_VERSION: "{{matrix.python}}"
CUDA_VERSION: "{{matrix.cuda}}"

- name: ray-llmbaseextra
label: "wanda: ray-llm-py{{matrix.python}}-cu{{matrix.cuda}}-base-extra"
wanda: docker/base-extra/cuda.wanda.yaml
matrix:
setup:
python:
- "3.11"
cuda:
- "12.8.1-cudnn"
env:
PYTHON_VERSION: "{{matrix.python}}"
CUDA_VERSION: "{{matrix.cuda}}"
IMAGE_TYPE: "ray-llm"
ARCH_SUFFIX: ""
depends_on: ray-llmbase

- name: ray-mlcpubase
label: "wanda: ray-ml-py{{matrix}}-cpu-base"
tags:
- python_dependencies
- docker
wanda: docker/ray-ml/cpu.wanda.yaml
depends_on: raycpubase
matrix:
- "3.9"
- "3.10"
- "3.11"
env:
PYTHON_VERSION: "{{matrix}}"

- name: ray-mlcpubaseextra
label: "wanda: ray-ml-py{{matrix}}-cpu-base-extra"
wanda: docker/base-extra/cpu.wanda.yaml
matrix:
- "3.9"
- "3.10"
- "3.11"
env:
PYTHON_VERSION: "{{matrix}}"
IMAGE_TYPE: "ray-ml"
ARCH_SUFFIX: ""
depends_on: ray-mlcpubase

- name: ray-mlcudabase
label: "wanda: ray-ml-py{{matrix.python}}-cu{{matrix.cuda}}-base"
tags:
- python_dependencies
- docker
wanda: docker/ray-ml/cuda.wanda.yaml
depends_on: raycudabase
matrix:
setup:
python:
- "3.9"
- "3.10"
- "3.11"
cuda:
- "12.1.1-cudnn8"
env:
PYTHON_VERSION: "{{matrix.python}}"
CUDA_VERSION: "{{matrix.cuda}}"

- name: ray-mlcudabaseextra
label: "wanda: ray-ml-py{{matrix.python}}-cu{{matrix.cuda}}-base-extra"
wanda: docker/base-extra/cuda.wanda.yaml
matrix:
setup:
python:
- "3.9"
- "3.10"
- "3.11"
cuda:
- "12.1.1-cudnn8"
env:
PYTHON_VERSION: "{{matrix.python}}"
CUDA_VERSION: "{{matrix.cuda}}"
IMAGE_TYPE: "ray-ml"
ARCH_SUFFIX: ""
depends_on: ray-mlcudabase

- name: ray-slimcpubase
label: "wanda: ray-slim-py{{matrix}}-cpu-base"
tags:
- python_dependencies
- docker
- skip-on-release-tests
wanda: docker/base-slim/cpu.wanda.yaml
matrix:
- "3.9"
- "3.10"
- "3.11"
- "3.12"
env:
PYTHON_VERSION: "{{matrix}}"
ARCH_SUFFIX: ""

- name: ray-slimcudabase
label: "wanda: ray-slim-py{{matrix.python}}-cu{{matrix.cuda}}-base"
tags:
- python_dependencies
- docker
- skip-on-release-tests
wanda: docker/base-slim/cuda.wanda.yaml
matrix:
setup:
python:
- "3.9"
- "3.10"
- "3.11"
- "3.12"
cuda:
- "11.7.1"
- "11.8.0"
- "12.1.1"
- "12.3.2"
- "12.4.1"
- "12.5.1"
- "12.6.3"
- "12.8.1"
env:
PYTHON_VERSION: "{{matrix.python}}"
CUDA_VERSION: "{{matrix.cuda}}"
ARCH_SUFFIX: ""
Loading