Commit 48a6e7f
[Data] Fix resource reservation by excluding completed operators' usages (#56319)
<!-- Thank you for your contribution! Please review
https://github.com/ray-project/ray/blob/master/CONTRIBUTING.rst before
opening a pull request. -->
<!-- Please add a reviewer to the assignee section when you create a PR.
If you don't have the access to it, we will shortly find a reviewer and
assign them to your PR. -->
## Problem
The `ReservationOpResourceAllocator` was incorrectly accounting for
resource usage when calculating available resources for reservation.
Specifically, it wasn't properly handling completed operators who have
blocks in the output queue.
The `ReadFiles` operator below consumes 50 GB of object store memory and
should be excluded from reservation, but it is currently not.
<img width="1628" height="281" alt="image"
src="https://github.com/user-attachments/assets/8a80902d-7f88-4263-bc97-a3dee519b401"
/>
## Solution
Added logic to identify and subtract resource usage specifically from
completed physical operators:
## Testing results
Before the fix
<img width="958" height="653" alt="image"
src="https://github.com/user-attachments/assets/432dc94e-bbe1-4ecb-b1c3-a6e201da724a"
/>
After the fix
<img width="1567" height="700" alt="image"
src="https://github.com/user-attachments/assets/2b780d68-a208-4c6b-a250-dee0823e9083"
/>
## Checks
- [ ] I've signed off every commit(by using the -s flag, i.e., `git
commit -s`) in this PR.
- [ ] I've run `scripts/format.sh` to lint the changes in this PR.
- [ ] I've included any doc changes needed for
https://docs.ray.io/en/master/.
- [ ] I've added any new APIs to the API Reference. For example, if I
added a
method in Tune, I've added it in `doc/source/tune/api/` under the
corresponding `.rst` file.
- [ ] I've made sure the tests are passing. Note that there might be a
few flaky tests, see the recent failures at https://flakey-tests.ray.io/
- Testing Strategy
- [x] Unit tests
- [ ] Release tests
- [ ] This PR is not tested :(
---------
Signed-off-by: xgui <xgui@anyscale.com>
Signed-off-by: Xinyuan <43737116+xinyuangui2@users.noreply.github.com>
Co-authored-by: Alexey Kudinkin <alexey.kudinkin@gmail.com>1 parent f4dc12e commit 48a6e7f
File tree
2 files changed
+393
-2
lines changed- python/ray/data
- _internal/execution
- tests
2 files changed
+393
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
493 | 493 | | |
494 | 494 | | |
495 | 495 | | |
| 496 | + | |
| 497 | + | |
| 498 | + | |
| 499 | + | |
| 500 | + | |
| 501 | + | |
| 502 | + | |
| 503 | + | |
| 504 | + | |
| 505 | + | |
| 506 | + | |
| 507 | + | |
| 508 | + | |
| 509 | + | |
| 510 | + | |
| 511 | + | |
| 512 | + | |
| 513 | + | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
| 517 | + | |
| 518 | + | |
| 519 | + | |
| 520 | + | |
| 521 | + | |
| 522 | + | |
| 523 | + | |
| 524 | + | |
| 525 | + | |
| 526 | + | |
496 | 527 | | |
497 | | - | |
| 528 | + | |
498 | 529 | | |
499 | 530 | | |
500 | 531 | | |
501 | 532 | | |
502 | 533 | | |
503 | | - | |
504 | 534 | | |
505 | 535 | | |
506 | 536 | | |
507 | 537 | | |
| 538 | + | |
| 539 | + | |
| 540 | + | |
| 541 | + | |
| 542 | + | |
| 543 | + | |
| 544 | + | |
| 545 | + | |
508 | 546 | | |
509 | 547 | | |
510 | 548 | | |
| |||
0 commit comments