ray icon indicating copy to clipboard operation
ray copied to clipboard

[RLlib] Add NPU and HPU support to RLlib

Open liuxsh9 opened this issue 11 months ago • 9 comments

Why are these changes needed?

Support for more accelerators in RLlib by allowing learners to configure custom_resources. By building on Ray Train's existing compatibility feature (#44086 ), extended RLlib to support NPU and HPU.

Related issue number

Checks

  • [ ] I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
  • [ ] I've run scripts/format.sh to lint the changes in this PR.
  • [ ] I've included any doc changes needed for https://docs.ray.io/en/master/.
    • [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in doc/source/tune/api/ under the corresponding .rst file.
  • [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • [ ] Unit tests
    • [ ] Release tests
    • [ ] This PR is not tested :(

liuxsh9 avatar Jan 02 '25 11:01 liuxsh9

thanks

chenfei8888 avatar Jan 07 '25 08:01 chenfei8888

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

  • If you'd like to keep this open, just leave any comment, and the stale label will be removed.

stale[bot] avatar Feb 25 '25 00:02 stale[bot]

thanks

chenfei8888 avatar Feb 27 '25 13:02 chenfei8888

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

  • If you'd like to keep this open, just leave any comment, and the stale label will be removed.

stale[bot] avatar Apr 26 '25 02:04 stale[bot]

Is there any active plan for rllib to adapt to more hardware? Thx. @jcotant1

liuxsh9 avatar May 21 '25 05:05 liuxsh9

Hey @liuxsh9 thanks for the question, I've flagged it to the RLlib team and someone should be reaching out soon

jcotant1 avatar May 28 '25 21:05 jcotant1

Hi @liuxsh9 thanks for this PR! I have taken a look at it. Is there any chance this could be easily tested within our CI tests? I have to find a way there to test with custom resources. Which custom resource would you see there?

simonsays1980 avatar May 29 '25 09:05 simonsays1980

This pull request has been automatically marked as stale because it has not had any activity for 14 days. It will be closed in another 14 days if no further activity occurs. Thank you for your contributions.

You can always ask for help on our discussion forum or Ray's public slack channel.

If you'd like to keep this open, just leave any comment, and the stale label will be removed.

github-actions[bot] avatar Jun 13 '25 00:06 github-actions[bot]

thx

chenfei8888 avatar Jun 15 '25 04:06 chenfei8888

This pull request has been automatically marked as stale because it has not had any activity for 14 days. It will be closed in another 14 days if no further activity occurs. Thank you for your contributions.

You can always ask for help on our discussion forum or Ray's public slack channel.

If you'd like to keep this open, just leave any comment, and the stale label will be removed.

github-actions[bot] avatar Jul 01 '25 00:07 github-actions[bot]

This pull request has been automatically closed because there has been no more activity in the 14 days since being marked stale.

Please feel free to reopen or open a new pull request if you'd still like this to be addressed.

Again, you can always ask for help on our discussion forum or Ray's public slack channel.

Thanks again for your contribution!

github-actions[bot] avatar Jul 15 '25 12:07 github-actions[bot]