codeflare-sdk
codeflare-sdk copied to clipboard
Updating training script in basic_interactive notebook
Issue link
What changes have been made
When running GPU utilising workloads using the basic_interactive demo notebook I encountered a lot of errors while running the training script provided in the notebook. As it was added more than a year ago and may be out of date I have updated the notebook to use the recently added mnist_fashion.py training script instead.
Verification steps
- Install RHOAI with the CodeFlare components set to
Managed - Ensure that your cluster has GPU nodes as required.
- Within a data science project workbench clone this branch
git clone https://github.com/Fiona-Waters/codeflare-sdk.git -b udpate-notebook-script
- Run the basic_interactive notebook and ensure that it works as expected.
Checks
- [ ] I've made sure the tests are passing.
- Testing Strategy
- [ ] Unit tests
- [X] Manual tests
- [ ] Testing is not required for this change
/retest
This PR just needs a rebase and it should pass the e2e test
/lgtm
/approve
[APPROVALNOTIFIER] This PR is APPROVED
This pull-request has been approved by: astefanutti, Bobbins228
The full list of commands accepted by this bot can be found here.
The pull request process is described here
- ~~OWNERS~~ [Bobbins228,astefanutti]
Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment