feat: end-to-end supervised fine-tuning for text classification
Description
Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:
- [ X] Follow the
CONTRIBUTINGGuide. - [ X] You are listed as the author in your notebook or README file.
- [ X] Your account is listed in
CODEOWNERSfor the file(s).
- [ X] Your account is listed in
- [ X] Make your Pull Request title in the https://www.conventionalcommits.org/ specification.
- [ X] Ensure the tests and linter pass (Run
nox -s formatfrom the repository root to format). - [ X] Appropriate docs were updated (if necessary)
Fixes #<issue_number_goes_here> 🦕
Check out this pull request on ![]()
See visual diffs & provide feedback on Jupyter Notebooks.
Powered by ReviewNB
Is this PR supposed to be a replacement for #787 ?
Yes. Currently fixing the notebook format and lint errors. Will update it soon.
@holtskinner I went through all the errors from lint code base and lint notebook errors and fixed them. The remaining ones make no sense to me. As per spelling errors, the errors are words likeICML (where dataset was published) last names from the authors of the dataset, automl and lopgrobs, which are not spelling errors.
I've added some of the terms to the spelling allowlist. You can see the remaining errors in the summary. It explains what the errors are and what they should be.
https://github.com/GoogleCloudPlatform/generative-ai/actions/runs/10254677291?pr=919
Thanks Holt! All Spelling errors are now fixed. The Lint Notebook and Lint Code errors remain, as I have made different changes but nothing fix them.
I would also recommend putting the methods in the python file in the notebook itself so the users won't need to download another file when using colab.
@holtskinner fixed all errors and refactored as per suggestions. Erwin has recommended to have the "migration" functions separately for portability, but overall I agree that having inside the same colab is easier, so just put them back in the notebook as well and deleted the separate file.
Many thanks for all the great feedback. LMK if sth else needs to be changed to finally merge it into main :)