Charles Srisuwananukorn comments

Results 56 comments of


                                            Charles Srisuwananukorn

change wikipedia folder name

Actually, @mauriceweber, can you take a look?

Can I fine tune GPT-Neo-XT-Chat-Base-20B with 8 A100?

We train this model on 8x A100 80GB GPUs. I'll update the README. > I... submit a request for a mini model to do sanity checks on local systems and...

Can I fine tune GPT-Neo-XT-Chat-Base-20B with 8 A100?

About an hour per 100 steps. Usually, we fine-tune for a couple days.

Fix typos and add a typo checker to GitHub Actions

Thank you for the PR, @shirayu! This looks great. I'd like to review a couple things tomorrow before merging. Please stay tuned.

Conda takes too long to install dependencies

After some research, many projects seem to be recommending `mamba` for faster installation (see [this article](https://pythonspeed.com/articles/faster-conda-install/) for more details). I just tested it, and it does seem much faster. Installing...

Conda takes too long to install dependencies

Training ran fine. I'll update the README to suggest using mamba.

One issue on env ResolvePackageNotFound

I believe it also does not work on macOS. These packages require NVIDIA GPUs, which most Macs do not have.

Add print statements to `pretrained/GPT-NeoX-20B/prepare.py` to show progress

@LorrinWWW, this is a version of your script for sharding the base model. Could you please take a look?

Add documentation for running inference on multiple GPUs

I've seen this issue when running out of GPU RAM. Unfortunately, the model requires an A100 80GB right now. Are you using an A100 40GB?