Charles Srisuwananukorn comments

Results 56 comments of


                                            Charles Srisuwananukorn

Add documentation for running inference on multiple GPUs

For inference, I saw that some folks on Discord were able to run on multiple cards [in this thread](https://discordapp.com/channels/1082503318624022589/1082510608123056158/1084210191635058759). I haven't had a chance to try it myself. For the...

Add documentation for running inference on multiple GPUs

Of course! https://discord.gg/9Rk6sSeWEG

Add documentation for running inference on multiple GPUs

I'm re-purposing this issue to track adding multi-GPU inference documentation to the repo.

TODO: Better Documentation for hyperparameters and fine tuning

@zhangce, any updates on this?

bug fix for inference

Thanks for the patch, @juncongmoo! Unfortunately, the `from_raw_prompt` method that we released was only half implemented. It needs to create an instance of `Conversation`, passing a human id and a...

where is the wiki-server.py file in retrieval part?

Sorry, the README is out of date. I'll delete the README.

python inference/bot.py Killed

You'll need more than 40GB of VRAM to run the model. An 80GB A100 is definitely enough. A 48GB A40 might work, but that might be cutting it a little...

Bug when running inference with retrieval augmented model

Thank you for the detailed bug report. Let me try to reproduce this.

Can't install nccl package

It looks like NVIDIA's [nccl](https://github.com/NVIDIA/nccl) only supports Linux, I'm sorry. I don't see any packages built for Windows on conda-forge.

Can't install nccl package

Thanks for responding, @davismartens!