gpt-fast icon indicating copy to clipboard operation
gpt-fast copied to clipboard

Tensor Parallel Inside notebook

Open nivibilla opened this issue 1 year ago • 3 comments

Hi,

Im trying to get an example working with Ray on Databricks. Essentially having multiple replicas of the model. Is it possible to load a model with tensor parallelism inside a notebook?

Thanks

nivibilla avatar Apr 29 '24 19:04 nivibilla

Yes, you can. gpt-fast uses torchrun to automatically dispatch tasks to different GPU nodes. You can use it just like regular python script.

yanboliang avatar Sep 16 '24 04:09 yanboliang

Is notebook compatible with torchrun? Last time I heard it has limited (or none) multi-processing support.

kwen2501 avatar Sep 16 '24 05:09 kwen2501

Right, I think you have to find a notebook which support multiprocessing.

yanboliang avatar Sep 16 '24 05:09 yanboliang