Vitaliy Chiley comments

Repositories
Issues
Comments

Results 64 comments of


                                            Vitaliy Chiley

ModuleNotFoundError: No module named 'torch'

this cant be done within a setup.py file in my project...

Setting Dropout in MPT Prefix-LM after Exporting to HuggingFace Crashes during Fine-tuning

If you are able to modify the code, could you try setting `inplace` [here](https://github.com/mosaicml/llm-foundry/blob/main/llmfoundry/models/layers/attention.py#L175) to `False`?

WIP: Preventing the loss from being computed when the input token is EOS Token

@samhavens should we also add the option to not predict BOS (assuming the previous tok is the end of the previous seq).

Add tensor parallelism for attention QKVO

> The implementation currently supports multihead and grouped query attention. I was not able to find a good way to parallelize the attention bias with AliBi in this setting -...