Suraj Subramanian comments

Results 44 comments of


                                            Suraj Subramanian

PaddlePaddle implementation of LLaMA

@jiaohuix - thanks for your work! i can help you write a blog about this for the wider community to learn. please email me subramen-at-meta-dot-com if you're interested!

where is the train file?

there's a finetuning script at https://github.com/facebookresearch/llama-recipes/blob/main/llama_finetuning.py which you could adapt for pretraining. Section 2 of the paper (https://arxiv.org/pdf/2307.09288.pdf) has the hyperparams used for pretraining

How to set up the LLaMA-2 model on our own server?

Hello @Vatsal1106Virani take a look at https://github.com/facebookresearch/llama-recipes/tree/main/demo_apps which has many code samples to get you started

Facing this error while running for the first time

The error seems to be occurring because the checkpoint_dir has spaces that haven't been escaped `AssertionError: no checkpoint files found in D:\Coding\Environment\Llama` Maybe try with ` --ckpt_dir "D:\Coding\Environment\Llama 2\llama-main\llama-2-13b"` or...

AssertionError: Loading a checkpoint for MP=8 but world size is 2

The default MP sharding for llama-2-70b-chat is 8, so you shouldn't be facing this error. llama-2-13b has an MP=2... is it possible that you are accidentally using that model instead?...

AssertionError: Loading a checkpoint for MP=8 but world size is 2

Closing this issue, @bhargavanubavam feel free to reopen when you have more information

CPU configuration for LLaMA 2

Serving frameworks like vLLM or TGI are mainly optimized for GPU usage, but they also have support for parallel inference and memory management that might be useful. Some examples [here](https://github.com/facebookresearch/llama-recipes/blob/main/demo_apps/llama-on-prem.md)

Suraj Subramanian

PaddlePaddle implementation of LLaMA

where is the train file?

How to set up the LLaMA-2 model on our own server?

Facing this error while running for the first time

AssertionError: Loading a checkpoint for MP=8 but world size is 2

AssertionError: Loading a checkpoint for MP=8 but world size is 2

CPU configuration for LLaMA 2

Question about total_len and max_gen_len

Optim - added quantization code.

Response for Llama2 Access