stanford_alpaca issues

Is there a consolidate_shard_weights for hugging face weights?

https://fairscale.readthedocs.io/en/stable/_modules/fairscale/nn/data_parallel/fully_sharded_data_parallel.html#FullyShardedDataParallel.consolidate_shard_weights

zhsu-private

How to inference after finetuning

4

How to inference after finetuning?

chenzuozhou

How to make this better.

./main -m ./models/ggml-vicuna-13b-4bit.bin --color -f ./prompts/alpaca.txt -ins -b 256 --top_k 10000 --temp 0.2 --repeat_penalty 1 -t 7 Below is the conversation, which is good as far as it goes. If...

iplayfast

inference llama-7b error

2

response is none and the generated_token_ids are all 7

Chevalier1024

How to modify llama-Xb-hf/tokenizer_config.json from HuggingFace?

1

As title, I found that the content of llama-Xb-hf/tokenizer_config.json is like the following, ``` {"bos_token": "", "eos_token": "", "model_max_length": 1000000000000000019884624838656, "tokenizer_class": "LLaMATokenizer", "unk_token": ""} ``` How did your team modify...

foreveronehundred

RuntimeErrorRuntimeError: ProcessGroupNCCL is only supported with GPUs, no GPUs found!

2

Python: 3.9.5 Ubuntu 20.04.6 LTS ``` WARNING:torch.distributed.run: ***************************************** Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the...

MyIcecream

Correct possible typo in data.

Bumped into this whilst looking at the file. Surely this must have been a typo whilst assembling it?

pecastro

RuntimeError: The size of tensor a (65539072) must match the size of tensor b (262156288) at non-singleton dimension 0

I used to successfully run alpaca on my own dataset last year. But I rerun the train.py recently, for both alpaca dataset and my own dataset, they both failed for...

YuyangJ0