Kevin Canwen Xu comments

Results 36 comments of


                                            Kevin Canwen Xu

Does prob_sum in greedy search always add up to 1?

This is for top-p sampling. We mask out tokens outside top-p (by default 95%).

train my own data, answers are not very accurate.

Are you using LLaMA as the foundation model? If so, as LLaMA has no Chinese pretraining data, it's not very surprising that the outcome isn't very good. BLOOMZ may be...

Encountering error: app.py has "cannot import name 'LlamaForCausalLM' from 'transformers'"

Simply ```bash pip uninstall transformers pip install transformers ```

Encountering error: app.py has "cannot import name 'LlamaForCausalLM' from 'transformers'"

You can try install from source! Following this page: https://huggingface.co/docs/transformers/installation#installing-from-source or try `pip install git+https://github.com/huggingface/transformers.git`

Encountering error: app.py has "cannot import name 'LlamaForCausalLM' from 'transformers'"

> However a search of that Huggingface repository finds no mention of `LlamaForCausalLM` https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/modeling_llama.py

How to access int8 with 13b version?

https://github.com/project-baize/baize/blob/ccf0bb8485657b7c16a57456bbb835503bac2456/demo/app.py#L18 Change this line to `tokenizer, model, device = load_tokenizer_and_model(base_model, adapter_model, load_8bit=True)`

Is it possible to train Chinese data directly based on English models?

Of course, but it won't be as good as a Baize model trained on a Chinese foundation model.

try train 25G data/quora_chat_data failed

Not completely sure but this may be helpful: https://stackoverflow.com/questions/68652157/how-do-i-debug-overflowerror-value-too-large-to-convert-to-int32-t

sharegpt 90k data

Thanks for the pointer. This data may have copyright issue so we are very cautious about it and we can't at this moment provide checkpoints trained on it. But feel...

"fix dark mode" in hugging face🤗 and for local demo

Hi @Keldos-Li Thanks for reaching out. Big fan of your project. Indeed I tried to fix the CSS for gradio on HF Space but it's not completely working. Thanks for...