One comments

Results 109 comments of

One

Hf chat template

Any updates to this PR? It's important for evaluating chat/instruction-finetuned models.

AESLC Checksum error

Any updates?

AESLC Checksum error

I manually modified the hash of AESLC in tensorflow-datasets, and it worked fine.

How to reproduce Evol-Instruct datasets?

What is the seed file for the WizardLM and WizardCoder datasets?

How to reproduce Evol-Instruct datasets?

BTW, the scripts seem to be missing the error checker and iterative evolution described in the paper. Are these parts necessary?

How to reproduce Evol-Instruct datasets?

Any updates?

Any plans on 8192 context version?

@Green-Sky We observed that fine-tuning may still cause performance degradation. It is better to have a native 8192 pretrained model.

Any plans on 8192 context version?

Thanks! How does it compare to native long context base models such as StarCoder 8192? BTW, if we want the 8192 version of OpenLLaMA, maybe we need a JAX FlashAttention...

13B V2 model planned?

Also excited to see V2 13B! Better with coding + 8192 native context length

huggingface down: openchat can't be started

I think it's related to the model loading of Transformers, which checks for model updates on HuggingFace every time. One temporary solution would be downloading to a local folder and...