Mooler0410
Mooler0410
Got it. Seems like some prompting stuffs. I will try to modify the prompts to see if there is any difference. Thank you for your clarification!
Same question about longchat-v1.5. Cannot find any details about the longchat-v1.5.
Thanks for clarification!
Hi! If the model mentioned is CohereForAI/c4ai-command-r-v01, we believe it's possible. It uses typical RoPE. We quickly checked its implementation in Hugging Face's Transformers library. It looks pretty similar to...
Thank you for sharing your experience with the Ghost 8B Beta model! ! ! It’s wonderful to hear it's performing well. Your feedback helps drive meaningful research impacts! The team
Hi, if possible, could you please share the scripts? We are happy to assistant you to figure out what happens