TencentPretrain
TencentPretrain copied to clipboard
Any plan of supporting larger LLAMA models ?
It seems like that the current script only support the smallest (7B) LLAMA model. Really expect to see extending to larger models.
coming soon
any updates on the timeline for this?