Baibaifan

Results 1 comments of Baibaifan

> when you use --use-mcore-models,, you cannot use local. --use-flash-attn decides whether to use the OSS flash attention implmentation or cudnn implmementation. hi @ethanhe42 ,I understand the process you mentioned,...