Baibaifan
Results
1
comments of
Baibaifan
> when you use --use-mcore-models,, you cannot use local. --use-flash-attn decides whether to use the OSS flash attention implmentation or cudnn implmementation. hi @ethanhe42 ,I understand the process you mentioned,...