LLaDA
LLaDA copied to clipboard
What is the difference between mask-predict and llada?
Impressive preference. I wonder what is the difference between llada and mask-predict in NAR?