Shan Tang
Results
2
issues of
Shan Tang
Please add examples using local open-source models, like llama or chatGLM. Thanks
Accoring to the Astra-sim 2.0 paper, simulates based on Chakra trace, to "decouple parallelization strategies from the ASTRAsim implementation" . Does that mean we have to trace a real 10,000...
question