priyanka-ganesha
Results
3
issues of
priyanka-ganesha
Inference decode configurations intended for CPU for model sizes 1B, 4B, 8B, and 16B parameters.
* Cloud monitoring prototype * Checkpoint initialization metrics emitting