정다혜 comments

Repositories
Issues
Comments

Results 1 comments of


                                            정다혜

[Feature]: FP6

@twaka hi there, I've tried it myself and also observed the model weights reduced using Llama3-8B. But for me, although the memory consumption decreased, the latency seems to increase significantly...