ds5t5 comments

Repositories
Issues
Comments

Results 4 comments of


                                            ds5t5

add refact llama.cpp tutorial

@olegklimov please help review and feel free to test. The inference is extremely fast with the effort from llama.cpp.

[bounty] CPU inference support, Mac M1/M2 inference support

/attempt https://github.com/smallcloudai/refact/issues/77 Options Cancel my attempt

[bounty] CPU inference support, Mac M1/M2 inference support

thanks. let me know when it is ready for model weight. i will rebase my llama.cpp PR to the latest branch of llama.cpp.

[bounty] CPU inference support, Mac M1/M2 inference support

@JegernOUTT can i ask why we decided to make the weight change? it seems not quite aligned with other popular models. they (falcon, llama) usually keep mlp.linear_1 and mlp.linear_3 separately....