inference icon indicating copy to clipboard operation
inference copied to clipboard

FEAT: Support RWKV Pile

Open Bojun-Feng opened this issue 1 year ago • 2 comments

Resolve #533

Add support for rwkv-4-pile models. Successfully tested the 169m variant on a local Mac setup.

I would like some guidance on setting the model size. Historically, we've rounded up the sizes for models with parameters fewer than 1 billion. However, with rwkv-4-pile, we encounter two smaller models: 169m and 430m.

I am leaning towards simply excluding the 169m variant and setting the smallest size threshold at 430m (for 1b). However, alternative suggestions are welcomed.

Bojun-Feng avatar Oct 13 '23 08:10 Bojun-Feng

It seems that all the tests are actually passing, but somehow there is a segmentation fault from Stable Diffusion test. Not sure if this is just a probabilistic error or do I need to fix something.

Bojun-Feng avatar Oct 18 '23 13:10 Bojun-Feng

Hi, could you please also update the docs? Here's an example: https://github.com/xorbitsai/inference/pull/587

UranusSeven avatar Oct 30 '23 07:10 UranusSeven