RWKV-LM issues

Hey BlinkDL, Thank you very very much for such a great model.... I have a question on my mind if the RNN part of the network could be much more...

snapo

4-bit quantization to reduce VRam requirement

5

Hi, Is it possible to use something like GPTQ to get a 4-bit quantized version of the latest 7B Instruct model. That's got to be one of the fastest and...

regstuff

Docker-based Environment Setup & Define requirements.txt

# What to achive - Define a requirements.txt file to clearly specify the necessary packages. - Enable to set up the learning environment using Docker. - Manage learning environments based...

lilacs2039

Question about RWKV formula

2

In the [first formula](https://github.com/BlinkDL/RWKV-LM/blob/main/RWKV-formula.png) in README, RWKV is rewritten into recurrent form by letting $W_n=(n-1)w$. Is there a particular reason for using $n-1$ instead of $n$? The latter is more...

luowyang

Bare metal cluster

1

I have 5 old (< 3yo) gaming rigs that are fairly serviceable. Can this be run with a distributed load across multiple machines? I can set up an internal network...

manchuwook

RWKV on TPU?

1

Is it possible to train RWKV on TPU? If not, what would be needed?

leejason

New Mojo AI Programming Language with RWKV-LM? = Performance Boost?

3

I was checking into the new Mojo AI programming language, and the possible significant performance boost advantages it might provide to RWKV-LM. Has anyone considered checking into this for RWKV-LM?...

Peter2023AI

fix typo with README.md

digger-yu

About how to convert pth file to huggingface

Target： For me, I am new to huggingface, just want to convert my fine-tuned model(pth) into huggingface, which can be directly downloaded and used. Done: #113 I've seen this issue,...

Galaxy-Ding

RWKV-LM
RWKV-LM copied to clipboard

Metadata

Fix name 'NUM_GPUS' is not defined when continuing training

AgileRL improvement for learning.

4-bit quantization to reduce VRam requirement

Docker-based Environment Setup & Define requirements.txt

Question about RWKV formula

Bare metal cluster

RWKV on TPU?

New Mojo AI Programming Language with RWKV-LM? = Performance Boost?

fix typo with README.md

About how to convert pth file to huggingface

← Metadata

Owner

Metadata

RWKV-LM RWKV-LM copied to clipboard

Metadata

← Metadata

Owner

Metadata

RWKV-LM
RWKV-LM copied to clipboard