nanjunye
Results
2
issues of
nanjunye
I ran this code below and: wkv_cuda = load(name="wkv", sources=["cuda/wkv_op.cpp", "cuda/wkv_cuda.cu"], verbose=True, extra_cuda_cflags=['-res-usage', '--maxrregcount 60', '--use_fast_math', '-O3', '-Xptxas -O3', f'-DTmax={T_MAX}']) got this: Traceback (most recent call last): File "d:\GitHub\S_GPT\src\model.py", line...
你好。论文里提到了将多个WFAs合并成一个,具体是怎么操作的呢,比如对于V_embedding词嵌入向量,shape为(V,r),那是把r分配到每个子WFA吗。此外,对于D1,D2,我的猜想是将根据子WFA的状态states数对应的赋值到r*K的矩阵里面,我的猜想对吗?