PENG Bo comments

Results 211 comments of


                                            PENG Bo

[REQUEST] When training a FP16 model, the ability to set some of the layers to FP32

I can do first and second manually. So now it's about DeepSpeed support :)

asm.js and weblas performance for example_policy_network

With GUI [https://withablink.coding.me/goPolicyNet/](https://withablink.coding.me/goPolicyNet/) : ![image](https://user-images.githubusercontent.com/33809201/33018072-db52dd4e-ce2f-11e7-84e7-c20428e2ba8b.png)

我想使用的你的项目做一个星云链Dapp

我也想提交，你改一改吧，不要完全copy了，哈哈。

请问这个mxnet实现的围棋程序可以像您知乎里tensorflow实现版本一样使用类似tensorboard可视化训练过程吗？

你好，可以用 wandb

File of format ".pyc" can be ignored

Thanks. Will do.

CUDA compilation error with Ctx Length>2000

> Hello, I am trying out RWKV with audio modality and when I set T_MAX>>1000, it throws this error: Reduce B_GROUP_FORWARD and B_GROUP_BACKWARD.

CUDA compilation error with Ctx Length>2000

> Also, it seems FP16 doesn't work out-of-the-box. Could you suggest changes to make it work? You can move the FFN to FP16 first :)

CUDA compilation error with Ctx Length>2000

> I did that, but now it gives `Illegal memory accessed` at `k.continguous()` in the forward of the TimeMix. Works fine in fp32. The CUDA code assumes a tensor element...

CUDA compilation error with Ctx Length>2000

> @BlinkDL Can you please point out where we need to make a change to the code to reduce the tensor element from 4 bytes to 2 bytes? Thanks a...

关于调用模型做分类任务

你好，可以试试传统方法，但还有一个办法，RWKV 的 hidden state 很小（请看 https://github.com/BlinkDL/RWKV-v2-RNN-Pile/blob/main/src/model.py 的 .xx .aa .bb ），可以试试直接加个线性层输出。试试用 .xx 和 .aa / .bb 作为线性层的输入。