LiYu Lu comments

Repositories
Issues
Comments

Results 6 comments of


                                            LiYu Lu

Exception: unknown storage type: -1 when no OHEM

@HaydenFaulkner I have the same problem. Have you solved it?

Some questions about data accuracy ？

@leimao

Fix spelling error in function GetMaxTokenLength()

@microsoft-github-policy-service agree

模型词表相关疑问

2000万文本要训练多长时间啊？自己复现感觉用bpe要跑好久QAQ

[Feature Request] GEMM benchmarks and FP8 Support

I provided a [simple GEMM implementation](https://github.com/HazyResearch/ThunderKittens/pull/28), but a more optimized GEMM implementation requires support for ldmatrix and pipeline, which I haven't implemented yet.

Load with ldmatrix

ldmatrix can refer to loading a 16x16 matrix with a single instruction, while LDS.32 requires 4 instructions, and ldmatrix also offers a transposition function.