苏剑林(Jianlin Su) issues

Repositories
Issues
Comments

Results 14 issues of


                                            苏剑林(Jianlin Su)

where to max the PriorDiscriminator?

the prior term of deep infomax is a minimax game, like GAN. But I can not find any code about it in train.py. Do I have a wrong understand or...

paper link has been redirected

https://www.aclweb.org/anthology/D17-1112 this link can not download the GNR paper now.

Non-Linearized Position Embedding可以展开介绍一下吗

> “为此，智源团队创新提出NLPE（Non-Linearized Position Embedding，非线性位置编码）方法，在 RoPE 方法的基础上，通过调整相对位置编码、约束最大相对长度来提升模型外延能力。” 来自 https://mp.weixin.qq.com/s/ZQF4Y-kJaPKn5q69WoxmzQ 的介绍，对NLPE部分比较感兴趣。我看hf上的代码也好像没发现相关内容。

再搞个根据概率分割的？

这样就可以取代jieba了哈哈哈。应该就是在原来基础上加个动态规划。