Tianwen

Results 4 comments of Tianwen

The SkyMath method mainly consists of two parts: instruction boosting and self-compare. 1 Instruction boosting primarily draws inspiration from wizardLM and MetaMath. We integrate and improve their methods to enhance...

现在7B的开源模型已经很多了,我们在考虑出开源一个3B的版本。

我们的实现是正确的。孤立的“A”在BPE中指的是非词首的“A”,例如"helloA"里面的A。词首"A",即"_A"才是选项对应。