Daya Guo

Results 76 comments of Daya Guo

We design different regex patterns according to the grammar characteristics of different languages. For examples, for `c` language, we use `#inlcude` to find dependency: ``` def extract_includes(file_content): includes = set()...

是的,该模型主要是为了弥补chat模型自然语言能力的不足

> Hey folks! To double-check, does that mean that both v1.5 models, instruct and base, no longer support FIM? yes.

> chat能力与 FIM能力为何不能同时兼容? 可以兼容,但v1.5没做FIM预训练。只有v1支持

> > > chat能力与 FIM能力为何不能同时兼容? > > > > > > 可以兼容,但v1.5没做FIM预训练。只有v1支持 > > ? 词表目前是不支持、、 这几个 FIM标识符的,默认会分成 [', 'a', '', 'a', ''] , 是需要 把这几个 add_special_token么,还是就是默认的llmatokenzierfast分词方式 > > 这个问题帮解答下?...

这里有人转化了 https://huggingface.co/TheBloke/deepseek-coder-6.7B-instruct-GGUF

We employ a fuzzy deduplication approach, utilizing the MinHash algorithm. However, when this method is applied at the file level, it may lead to the removal of certain files in...