rainred

Results 7 comments of rainred

> This should be easy to support. Could you give us a specific example or model name that we can run the test on? maybe you can try on https://huggingface.co/openbmb/cpm-bee-2b,...

> Sounds good. We will look into this later. If you have bandwidth, contributions are welcome! have tried filing https://github.com/sgl-project/sglang/pull/959, could you help to take a look?

self.model = AutoModel.from_pretrained(model_path, trust_remote_code=True).to(dtype=torch.bfloat16) int4 model should not be bf16 type?

I tried in llama.cpp today, still not supported yet: python3 convert_hf_to_gguf.py model_path ERROR:hf-to-gguf:Model InternVLChatModel is not supported Any update for luck?

> Yes. We will update our doc to cover the supported and unsupported keywords soon. is the doc updated? i'd like to evaluate if xgrammar can meet my required json...

> Hi [@gryffindor-rr](https://github.com/gryffindor-rr), thanks for asking about the C++ API! XGrammar's Python API is a simple wrapper around the C++ API, so you can refer to our [Python documentation](https://xgrammar.mlc.ai/docs/api/python/index.html) to...

> > > Hi [@gryffindor-rr](https://github.com/gryffindor-rr), thanks for asking about the C++ API! XGrammar's Python API is a simple wrapper around the C++ API, so you can refer to our [Python...