fxmarty
fxmarty
Support has been added officially for Instinct MI210 and MI250: https://github.com/huggingface/text-generation-inference/pull/1243 Closing in favor of https://github.com/huggingface/text-generation-inference/issues/223
cc @echarlaix
Hi @L1-M1ng, how is `Qwen_models/Qwen-7B-onnx/qwen_model.onnx` exported? Could you share the model on e.g. the HF Hub? I am not surprised that this fails, as qwen is not natively supported in...
@xenova Can you also rebase/merge main?
@xenova Could you either push an empty commit or merge main (there was an issue with pytest that is now pinned)? Can you also confirm that the exported model behaves...
Hi @KaifAhmad1, thank you for the report. Olmo appears to be a model not natively supported in Transformers, rather using custom modeling code: https://huggingface.co/allenai/OLMo-7B/tree/main. As for text generation, BetterTransformer is...
@smile2game Thank you. Qwen is not natively supported in Transformers (but Qwen2 is https://github.com/huggingface/transformers/pull/28436). I tried running the export for Qwen-7B and we get: ``` Traceback (most recent call last):...
@AnonymUnsichtbar @Gabriel-Kissin To be fair the TFLite support is currently quite minimal, only a few simple architectures are supported ('albert', 'bert', 'camembert', 'convbert', 'deberta', 'deberta_v2', 'distilbert', 'electra', 'flaubert', 'mobilebert', 'mpnet',...
Hi, export of Transformers Whisper to ONNX for audio-classification is merged in https://github.com/huggingface/optimum/pull/1727, for example: ```python optimum-cli export onnx --model shhossain/whisper-tiny-bn-emo whisper_onnx ``` For TFLite, there are no short term...