Ella Charlaix issues

Results 20 issues of


                                            Ella Charlaix

Enable past_key_values for ORTModelForCausalLM

In this PR we allow `ORTModelForCausalLM` class to take advantage of the pre-computed key and value `past_key_values` in order to speed up decoding, by setting `use_cache` to `True`. ## Before...

add onnx export for VITS architecture

Need https://github.com/huggingface/transformers/pull/28141 to be merged and part of the release before we can merge

Add ONNX model support

Enable loading of ONNX models + ONNX Runtime inference using Optimum Some updates might follow from https://github.com/huggingface/moon-landing/pull/7320 (WIP)

Add support for neural compressor models

docs now deleted automatically after 30 days https://github.com/huggingface/doc-builder/blob/main/.github/workflows/delete_old_pr_documentations.yml As done in optimum : https://github.com/huggingface/optimum/pull/1565 cc @regisss

Add link leaderboard

Add custom model export test

Needs https://github.com/huggingface/optimum/pull/1832 to be merged

Ella Charlaix

Enable past_key_values for ORTModelForCausalLM

ORT optimizer refactorization

add onnx export for VITS architecture

Add ONNX model support

Add support for neural compressor models

Remove workflow deleting doc

Add link leaderboard

Add custom model export test

Add test for INC examples

Export the decoder only once for seq2seq models