fxmarty comments

Results 316 comments of


                                            fxmarty

Implement ORTModelForZeroShotObjectDetection

@solomonmanuelraj Could you explain what would you like to be supported? ``` optimum-cli export onnx -m google/owlvit-base-patch32 owlvit_onnx ``` & e.g. ``` optimum-cli onnxruntime quantize --onnx_model owlvit_onnx --output owlvit_onnx_quantized --avx512...

Implement ORTModelForZeroShotObjectDetection

Hi @solomonmanuelraj, First, investigating this issue I found out there was an issue in the ONNX export of owlvit due to the usage of numpy in the modeling code and...

PyPI wheel

Yes - not very important but it can be useful to host wheels on PyPI index.

SDPA gives nans/infs during sampling on ROCM w/ float16

@cjekel there is a bug in current SDPA + FA2 backend using aotriton (https://github.com/ROCm/aotriton) that is being investigated and fixed. For https://github.com/ROCm/flash-attention, this is supported using the argument `attn_implementation="flash_attention_2"` when...

[RoBERTa-based] Add support for sdpa

@michaelshekasta Approval from @ArthurZucker or @amyeroberts.

[RoBERTa-based] Add support for sdpa

Thanks @kiszk, missed it when reordering the lists.

[RoBERTa-based] Add support for sdpa

gentle ping @ArthurZucker @amyeroberts

[RoBERTa-based] Add support for sdpa

@ArthurZucker @amyeroberts

Allow parsing of E4M3FN models using scales manipulation

@umangyadav I am curious whether there is a conversion E4M3FN + scale E4M3FNUZ + scale implemented anywhere?

Can't be installed using "uv" due to an issue in the setup script

It's wild that https://github.com/pypa/pip/issues/8437 is locked.