Results 2 comments of Owen Zhang

Issue seems to be this line: https://github.com/huggingface/optimum/blob/main/optimum/utils/input_generators.py#L657 It uses self.num_attention_heads, instead of num_key_value_heads

Hi @jl749, Thanks for the response, that seems to work with exporting, however when I try to use onnxruntime-genai to do model inferencing I'm running into this error: ``` genai\examples\csharp\HelloPhi\bin\x64\Debug_DirectML\net6.0\runtimes\win-x64\native\2025-03-16...