Arup De
Results
2
issues of
Arup De
Co-authored with Qingquan Song (@qingquansong) and Ziang Li (@zianglih ) **Multi-item scoring** 1. concatenate multiple candidates of a same member with all ranking candidates with delimiter separation. + + +...
### What does this PR do? #### Problem CriticWorker was extracting \`attn_implementation\` from \`override_config\` but not passing it to \`load_valuehead_model()\`, causing models to always default to \`flash_attention_2\` regardless of configuration....