llm-foundry
llm-foundry copied to clipboard
Adding temperature tuning in attention
Adding temperature tuning in attention similar to https://github.com/huggingface/transformers/blob/9a4ce6477019358abc3ebd72d435da56f4c0ab7c/src/transformers/models/llama4/modeling_llama4.py#L332-L337