Megatron-LM icon indicating copy to clipboard operation
Megatron-LM copied to clipboard

[QUESTION] How to set `--rotary-seq-len-interpolation-factor` for rope scaling?

Open eagle705 opened this issue 1 year ago • 2 comments

Hi,

I'm curious if it's possible to set --rotary-seq-len-interpolation-factor in Megatron-LM to match huggingface's rope_scaling setting ({"type": "dynamic", "factor": 2.0}).

Is there any information you can share on how the --rotary-seq-len-interpolation-factor option is used?

eagle705 avatar Jan 26 '24 14:01 eagle705

https://github.com/NVIDIA/Megatron-LM/blob/4bd4e7426c8def582eb80ae53f296a803643d2f5/megatron/core/models/common/embeddings/rotary_pos_embedding.py#L52

ethanhe42 avatar Feb 01 '24 19:02 ethanhe42

Marking as stale. No activity in 60 days.

github-actions[bot] avatar Apr 02 '24 18:04 github-actions[bot]