Nicholas Davidson
Results
2
comments of
Nicholas Davidson
@ycros Mistral-v0.2 uses a Rope Theta value of 1e6 and removed sliding window attention should be easy to fix within the model config parameters. @vgel I'm interested in getting this...
@Gunnar-Stunnar I am running into a similar error with the conversion script when trying to convert a lora from the StableLM arch derived model. I'll update if I can find...