starcoder2
starcoder2 copied to clipboard
Home of StarCoder2!
Added arxiv link
I get the following error after finetuning this model on the R dataset following the example in the README. ```console Some weights of the model checkpoint at finetune_starcoder2/final_checkpoint were not...
I noticed that even though _bigcode/starcoder(2)_ is much opener than code llama and deepseekcoder, eg. open-sourced datasets, clearly described data processing and training, and so on, it is still not...
Official Support for GGUF Quantization in BigCode Starcoder2 to Enhance Accessibility and Efficiency
Dear BigCode team, what a wonderful project! I am writing this feature request for official implementation of GGUF quantization for Starcoder2 to enhance its adoption with coding platforms and APIs...
starcoder's format for inference in code completion is PSM, + prefix + + suffix + what's that for starcoder2? from the paper, we could only see that 
what is the sft template? when i try to use this model, i dont know what is the sft template. please help me, thankyou
Hello @loubnabnl is it possible to get starcoder2 to learn TCL? It was not part of the 30 languages so was curious if it's worth pursuing with SFT? Also, is...
#Facing the following error while trying to finetune Starcoder2 with the given script. ### Description: For `transformers.AutoModelForCausalLM` to recognize Starcoder2 `transformers>4.39.0` is required. But `trl` is still using `transformers==4.38.2`. Even...
There is no lm_head.weight in the parameters of Starcoder2-3B and Starcoder2-7B. Is it because of tied embedding?
The readme of this repo reads the following: > StarCoder2 is a family of code generation models (3B, 7B, and 15B), trained on 600+ programming languages from The Stack v2...