starcoder2 issues

Save Model and Adapters locally.

model.save_pretrained saves only the model but not the trained adapters.

Megatron model weights for StarCoder2-15B

A year ago, the raw Megatron weights for StarCoder [were released](https://github.com/bigcode-project/starcoder/issues/25). Would it be possible to release the Megatron weights for StarCoder2, especially the 15B variant? Also, publishing a script...

christiancosgrove

support SPM mode for FIM prompts

from fim paper (https://arxiv.org/pdf/2207.14255.pdf) section 3.1: SPM mode can be used to reuse kv cache across completion requests. SPM modes can enable further latency optimization (which is very important in...

erfanium

Better inference based on starcode2-3b model

1

I am new to starcode. when I run the follow demo: ``` import torch from transformers import AutoTokenizer, AutoModelForCausalLM checkpoint = "./starcoder2-3b" tokenizer = AutoTokenizer.from_pretrained(checkpoint) model = AutoModelForCausalLM.from_pretrained(checkpoint, device_map="auto", torch_dtype=torch.bfloat16)...

HeroSong666

[Q] Does this support fill-in-the-middle completion?

Does this support fill-in-the-middle completion?

NightMachinery

CrossCodeEval Results for StarCoder 2

Hi, currently I'm researching the impact of different retrieval-augmented generation (RAG) techniques on the LLM effect. We are attempting to replicate the CrossCodeEval from the "StarCoder 2 and The Stack...

Azure-Tang

Inquiry about Fine-Tuning Using Custom Code

1

Hi there, I hope this message finds you well. I am currently exploring the process of fine-tuning models using my own codebase, and I was hoping to seek some guidance...

tclxmeng-jia

Question about the future work

Is there any progress on the future work on the research project? Want to know whether has chance to participate in the future work. Or do we have any community...

yufansong

Is it possible to use Starcoder2 in llmstudio?

I loaded startcoder2 in llmstudio and asked it to write a python script that solves the Fibonacci series, but it outputs a bunch of weird results. ![Image](https://github.com/user-attachments/assets/c1f3bbdc-05d5-49da-9ae9-7d02c6ef825e) ![Image](https://github.com/user-attachments/assets/26926b29-bb23-400e-bf18-a3d02f4f4213)

SilentZhang

wtf?

2

``` C:\Users\reinz>ollama run starcoder2:7b >>> thistuple = ("apple", "banana", "cherry") ... print(thistuple) ... >>> ? why dont you generate text ! ? because I can't. ? what do you mean?...

reinzsal

starcoder2
starcoder2 copied to clipboard

Metadata

Save Model and Adapters locally.

Megatron model weights for StarCoder2-15B

support SPM mode for FIM prompts

Better inference based on starcode2-3b model

[Q] Does this support fill-in-the-middle completion?

CrossCodeEval Results for StarCoder 2

Inquiry about Fine-Tuning Using Custom Code

Question about the future work

Is it possible to use Starcoder2 in llmstudio?

wtf?

← Metadata

Owner

Metadata

starcoder2 starcoder2 copied to clipboard

Metadata

← Metadata

Owner

Metadata

starcoder2
starcoder2 copied to clipboard