starcoder issues

8bit model output <endoftext>

2

Hi, I'm using the 8bit version, and tried the demo case. However, I got an output . This is my code: from transformers import AutoModelForCausalLM, AutoTokenizer checkpoint = "bigcode/starcoder" device...

para-lost

RuntimeError: RuntimeError: IndexError: list index out of range - multiple GPUs

3

Trying to fine tune bigcode/starcoderbase model on compute A100 with 2 GPUs , 40 GBx2 so 80GB. Finetune.py is slightly modified and loaded the model with 4bit, adopt Qlora and...

Kushalamummigatti

code refactoring

2

I want to fine tune star coder for code refactoring tasks and I was thinking if it is possible and in this context, how can I get a dataset and...

Muntahabintealam

KeyError: 'response '

4

Hi,I am trying to run the fine-tuning code on my computer, but I got KeyError: 'response'，the environment is installed according to the README. Traceback (most recent call last): File "/home/starcoder/finetune/finetune.py",...

camhfwang

TypeError: expected str, bytes or os.PathLike object, not NoneType

6

I tried to fine-tune using the commands provided in the README and encountered the aforementioned error. For specific details, please refer to my [wandb log](https://wandb.ai/hansbug/huggingface/runs/9wdymnye/overview?workspace=).

HansBug

jsuper

Training Loss vs Evaluation Loss during Fine Tuning Star Coder

10

Hi @ArmelRandy and @loubnabnl I am fine-tuning star coder on my custom dataset and was monitoring the training and validation loss. The training loss seems to decrease however in case...

ruchaa0112

starcoder
starcoder copied to clipboard

Metadata

8bit model output <endoftext>

RuntimeError: RuntimeError: IndexError: list index out of range - multiple GPUs

code refactoring

KeyError: 'response '

TypeError: expected str, bytes or os.PathLike object, not NoneType

Is there a script for evaluating against eleutherAI’s language model evaluation harness?

Failure Modes?

I wonder why starchat model size is half of starcoder? It is that save in FP16?

How to train a instruction code generated model based on starcoder and ta-prompt?

Training Loss vs Evaluation Loss during Fine Tuning Star Coder

← Metadata

Owner

Metadata

starcoder starcoder copied to clipboard

Metadata

← Metadata

Owner

Metadata

starcoder
starcoder copied to clipboard