starcoder issues

During finetuning (finetuner_starcoder.py), I'm running out of GPU memory during the checkpoint saving step (save_pretrained)

5

Looks like GPU usage almost doubles during saving (save_pretrained - get_peft_model_state_dict function). Is there a way to avoid this? stack trace: ```Traceback (most recent call last): File "finetune_starcoder.py", line 343,...

astarostap

Fine-tuning on other programming languages than Python

4

Hello, It is really exciting to see your work! May I know if the codes for fine-tuning on other programming languages will be released in the near future? Up to...

WrViajreo

My device can not run this model, it tip 'Killed'

1

xiashuqin89

fix: 'DataArguments' object has no attribute 'dataset_config_name'

1

When I was training with ```chat/train.py```, I reported the following error after training: ``` Traceback (most recent call last): File "train.py", line 345, in main() File "train.py", line 313, in...

KINNNNNNG

finetune time

7

use A800 80g, how long it takes to finetune? I am stucking...

Maomaoxion

Empty Generations / Failing Reproducing 40% on HumanEval

3

Hi all, I've set up Starcoder as follows: ``` gen_checkpoint = "bigcode/starcoder" gen_device = "cuda" gen_tokenizer, gen_model = setup_model_tokenizer( gen_checkpoint, bit_4=False, device=gen_device, bnb_config=None ) ``` ``` def setup_model_tokenizer( path, device=None,...

leonardtang

Removal request & notice: permissive licensing might often still be unsuitable(!) for training set inclusion

2

I'd just like you to know that code with permissive licensing with attribution requirements **are possibly unsuitable for training set inclusion.** I'm bringing this to your attention not as a...

ell1e

Effect of FIM on StarCoder pre-training

2

Hi! Curious to know some more details about FIM and its effect on the pre-trained model. Here's a paragraph from the SantaCoder paper: > FIM for cheap We observe a...

gojkoc54

RuntimeError: CUDA error: CUDA-capable device(s) is/are busy or unavailable CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions.

When I tried to load starcoder based on tutorial provided, RuntimeError emerges because of CUDA error, but torch.cuda.is_available() returns True. The gpu that I run this on is provided below,...

NyanNat

v0.10.0 of Peft breaks finetune.py

See title. Temporarily, adding "peft==0.9.0" to requirements.txt and ignoring the readme.md instructions to install from git works around this issue. However, it would be better if huggingface/bigcode could coordinate between...

umm-maybe

starcoder
starcoder copied to clipboard

Metadata

During finetuning (finetuner_starcoder.py), I'm running out of GPU memory during the checkpoint saving step (save_pretrained)

Fine-tuning on other programming languages than Python

My device can not run this model, it tip 'Killed'

fix: 'DataArguments' object has no attribute 'dataset_config_name'

finetune time

Empty Generations / Failing Reproducing 40% on HumanEval

Removal request & notice: permissive licensing might often still be unsuitable(!) for training set inclusion

Effect of FIM on StarCoder pre-training

v0.10.0 of Peft breaks finetune.py

← Metadata

Owner

Metadata

starcoder starcoder copied to clipboard

Metadata

← Metadata

Owner

Metadata

starcoder
starcoder copied to clipboard