starcoder issues

What should be masking id . should it be -100 only . giving device side assert triggered

I am trying to recreate this but while recreating when I use masking id -100 it is giving error device side assert triggered

nileshdhul

Is finetune.py incompatible with older GPUs?

Hi, while running on a Colab A100 instance I noticed that the VRAM consumed by finetune.py was only about 5 GB for starcoderbase-1b so I attempted it on my local...

umm-maybe

FileNotFoundError: [Errno 2] No such file or directory: 'checkpoint-100/model-00001-of-00003.safetensors'

when mutil gpu run starcoder in full parameter tuning , File "starcoder-git/finetune.py", line 44, in on_save kwargs["model"].save_pretrained(checkpoint_folder) File "/miniconda3/envs/sqlcode/lib/python3.9/site-packages/transformers/modeling_utils.py", line 2480, in save_pretrained os.remove(full_filename) FileNotFoundError: [Errno 2] No such file...

dshwei

Better inference based on starcode2-3b model

1

I am new to starcode. when I run the follow demo: ``` import torch from transformers import AutoTokenizer, AutoModelForCausalLM checkpoint = "./starcoder2-3b" tokenizer = AutoTokenizer.from_pretrained(checkpoint) model = AutoModelForCausalLM.from_pretrained(checkpoint, device_map="auto", torch_dtype=torch.bfloat16)...

HeroSong666

Question about Improving Code Generation with Promting

It shows on the [paper](https://arxiv.org/html/2305.06161v2) (Section E.3) that we can put prompt prefixes with the `` token. My Question is, how do we handle this when the prompt we are...

icnahom

HuggingFaceH4/oasst1_en - missing dataset

1

Hello, I wish to reproduce the StarChat training for educational purposes, but I see the dataset (HuggingFaceH4/oasst1_en) has been removed. Is there any way to download it? If not, any...

erap129

Update finetune.py

FutureWarning: The `use_auth_token` argument is deprecated and will be removed in v5 of Transformers. Please use `token` instead.

jiagaoxiang

torch.cuda.OutOfMemoryError on HuhhingFace NVidia 4xA10G Large

2

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 288.00 MiB. GPU 0 has a total capacty of 21.99 GiB of which 59.00 MiB is free. Process 42083 has 21.92 GiB...

lkthomas

Could somebody guide me how to fine-tune with fill-in-middle task based on StarCoderBase?

1

As titled, I have thoudsands of SQL files, I wish to fine tune the base model with these sqls to achieve FIM task.

FlyingPiggyKing

Fine tuning With SQLcoder-7b

I'm new to this area of Language models, in my use case I want to fine tune SQL coder model with spider dataset using this code base as this repo...

bhrt95

starcoder
starcoder copied to clipboard

Metadata

What should be masking id . should it be -100 only . giving device side assert triggered

Is finetune.py incompatible with older GPUs?

FileNotFoundError: [Errno 2] No such file or directory: 'checkpoint-100/model-00001-of-00003.safetensors'

Better inference based on starcode2-3b model

Question about Improving Code Generation with Promting

HuggingFaceH4/oasst1_en - missing dataset

Update finetune.py

torch.cuda.OutOfMemoryError on HuhhingFace NVidia 4xA10G Large

Could somebody guide me how to fine-tune with fill-in-middle task based on StarCoderBase?

Fine tuning With SQLcoder-7b

← Metadata

Owner

Metadata

starcoder starcoder copied to clipboard

Metadata

← Metadata

Owner

Metadata

starcoder
starcoder copied to clipboard