starcoder
starcoder copied to clipboard
Home of StarCoder: fine-tuning & inference!
I installed all the dependencies by following the instructions from the repo. Following that, I am running the following code: ``` import torch from transformers import AutoModelForCausalLM, AutoTokenizer checkpoint =...
I'm attempting to run the Starcoder model on a Mac M2 with 32GB of memory using the Transformers library in a CPU environment. Despite setting load_in_8bit=True, I'm encountering an error...
Thank you for your valuable open-source contribution! In instruction tuning stage, it seems that only the answer aligned with the instruction participates in the backpropagation process. And [this code](https://github.com/bigcode-project/starcoder/blob/main/finetune/finetune.py#L190) seems...
While performing inference with starcoder, I encountered the following errors: ``` bin [ANONYMIZED_PATH]/miniconda3/envs/starcoder/lib/python3.10/site-packages/bitsandbytes/libbitsandbytes_cpu.so [ANONYMIZED_PATH]/miniconda3/envs/starcoder/lib/python3.10/site-packages/bitsandbytes/cextension.py:34: UserWarning: The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and...
Dear authors of Starcoder, while using the framework, I noticed that mask_user_labels sometimes does not function properly. Upon investigation, I found that there might be an issue with the function...
I'm interested in running StarCoder for inference on my macOS machine, but I have some questions. **Questions:** **Library Recommendations:** I've come across OpenLLM. Should I be considering OpenLLM for this,...
```python def mask_user_labels(tokenizer, dialogue_template, labels): """Masks the user turns of a dialogue from the loss""" user_token_id = tokenizer.convert_tokens_to_ids(dialogue_template.user_token) assistant_token_id = tokenizer.convert_tokens_to_ids(dialogue_template.assistant_token) for idx, label_id in enumerate(labels): ``` The labels parameter...
I have a dataset that has two columns called: Question Answer Questions like: "Write a python code to reverse a list". Answer: code for that question. I have looked at...
I am trying to further finetune `Starchat-Beta`, save my progress, load my progress, and continue training. But whatever I do, it doesn't come together. Whenever I load my progress and...
1. file chat/dialogues.py:239 should while labels[current_idx] != assistant_token_id and current_idx < len(labels): be while current_idx < len(labels) and labels[current_idx] != assistant_token_id: ? 2. chat/train.py:204 should mask_user_labels(tokenizer, dialogue_template, labels) be: for...