starcoder2
starcoder2 copied to clipboard
Home of StarCoder2!
Starting from _transformers_>=4.39.0, the _top_k_top_p_filtering_ method is DEPRECATED. transformers
When initiating the model, come to this problem: Traceback (most recent call last): File "/home/user/documents/checkpoint/download/down_bigcode2.py", line 12, in model = AutoModelForCausalLM.from_pretrained(checkpoint, device_map="auto", torch_dtype=torch.bfloat16) File "/home/user/miniconda3/envs/lib/python3.7/site-packages/transformers/models/auto/auto_factory.py", line 461, in from_pretrained **kwargs,...
hi there! How is the middle part of FIM data determined? Random selection? Is it single or block or mixed in some proportion?