Tuan Pham comments

Results 8 comments of


                                            Tuan Pham

ValueError: Cannot flatten integer dtype tensors

Seem to be related to bitsandbytes, turn off load_in_4bit or load_in_8bit and seem to be working correctly

[BUG]error: can't copy 'deepspeed/accelerator': doesn't exist or not a regular file

> I found the related bug #1909 and the solution there: [#1909 (comment)](https://github.com/microsoft/DeepSpeed/issues/1909#issuecomment-1225113348) > > Basically: > > ``` > rm deepspeed/ops/{csrc,op_builder} > rm deepspeed/accelerator > cp -R csrc op_builder...

Very Slow Inference on PEFT-LORA fine-tunned FLAN-UL2

So does that mean that if i want to eval every epoch, i would have to merge the lora adapter and then run the model.generate at every epoch?

Very Slow Inference on PEFT-LORA fine-tunned FLAN-UL2

> > So does that mean that if i want to eval every epoch, i would have to merge the lora adapter and then run the model.generate at every epoch?...

Incomplete init when disable=True for notebook tqdm, producing attribute errors:'tqdm' object has no attribute 'dynamic_ncols', 'tqdm' object has no attribute 'container'

Friendly cc @casperdcl! I try to hotfix by force-installing ```4.66.1```, and it work for another 4 months before appearing again. ``` File /opt/conda/lib/python3.10/site-packages/tqdm/notebook.py:156, in tqdm_notebook.display(self, msg, pos, close, bar_style, check_delay)...

Tuan Pham

ValueError: Cannot flatten integer dtype tensors

[BUG]error: can't copy 'deepspeed/accelerator': doesn't exist or not a regular file

Very Slow Inference on PEFT-LORA fine-tunned FLAN-UL2

Very Slow Inference on PEFT-LORA fine-tunned FLAN-UL2

Incomplete init when disable=True for notebook tqdm, producing attribute errors:'tqdm' object has no attribute 'dynamic_ncols', 'tqdm' object has no attribute 'container'

Cache only has 0 layers, attempted to access layer with index 0

Can we continue finetuning LlaMA-3.1-8B's QLora pre-finetuned adapter with Unsloth to extend context length ?

Migrate GoogleProvider to googletrans 4.0.2 with async support and add new example