abhishek thakur comments

Results 116 comments of


                                            abhishek thakur

[FEATURE REQUEST]

yes, hopefully!

[BUG] Loading auto train model gets size mismatch on AutoModelForCausalLM

seems like a transformer issue. could you merge the model and then try the same code? here is code for merging: ``` def merge_adapter(base_model_path, target_model_path, adapter_path): logger.info("Loading adapter...") model =...

[BUG] Loading auto train model gets size mismatch on AutoModelForCausalLM

can you remove the old adapter files when you reload?

[BUG] Loading auto train model gets size mismatch on AutoModelForCausalLM

``` model = AutoModelForCausalLM.from_pretrained( base_model_path, torch_dtype=torch.float16, low_cpu_mem_usage=True, trust_remote_code=True, device_map="auto", ) ```

[BUG] Loading auto train model gets size mismatch on AutoModelForCausalLM

glad to hear it worked. is there still an issue?

[BUG] Loading auto train model gets size mismatch on AutoModelForCausalLM

its a bit difficult to answer that question without looking at the training data and the training parameters used. does it happen with all datasets?

Fix typos

hi, if you have more upcoming fixes like this, please add them in this pr

Integrate 'autotrain-advanced' as module inside huggingface_hub

lets have a talk about this before we start so that we are all on the same page. :)

NOTE: This is old Model class and is deprecated. It will no longer be maintained! Please use version > 0.5.1. Its much better and supports multi-gpu training too!

please see this example and convert your code to new Model class and get rid of the warning: https://github.com/abhishekkrthakur/tez/blob/main/examples/image/digit_recognizer.py :)

user info for git commits made in the hub

fwiw, i ended up reverting my username :D