Nghi D. Q. Bui comments

Results 29 comments of


                                            Nghi D. Q. Bui

trafficstars

Add RWKV models

thank you. The RWKV model family looks very nice. However, they are not really code learning models, do you have the checkpoints related to coding tasks?

Cannot load "starcoder-15.5B" with weight_sharding=True

yes, the starcoder model has been sharded into multiple files already, so I recommend do not use weight_sharding for starcoder.

AttributeError: 'function' object has no attribute 'func'

we will release a new stable version very soon to fix all of these bugs, thanks for updating us the issues !!

Fix some errors

hello, can you sign the salesforce-cla before making the pull request, thanks !!

Guidance on finetuning with PEFT/LoRA

you can use CodeTF to fine-tune with PEFT/LORA: https://github.com/salesforce/CodeTF

I want to fine tuning codetf to generate json rule code.

i'm not very sure, can your dataset be converted into some sequence format?

MPS support

this is great, can you help to propose any pull request that i can merge?

Participant %9: Singapore Management University

thanks!!! I will update my result later

[main.py --training] accuracy max is 72.9%

hi, not sure if you are still interested, but i have just updated the code with clearer instructions and results