superduper [TEST-USE] Fine Tuning

[TEST-USE] Fine Tuning

Open guerra2fernando opened this issue 10 months ago • 2 comments

Apr 12 '24 09:04 guerra2fernando

Environment (with MongoDB + 1 type of data + 1 model):

[x] local cluster
[x] development mode

DBs (development mode + 1 non-trivial type of data + 1 model):

[x] MongoDB
[x] SQLite
[x] Postgres

Data Format (MongoDB + 1 model + development mode)

[x] Chat
[x] Prompt-Response
[x] Prompt

Training options (MongoDB + 1 type of data + development mode)

[x] ray
[x] deepspeed
[x] lora
[x] qlora
[x] Multi-GPUS

Load Options

[x] Directly
[x] Use checkpoint

Apr 23 '24 15:04 jieguangzhou

At the functional level, the test has been completed (running a small max_steps test). Because it involves model training, work at the detection effect level is required. I think I need to continue to run the real finetune task on data in different formats to test the effect after the model training is completed, Each training takes a long time, but other work can be done simultaneously

Data Format (MongoDB + 1 model + development mode)

[x] Chat
[x] Prompt-Response
[x] Prompt

Apr 23 '24 15:04 jieguangzhou

superduper superduper copied to clipboard

[TEST-USE] Fine Tuning

superduper
superduper copied to clipboard