salman issues

Results 41 issues of


                                            salman

[Docs] Recipe documentation tracker

We're working on better surfacing documentation for the finetuning techniques we support. This issue tracks the different recipes we still need to add documentation for. If you're interesting in helping...

documentation

good first issue

community help wanted

Fix eval recipe for consecutive generation and non-generation tasks

Currently, if we specify multiple tasks for the eval recipe, and one of the tasks is a generation task which uses KV-cacheing, then the cache is still enabled for non-generation...

bug

Update download commands in configs

Following the instructions in `/root/torchtune/recipes/configs/llama3/8B_qlora_single_device.yaml` ```bash $ tune download meta-llama/Meta-Llama-3-8B-Instruct --output-dir /tmp/Meta-Llama-3-8B-Instruct --hf-token $ tune run lora_finetune_single_device --config llama3/8B_qlora_single_device checkpointer.checkpoint_dir=/tmp/Meta-Llama-3-8B-Instruct File "/usr/local/lib/python3.11/dist-packages/torchtune/training/checkpointing/_utils.py", line 161, in get_path raise ValueError(f"No file with...

rlhf

Removing extra checks for logging memory in recipes

community help wanted

better engineering

RLHF Tracker

```[tasklist] ### Tasks - [ ] https://github.com/pytorch/torchtune/issues/2082 - [ ] Full-finetune distributed DPO recipe #1966 - [ ] #1262 - [ ] PPO tutorial/deep dive - [ ] DPO tutorial/deep...

enhancement

community help wanted

salman

[Docs] Recipe documentation tracker

Fix eval recipe for consecutive generation and non-generation tasks

Update download commands in configs

Change default dataset in DPO configs to use HH-RLHF dataset

Add sample packing for DPO, PPO

Fixing docstring linter

Full-finetune DPO single device recipe

PPO Performance Improvements

Removing extra checks for logging memory in recipes

RLHF Tracker