torchtune issues

validate checkpoint is consistent with meta_to_tune flag

5

Summary: add a flag that do validation when load_checkpoint passed unexpected meta_to_tune flag. i.e. if this flag is true but checkpoint is not in meta format, or if this flag...

lijia19

CLA Signed

fb-exported

Add tool calling to ``OpenAIToMessages`` Transform

3

Currently, we only support text and images from the OpenAI format to the torchtune Messages format. We should incorporate tool calling as it is supported by the OpenAI format and...

joecummings

enhancement

Could you recommend evaluation benchmark for LLM instruction-tuning? (with alpaca or slimorca dataset)

Torchtune provides several config for fine-tuning LLM (such as [llama3_2/3B_full.yaml](https://github.com/pytorch/torchtune/blob/main/recipes/configs/llama3_2/3B_full.yaml)), and they often used alpaca dataset. Could you suggest some evaluation benchmark whether LLM is trained properly with alpaca (or...

lifelongeeek

Running into a torch.distributed.elastic.rendezvous.api.RendezvousConnectionError: The connection to the C10d store has failed.

4

Hi, I'd like to express my gratitude for torchtune as it provides me with a high level of abstraction when trying to experiment with various post-training strategies. However, in my...

vsoesanto

Add a knowledge distillation full finetune distributed recipe

1

### Goal Add a recipe entitled `fft_knowledge_distillation_distributed.py` that largely mirrors [knowledge_distillation_distributed.py](./recipes/knowledge_distillation_distributed.py) but instead of using the LoRA method of weight updating, uses full weight finetuning. ### Artifacts * One recipe...

joecummings

enhancement

community help wanted

Use ``LRScheduler.load_state_dict()``

2

Currently, when resuming from a previous run that utilizes a learning rate scheduler, we do NOT load a state dict from the scheduler. **But wait, does that mean our code...

joecummings

best practice

community help wanted

Add support for Qwen 2.5 VL 7B

6

Reference: https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct

joecummings

enhancement

community help wanted

Feature request: support different input/output formats in the same recipe

Today, users have to do manual conversions between `.pth` and `.safetensors` formats before/after fine-tuning with torchtune. **Example 1: torchtitan -> torchtune -> HF transformers.** torchtitan outputs `.dcp`, which can be...

andrewor14

[RFC] Automatically download models as part of tune run

We previously didn't do this for \*\*_reasons_\*\*. Not sure what other folks do, but my typical flow for launching a finetune is currently: 1) open the config 2) copy-paste the...

ebsmothers

discussion

[wip] context parallelism

1

Initial implementation of context parallelism in torchtune. ### Initial test ``` tune run --nproc_per_node 8 full_finetune_distributed --config llama3/8B_full \ context_parallel_dim=4 metric_logger=torchtune.training.metric_logging.WandBLogger metric_logger.project=context-parallel metric_logger.name=llama3-8b-cp4-dp2 ``` Also confirmed that we can run...

ebsmothers

CLA Signed

torchtune
torchtune copied to clipboard

Metadata

validate checkpoint is consistent with meta_to_tune flag

Add tool calling to ``OpenAIToMessages`` Transform

Could you recommend evaluation benchmark for LLM instruction-tuning? (with alpaca or slimorca dataset)

Running into a torch.distributed.elastic.rendezvous.api.RendezvousConnectionError: The connection to the C10d store has failed.

Add a knowledge distillation full finetune distributed recipe

Use ``LRScheduler.load_state_dict()``

Add support for Qwen 2.5 VL 7B

Feature request: support different input/output formats in the same recipe

[RFC] Automatically download models as part of tune run

[wip] context parallelism

← Metadata

Owner

Metadata

torchtune torchtune copied to clipboard

Metadata

← Metadata

Owner

Metadata

torchtune
torchtune copied to clipboard