llama-recipes
llama-recipes copied to clipboard
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a...
### 🚀 The feature, motivation and pitch Config dataclasses like `llama_recipes.configs.training.train_config` should be named using PascalCase, in this case `TrainConfig`. The current naming violates a widely accepted Python convention. It...
What does this PR do? This PR introduces multi-modal inference using the Gradio UI for Llama 3.2 vision models. The Gradio UI allows users to upload images and generate descriptive...
# What does this PR do? This PR adds a new section in 3p_integrations for deploying Llama solutions on [Crusoe Cloud](https://crusoe.ai/). ## Feature/Issue validation/testing For easy reproducibility, Crusoe recipes use...
### 🚀 The feature, motivation and pitch Is fine-tuning the Vision models at a lower resolution supported? If so, can you please add a recipe for that(or add a note...
# What does this PR do? Updated the script to support converting finetuned llama 3.2 vision model to HF format, so it works with multimodal inference. ## Feature/Issue validation/testing Tested...
### 🚀 The feature, motivation and pitch I don't see any option to set up a learning scheduler in the fine-tuning input arguments. Is there a way to implement it?...
During the use of LoRA fine-tuning, everything was normal, but the following issue arose during full-scale fine-tuning. I use the following script for full fine-tuning : ```shell #!/bin/bash NNODES=1 NPROC_PER_NODE=4...
This update include adding support for multiple input formats, improving documentation, and updating dependencies. ### Major Enhancements: #### Support for Multiple Input Formats: 1. **Ingestion Module**: Added an ingestion module...
### System Info PyTorch version: 2.2.0 Is debug build: False CUDA used to build PyTorch: 11.8 GPU: A100 PCIe * 4 transformers: 4.45.2 ### Information - [X] The official example...
Hey, first of all thanks for your wonderful project! I've noticed this repo lacks a LICENSE file, which is the usual way of specifying the project's license. I also see...