llama-recipes icon indicating copy to clipboard operation
llama-recipes copied to clipboard

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a...

Results 272 llama-recipes issues
Sort by recently updated
recently updated
newest added

### 🚀 The feature, motivation and pitch Anthropic directly states that their models prefer context for longer prompts (like the usual RAG applications) to be inserted in XML tags. Some...

# What does this PR do? **Description of Changes**: This update introduces an integrated pipeline using Retrieval-Augmented Generation (RAG) with MongoDB and Hugging Face's open-source Llama3 model for advanced question...

cla signed

### System Info transformers==4.40.0, peft==0.10.0 ### Information - [X] The official example scripts - [ ] My own modified scripts ### 🐛 Describe the bug when i running this command:...

# What does this PR do? Adds a recipe (`FMBench`) for benchmarking Llama models (including Llama3) on AWS platforms (SageMaker, Bedrock). `FMBench` is an open-source Python package for benchmarking foundation...

cla signed

# What does this PR do? update LLaMa family ![image](https://github.com/meta-llama/llama-recipes/assets/20925537/fd496d4c-7ac7-421b-b1f4-15b769a24625) Fixes # (issue) ## Feature/Issue validation/testing Please describe the tests that you ran to verify your changes and relevant result...

cla signed

# What does this PR do? This PR solve an old quesiton. prepare_model_for_int8_training has been deprecated for quite some time, with PEFT v0.10.0, it has been removed. Please use prepare_model_for_kbit_training...

cla signed

# What does this PR do? Fixes # (issue) ## Feature/Issue validation/testing Please describe the tests that you ran to verify your changes and relevant result summary. Provide instructions so...

# What does this PR do? VideoSummary: show how to ask Llama 3 to generate a summary of a long (almost 3 hours) youtube video (Yann LeCun vs Lex Fridman)...

cla signed

## Main Goal Building an e2e recipe for building chatbots where we need to fine-tune a model and wont be able to rely only on RAG. This is just a...

cla signed