llama-recipes
llama-recipes copied to clipboard
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a...
### 🚀 The feature, motivation and pitch Anthropic directly states that their models prefer context for longer prompts (like the usual RAG applications) to be inserted in XML tags. Some...
# What does this PR do? **Description of Changes**: This update introduces an integrated pipeline using Retrieval-Augmented Generation (RAG) with MongoDB and Hugging Face's open-source Llama3 model for advanced question...
### System Info transformers==4.40.0, peft==0.10.0 ### Information - [X] The official example scripts - [ ] My own modified scripts ### 🐛 Describe the bug when i running this command:...
# What does this PR do? Adds a recipe (`FMBench`) for benchmarking Llama models (including Llama3) on AWS platforms (SageMaker, Bedrock). `FMBench` is an open-source Python package for benchmarking foundation...
# What does this PR do? update LLaMa family data:image/s3,"s3://crabby-images/069ab/069abf8604a5317d6e98c0772ac9aac73209ba07" alt="image" Fixes # (issue) ## Feature/Issue validation/testing Please describe the tests that you ran to verify your changes and relevant result...
# What does this PR do? This PR solve an old quesiton. prepare_model_for_int8_training has been deprecated for quite some time, with PEFT v0.10.0, it has been removed. Please use prepare_model_for_kbit_training...
Dev
# What does this PR do? Fixes # (issue) ## Feature/Issue validation/testing Please describe the tests that you ran to verify your changes and relevant result summary. Provide instructions so...
# What does this PR do? VideoSummary: show how to ask Llama 3 to generate a summary of a long (almost 3 hours) youtube video (Yann LeCun vs Lex Fridman)...
## Main Goal Building an e2e recipe for building chatbots where we need to fine-tune a model and wont be able to rely only on RAG. This is just a...