Aflah issues

Results 28 issues of


                                            Aflah

Added Helpful Information for Recreating Results

Added Possible Error Fixes

Adding GPT-NeoX

I followed along the instructions [here ](https://github.com/sgl-project/sglang/issues/81#issuecomment-1917086172) to add GPT-NeoX support which would bring support for the Pythia model family and other similar architecture models. Reference: https://github.com/sgl-project/sglang/issues/157#issue-2122338478 FIXED (Keeping Logs...

Slow weight loading

Whenever I try to load the Mixtral models it takes very long and at the end instead of actually starting the server I get a similar error as the one...

help wanted

[Usage]: What's the minimum VRAM needed to use entire context length for Llama 3.1 70B and 405B

### Your current environment Libraries Installed - ``` "vllm==0.5.5", "torch==2.4.0", "transformers==4.44.2", "ray", "hf-transfer", "huggingface_hub" ``` ### How would you like to use vllm Hi I want to run Llama 3.1...

usage

Expected Data Format

### ❓ The question I was looking at the config files and noticed that the config files sometimes point to `.npy` files for the dataset. Is there any script to...

type/question

Allow pointing to a revision for models

### Feature Request Several models like Pythia, OLMo etc. use revisions to store different checkpoints. It would be great if supplying a revision arg is supported for these models. ###...

feature request

Pretraining an OLMo model on the SlimPajama dataset

Hi! I am planning to test pretraining OLMo 1B model on the slim pajama dataset. I was trying to follow the tutorial for tinyllama but one of the steps for...

help wanted

question

[Example] OpenRLHF Integration

This PR adds examples for running DPO, SFT and RM Training using OpenRLHF on SkyPilot. Verified the training runs on GCP