lorax issues

marlin

3

# What does this PR do? Fixes # (issue) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks...

flozi00

Add per-adapter metrics to `/metrics` endpoint

cc @noyoshi @abidwael

tgaddair

enhancement

good first issue

Make Lorax Easier for New Users

1

### Feature request **Description** I want to make it easier for new people to use Lorax, especially those coming from other tools. Right now, they have to set max-input-length and...

AdithyanI

enhancement

marlin

1

### Feature request https://github.com/IST-DASLab/marlin ### Motivation faster inference ### Your contribution will open an PR tomorrow, opening this issue for tracking or if someone else is faster

flozi00

enhancement

Skip the download process from `lorax-launcher` if model weights already on local disk

1

Download process [here](https://github.com/predibase/lorax/blob/main/launcher/src/main.rs#L800) will be essentially a no-op if the model weights are already present, but this can add several seconds of latency to startup. We can make a quick...

tgaddair

enhancement

LongLM

1

### Feature request https://arxiv.org/pdf/2401.01325.pdf Abstract This work elicits LLMs’ inherent ability to handle long contexts without fine-tuning. The limited length of the training sequence during training may limit the application...

flozi00

enhancement

ValueError: Adapter '/data/llama2-lora' is not compatible with model '/data/Llama-2-7b-chat-hf'. Use --model-id '/new-model/llama2-7b/Llama-2-7b-chat-hf' instead.

5

### System Info 2024-01-10T09:14:20.356771Z INFO lorax_launcher: Args { model_id: "/data/Llama-2-7b-chat-hf", adapter_id: "/data/llama2-lora", source: "hub", adapter_source: "hub", revision: None, validation_workers: 2, sharded: None, num_shard: None, quantize: None, compile: false, dtype: None,...

Senna1960321

question

vision language model support

2

### Feature request The developments in the robotics community around RT-2 show a lot of potential for VLMs but the hardware constraints for small developers makes it difficult to deploy...

7uk3y

enhancement

Support custom tokenizer when loading a local model

8

### Feature request I have download the model, so I want to run it use local model, eht sample is: docker run --gpus all --shm-size 1g -p 8080:80 -v /data/model/:/data/...

yinjiaoyuan

bug

Second GPU is not found when running --sharded true

6

### System Info Lorax version: 0.4.1 Lorax_launcher: 0.1.0 Model: mistralai/Mixtral-8x7B-Instruct-v0.1 GPUS: 3090 (24 gb) 3060 (12 gb) ### Information - [X] Docker - [ ] The CLI directly ### Tasks...

psych0v0yager

question

lorax
lorax copied to clipboard

Metadata

marlin

Add per-adapter metrics to `/metrics` endpoint

Make Lorax Easier for New Users

marlin

Skip the download process from `lorax-launcher` if model weights already on local disk

LongLM

ValueError: Adapter '/data/llama2-lora' is not compatible with model '/data/Llama-2-7b-chat-hf'. Use --model-id '/new-model/llama2-7b/Llama-2-7b-chat-hf' instead.

vision language model support

Support custom tokenizer when loading a local model

Second GPU is not found when running --sharded true

← Metadata

Owner

Metadata

lorax lorax copied to clipboard

Metadata

← Metadata

Owner

Metadata

lorax
lorax copied to clipboard