worker-vllm issues

Results 16 worker-vllm issues

Sort by recently updated

[WIP] Testing Suite

ImputError prepare_hf_model_weights method

Since vLLM 0.4.1 added model_loader and did not added function. During docker building process model downloader module failed to import this function.

ArtyoMKos

Got some deprecation notice, might update these

2024-05-23T09:58:01.432712734Z CUDA Version 12.1.0 2024-05-23T09:58:01.433425080Z 2024-05-23T09:58:01.433427258Z Container image Copyright (c) 2016-2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved. 2024-05-23T09:58:01.434084437Z 2024-05-23T09:58:01.434087212Z This container image and its contents are governed by the...

nerdylive123

Building Docker with model built in

Hi there, The current version of the download_model.py script does not work due to the empty `TENSORIZE_MODEL` env check on line 50. Once that is fixed, the `weight_utils` file in...

KDercksen

Runpod serverless vLLM with Llama 3 70B on 40GB GPU

Im running a runpod serverless vLLM template with Llama 3 70B on 40GB GPU. One of the requests failed and I'm not completely sure what happened but the message asked...

EdwardTheLegend

Update documentation to note support for extra parameters

Greetings! I just wanted to make a quick note that the documentation for worker-vllm and RunPod both don't seem to mention anything about vLLM supporting guided generation via Json schemas...

bryankruman

worker-vllm
worker-vllm copied to clipboard

Metadata

[WIP] Testing Suite

ImputError prepare_hf_model_weights method

Got some deprecation notice, might update these

Building Docker with model built in

Runpod serverless vLLM with Llama 3 70B on 40GB GPU

Update documentation to note support for extra parameters

← Metadata

Owner

Metadata

worker-vllm worker-vllm copied to clipboard

Metadata

[WIP] Testing Suite

ImputError prepare_hf_model_weights method

Got some deprecation notice, might update these

Building Docker with model built in

Runpod serverless vLLM with Llama 3 70B on 40GB GPU

Update documentation to note support for extra parameters

← Metadata

Owner

Metadata

worker-vllm
worker-vllm copied to clipboard