djl issues

How to serv with 2 model folders?

17

![image](https://github.com/user-attachments/assets/4fa65867-6cbf-489f-9b12-0ba881b1347e) I have 2 model folders for llama3, one is the original and another is the finetuned, how to config to use the 2 model folders for djl-serving?

SidneyLann

enhancement

There is an error dialog when i try to load a torchscript model.

2

## Description (A clear and concise description of what the bug is.) I use version 0.29.0, JDK environment is 17, operating system is Win11. The libtorch dependencies are loaded properly,...

huobingnan

bug

Onnxruntime-gpu 1.8.0 killed the process on cpu device

4

### Environment Info Container: Docker with NO GPU OS: AlmaLinux CUDA installed: 12.2 Cudnn installed: 8.9.0 djl version: 0.29.0 onnxruntime_gpu version: 1.8.0 ### Error Message ``` [root@r100048367-91051506-l5wvj powerop]# cat /tmp/hs_err_pid1062.log...

zaobao

bug

OFA-Sys Chinese-CLIP Example

1

OFA-Sys Chinese-CLIP Example

heliang230

enhancement

Model conversion process failed when deploying Mixtral 8x22B AWQ with djl-tensorrtllm to Sagemaker

3

## Description Model conversion process failed with djl-tensorrtllm and below serving.properties: ``` image_uri = image_uris.retrieve( framework="djl-tensorrtllm", region=sess.boto_session.region_name, version="0.28.0" ) %%writefile serving.properties engine=MPI option.model_id=MaziyarPanahi/Mixtral-8x22B-Instruct-v0.1-AWQ option.tensor_parallel_degree=4 option.quantize=awq option.max_num_tokens=8192 option.max_rolling_batch_size=8 ``` ### Expected...

gsjoy8888

bug

Request for SAM2 Model in ONNX Format and Synset File for Classification Output

5

## Description Hello DJL Team, I am currently using the SAM2 model from the DJL ModelZoo for inference purposes. However, I have encountered a couple of limitations that I would...

leleZeng

enhancement

Add intfloat/multilingual-e5-large-instruct to model zoo

8

## Description @frankfliu asked I create an issue to track this which was originally reported by me on Slack: https://deepjavalibrary.slack.com/archives/C01AURG857U/p1727308663498229. Can we please add `intfloat/multilingual-e5-large-instruct` to the model zoo? It...

david-sitsky

enhancement

Expose vLLM logprobs in model output

3

## Description vLLM sampling parameters include a [richer set of values](https://github.com/vllm-project/vllm/blob/c9b45adeeb0e5b2f597d1687e0b8f24167602395/vllm/sampling_params.py), among which `logprobs` has a wider utility. When testing by adding the logpobs option to the request payload, the...

CoolFish88

enhancement

TensortRT-LLM compilation parameter overwrite

## Description When deploying a Mistral Instruct 7B v.02 on a SageMaker endpoint (ml.g5.12xlarge) using the TensortRT-LLM backend (just-in-time compilation), I noticed that some of the serving parameters get overwritten....

CoolFish88

bug

UnsatisfiedLinkError when running examples

4

## Description When running examples in 0.30.0-SNAPSHOT I receive an UnsatisfiedLinkError. ### Expected Behavior I expect the examples to run. ### Error Message ``` Failed to load PyTorch native library...

dwctic

bug

djl
djl copied to clipboard

Metadata

How to serv with 2 model folders?

There is an error dialog when i try to load a torchscript model.

Onnxruntime-gpu 1.8.0 killed the process on cpu device

OFA-Sys Chinese-CLIP Example

Model conversion process failed when deploying Mixtral 8x22B AWQ with djl-tensorrtllm to Sagemaker

Request for SAM2 Model in ONNX Format and Synset File for Classification Output

Add intfloat/multilingual-e5-large-instruct to model zoo

Expose vLLM logprobs in model output

TensortRT-LLM compilation parameter overwrite

UnsatisfiedLinkError when running examples

← Metadata

Owner

Metadata

djl djl copied to clipboard

Metadata

← Metadata

Owner

Metadata

djl
djl copied to clipboard