BentoML issues

Results 258 BentoML issues

Sort by recently updated

feature: use LoRA as entity in bentoml model store

### Feature request We are using diffusers SDXL + LoRA Potentially we gonna have lots of LoRA files that we want to manage. we would like to be able to...

Amitg1

enhancement

bug: Custom exception using FastAPI mount does not work as expected

### Describe the bug Hello, Currently, the Bento REST API handles inference issues, such as IO errors or GPU out-of-memory errors, by returning a 500 internal server error to the...

Amitg1

bug

fix: memory issue when push large bentos

## What does this PR address? supporting limit max memory usage when pushing models ![image](https://github.com/bentoml/BentoML/assets/141706136/42c364fb-fc5b-495e-8973-b867244d109e) ``` bentoml push facebook--opt-2.7b-service:905a4b602cda5c501f1b3a2650a4152680238254 --maxmemory 2 ``` Test case 1: pushing `bento google--flan-t5-large-service`, model size...

xianml

bug: BentoML Ray ray.serve._private.storage.kv_store.KVStoreError: 8

### Describe the bug Hi, I use bentoml, the bento service is a simple BERT. I saw there was a bentoml.ray feature, I tried it. But got an error :...

BaptisteLoquette

bug

On-demand loading with limited GPU resource

Hi. Are bentoml planning to support on-demand loading? My use case is, that I have multiple models which can't be all loaded at the same time, so I want to...

shenxiangzhuang

bug: Traces are not fully suppressed when using a runner with excluded_urls

### Describe the bug When using a runner in a service, the service outputs opentelemetry traces even for requests to urls included in excluded_urls in the config. This is most...

vivienrobert-wefox

bug

bug: incorrectly rounded float output

### Describe the bug When I retrieve a float number as output, it appears to be inaccurately rounded. ```py @svc.api(input=NumpyNdarray(), output=NumpyNdarray(dtype=np.float32)) async def predict(arr: np.ndarray) -> float: return np.round(25.1234, 2)...

KimToehler

bug

bug: --run-as-root should take effect with docker healthy

### Describe the bug docker check healthy also need root privilege, but `_internal/container/docker.py` just ```python [client, "version", "--format", "{{json .Server.Version}}"] ``` sudo is needed (but I don't know how to...

du00cs

bug

bug: numpy to torch.Tensor conversion does not preserve dtype when using np.float16

### Describe the bug I recently experienced some dtype mismatch errors when using model.run() with numpy.float16 input, when the pytorch model's dtype is torch.float16. After inspection, I found that the...

igamenovoer

bug

fix: Path issues with pip modules

Always prepend current bentofile directory to system path to avoid unwanted behavior when other bentofiles are on the system PATH This is especially evident when trying to use bentoml cli...

tokotchd

BentoML
BentoML copied to clipboard

Metadata

feature: use LoRA as entity in bentoml model store

bug: Custom exception using FastAPI mount does not work as expected

fix: memory issue when push large bentos

bug: BentoML Ray ray.serve._private.storage.kv_store.KVStoreError: 8

On-demand loading with limited GPU resource

bug: Traces are not fully suppressed when using a runner with excluded_urls

bug: incorrectly rounded float output

bug: --run-as-root should take effect with docker healthy

bug: numpy to torch.Tensor conversion does not preserve dtype when using np.float16

fix: Path issues with pip modules

← Metadata

Owner

Metadata

BentoML BentoML copied to clipboard

Metadata

← Metadata

Owner

Metadata

BentoML
BentoML copied to clipboard