Yaron Rosenbaum issues

Results 10 issues of


                                            Yaron Rosenbaum

Expose clustering params as environment variables

Hi I'm exploring running your docker setup in a cross-datacenter cluster. Would it be possible to expose the following params as -e environment variables for the docker container? cluster_name: ''...

Logviewer ?

Hi I would like to access the storm logviewer on one of the supervisors. How do I do that? (AFAIK the supervisors are started by mesos) Thanks

enhancement

question

InnoDB error

Hi buddy, First - thanks for taking the effort and putting this repository in place. I'm looking for a quick way to test out VOIP for my home. I ran...

platform-issue

[Bug]: Unable to use vllm hosted model

### What happened? Hi I followed the instructions here: https://docs.litellm.ai/docs/providers/vllm My relevant config is: ` - model_name: Mistral-7B-Instruct-v0.2 litellm_params: model: vllm/mistralai/Mistral-7B-Instruct-v0.2 api_base: http://Mistral-7B-Instruct-v0.2.mycloud.local:8000 api_key: fake-key` Queries fail. "No module named...

bug

Error running google/gemma-7b-it

## Description Running djl-inference:0.27.0-neuronx-sdk2.18.1, with Huggingface model google/gemma-7b-it fails. ### Error Message WARN PyProcess W-93-model-stderr: --- Logging error --- WARN PyProcess W-93-model-stderr: Traceback (most recent call last): WARN PyProcess W-93-model-stderr:...

bug

[Feature]: Build and publish Neuron docker image

### 🚀 The feature, motivation and pitch It seems like the current docker images don't support Neuron (Inferentia). It would be very helpful if there was a tested, managed Neuron...

feature request

[Bug]: Running vllm docker image with neuron fails

### Your current environment root@9c92d584ab5f:/app# python3 ./collect_env.py Collecting environment information... WARNING 05-15 15:13:52 ray_utils.py:46] Failed to import Ray with ModuleNotFoundError("No module named 'ray'"). For multi-node inference, please install Ray with...

bug

Divide by zero: request_metrics[common_metrics.REQ_OUTPUT_THROUGHPUT] = num_output_tokens / request_metrics[common_metrics.E2E_LAT]

Running the benchmark script on a llama-3-8b-inst on inferentia 2 (djl-serving) results in: ``` python3.10 token_benchmark_ray.py \ --model "openai/llama3-8b-inst" \ --mean-input-tokens 550 \ --stddev-input-tokens 150 \ --mean-output-tokens 150 \ --stddev-output-tokens...

[Bug]: Producer process has been terminated before all shared CUDA tensors released (v 0.5.0 post1, v 0.4.3)

### Your current environment Docker Image: vllm/vllm-openai:v0.4.3 as well as 0.5.0 post-1 Params: ``` --model=microsoft/Phi-3-medium-4k-instruct --tensor-parallel-size=2 --disable-log-requests --trust-remote-code --max-model-len=2048 --gpu-memory-utilization=0.9 ``` The container freezes (does nothing) after presenting the following...

bug

docker 0.29.0-pytorch-inf2 with meta-llama/Meta-Llama-3.1-8B-Instructn failes

## Description Unable to use open-ai endpoint, getting the error below. ### Error Message PyProcess W-100-model-stdout: The following parameters are not supported by neuron with rolling batch: {'frequency_penalty'}. ## How...

bug