vidur issues

Support simulation of 4090 cards

7

Since 4090 really has an edge on inferencing with smaller models - is it possible to add data for 4090 cards? Thanks!

Enoch2090

[profiling] Add cpu overhead data

AgrawalAmey

Can this simulator monitor the actual memory usage instead of memory reserved?

2

I am trying to stydy the scheduling algorithms used in the simulator, some scheduling algorithms like Orca just reserve the maximum memory for requests but they are not actually used....

JasonZhang517

How to use Vidur-Search?

2

The paper mentions that Vidur-Search can be used to find the best deployment method, but the README does not mention how to use it.

EheinWang

What are requirements for main.py from cpu_overhead

Found a cpu example and want to test this. It needs sarathi, but sarathi that I found haven't implementation for LLMEngine. What're requirements for cpu usage

Dodon4

About Splitwise branch

3

Has the Splitwise branch been fully updated? I noticed some logic changes in the SplitwiseGlobalScheduler, but it seems that there are no adjustments made for PD separation in other areas,...

deanhzm

Error: can't register atexit after shutdown

6

Hi, could you please help with resolve below issue for IPython.core.display module Setup mamba virtual env: /home/idps/vidur/vidur-venv I configured wandb and set to variable WANDB_BASE_URL to local web server with...

rajeshitshoulders

Supported Simulation Parameters Link Doesn't Work

2

Hello! There is a statement in the README file: "The simulator supports a plethora of parameters for the simulation description which can be found [here](https://github.com/microsoft/vidur/blob/main/docs/launch_parameters.md)." However, the link doesn't work:...

ozcanmiraay

Check whether the calculation of the 'add_time' is incorrect?

2

I found the` _get_block_execution_time()` function in the `vidur/entities/execution_time.py` path computes 'add_time' only once. ``` def _get_block_execution_time(self) -> float: return ( self._get_attention_layer_execution_time() + self._get_mlp_layer_execution_time() + self._add_time ) ``` But in other...

yipengLeo

Adding new model

5

The below mentioned point where the link for GPTModel is broken and can't get the exact config for the yaml file hence not able to add new model (llama2-13b) as...

samarth1612

vidur
vidur copied to clipboard

Metadata

Support simulation of 4090 cards

[profiling] Add cpu overhead data

Can this simulator monitor the actual memory usage instead of memory reserved?

How to use Vidur-Search?

What are requirements for main.py from cpu_overhead

About Splitwise branch

Error: can't register atexit after shutdown

Supported Simulation Parameters Link Doesn't Work

Check whether the calculation of the 'add_time' is incorrect?

Adding new model

← Metadata

Owner

Metadata

vidur vidur copied to clipboard

Metadata

← Metadata

Owner

Metadata

vidur
vidur copied to clipboard