vcuda-controller issues

Results 19 vcuda-controller issues

Sort by recently updated

some issues on cuda-11.4

I want to ask some question, and look forward to your reply. Cugetprocaddress is added in cuda-11.4, which can prevent our hijacking code from being executed. Have you solved this...

M201972777

进入到 pod 里面执行 nvidia-smi ，显示的显存和 node 节点是一致， pod 里面应该显示给pod 分配的资源吧

khw934

Problems caused by launching multiple pods at the same time

Why do I get an error when I start multiple GPU-resource pods simultaneously (concurrently) using vcuda? In vcuda loader.c, I add `ferror` to print `errno` related error message, I get...

pidb

some questions about initialization

1. why call cuInit() in initialization() and call initialization() in cuInit() ? 2. why call initialization() many times while only one call is need? (even use g_init_once to ensure it)

matinjugou

why is nvmlDeviceSetComputeMode disabled when vcuda is enabled?

```c nvmlReturn_t nvmlDeviceSetComputeMode(nvmlDevice_t device, nvmlComputeMode_t mode) { if (g_anycuda_config.enable) { return NVML_ERROR_NOT_SUPPORTED; } return NVML_ENTRY_CALL(nvml_library_entry, nvmlDeviceSetComputeMode, device, mode); } ```

matinjugou

read to EOL for pod and container id in cgroup

fix containerd cgroupfs path use `:` refactor the reading logic to simplify it.

zwpaper

need update for cuda11.4？

I tried to use vcuda on Driver Version: 470.57.02, the program may fail without warning. Does it need to be updated for cuda11.4？Thanks!

difenbei

untraceable GPU memory allocation

**Describe the bug** When I was testing triton inference server 19.10, GPU memory usage increases when the following two functions are called: 1. cuCtxGetCurrent 2. cuModuleGetFunction It seems when loading...

zw0610

Doubts about cuArray3DCreate_helper calculating memory usage

I see the function https://github.com/tkestack/vcuda-controller/blob/5ec3fcbc58679bbbbd82ef9e21d647d8d7383876/src/hijack_call.c#L713-L731 so, should it be change to ```c request_size = base_size * pAllocateArray->NumChannels * pAllocateArray->Height * pAllocateArray->Width * pAllocateArray->Depth; ```

pidb

How to use the vcuda by docker run

I tried to build vcuda by running `IMAGE_FILE={xx} ./build-img.sh` and get the /usr/bin/nvml-monitor以及/usr/lib64/libcuda-control.so. Now I want to run the container by docker run xx, so what should I do and...

MC17

vcuda-controller
vcuda-controller copied to clipboard

Metadata

some issues on cuda-11.4

进入到 pod 里面执行 nvidia-smi ，显示的显存和 node 节点是一致， pod 里面应该显示给pod 分配的资源吧

Problems caused by launching multiple pods at the same time

some questions about initialization

why is nvmlDeviceSetComputeMode disabled when vcuda is enabled?

read to EOL for pod and container id in cgroup

need update for cuda11.4？

untraceable GPU memory allocation

Doubts about cuArray3DCreate_helper calculating memory usage

How to use the vcuda by docker run

← Metadata

Owner

Metadata

vcuda-controller vcuda-controller copied to clipboard

Metadata

← Metadata

Owner

Metadata

vcuda-controller
vcuda-controller copied to clipboard