[Question] vkCreateFence failed with ERROR_OUT_OF_HOST_MEMORY
Question
π§© Issue Summary
vkCreateFence failed with ERROR_OUT_OF_HOST_MEMORY occurs during runtime of an Isaac Simβbased Python simulation (carb.graphics-vulkan.plugin) even though system memory usage is low (< 10 GB of 64 GB available) and GPU memory is not fully utilized. The issue persists even after Vulkan ICD cleanup and verification.
π₯οΈ System Configuration
| Component | Details |
|---|---|
| GPU | NVIDIA GeForce RTX 3080 Ti |
| Driver Version | 550.163.01 (verified via nvidia-smi) |
| Vulkan API Version | 1.3.277 (vulkaninfo) |
| CUDA Version | 12.x (inferred from driver package) |
| CPU | Intel Core i9-12900K (includes integrated UHD 770) |
| OS | Ubuntu 22.04 LTS (x86-64, kernel 5.15+) |
| RAM | 64 GB DDR5 |
| Swap | 2 GB |
| Python Environment | conda env: unitree_sim_env, Python 3.10, PyTorch 2.3+, Isaac Lab/Sim 2025 |
| Display Server | X11 |
| ICD configuration | /usr/share/vulkan/icd.d/nvidia_icd.json only (all Mesa/LLVM/AMDGPU ICDs removed) |
| Layers visible | nvidia_layers.json, VkLayer_MESA_device_select.json (disabled with NODEVICE_SELECT=1) |
βοΈ Vulkan Diagnostic Output
GPU id = 0 (NVIDIA GeForce RTX 3080 Ti)
apiVersion = 1.3.277
driverVersion = 0x89a8c040
deviceType = PHYSICAL_DEVICE_TYPE_DISCRETE_GPU
All ICDs and layers verified via:
VK_LOADER_DEBUG=all vulkaninfo
Only NVIDIA ICD active.
π§ͺ Error Log Excerpt
[carb.graphics-vulkan.plugin] VkResult: ERROR_OUT_OF_HOST_MEMORY
[carb.graphics-vulkan.plugin] vkCreateFence failed with ERROR_OUT_OF_HOST_MEMORY (file descriptors: 304, soft limit: 2450, hard limit: 1048576)
[omni.kit.renderer.plugin] acquireNextFrameBufferNoWait: Failed to begin frame command list
Despite these, free -h shows:
Mem: 62Gi total, 7.5Gi used, 47Gi free, 7.8Gi buff/cache, 49β55Gi available
Swap: 2.0Gi total, 0 used
No spike in process RSS or mapped memory (VmSize ~100 GB, VmRSS ~3β4 GB constant).
π Troubleshooting Already Performed
-
β Verified correct ICD in
/usr/share/vulkan/icd.d/nvidia_icd.json. -
β Removed/disabled all Mesa & LLVM ICDs (
lvp_icd,radv_icd,intel_icd,VkLayer_MESA_device_select). -
β Verified Vulkan runtime (
libvulkan.so.1) points to/lib/x86_64-linux-gnu/libvulkan.so.1. -
β No memory exhaustion (RAM, swap, GPU).
-
β ulimit increased (
ulimit -n 65536,ulimit -v unlimited). -
β Tested both headless (
--no_render) and rendering modes β same error. -
β Vulkan validation layers show no
vkAllocateMemoryfailure before the crash. -
β Kernel/dmesg logs clean (no OOM killer events).
-
β NVIDIA driver modules (
nvidia_uvm,nvidia_drm,nvidia_modeset) all loaded normally. -
β οΈ
resource_tracker: There appear to be leaked shared_memory objectswarnings from Python multiprocessing at shutdown (unclear relation).
Here is full log [run_full.log](https://github.com/user-attachments/files/22759541/run_full.log)
Thank you for posting this. It seems you may be hitting a hardware limitation, as the minimum spec. for Isaac Sim is GeForce RTX 4080. See this doc for ref..
Thank you for posting this. It seems you may be hitting a hardware limitation, as the minimum spec. for Isaac Sim is GeForce RTX 4080. See this doc for ref..
Thank you for your reply. However, it is Isaac Sim 4.5..
Thank you for posting this. It seems you may be hitting a hardware limitation, as the minimum spec. for Isaac Sim is GeForce RTX 4080. See this doc for ref..
Hi team β could you please take another look at this issue? It doesnβt seem to be a hardware limitation. For reference, version 4.5 runs fine on an RTX 3080 Ti, so the current problem is likely unrelated to insufficient specs. Thanks!
Hi @LiliviaWu did you solve this issue?