Chulki Lee
Chulki Lee
If it needs to be reported to https://github.com/awslabs/amazon-eks-ami/issues then please let me know.
``` ENV NVIDIA_VISIBLE_DEVICES all ENV NVIDIA_DRIVER_CAPABILITIES compute,utility ``` Those were set. I'm using `nvidia/cuda:11.8.0-base-ubuntu22.04` image - but still failing. Update ``` declare -x CUDA_VERSION="11.8.0" declare -x NVIDIA_REQUIRE_CUDA="cuda>=11.8 brand=tesla,driver>=450,driver=470,driver=470,driver=470,driver=470,driver=470,driver=470,driver=470,driver=470,driver=470,driver=470,driver=510,driver=510,driver=510,driver=510,driver=510,driver=510,driver=510,driver=510,driver=510,driver=510,driver=515,driver=515,driver=515,driver=515,driver=515,driver=515,driver=515,driver=515,driver=515,driver=515,driver
I tested the `g5g.xlarge` instance with the `BOTTLEROCKET_ARM_64_NVIDIA` image using the `nvcr.io/nvidia/pytorch:24.03-py3` container. I observed that CUDA is only detected when the GPU resource is explicitly specified. According to the...
Thanks for the confirmation! I agree that allocating gpu only when requested is better behavior... but changing AL is not easy.. I hope it happens on major AL version bump.....
Hitting similar error: ``` /Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX.sdk/usr/include/unistd.h:525:10: fatal error: cannot open file 'aarch64-apple-darwin24.0.0/opt/jit/_ctermid.h': Too many open files 525 | #include | ^ 1 error generated. make[4]: *** [obj/aarch64-apple-darwin24.0.0/opt/jit/beam_asm_module.o] Error 1 ``` Environment...
For format changes - please see #755
Probably this is blocked by https://github.com/awslabs/amazon-eks-ami/issues/853 ? See also https://docs.aws.amazon.com/eks/latest/APIReference/API_Nodegroup.html
Can we make biome zed plugin no-op when the condition is met, instead shutting down it with error?