InternLM-XComposer
InternLM-XComposer copied to clipboard
How much VRAM required for Share Captioner?
Hi
-
How much VRAM is required for Share Captioner?
-
Is there a way to use Multiple GPUs for loading Share Captioner?
-
Are there Quantization methods (4bit, 8bit) available for Share Captioner?
I have tried to run Share Captioner but I have got CUDA Out of Memory error on a Rtx 3090:
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 172.00 MiB. GPU 0 has a total capacty of 24.00 GiB of which 0 bytes is free. Including non-PyTorch memory, this process has 17179869184.00 GiB memory in use. Of the allocated memory 23.10 GiB is allocated by PyTorch, and 3.24 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
I am also having this question. Does anybody have a answer ?
I am able to run the share-captioner with A6000 (48G). 4090 (24G) is not enough.
@Han230104 So how much VRAM is actually being used in your A6000?