PoseGPT
PoseGPT copied to clipboard
python main_chat.py <> CUDA out of memory
Hello,
- When running the
main_chat.py
with bf16 precision I get an OOM error. I am using a 24GB GPU. Is this expected? Can't find info about the minimal gpu requirement. - If I enable fp16 precision, I get
AttributeError: 'LlamaAttention' object has no attribute 'rope_theta'
. I think this is related todeepspeed
, which was not listed in the requirements, so should I install a specific version?
Thanks!