hujunchao
hujunchao
Hi, When I use cppad, I meet this problem, can any kind person help me? Thank you! EXIT: Optimal Solution Found. Tried to set Option: print_level. It is not a...
### 🐛 Describe the bug When i use 2 nodes(16gpus) to do roberta/pretraining,i meet this error, can you help me? ### Environment cuda 11.6 python 3.10 pytorch 1.12.1
when I use two A100 nodes, each node is (80GX8). I found two nodes train is slower than one node. I use torchrun xxx. can any one meet this?
Thanks for the amazing job! I have a question that after training, can I convert ckpt.pt to hugging face model? And can I use the LlamaForCausalLM to inference?
Sometimes, there is no way to use Chatgpt or GPT4. Can we use smaller model than GPT like llama to get layout? If we can use small model to get...
I try this project. It's amzing and interesting. But now, I meet a question. It's hard for me to get a good image by the text "a man rides a...
YOLOV4
HI! Thanks for this great work! Can you provide some examples for yolov4 ? Thank you!
File "lib/python3.9/site-packages/transformers/models/idefics2/modeling_idefics2.py", line 190, in forward position_ids[batch_idx][p_attn_mask.view(-1).cpu()] = pos_ids RuntimeError: shape mismatch: value tensor of shape [1037] cannot be broadcast to indexing result of shape [1036]
Congratulations on training such a great model!Is there any plan to use M2-Encoder to make better text-video retrieval?