hujunchao issues

Results 10 issues of


                                            hujunchao

Tried to set Option: xx. It is not a valid option. Please check the list of available options

Hi, When I use cppad， I meet this problem, can any kind person help me? Thank you! EXIT: Optimal Solution Found. Tried to set Option: print_level. It is not a...

[BUG]: torch.distributed.elastic.rendezvous.dynamic_rendezvous

### 🐛 Describe the bug When i use 2 nodes(16gpus) to do roberta/pretraining，i meet this error, can you help me? ### Environment cuda 11.6 python 3.10 pytorch 1.12.1

bug

train using 2 nodes is slower than 1 node

when I use two A100 nodes, each node is (80GX8). I found two nodes train is slower than one node. I use torchrun xxx. can any one meet this?

convert ckpt.pt to huggingface model

Thanks for the amazing job! I have a question that after training, can I convert ckpt.pt to hugging face model? And can I use the LlamaForCausalLM to inference?

Can we use small model like LLAMA to get layout?

Sometimes, there is no way to use Chatgpt or GPT4. Can we use smaller model than GPT like llama to get layout? If we can use small model to get...

How to get the image a man rides a horse?

I try this project. It's amzing and interesting. But now, I meet a question. It's hard for me to get a good image by the text "a man rides a...

YOLOV4

HI! Thanks for this great work! Can you provide some examples for yolov4 ? Thank you!

RuntimeError: shape mismatch: value tensor of shape [1037] cannot be broadcast to indexing result of shape [1036]

File "lib/python3.9/site-packages/transformers/models/idefics2/modeling_idefics2.py", line 190, in forward position_ids[batch_idx][p_attn_mask.view(-1).cpu()] = pos_ids RuntimeError: shape mismatch: value tensor of shape [1037] cannot be broadcast to indexing result of shape [1036]

Any plan to use M2-Encoder to make better text-video retrieval?

Congratulations on training such a great model！Is there any plan to use M2-Encoder to make better text-video retrieval?