Jongho Park
Jongho Park
Yes, at least I think so? I downloaded 30b running `wget http://parl.ai/downloads/_models/bb3/bb3_30B/consolidated.pt `. Then resharded according to ``` CONSOLIDATED=/path/to/bb3_30B/consolidated/ RESHARD=/save/path/to/bb3_30B/resharded/ MP=2 python -m metaseq.scripts.reshard_model_parallel $CONSOLIDATED/consolidated $MP --save-prefix $RESHARD/reshard ``` so...
Do you mean the following? Then yes... ``` cd /path/to/resharded-weights wget https://github.com/facebookresearch/metaseq/raw/main/projects/OPT/assets/gpt2-merges.txt wget https://github.com/facebookresearch/metaseq/raw/main/projects/OPT/assets/gpt2-vocab.json ```
I would like to think it's not a dict problem since after saying "hello", I receive the following. ``` Enter Your Message: hello 05:46:35 | ['Person 1: hello\nSearch Decision:'] 05:46:35...
Hmm so I should be on the `fairseq_v3` branch instead of `fairseq_v2` for Megatron? If so, that could possibly be it so I'll give it a shot. I did run...