Shibo Hao

Results 4 issues of Shibo Hao

Hi, thanks for the great work. From the paper, the only objective function is the distance between predicted and encoded representations of target patches. Why does the model not converge...

Hi, thanks for this great work! Do you plan to release the evaluation script of image generation (ms-coco)?

Hi, thanks for the cool project. I am testing [Llama-2-70B-GPTQ](https://huggingface.co/TheBloke/Llama-2-70B-GPTQ) with 1 * A100 40G, the speed is around 9 t/s Is this the expected speed? I noticed in some...

It seems each sample in the deita dataset consists of a lot of turns and is super long (>10k tokens). Your paper mentioned the max length of input is 2048...