matrix issues

Results 10 issues of


                                            matrix

Add tpu models

创建预训练的数据时， article['input_ids'].append(0)，最后时0，但是在demo里eos确是102，这是什么原因

创建预训练的数据时， article['input_ids'].append(0)，最后时0，但是在demo里eos确是102，如果在预训练数据中没有102，那结束字符很难会出现102（[sep])字符吧，但是在运行demo时确实有sep，想问下，是不是article['input_ids'].append(0)应该是article['input_ids'].append(102)？

can we train by Parallel Computing or Multithreading or multi-Progress

can we train by Parallel Computing or Multithreading or multi-Progress? Speed up training thank you

feature request

When I use the following code on tpuvm and use model.generate() to infer, the speed is very slow. It seems that the tpu is not used. What is the problem?

### System Info When I use the following code on tpuvm and use model.generate() to infer, the speed is very slow. It seems that the tpu is not used. What...

from cyg_conversation import default_conversation ModuleNotFoundError: No module named 'cyg_conversation'

### System Info from cyg_conversation import default_conversation ModuleNotFoundError: No module named 'cyg_conversation' ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks...

bug

For 30B LLama model, can server be supported by configuring mesh_dims on tpu v3-8 (128g)? I tried 8,1 and 4,1 but they don't seem to work.

I would like to ask whether the 100k data of sharegpt4v is brushed with the openai gpt4v interface or the azure gpt4v interface, and what is the version of the interface model?

I would like to ask whether the 100k data of sharegpt4v is brushed with the openai gpt4v interface or the azure gpt4v interface, and what is the version of the...

matrix