Yan Tao issues

Results 14 issues of


                                            Yan Tao

On android 4.4 avformat_open_input failed found invalid data where processing input?

On Android 4.4 it was OK for audio call. But for video call the logcat show these errors: QCMediaPlayer mediaplayer not present avformat_open_input failed, err:invalid data found when processing input...

Why the loss become NaN?

It is a great project. I am very interested in Transformer in Transformer model. I had use your model to train on Vehicle-1M dataset. Vehicle-1M is a fine graied visual...

pip install mayavi blocked at Building wheel for mayavi (setup.py)?

I installed mayavi on ubuntu docker used this command: pip install mayavi It gave me these logs: Collecting mayavi Using cached mayavi-4.7.3.tar.gz (7.1 MB) Requirement already satisfied: apptools in /usr/local/lib/python3.8/dist-packages...

How to use deep speed inference start MiniGPT-4 on low GPU memory hardware?

I had run the MiniGPT-4 on Nvidia T4 which has 16G memory. I could upload picture. But when I asked question about this picture it reported CUDA out of memory....

怎样在web_demo_hf.py中加载QLoRA微调后的权重？

我可以成功跑通QLoRA微调，使用python cli_demo.py --from_pretrained ... 形式也可以跑起微调后的模型。我想通过网络提供该服务，但是发现cli_demo.py中调用模型的方法，和web_demo_hf.py中的方法不一样，不能直接使用。进一步发现，cli_demo_hf.py中使用的方法，与cli_demo.py中的方法也是不一样的，直接套用也是会失败的。我们想问一下，怎样在web_demo_hf.py中加密微调后的模型？

支持采用ZeRO-Infinity技术使用内存和NVME硬盘来训练模型吗？

我现在手头只有一台A100 40G、128G内存、1T的NVME硬盘，官方说可以在8块A100 80G上训练，如果采用ZeRO-Infinity技术，我的这个机器应该也可以训练，请问我的这个硬件可以支持全参数训练吗？另外，想问一下，支持LoRA等PEFT微调方法吗？

Does it support deep speed ZeRO to offload parameters to CPU and NVMe ssd?

I had used ChatGLM-6B. I could use deep speed tech to offload the parameters to CPU and NVMe ssd. So I could finetune the model on a machine with only...

在预训练Pretrain时报no valid `self._rcvd_idx` is found错误

我根据Pretrain部分，尝试进行预训练，首先准备一个数据集文本文件，放在./datasets/zjkj/zjkj_wd.jsonl文件中，内容为： {"id": 1, "dataType": "百科", "title": "转睛科技的业务", "content": "上海转睛科技有限公司，是一家以视频分析为核心、专业从事自动驾驶车路协同及其关联技术的公司。"} 然后在data_utils/corpora.py中，定义新的语料库： ``` class ZjkjCorpus(PromptReader): PATH = "./datasets/zjkj" is_json = False reserve_punct = True split_row = True def process_line(self, item, tokenizer, tokenize):...

怎样使用DeepSpeed的offload功能降低显存占用？

我运行VisualGLM-6B的LoRA finetune时，由于显卡为16G显存，所以会报CUDA Out of Memory错误。我在命令行加入DeepSpeed配置文件： ``` gpt_options=" \ --experiment-name finetune-$MODEL_TYPE \ --model-parallel-size ${MP_SIZE} \ --mode finetune \ --train-iters 300 \ --resume-dataloader \ $MODEL_ARGS \ --train-data ${train_data} \ --valid-data ${eval_data} \...

The results are different from hellokan.ipynb

Hi. I had followed the instruction exactly. When I run the hellokan.ipynb it worked. But the result is different from the one in the original version. For example the prouned...