deep_shf comments

Results 51 comments of


                                            deep_shf

mobileface test in MFR

> 做了一下调整可以跑通MFR测试，但是onnx_helper.py会报 > > ``` > 2023-03-31 18:13:30.296547356 [W:onnxruntime:, execution_frame.cc:835 VerifyOutputSizes] Expected shape from model of {1,512} does not match actual shape of {32,512} for output 1333 > err: max...

mobileface test in MFR

> > 做了一下调整可以跑通MFR测试，但是onnx_helper.py会报 > > ``` > > 2023-03-31 18:13:30.296547356 [W:onnxruntime:, execution_frame.cc:835 VerifyOutputSizes] Expected shape from model of {1,512} does not match actual shape of {32,512} for output 1333 >...

[BUG] <当我运行微调bash train.sh时报错>

> 请提供完整的报错信息和运行脚本这是运行脚本： PRE_SEQ_LEN=128 LR=2e-2 NUM_GPUS=1 torchrun --standalone --nnodes=1 --nproc-per-node=$NUM_GPUS main.py \ --do_train \ --train_file AdvertiseGen/train.json \ --validation_file AdvertiseGen/dev.json \ --preprocessing_num_workers 10 \ --prompt_column content \ --response_column summary \ --overwrite_cache...

你好老师请教您一个问题

> 老师这个代码怎么改成可以实时显示可以交流一下。

details

> 好的，这周日我整理一下自述文件。感谢大佬,期待开源细节,我会快速用自己的数据进行实验,到时候再来请教您

pt转onnx 的问题

> I've used a ready-made model by [Xiang Shin Wuu](https://github.com/xiang-wuu). You might take a look [here](https://blog.csdn.net/weixin_40293999/article/details/130977968) thanks, I got it done

huggingface的代码示例报错

import torch from modelscope import snapshot_download, AutoModel, AutoTokenizer torch.set_grad_enabled(False) # init model and tokenizer # model_dir = snapshot_download('Shanghai_AI_Laboratory/internlm-xcomposer2-vl-7b-4bit') model_dir = 'modelscope/internlm-xcomposer2-vl-7b' print(model_dir) # model_dir = 'Shanghai_AI_Laboratory/internlm-xcomposer2-vl-7b-4bit' print(model_dir) # ssss model...

huggingface的代码示例报错

> > 用这个modelscope的代码时，用int4版本会出现缺少配置文件的错误 OSError: /home/shf/.cache/modelscope/hub/Shanghai_AI_Laboratory/internlm-xcomposer2-vl-7b-4bit does not appear to have a file named config.json. Checkout 'https://huggingface.co//home/shf/.cache/modelscope/hub/Shanghai_AI_Laboratory/internlm-xcomposer2-vl-7b-4bit/None' for available files. > > @panzhang0212 作者能解答一下吗稀里糊涂的就成功了我用了transformer的推理也用了modelscope的推理都成了

Complete model

> Thanks for sharing! I'd like to do some research based on this library. Compared with the original paper, what modules do we need to complete this model, in addition...