Daxiong

Results 12 issues of Daxiong

刚开始使用,下面的代码模型训练好像有问题 import m3tl from m3tl.preproc_decorator import preprocessing_fn from m3tl.params import Params from m3tl.special_tokens import TRAIN from m3tl.predefined_problems.ner_data import get_weibo_ner_fn params = m3tl.params.Params() for problem_type in params.list_available_problem_types(): print('`{problem_type}`: {desc}'.format( desc=params.problem_type_desc[problem_type], problem_type=problem_type))...

您好、是否也考虑支持一下bloom、结构上应该和llama差不多、但是bloom有比较多不同size的模型,更适合移动端的场景,可能能让这个项目更丰富

加载模型的时候报错:

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction accelerate launch src/train_bash.py \ --stage pt \ --model_name_or_path $model_name_or_path \ --do_train \ --dataset $dataset...

pending

when i use the prune.sh and save model, i meet this problem:

i use your provided data and meet above problems: