过拟合

Results 15 issues of 过拟合

there are embedding.npy , uid2index.pkl, word_dict.pkl, nrms.yaml in MINDSMALL_utils.zip , Can you share the code to generate these files?

bug

在 convert_examples_to_features 函数中,代码对每个词进行token分词处理对时候,并没有让label更新 使得label来对应每个分词过后对token 只是在最后通过while循环来对齐 但是实际上token和label 已经不是一一对应的关系了 感觉这里应该是有问题的

when I run example code: ```python accelerate launch imdb_sentiment_classification.py ``` after run some epoch get error info ```python INFO:tez.callbacks.early_stopping:EarlyStopping counter: 4/5 [train] accuracy=0.9915, loss=0.0269 [valid] accuracy=0.8953, loss=0.4287 [e=5 steps=2112] 30%|████████████████████████████████▍...

### System Info transformers version 4.7 , pytorch2.0, python3.9 run the example code in document of transformers ```shell rm -r /tmp/test-clm; CUDA_VISIBLE_DEVICES=0,1 \ python -m torch.distributed.launch --nproc_per_node 2 examples/pytorch/language-modeling/run_clm.py \...

when I run code bash training_scripts/single_node/run_1.3b.sh , meet error ```shell ninja: no work to do. Loading extension module fused_adam... Time to load fused_adam op: 0.0961456298828125 seconds Loading extension module fused_adam......

bug
deespeed chat

**Describe the bug** run example code bash training_scripts/single_node/run_1.3b.sh get error **To Reproduce** bash training_scripts/single_node/run_1.3b.sh **Expected behavior** A clear and concise description of what you expected to happen. **ds_report output** Please...

bug
deepspeed-chat

### System Info ```Shell ubuntu 20.04 cuda 11.7 torch 2.0 python 3.8 accelerate 0.19.0.dev0 deepspeed 0.9.2 ``` ### Information - [X] The official example scripts - [ ] My own...

你好 想知道这批数据的幽默度等级是如何标注的?

hi, Thanks for your open source code. I want load albert model, but it seems there are. not vaocab.txt and merges.txt in albert model. how can I load it use...