carbonz

Results 2 issues of carbonz

I found `gradient_accumulation_batch_size` exists in several scibert conf, such as https://github.com/allenai/scibert/blob/8562a120e6788dcbadbe05ef7fd4463dee17ee59/allennlp_config/ner.json but allennlp trainer doesn't have this param, https://github.com/allenai/allennlp/blob/master/allennlp/training/trainer.py

类似这样的 https://github.com/tatsu-lab/stanford_alpaca/blob/main/alpaca_data.json