misska1

Results 3 issues of misska1

检查代码时发现未实现_get_feed_dict()

Does BigScience also provide the original BLOOM checkpoints (without conversion to Huggingface 🤗). I am working on finetuning BLOOM (6.3B,2.5B,1.3B) and I need those checkpoint files. [issues/315](https://github.com/bigscience-workshop/Megatron-DeepSpeed/issues/315) In [https://github.com/bigscience-workshop/bigscience/tree/master/train/tr1-13B-base](url) ,I...

When I run this [https://github.com/bigscience-workshop/bigscience/blob/master/train/tr11-176B-ml/smaller_models/tr11f-6B3-ml-continuation.slurm](https://github.com/bigscience-workshop/bigscience/blob/master/train/tr11-176B-ml/smaller_models/tr11f-6B3-ml-continuation.slurm) script to continue training the model ,I got a strange grad norm and huge loss after auto reduced loss-scale of overflow .Just like I am...