poet Evaluation of PoET in distributed training mode

Evaluation of PoET in distributed training mode

Open tgjantos opened this issue 1 year ago • 0 comments

Reported by @3bsamad in #10

Currently when training PoET in distributed training mode, it seems that the evaluation is only based on the data used by GPU 1, i.e. 1/n of the dataset. Possible solution might be using Hugging Face Accelerate.

Jul 04 '23 11:07 tgjantos

poet poet copied to clipboard

Evaluation of PoET in distributed training mode

poet
poet copied to clipboard