Dongfang Li

Results 4 issues of Dongfang Li

In other words, how many possibilities to convert this package code into java? THANK YOU.

Good work~ I wonder if there is a release plan. thx.

I run the code, but only got 90+ tflops. INFO train.py:317 in record_current_batch_training_metrics -- tflops=93.48098385143103,step=9,loss=7.502509117126465,tgs (tokens/gpu/second)=2104.89,lr=2.2e-06,loss_scale=65536.0,grad_norm=20.60409540743281,micro_num=4,num_consumed_tokens=2621440,inf_nan_skip_batches=0,num_samples_in_batch=13,largest_length=2048,largest_batch=4,smallest_batch=3,adam_beta2=0.95,fwd_bwd_time=6.15