[TorchAcc][Experimental] Integrate more model in torchacc

Open Zhikaiiii opened this issue 2 months ago • 0 comments

PR type

Previous PR: https://github.com/modelscope/swift/pull/647

Integrate more model patch function for torchacc.
Support stat speed metrics for after some warmup steps(since there is compile time in the beginning of torchacc)

Paste your experiment result here(if needed).

We have test some models for torchacc and swift

method	train_sample/s	train_sample/s after warmup
torchacc + 2fsdp	9.859(1.82x)	11.896(2.19x)
swift + 2ddp	5.431	-

method	train_sample/s	train_sample/s after warmup
torchacc + 4fsdp	2.349	2.978(1.24x)
swift + 2ddp + 2mp	2.411	2.411

method	train_sample/s	train_sample/s after warmup
torchacc + 2ddp	9.216(1.13x)	10.243(1.26x)
swift + 2ddp	8.126	-

method	train_sample/s	train_sample/s after warmup
torchacc + 2ddp	5.076(1.03x)	5.376(1.08x)
swift + 2ddp	4.944	-

Apr 11 '24 06:04 Zhikaiiii