pengwa

Results 13 comments of pengwa

> Can you explain what config the user needs to set to enable this feature? From the code I gather user needs to set env variable ORTMODULE_ENABLE_MEMORY_ALLEVIATION to "Dropout:0,Gelu:1,Tile:0" is...

Thanks a lot @askhade @baijumeswani for the efforts to review!!!

Just as a FYI, I had an old PR (closed) here. https://github.com/microsoft/onnxruntime/pull/10980

> You can check the PR description, there are few reasons.

> @pengwa do you have a moment to comment on this change? Asking you since this issue occurred as a result of a CPU AdamW implementation commit. sorry I somehow...

As mentioned in some comments, inlined containers are not used in orttraining/orttraining/test/training_api/common/synthetic_data_loader.cc/.h and orttraining/orttraining/test/training_api/trainer/trainer.cc , for two reasons: 1. those tests are independent of ORT internal libs, so we should...

Thank @edgchen1 , @yuslepukhin a lot for the suggestions. Please let me know if you have more comments.

@edgchen1 @yuslepukhin I updated as suggested, let me know if there is more comment. Thanks!

Thanks everyone! @edgchen1 @yuslepukhin @baijumeswani. 💯

Will create a new specialized shape optimizer for this, to avoid any backward incompatibility.