pengwa
pengwa
> Can you explain what config the user needs to set to enable this feature? From the code I gather user needs to set env variable ORTMODULE_ENABLE_MEMORY_ALLEVIATION to "Dropout:0,Gelu:1,Tile:0" is...
Thanks a lot @askhade @baijumeswani for the efforts to review!!!
Just as a FYI, I had an old PR (closed) here. https://github.com/microsoft/onnxruntime/pull/10980
> You can check the PR description, there are few reasons.
> @pengwa do you have a moment to comment on this change? Asking you since this issue occurred as a result of a CPU AdamW implementation commit. sorry I somehow...
As mentioned in some comments, inlined containers are not used in orttraining/orttraining/test/training_api/common/synthetic_data_loader.cc/.h and orttraining/orttraining/test/training_api/trainer/trainer.cc , for two reasons: 1. those tests are independent of ORT internal libs, so we should...
Thank @edgchen1 , @yuslepukhin a lot for the suggestions. Please let me know if you have more comments.
@edgchen1 @yuslepukhin I updated as suggested, let me know if there is more comment. Thanks!
Thanks everyone! @edgchen1 @yuslepukhin @baijumeswani. 💯
Will create a new specialized shape optimizer for this, to avoid any backward incompatibility.