ProxylessNAS-Pytorch
ProxylessNAS-Pytorch copied to clipboard
Implement Latency Loss
@kairos03 Now I have full time on it , and i notice the LatencyLoss Class already implement a part of Latencyloss. I have some point confused?
- how to get the latency of every op, I need to implement it, right?
- I need define λ2 hyper-parameters in LatencyLoss Class, right?
- Finally, I need make the Class Differentiable.
- latency of every op. use
https://github.com/kairos03/ProxylessNAS-Pytorch/blob/master/proxylessnas/latency.csv
file. The file is official latency result inhttps://hanlab.mit.edu/files/proxylessNAS/LatencyTools/
. - no, I think we add new loss and λ in below code in
train_search.py
. https://github.com/kairos03/ProxylessNAS-Pytorch/blob/c87b233aaffb9e38329cbb7d4fc5f5398b1312a8/proxylessnas/train_search.py#L84 - no, You dont need that, just calculate loss. thx.
Thank you for your prompt reply @kairos03 From your answer, i got some point:
- The latency of every op is fixed.
- Add the weight decay term and LatencyLoss to CrossEntropyLoss
- I mean the Latency loss can be differentiable, we will not make it differentiable manual.
Also Confused:
- This Latency implement is not complete? https://github.com/kairos03/ProxylessNAS-Pytorch/blob/master/proxylessnas/latencyloss.py#L70
Yes, not completed. You can change whole class.
Thx
what is the difference of pixel _trim and mobile_trim? @kairos03
I dont know that.. So, I Just choose mobile_trim. @xieydd
@kairos03 I have writed one version of latency loss, and i am training(WIP) the model. But it is very slow, it is normal?
yes training is very slow
@kairos03 One Question: Darts need Augment, and the Cell`s node number is fixed, and the Graph is stacked by the Cell. And ProxylessNAS no need Augment, What are your thoughts on this?
I didn't under stand your question. you mean why ProxylessNAS are not using stacking the Cells?
@kairos03 Now, i am clear; Now the problem is accuracy is very low and train very slowly.