ProxylessNAS-Pytorch Implement Latency Loss

Aug 23 '19 05:08 xieydd

@kairos03 Now I have full time on it , and i notice the LatencyLoss Class already implement a part of Latencyloss. I have some point confused?

how to get the latency of every op, I need to implement it, right?
I need define λ2 hyper-parameters in LatencyLoss Class, right?
Finally, I need make the Class Differentiable.

Sep 04 '19 02:09 xieydd

latency of every op. use https://github.com/kairos03/ProxylessNAS-Pytorch/blob/master/proxylessnas/latency.csv file. The file is official latency result in https://hanlab.mit.edu/files/proxylessNAS/LatencyTools/.
no, I think we add new loss and λ in below code in train_search.py. https://github.com/kairos03/ProxylessNAS-Pytorch/blob/c87b233aaffb9e38329cbb7d4fc5f5398b1312a8/proxylessnas/train_search.py#L84
no, You dont need that, just calculate loss. thx.

Sep 04 '19 06:09 kairos03

Thank you for your prompt reply @kairos03 From your answer, i got some point:

The latency of every op is fixed.
Add the weight decay term and LatencyLoss to CrossEntropyLoss
I mean the Latency loss can be differentiable, we will not make it differentiable manual.

Also Confused:

This Latency implement is not complete? https://github.com/kairos03/ProxylessNAS-Pytorch/blob/master/proxylessnas/latencyloss.py#L70

Sep 04 '19 08:09 xieydd

Yes, not completed. You can change whole class.

Sep 04 '19 08:09 kairos03

Thx

Sep 04 '19 08:09 xieydd

what is the difference of pixel _trim and mobile_trim? @kairos03

Sep 05 '19 02:09 xieydd

I dont know that.. So, I Just choose mobile_trim. @xieydd

Sep 06 '19 01:09 kairos03

@kairos03 I have writed one version of latency loss, and i am training(WIP) the model. But it is very slow, it is normal?

Sep 07 '19 13:09 xieydd

yes training is very slow

Sep 08 '19 03:09 kairos03

@kairos03 One Question: Darts need Augment, and the Cell`s node number is fixed, and the Graph is stacked by the Cell. And ProxylessNAS no need Augment, What are your thoughts on this？

Sep 09 '19 03:09 xieydd

I didn't under stand your question. you mean why ProxylessNAS are not using stacking the Cells?

Sep 30 '19 01:09 kairos03

@kairos03 Now, i am clear; Now the problem is accuracy is very low and train very slowly.

Sep 30 '19 03:09 xieydd

ProxylessNAS-Pytorch ProxylessNAS-Pytorch copied to clipboard

Implement Latency Loss

ProxylessNAS-Pytorch
ProxylessNAS-Pytorch copied to clipboard