Varun Parthasarathy comments

Results 13 comments of


                                            Varun Parthasarathy

Optimal hyperparameter selection for triplet loss training

@douxiaotian what kind of results did you get? I'm currently trying out a cyclical learning rate with SGD, but it'll take a while to finish training. I'm planning to try...

Optimal hyperparameter selection for triplet loss training

Yeah, I think so too. I'm currently downloading the Deepglint dataset (Cleaned MS-Celeb + Asian Celeb; ~7 million images, ~180,000 identities) - my previous experiment with SGD failed miserably. I'll...

Optimal hyperparameter selection for triplet loss training

@xlphs your results seem promising! Just to clarify, what dataset did you finetune on? Also, have you tried training from scratch at any point?

Optimal hyperparameter selection for triplet loss training

@kifaw the idea of validation is to see how well the model generalizes on data it hasn't seen before, so if the overlap is still present, then there will be...

Optimal hyperparameter selection for triplet loss training

I ran a learning rate range test a while back; the results are interesting - ![final_plot](https://user-images.githubusercontent.com/16452817/57321592-4f4ecd00-711f-11e9-93e0-04a3a9e99f71.png) Does this mean larger learning rates would perform well? Can someone clarify this? This...

Optimal hyperparameter selection for triplet loss training

@kifaw that's something I unfortunately don't understand myself. I'm running some more range tests right now using the FaceNet triplet selection method, but I find it strange that the learning...

Optimal hyperparameter selection for triplet loss training

I guess there were some issues with the range test (I didn't run it for long enough). I ran it for about 20000 steps and got a more reasonable range...

Optimal hyperparameter selection for triplet loss training

@neklom the range test essentially involves slowly _increasing_ the learning rate over time, while tracking loss vs. learning rate. At a certain value of the learning rate, loss falls drastically...

Optimal hyperparameter selection for triplet loss training

@xlphs From my experience, training seems to become unstable once accuracy crosses 0.9 - the validation rate starts fluctuating wildly between 0.2 and 0.5. I generally stop training at this...

Optimal hyperparameter selection for triplet loss training

Training from scratch with triplet loss gives an accuracy of about 92.5% (similar to OpenFace), while validation tends to vary between 35 to 40%, even after 800k iterations. I guess...