DilatedRNN icon indicating copy to clipboard operation
DilatedRNN copied to clipboard

Classification to regression

Open cgebe opened this issue 7 years ago • 3 comments

I have trained the DRNN on a classification task which overfits after about 12k steps with a hidden size of 512 and batch size of 1024. As soon as i adjust it to a regression task (basically changing the loss function to mse, 1 output dim), it takes very long to converge. Did someone make similiar experiences on regression tasks?

cgebe avatar Oct 30 '18 18:10 cgebe

I have not tried on a regression task. Maybe you want to first to identify whether the problem comes from the model or the data? I mean if other models can easily overfit your regression data? If yes, then maybe you want to adjust the learning rate of DRNN and see if the problem comes from there. If other models cannot easily overfit the data, then maybe you might need to consider to further process the data or reformulate your task. Thanks.

code-terminator avatar Oct 30 '18 18:10 code-terminator

Yes, i keep investigating, probably give transformer/multimodel a try - will report back. Thank you

cgebe avatar Oct 30 '18 18:10 cgebe

Hi @cgebe , have you made some progress using dilated rnn for regression. I'm trying migrating the code to tensorflow 2.0 but has not succeeded yet. Hope to hear from you

RidhwanAmin avatar Jun 23 '22 02:06 RidhwanAmin