siren icon indicating copy to clipboard operation
siren copied to clipboard

[Inconsistency with paper] I reproduced audio signal by RELU + MLP.

Open sh-lee-prml opened this issue 2 years ago • 1 comments

To compare SIREN layer with RELU +MLP, we implement two models.

  1. audio signal (B, T, 1) --> Linear(B, T, 128) +RELU --> Linear(B, T, 128) +RELU --> Linear(B, T, 128) +RELU --> Linear(B, T, 1) --> reproduced signal (B, T, 1)

  2. audio signal (B, T, 1) -->SIREN layer --> SIREN layer -->SIREN layer --> Linear(B, T, 1) --> reproduced signal (B, T, 1)

In your paper, RELU+MLP is not able to reproduce the audio signal, However, First model can reproduce audio signal even better than SIREN...

SIREN is also very instability so i used lower learning rate. But the loss fluctuated.

Could you explain why the siren is better than others in audio reproduction domain?

sh-lee-prml avatar Oct 08 '21 03:10 sh-lee-prml

Hi, I also doubt about this. Do you figure why this happen now?

1999kevin avatar Jun 13 '22 01:06 1999kevin