RNN-Time-series-Anomaly-Detection
RNN-Time-series-Anomaly-Detection copied to clipboard
training loss function
Hi Could you explain the three training loss you defined, free running, teacher forcing, and simplified professor force? or point out related papers? Thanks
A useful blog post about 'free running' (using outputs as inputs) & 'teacher forcing' : post
Professor forcing paper: NIPS 2016
Simplified professor forcing is my idea which is a simplified version of professor forcing. While processor forcing requires a discriminator, simplified processor forcing does not require it.