uaggan Using mask for D

Using mask for D

Open mohammadshahabuddin opened this issue 4 years ago • 0 comments

In paper, authors have mentioned that they have not used mask till 30 epochs for D. Also, they have not trained attention portion after 30 epochs. Have you used these conditions in codes?

Jun 02 '20 04:06 mohammadshahabuddin

uaggan uaggan copied to clipboard

Using mask for D

uaggan
uaggan copied to clipboard