trans-inr
trans-inr copied to clipboard
test with SIREN
Nice job! I notice that you still use ReLU MLP with PE. A MLP architecture named SIREN, which replaces the ReLU activation to sine activation, has a better reprenentation ability. Have you even tested SIREN as your hyponet?