Matrix-Capsules-EM-Tensorflow
Matrix-Capsules-EM-Tensorflow copied to clipboard
Softmax as logistic function
Hi, I have two questions:
- After training mnist, the testing it gets me average accurage of 0.3, any ideas why?
- Activations are updated by calculating the cost and applying a logistic function, why is a softmax used instead of sigmoid?
Hi,
- Basically, you can use the hyper parameters given in the readme file for similar performance on mnist. Other parameters may lead to failure in training.
- Sigmoid function can give a similar performance according to our experiments. Softmax here just theoretically ensures a distribution over capsules.