DLeeeeeee
Results
2
comments of
DLeeeeeee
I found that there are several BN layers whose variance contain negative number after converting. I am quite sure it is the reason to cause NaN output but I don't...
I guess I found the reason. That is because SKA is not a standard conv. it takes in kernel weights in shape (B, C, K**2, H, W), normal conv kernel...