Cuauhtémoc Daniel Suárez Ramírez
Results
2
comments of
Cuauhtémoc Daniel Suárez Ramírez
Just wild guessing here but I think that changing the dtype of the tensor should do the work.
The purpose is that the "forward" value is going to be the binarized weight (binary_weights_no_grad) while the value for obtaining the gradient (the "backward" value) is the clamped weight (cliped_weights)....