Cuauhtémoc Daniel Suárez Ramírez

Results 2 comments of Cuauhtémoc Daniel Suárez Ramírez

Just wild guessing here but I think that changing the dtype of the tensor should do the work.

The purpose is that the "forward" value is going to be the binarized weight (binary_weights_no_grad) while the value for obtaining the gradient (the "backward" value) is the clamped weight (cliped_weights)....