Bar
Results
2
comments of
Bar
The prioritization argument makes total sense. However, the error should be clearer to the user. probably NotImplementError('cpu to gpu imports are not supported')... Also, consider the following scenario: resuming (weights...
@guyjacob Can we add a check on the output quantized model, verifying all tensors are indeed quantized as expected? If the check finds unexpected non-quantized tensors, user should be warned.