Yibin Li

Results 27 comments of Yibin Li

@Maxung the root cause is indeed the mismatch of your input data type (fp32) and onnx model input type (fp16). If the input data is a numpy array, polygraphy checks...

Need to update internal_cutlass_kernel libs.

> > Need to update internal_cutlass_kernel libs. > > @yibinl-nvidia is there mr for updating internal_cutlass_kernels? Yes, I will post a MR soon. I am still familiarizing myself with the...

@mikeiovine could you re-approve this PR? This is a mirror of the internal MR, with the minor changes on the internal_cutlass_kernel lib files. Thanks!

> Sorry for the delay! I've missed this in the move to Github. Looks good to me assuming there are only trivial changes compared to what I reviewed internally. Yes...