Jiangtao lv

Results 1 issues of Jiangtao lv

# issue When testing the linear API provided by NVIDIA's transformer engine (with FP8 precision) on an L20 device, I found that its speed is significantly slower than PyTorch's built-in...