magic_zhang

Results 12 issues of magic_zhang

fix https://github.com/NVIDIA/FasterTransformer/issues/790. Hi @byshiue , could you help me to review it?, Thank you.

### Branch/Tag/Commit main ### Docker Image Version nvcr.io/nvidia/pytorch:22.12-py3 ### GPU name A10 ### CUDA Driver 535.54.03 ### Reproduced Steps ```shell 1. docker run -ti --gpus all --rm nvcr.io/nvidia/pytorch:22.12-py3 bash 2....

bug