xFasterTransformer icon indicating copy to clipboard operation
xFasterTransformer copied to clipboard

Results 47 xFasterTransformer issues
Sort by recently updated
recently updated
newest added

Master and slaves should both run according to the following workflow: ```Python while True: model.set_input_cb() model.forward_cb() model.free_seqs() ```

enhancement
continuous batching

```bash # weight only FP16 (input FP32, weight FP16, output FP32) [INFO] First token time: 148.062 ms [INFO] Second token time: 48.3581 ms [INFO] Final output is: ============================================== Once upon...

enhancement

![image](https://github.com/intel/xFasterTransformer/assets/52310069/cd1493de-8a6c-4a9f-83a4-c0ca83166f11)

build