GGBond8488
GGBond8488
### Is there an existing issue for this? - [X] I have searched the existing issues and checked the recent builds/commits ### What happened? when I run webui --api to...
I have a batch processing task that will read batch dot files. Reading the first dot file is normal. When reading the second file, an error will be reported. However,...
### PR types Others ### PR changes Others ### Describe ...
### PR Category Others ### PR Types Others ### Description pcard-81841 A template demonstrating how to test higher-order autodiff.
### System Info NVIDIA-H100 ### Who can help? @kaiyux ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An officially...
like the https://github.com/microsoft/mscclpp
such as https://github.com/vllm-project/vllm/pull/2809 and https://github.com/LLMServe/DistServe that had done reference:https://arxiv.org/pdf/2311.18677
deploy with the IFB, when user input the paylod as follow: ``` { "text_input": str(question), "max_tokens": 512, "bad_words": "", "stop_words": stop_words, "pad_id": pad_id, "end_id": end_id, "top_p": 1, "id": "ggbond_test", "temperature":...
### PR types ### PR changes ### Description
**Is your feature request related to a problem? Please describe.** I have some special feature want to apply in triton and I don't want to modify the origin code directly,...