Jun-Howie

Results 3 issues of Jun-Howie

python -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --port 8000 --model Qwen1.5-0.5B-Chat --dtype=half INFO 05-16 14:53:28 api_server.py:149] vLLM API server version 0.4.0.post1 INFO 05-16 14:53:28 api_server.py:150] args: Namespace(host='0.0.0.0', port=8000, uvicorn_log_level='info', allow_credentials=False, allowed_origins=['*'], allowed_methods=['*'],...

log (computing) (aqlm) root@f9f90a551b02:~/xinglin-data/AQLM# bash train.sh wandb: Using wandb-core as the SDK backend. Please refer to https://wandb.me/wandb-core for more information. wandb: (1) Create a W&B account wandb: (2) Use an...

Specifically, when VLLM is running, the format and values of the positions, eager, and CUDA passed into the QWEN attention module are different. The Qwen2Attention module of the qwen2.py file...