FastChat Unable to launch the OpenAI API [Vicuna-7B]. Error log: Using pad

Would you suggest me some ways to debug that? By the way, this model can be successfully inferenced with fastchat cli.

Error Log: 2023-06-13 15:19:44 | INFO | model_worker | Loading the model vicuna-7b on worker 005f53 ... Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s] Loading checkpoint shards: 50%|███████████████████████████████████████████████████████� �████████████▌ | 1/2 [00:08<00:08, 8.92s/it] Loading checkpoint shards: 100%|███████████████████████████████████████████████████████� �██████████████████████████████████████████████████████████████████ ███████████████| 2/2 [00:12<00:00, 5.54s/it] Loading checkpoint shards: 100%|███████████████████████████████████████████████████████� �██████████████████████████████████████████████████████████████████ ███████████████| 2/2 [00:12<00:00, 6.05s/it] 2023-06-13 15:19:57 | ERROR | stderr | Using pad_token, but it is not set yet. 2023-06-13 15:20:03 | INFO | model_worker | Register to controller 2023-06-13 15:20:03 | ERROR | stderr | ╭─────────────────────────────── Traceback (most recent call last) ─────────� �──────────────────────╮ 2023-06-13 15:20:03 | ERROR | stderr | │ /mnt/lustre/duanhaodong/anaconda3/envs/mm2/lib/python3.8/runpy.py:194 in _run_module_as_main │ 2023-06-13 15:20:03 | ERROR | stderr | │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ 191 │ main_globals = sys.modules["main"].dict │ 2023-06-13 15:20:03 | ERROR | stderr | │ 192 │ if alter_argv: │ 2023-06-13 15:20:03 | ERROR | stderr | │ 193 │ │ sys.argv[0] = mod_spec.origin │ 2023-06-13 15:20:03 | ERROR | stderr | │ ❱ 194 │ return _run_code(code, main_globals, None, │ 2023-06-13 15:20:03 | ERROR | stderr | │ 195 │ │ │ │ │ "main", mod_spec) │ 2023-06-13 15:20:03 | ERROR | stderr | │ 196 │ 2023-06-13 15:20:03 | ERROR | stderr | │ 197 def run_module(mod_name, init_globals=None, │ 2023-06-13 15:20:03 | ERROR | stderr | │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ /mnt/lustre/duanhaodong/anaconda3/envs/mm2/lib/python3.8/runpy.py:87 in _run_code │ 2023-06-13 15:20:03 | ERROR | stderr | │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ 84 │ │ │ │ │ loader = loader, │ 2023-06-13 15:20:03 | ERROR | stderr | │ 85 │ │ │ │ │ package = pkg_name, │ 2023-06-13 15:20:03 | ERROR | stderr | │ 86 │ │ │ │ │ spec = mod_spec) │ 2023-06-13 15:20:03 | ERROR | stderr | │ ❱ 87 │ exec(code, run_globals) │ 2023-06-13 15:20:03 | ERROR | stderr | │ 88 │ return run_globals │ 2023-06-13 15:20:03 | ERROR | stderr | │ 89 │ 2023-06-13 15:20:03 | ERROR | stderr | │ 90 def _run_module_code(code, init_globals=None, │ 2023-06-13 15:20:03 | ERROR | stderr | │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ /mnt/lustre/duanhaodong/anaconda3/envs/mm2/lib/python3.8/site-packages/fastchat/serve/model_work │ 2023-06-13 15:20:03 | ERROR | stderr | │ er.py:414 in │ 2023-06-13 15:20:03 | ERROR | stderr | │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ 411 │ │ │ ) │ 2023-06-13 15:20:03 | ERROR | stderr | │ 412 │ │ os.environ["CUDA_VISIBLE_DEVICES"] = args.gpus │ 2023-06-13 15:20:03 | ERROR | stderr | │ 413 │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ ❱ 414 │ worker = ModelWorker( │ 2023-06-13 15:20:03 | ERROR | stderr | │ 415 │ │ args.controller_address, │ 2023-06-13 15:20:03 | ERROR | stderr | │ 416 │ │ args.worker_address, │ 2023-06-13 15:20:03 | ERROR | stderr | │ 417 │ │ worker_id, │ 2023-06-13 15:20:03 | ERROR | stderr | │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ /mnt/lustre/duanhaodong/anaconda3/envs/mm2/lib/python3.8/site-packages/fastchat/serve/model_work │ 2023-06-13 15:20:03 | ERROR | stderr | │ er.py:103 in init │ 2023-06-13 15:20:03 | ERROR | stderr | │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ 100 │ │ │ self.generate_stream_func = generate_stream │ 2023-06-13 15:20:03 | ERROR | stderr | │ 101 │ │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ 102 │ │ if not no_register: │ 2023-06-13 15:20:03 | ERROR | stderr | │ ❱ 103 │ │ │ self.register_to_controller() │ 2023-06-13 15:20:03 | ERROR | stderr | │ 104 │ │ │ self.heart_beat_thread = threading.Thread( │ 2023-06-13 15:20:03 | ERROR | stderr | │ 105 │ │ │ │ target=heart_beat_worker, args=(self,) │ 2023-06-13 15:20:03 | ERROR | stderr | │ 106 │ │ │ ) │ 2023-06-13 15:20:03 | ERROR | stderr | │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ /mnt/lustre/duanhaodong/anaconda3/envs/mm2/lib/python3.8/site-packages/fastchat/serve/model_work │ 2023-06-13 15:20:03 | ERROR | stderr | │ er.py:119 in register_to_controller │ 2023-06-13 15:20:03 | ERROR | stderr | │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ 116 │ │ │ "worker_status": self.get_status(), │ 2023-06-13 15:20:03 | ERROR | stderr | │ 117 │ │ } │ 2023-06-13 15:20:03 | ERROR | stderr | │ 118 │ │ r = requests.post(url, json=data) │ 2023-06-13 15:20:03 | ERROR | stderr | │ ❱ 119 │ │ assert r.status_code == 200 │ 2023-06-13 15:20:03 | ERROR | stderr | │ 120 │ │ 2023-06-13 15:20:03 | ERROR | stderr | │ 121 │ def send_heart_beat(self): │ 2023-06-13 15:20:03 | ERROR | stderr | │ 122 │ │ logger.info( │ 2023-06-13 15:20:03 | ERROR | stderr | ╰────────────────────────────────────────────────────� ��─────────────────────────────────────────────╯ 2023-06-13 15:20:03 | ERROR | stderr | AssertionError

Jun 13 '23 07:06 kennymckormick

@kennymckormick please download v1.1 weight here The old weight had no eos_token

Jun 14 '23 12:06 andy-yang-1

@kennymckormick please download v1.1 weight here The old weight had no eos_token

I have the same question even though I download v1.1 weight.

Jun 15 '23 02:06 Haifengtao

I have the same question even though I download v1.1 weight.

v1.1 weight had changed several times, and you can remove your weight and download it again

Jun 15 '23 02:06 andy-yang-1

@Haifengtao @kennymckormick I encountered the same issue. Could you please tell me how you resolved it? Thank you! :)

Jun 20 '23 01:06 Jacob-yen

I'm going to try with the latest v1.1 weight and update the result once ready. One more thing to add, when trying with the v1.3 weight, I still get an error: in register_to_controller, I got a 503 error for the post request.

Jun 21 '23 04:06 kennymckormick

@kennymckormick Thank you for your prompt reply. After replacing all localhost with my real ip, I could successfully run the server and the controller and connect to them using requests (wiht postman). However, I am still encountering some issues when using the OpenAI API.

Jun 21 '23 05:06 Jacob-yen

@Jacob-yen what is the issue you see?

Nov 03 '23 12:11 surak

FastChat
FastChat copied to clipboard

Unable to launch the OpenAI API [Vicuna-7B]. Error log: Using pad_token, but it is not set yet.

FastChat FastChat copied to clipboard

Unable to launch the OpenAI API [Vicuna-7B]. Error log: Using pad_token, but it is not set yet.

FastChat
FastChat copied to clipboard