部署模型后启动报错:main: couldn't bind HTTP server socket, hostname: 0.0.0.0, port: 8080
作者你好,按照你的教程部署模型后,运行脚本文件报错: ggml_cuda_init: GGML_CUDA_FORCE_MMQ: no ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no ggml_cuda_init: found 1 CUDA devices: Device 0: NVIDIA GeForce RTX 4080, compute capability 8.9, VMM: yes build: 3923 (becfd387) with MSVC 19.29.30154.0 for x64 system info: n_threads = 14, n_threads_batch = 14, total_threads = 20
system_info: n_threads = 14 (n_threads_batch = 14) / 20 | AVX = 1 | AVX_VNNI = 0 | AVX2 = 1 | AVX512 = 0 | AVX512_VBMI = 0 | AVX512_VNNI = 0 | AVX512_BF16 = 0 | FMA = 1 | NEON = 0 | SVE = 0 | ARM_FMA = 0 | F16C = 1 | FP16_VA = 0 | RISCV_VECT = 0 | WASM_SIMD = 0 | BLAS = 1 | SSE3 = 1 | SSSE3 = 1 | VSX = 0 | MATMUL_INT8 = 0 | LLAMAFILE = 1 |
main: couldn't bind HTTP server socket, hostname: 127.0.0.1, port: 8080 是gpu问题还是网络问题还是其他呢?网络是用clash挂梯子的。请问作者知道如何解决吗?
已解决,把端口开成7980可以成功启动服务器
嗯,就是端口被其他应用占用了
如果有人不清楚应该在哪里指定端口,是在00_Core.bat中.\llama\llama-server.exe后添加 --port 端口号,默认没传这个参数。