houliangxue

Results 4 comments of houliangxue

> torch_dtype 设置成bfloat16就ok了 qwem1.5的config.json默认就是bfloat16呀,提示词比较长的时候还是会爆显存

![Image](https://github.com/user-attachments/assets/0dbe828c-fe50-407b-8723-c023060b1055)

提示词与2不一样哦 ,需要'You are a helpful assistant.’

import sys import markdown from bs4 import BeautifulSoup from fastapi import FastAPI, HTTPException from fastapi.responses import StreamingResponse, Response import uvicorn from pydantic import BaseModel import torchaudio import io import os...