squirrelfish

Results 6 comments of squirrelfish

I think this pr can also be applied to deepseek-r1, If we append "{% if ns.is_last_user %}{% if enable_thinking is defined and enable_thinking is false %}{{'\n\n\n\n'}}{% endif %}{% endif %}"...

我看示例代码中坐标标准化处理有的地方除以1000,有的地方除以999,到底哪个是正确的,还是说不同场景除以不同的值

好像开发者模式有数据返回,网页上不行,不知道有啥区别

Since this model is less than 200GB in size, using 8 GPU cards would be a waste. How can we run it with only 4 cards?

> Yes since docker image 20250723: > > # For A100/A800/H100/H800/H20/H200 (80G x 8): > docker run -it --rm --ipc=host --net=host --shm-size=8g --ulimit memlock=-1 \ > --ulimit stack=67108864 --gpus=all -v...

> Got it. Please re-pull the image to skip the downloading procedure: > > docker pull tutelgroup/deepseek-671b:a100x8-chat-20250723 Thank you. It works very well, but there are still some minor issues....