SQ comments

Results 5 comments of

SQ

wx group for share dataset

@TpBlair just mailed you

Will exo support public internet access and service provider payment systems in the future?

hi @dengbuqi @larson-carter, I am also interested in unified idle GPUs into a public network and am just transferred from Petals to exo. May I ask if exo supports GPU...

复现了，但DeepSeek-R1-Q4_K_M跑起来速度非常慢，只有约1.5token/s，请问是我配置的原因么？

> 我2块A800 80G gpu ，72核cpu，可以把cpu用满，但是显卡利用利只有10%，都是用cpu在计算了，怎么把显卡性能用起来，显卡还有90%性能没用上，只用了10G显存，怎调？各位有办法吗？ > > 用这个参数会快一些: --optimize_rule_path ./ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-multi-gpu.yaml > > prompt eval count: 60 token(s) prompt eval duration: 3.1178791522979736s prompt eval rate: 19.243850408948063 tokens/s eval count: 3673...

Request failed. - Benjamin

I also met this issue. I started a private swarm with two GPU servers and built this web chat on another CPU node.

Request failed. - Benjamin

@lipere123 Hi, may I ask if you have figured this out?