Ycros
Results
2
issues of
Ycros
# Behavior 65B models running with CUBLAS fully offloaded to the gpu break as prompts approach the model's max context size (the models I've tested with are 2048), they start...
Everything seems to work fine via the embedded klite interface, but when I pointed horde at it, it started throwing these: It seems to kinda sorta maybe still serve horde...