alpaca-lora
alpaca-lora copied to clipboard
simple playground share
trafficstars
not sure when I am going to shutdown, but I will leave this for few hours at least (running on RTX5000). maybe I will put things up in GKE later for tester purpose
NOTE: didn't do anything about maintaining the context yet
https://notebooksa.jarvislabs.ai/P1lDk5ziArYf6hVUkcne1vVlbwica44Ux7zNWyAeq-c69p-j0D1_ktPMmBKniGk8/
any tips to speed up the inference speed?
@deep-diver which model is this playground running? also what hardware?
it runs the model shared by @tloen on 7 core CPU / 32GB with a single RTX5000. I am hosting it in jarvislabs.ai