exo icon indicating copy to clipboard operation
exo copied to clipboard

Update device_capabilities.py: Add GTX 1070, 1080; main.py: timeout 90->900

Open FFAMax opened this issue 4 months ago • 2 comments

  1. Added few GPUs.
  2. Tuned timeout. On slow setups (~1 token per second) average response may take ~600-1000 tokens. In most cases it will lead to timeout (network error which is not). Fixing to reduce exceptions. Who looking for better performance and know what to do need adjust with a knowledge how it will impact. By default making it will work for most cases.

FFAMax avatar Oct 28 '24 04:10 FFAMax