exo
exo copied to clipboard
Update device_capabilities.py: Add GTX 1070, 1080; main.py: timeout 90->900
- Added few GPUs.
- Tuned timeout. On slow setups (~1 token per second) average response may take ~600-1000 tokens. In most cases it will lead to timeout (network error which is not). Fixing to reduce exceptions. Who looking for better performance and know what to do need adjust with a knowledge how it will impact. By default making it will work for most cases.