tabby
tabby copied to clipboard
Consultation on code completion performance optimization issues.
The binary service I deployed in a Linux environment has a GPU utilization consistently below 30%, but there are frequent instances of particularly slow code completion. Does anyone have any methods to improve efficiency?
My machine configuration: V100 * 8.