petals
petals copied to clipboard

Published 20 hours ago •

bigscience-workshop

Reame
Issues

Report average queue size in tokens (per last 10 min) for routing

Open borzunov opened this issue 11 months ago • 0 comments

This would help to take the server load into account while planning a route for inference and fine-tuning.

Aug 07 '23 16:08 borzunov