aici
aici copied to clipboard
have aici_init() return max number of forks
vllm uses max number of possible forks in a sequenace group for scheduling
also that max should be limited