badrisnps
Results
2
comments of
badrisnps
I'm not sure how common it is, absolutely not as common as single target classification. People tend to work around it by making a model per target, but that results...
THe automatic inference of max-batch-prefill-tokens during the warmup phase is exceeding the VRAM. There seems to be no easy way to control the automatic estimation of that.