badrisnps

Results 2 comments of badrisnps

I'm not sure how common it is, absolutely not as common as single target classification. People tend to work around it by making a model per target, but that results...

THe automatic inference of max-batch-prefill-tokens during the warmup phase is exceeding the VRAM. There seems to be no easy way to control the automatic estimation of that.