client
client copied to clipboard
[WIP] LLaVA support
The goal of this MR is to enable measuring VLM throughput and latency where input includes images.