VLM-Captioning-Tools icon indicating copy to clipboard operation
VLM-Captioning-Tools copied to clipboard

How fast should I expect this to be?

Open brandostrong opened this issue 1 year ago • 1 comments

Using the default settings and the cogvlm script, processing is quite slow. It's running on my 4090,taking up about 14.4gb, and taking about 10 seconds per image.

brandostrong avatar Aug 15 '24 02:08 brandostrong

@brandostrong Unfortunately you are limited by the VLM model being used. For the original CogVLM model your speed seems similar to what I got on a 3090, which was 8-9 seconds per image.

ProGamerGov avatar Aug 20 '24 15:08 ProGamerGov