candle
candle copied to clipboard
Candle vs. PyTorch performance
I'm running https://github.com/huggingface/candle/tree/main/candle-examples/examples/llava vs. https://github.com/fpgaminer/joycaption/blob/main/scripts/batch-caption.py on a Mac m1.
Seeing significant performance difference, Candle seems much slower. I enabled accelerate and metal features.
Would love some pointers how to improve it.