candle
candle copied to clipboard
feat: add flash_forward and flag to toggle
trafficstars
This PR adds flash attention to whisper and can be run with the following command.
cargo run --example whisper --release --features flash-attn -- --use-flash-attn
*note to enable candle must be built with flash-attn and the --use-flash-attn must bet set
@LaurentMazare what is missing to merge this PR?