transformers
transformers copied to clipboard
Add Wav2Vec2BertProcessorWithLM
Feature request
Wav2Vec2-Bert was open sourced and integrated with Transformers in the end of last year. However, it is missing an easy integration with pyctcdecode similar to Wav2Vec2ProcessorWithLM. This should be quite trivial to implement, since Wav2Vec2Processor is very similar to Wav2Vec2BertProcessor, the only difference being that they use different feature extractors.
Motivation
Having a Wav2Vec2BertProcessorWithLM class would make it possible to use Wav2Vec2-Bert with a kenlm model in a Transformers ASR pipeline.
Your contribution
I can submit a PR.
cc @sanchit-gandhi @ylacombe
Hey @FredHaa, #28706 should fix this, I'm reopening it! Note that you would have to use Wav2Vec2ProcessorWithLM
and not Wav2Vec2BertProcessorWithLM
!
#28706 has been merged, I'm closing the issue for now, feel free to ask questions