ctc_decoders icon indicating copy to clipboard operation
ctc_decoders copied to clipboard

Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model

This beam search decoder implementation was originally taken from Baidu DeepSpech project: https://github.com/PaddlePaddle/DeepSpeech/tree/master/decoders/swig

Git commit: https://github.com/PaddlePaddle/DeepSpeech/commit/a76fc692123e97e5f183c3a00e30b7f2e2d3f07c

Copyright 2019 Baidu, Inc. Copyright 2019 NVIDIA Corporation

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Changes: modified: decoder_utils.cpp modified: setup.py modified: setup.sh modified: ctc_beam_search_decoder.cpp modified: ctc_beam_search_decoder.h renamed: swig_decoders.py -> ctc_decoders.py modified: ctc_decoders.py

Dependencies

# Ubuntu
sudo apt install libboost-all-dev swig sox
# Arch
sudo pacman -Syu boost swig sox

Install

# Run
./setup.sh

Usage

See example/decode.py for usage.

Beam Search LM Batch Beam Search LM