piper-phonemize
piper-phonemize copied to clipboard
Why does BOS have padding?
src/phoneme_ids.cpp
I have been reviewing the phonemes_to_ids function in src/phoneme_ids.cpp and noticed that padding (PAD) is inserted after the Beginning of Sentence (BOS) symbol (^). I'm curious about the reasoning behind this implementation choice.
If this decision was intentional, could you kindly clarify why padding is included after BOS? I'm interested to understand the rationale behind this practice.
Additionally, I noticed that padding is not inserted after the End of Sentence (EOS) symbol ($). I'm wondering if there's a specific reason why EOS does not have padding, whereas BOS does.
Thank you in advance for taking the time to provide clarification on this matter. I appreciate your insights into the design decisions behind the code.