morphodict
morphodict copied to clipboard
Show information about a word-form in pop-ups per each cell in paradigm
We might want to present multiple types of information for each paradigm layout cell. For instance, the following:
${wâpamêw}+V+TA+Ind+1Sg+2SgO:
- [x] kiwâpamitin: (1) surface word-form without morpheme boundaries
- [x] ki·wâpam·iti·n: (2) surface word-form with morpheme boundaries
- [ ] kit2·wâpam·i2ti·n: (3) underlying word-form with original morphemes and boundaries
- [ ] I see me, I witness me: (4) generated English translation of cell word-form
- [ ] 4: (5) corpus-frequency
- [x] (6) human recording
- [x] (7) generated robot recording
Originally posted by @aarppe in https://github.com/UAlbertaALTLab/morphodict/issues/397#issuecomment-667319602
For generating the underlying morphotactic representation, we would need to use the specially compiled FST:
hfst-lookup -q src/fst/lexicon.hfst
wâpamêw+V+TA+Ind+1Sg+2SgO kit2<wâpam>i2tin 0.000000
This underlying form would need to be mapped against the surface form - then the meaning/labeling of the morpheme can be unambiguously retrieved (from a relabeling file):
| 1 | 2 | 3 | 4 |
|---|---|---|---|
| kit2< | wâpam | >i2ti | >n |
| kit2 | wâpam | i2ti | n |
| ki< | wâpam | >iti | >n |
This expands from #1093 to cover information not only on morphemes.