morphodict icon indicating copy to clipboard operation
morphodict copied to clipboard

Show information about a word-form in pop-ups per each cell in paradigm

Open aarppe opened this issue 3 years ago • 2 comments

We might want to present multiple types of information for each paradigm layout cell. For instance, the following:

${wâpamêw}+V+TA+Ind+1Sg+2SgO:

  • [x] kiwâpamitin: (1) surface word-form without morpheme boundaries
  • [x] ki·wâpam·iti·n: (2) surface word-form with morpheme boundaries
  • [ ] kit2·wâpam·i2ti·n: (3) underlying word-form with original morphemes and boundaries
  • [ ] I see me, I witness me: (4) generated English translation of cell word-form
  • [ ] 4: (5) corpus-frequency
  • [x] (6) human recording
  • [x] (7) generated robot recording

Originally posted by @aarppe in https://github.com/UAlbertaALTLab/morphodict/issues/397#issuecomment-667319602

aarppe avatar May 30 '22 22:05 aarppe

For generating the underlying morphotactic representation, we would need to use the specially compiled FST:

hfst-lookup -q src/fst/lexicon.hfst
wâpamêw+V+TA+Ind+1Sg+2SgO	kit2<wâpam>i2tin	0.000000

This underlying form would need to be mapped against the surface form - then the meaning/labeling of the morpheme can be unambiguously retrieved (from a relabeling file):

1 2 3 4
kit2< wâpam >i2ti >n
kit2 wâpam i2ti n
ki< wâpam >iti >n

aarppe avatar Jun 01 '22 03:06 aarppe

This expands from #1093 to cover information not only on morphemes.

aarppe avatar Jun 01 '22 22:06 aarppe