dev icon indicating copy to clipboard operation
dev copied to clipboard

Revisit marginal status

Open bambooforest opened this issue 6 years ago • 1 comments

I'd like to revisit marginal status for each source. For example, we do have info on borrowing in SPA, although we don't add it:

https://github.com/phoible/dev/blob/master/scripts/aggregate-raw-data.R#L143-L144

and hence we could potentially infer marginal/borrowed segments in UPSID when the sources overlap (but we don't have to). See also issue #230 .

For UPSID we denote marginal with the anomalous flag in their raw data:

https://github.com/phoible/dev/blob/master/scripts/aggregate-raw-data.R#L158

but my understanding is that anomalous is a flag for when a segment occurred only once in the database. By that account I don't think we should mark them marginal.

In the other sources, we might also revisit whether an inventory should be marked all FALSE, e.g.

  • in EA some inventories have segments in parentheses, like (z) in Italian, and this means borrowed/marginal, but this is very rare
  • in AA and PH some languages have marginal marked, but then for the rest we just FALSE entire inventories (without knowing whether the source explicitly said that the language has no marginals)
  • in ER everything is marginal false, but we have data in SegBo that we could use to add the marginal phones to the ER inventories

bambooforest avatar Aug 18 '19 09:08 bambooforest

Browsing through the raw AA, I noticed that there are cases noted as marginal for segments only found in borrowed words

https://github.com/phoible/dev/blob/master/raw-data/AA/AA_inventories.tsv#L465

and less clear cases, such as:

https://github.com/phoible/dev/blob/master/raw-data/AA/AA_inventories.tsv#L4510

The issue of marginality is a gradient one, for example:

Jelaska, Z. and Machata, M. G. (2005). Prototypicality and the Concept Phoneme. Glossos, 6:1–13.

Should we consider marking known borrowings in addition to marginality? This could be a long term goal.

bambooforest avatar Jun 11 '20 12:06 bambooforest