Tomaž Erjavec

Results 19 issues of Tomaž Erjavec

A user of our corpora v2.1 has noticed that the GB metatada for all years, except 2015, gives all parties as Opposition. I had a look at the GB root...

bug

For PoS tagged corpora (no)SkE requires the positional attribute [tag](https://www.sketchengine.eu/my_keywords/tag/). But ParlaMint has the attributes (UD) "pos" and "feats", so, e.g. trying to compute the frequency of preset "KWIC tags"...

bug

ParlaMint-PT.xml (and ParlaMint-PT.ana) has: https://github.com/clarin-eric/ParlaMint/blob/5d55e6a0a63e6b6ce5047142df6236dbfd2ecae4/Samples/ParlaMint-PT/ParlaMint-PT.xml#L111 we have now common taxonomies, so it should be `````` @matyaskopp , strange that your factorise script didn't change than. Or maybe it was never...

bug

This issue discusses the non-resloved problems from #202 and #204. The current encoding of USAS in TEI is given in the [guidelines](https://clarin-eric.github.io/ParlaMint/#sec-semantics), which is arguably ok, even though other possibilites...

bug
enhancement

@tungland, in preparing the the release and modifying some scripts I noticed the situation where you distinguish meetings of the lower, upper, and unicameral "houses", which has some problems. First,...

bug

The current CZ corpus has about 25 persons without the `` element, even though in Czech it is easy to determine based on the person's forename. Examples: AndrejDanko, DanielHavlik, EvaZamrazilova,...

bug

While doing MT Taja noticed that quite a lot of the text in the FI transcriptions is in fact in Swedish but is not marked as such. This is esp....

bug

In the BE corpus there are 723 paragraphs (segments) that have `@xml:lang="en"`, even though - at least the ones I've checked - are not in fact in English - they...

bug

In ParlaMint I the GB corpus had ministers marked up, but they dissapeared in ParlaMint II (i.e. ParlaMint-GB in 4.1 does not mark ministers). However, there is the TSV file...

bug