David Chiang

Results 88 comments of David Chiang

I think I hear a consensus about (3) Don't copy errors from the PDF into the XML. Note that this can occasionally be a tough call: for example, I had...

@nschneid we previously discussed first initials at length in #245; @mjpost sorry to bring it up again. In the current situation (LREC and other conferences that use START), the full...

Do you want to further discuss how to get people to change their names in START? If not, we can close this issue.

I think we can pretty reliably restore accents now by scraping them from PDFs. What's the best way to use this -- to identify people to ask to update their...

I agree, it would be annoying for everyone involved to email individual people. So, we have an author-name scraper (https://github.com/acl-org/acl-anthology/blob/auto_accents/bin/auto_authors.py) that could be incorporated into `normalize_anth.py` and run as part...

I don’t have a CLSP account (I don’t think). But a local copy might be a good idea if we can figure out a way.

Can this file (as well as the mirroring script) become part of the repo?

There are also broken external PDF links for LREC.

I think papers should always have an explicit `` field and the absence of it should indicate that there isn't one. As for the whole-volume PDFs...maybe they should be automatically...

Unlike #590, I think this is more important to fix, because our BibTeX styles do not change case in author names. But getting the heuristics right could be tricky. I...