Matt Post
Matt Post
The `joint.yaml` file for EMNLP 2021 [lists the workshops in one order](https://github.com/acl-org/acl-anthology/blob/5eb632d5eccb3cf380b9ede08c60e4dadc414c6b/data/yaml/joint.yaml#L363), but they are [displayed in another order](https://aclanthology.org/events/emnlp-2021/). Where this matters is that it would be nice if Findings...
We are missing [FSMNLP proceedings](https://www.aclweb.org/anthology/venues/fsmnlp/) #2 through #8 (1999–2010). https://www.aclweb.org/anthology/venues/fsmnlp/ It would be great to find physical proceedings and scan them for ingestion.
Instantiating the `Anthology` class in the Python API takes 30 seconds or so, while all the files are loaded and parsed. This is inconvenient when testing. It would be nice...
We currently have no author index: https://aclanthology.org/people/ It would be nice to build a nice index, say paged by first initial of last name.
The [check-build workflow](https://github.com/acl-org/acl-anthology/blob/master/.github/workflows/check-build.yml) should only run `make check site` for remove PRs. For local PRs, we could just run `make check`, since the complete check will come from the preview.
We should surface the ingestion date on volume and paper pages, e.g., https://www.aclweb.org/anthology/volumes/2021.eacl-main/
The primary purpose of the Anthology to this point has been as an archive for the research output of the ACL community and related venues who have decided to contribute...
Builds complain with the following errors. Can we: 1. Manually fix them 2. (Time-permitting) update the ingestion code to handle them better? (This is less important and I don't want...
The scan quality is pretty poor (e.g., https://www.aclweb.org/anthology/W89-0218.pdf). It applies to [most of the papers in the proceedings](https://www.aclweb.org/anthology/W89-02).