acl-anthology
acl-anthology copied to clipboard
Bug report: DeepLo 2022 abstract in BibTeX and webpage
Issue description
All the abstracts in DeepLo 2022 BibTeX files and paper landing pages are simply the string "t". This was true for all of the handful of bib files I checked in deepLo and true for none of the conference/workshop files I checked in naacl 2022 but not in deeplo. I did not check exhaustively.
Additionally, the last two authors are combined in every paper in a way that makes them appear as a single author. E.g.
title = "Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and {BERT} Models for {M}altese",
author = "Micallef, Kurt and
Gatt, Albert and
Tanti, Marc and
van der Plas and Claudia Borg, Lonneke",
booktitle = "Proceedings of the Third Workshop on Deep Learning for Low-Resource Natural Language Processing",
month = jul,
year = "2022",
address = "Hybrid",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2022.deeplo-1.10",
pages = "90--101",
abstract = "t",
}
This creates a final author with the last name van der Plas and Claudia Borg
and the first name Lonneke
. That in turn creates a new author page for this hybrid person: https://aclanthology.org/people/l/lonneke-van-der-plas-and-claudia-borg/
Steps to reproduce the issue
Open https://aclanthology.org/2022.deeplo-1.8.bib or https://aclanthology.org/2022.deeplo-1.8/ and compare to the actual abstract in https://aclanthology.org/2022.deeplo-1.8.pdf (can be repeated with other similar papers)
What's the expected result?
Abstract text should match that in the pdf. Author corrections should be separated.
What's the actual result?
Abstract text is simply "t". Two-headed authors have been created.
@jonmay Thanks for the bug report. We are aware of the abstract issue and are tracking down the abstracts for DeepLo papers.
For the author names, I believe they are fixed in #2037 and #2049. I don't see other author names meshed together.
I don't see other author names meshed together.
Here's an example: https://aclanthology.org/2022.deeplo-1.14/
The mangled author profile: https://aclanthology.org/people/k/kelechi-ogueji-and-jimmy-lin/
to echo jimmy, all of the deeplo 2022 papers with 2+ authors have this problem:
https://aclanthology.org/2022.deeplo-1.11/ <=> https://aclanthology.org/people/j/jonathan-may-and-heng-ji/ https://aclanthology.org/2022.deeplo-1.17/ <=> https://aclanthology.org/people/z/zhou-yu-and-samuel-r-bowman/ https://aclanthology.org/2022.deeplo-1.22/ <=> https://aclanthology.org/people/i/iman-jundi-and-gabriella-lapesa/
etc.
Who was the DeepLo pub chair? This is the data that was delivered to us in the papers.yml file. Fix your metadata and we'll happily reingest.
good thing three of the organizers are institutional colleagues! I'll bug them...
Any updates here? This really just needs someone to bang this out manually, I bet it would take 20 minutes.
In the meantime, @xinru, for November corrections, can you please remove all <abstract>
lines for 2022.deeplo-1?
I had contacted some organizers back in july and they didn't respond and then i forgot. but now it looks like the authors are fixed. abstract problem is still there.
On Tue, Nov 8, 2022 at 1:20 PM Matt Post @.***> wrote:
Any updates here? This really just needs someone to bang this out manually, I bet it would take 20 minutes.
In the meantime, @xinru https://github.com/xinru, for November corrections, can you please remove all
lines for 2022.deeplo-1? — Reply to this email directly, view it on GitHub https://github.com/acl-org/acl-anthology/issues/2058#issuecomment-1307839998, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAFIMNIAMKXJZT7XKZ3TIB3WHK7ZZANCNFSM54FAIKZQ . You are receiving this because you were mentioned.Message ID: @.***>
-- "Je n’ai fait celle-ci plus longue que parce que je n’ai pas eu le loisir de la faire plus courte." -- Pascal
I just did it manually.