acl-anthology icon indicating copy to clipboard operation
acl-anthology copied to clipboard

Bug report: DeepLo 2022 abstract in BibTeX and webpage

Open jonmay opened this issue 2 years ago • 5 comments

Issue description

All the abstracts in DeepLo 2022 BibTeX files and paper landing pages are simply the string "t". This was true for all of the handful of bib files I checked in deepLo and true for none of the conference/workshop files I checked in naacl 2022 but not in deeplo. I did not check exhaustively.

Additionally, the last two authors are combined in every paper in a way that makes them appear as a single author. E.g.

    title = "Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and {BERT} Models for {M}altese",
    author = "Micallef, Kurt  and
      Gatt, Albert  and
      Tanti, Marc  and
      van der Plas and Claudia Borg, Lonneke",
    booktitle = "Proceedings of the Third Workshop on Deep Learning for Low-Resource Natural Language Processing",
    month = jul,
    year = "2022",
    address = "Hybrid",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.deeplo-1.10",
    pages = "90--101",
    abstract = "t",
}

This creates a final author with the last name van der Plas and Claudia Borg and the first name Lonneke. That in turn creates a new author page for this hybrid person: https://aclanthology.org/people/l/lonneke-van-der-plas-and-claudia-borg/

Steps to reproduce the issue

Open https://aclanthology.org/2022.deeplo-1.8.bib or https://aclanthology.org/2022.deeplo-1.8/ and compare to the actual abstract in https://aclanthology.org/2022.deeplo-1.8.pdf (can be repeated with other similar papers)

What's the expected result?

Abstract text should match that in the pdf. Author corrections should be separated.

What's the actual result?

Abstract text is simply "t". Two-headed authors have been created.

jonmay avatar Jul 20 '22 20:07 jonmay

@jonmay Thanks for the bug report. We are aware of the abstract issue and are tracking down the abstracts for DeepLo papers.

For the author names, I believe they are fixed in #2037 and #2049. I don't see other author names meshed together.

xinru1414 avatar Jul 21 '22 01:07 xinru1414

I don't see other author names meshed together.

Here's an example: https://aclanthology.org/2022.deeplo-1.14/

The mangled author profile: https://aclanthology.org/people/k/kelechi-ogueji-and-jimmy-lin/

lintool avatar Jul 25 '22 17:07 lintool

to echo jimmy, all of the deeplo 2022 papers with 2+ authors have this problem:

https://aclanthology.org/2022.deeplo-1.11/ <=> https://aclanthology.org/people/j/jonathan-may-and-heng-ji/ https://aclanthology.org/2022.deeplo-1.17/ <=> https://aclanthology.org/people/z/zhou-yu-and-samuel-r-bowman/ https://aclanthology.org/2022.deeplo-1.22/ <=> https://aclanthology.org/people/i/iman-jundi-and-gabriella-lapesa/

etc.

jonmay avatar Jul 25 '22 18:07 jonmay

Who was the DeepLo pub chair? This is the data that was delivered to us in the papers.yml file. Fix your metadata and we'll happily reingest.

mjpost avatar Jul 25 '22 22:07 mjpost

good thing three of the organizers are institutional colleagues! I'll bug them...

jonmay avatar Jul 25 '22 23:07 jonmay

Any updates here? This really just needs someone to bang this out manually, I bet it would take 20 minutes.

In the meantime, @xinru, for November corrections, can you please remove all <abstract> lines for 2022.deeplo-1?

mjpost avatar Nov 08 '22 21:11 mjpost

I had contacted some organizers back in july and they didn't respond and then i forgot. but now it looks like the authors are fixed. abstract problem is still there.

On Tue, Nov 8, 2022 at 1:20 PM Matt Post @.***> wrote:

Any updates here? This really just needs someone to bang this out manually, I bet it would take 20 minutes.

In the meantime, @xinru https://github.com/xinru, for November corrections, can you please remove all lines for 2022.deeplo-1?

— Reply to this email directly, view it on GitHub https://github.com/acl-org/acl-anthology/issues/2058#issuecomment-1307839998, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAFIMNIAMKXJZT7XKZ3TIB3WHK7ZZANCNFSM54FAIKZQ . You are receiving this because you were mentioned.Message ID: @.***>

-- "Je n’ai fait celle-ci plus longue que parce que je n’ai pas eu le loisir de la faire plus courte." -- Pascal

jonmay avatar Nov 08 '22 22:11 jonmay

I just did it manually.

mjpost avatar Nov 09 '22 01:11 mjpost