Marcel Bollmann
Marcel Bollmann
I feel a quick, site-internal metadata search is somewhat complementary to a dense retrieval system like you're describing. I'm really mainly talking about the former here, as I think the...
Ping @GuyAglionby, wrapping URLs in `` tags might be another good addition if you have the time :)
Thank you @brunoenten! Installing and importing the library works like a charm on my current system, but both my hardware and OS have changed since I last tried this, and...
Minimal working example: ```python from lxml import etree filename = "/home/bollmann/repos/acl-anthology-py/tests/toy_anthology/xml/2022.acl.xml" for event, element in etree.iterparse(filename): if element.tag == "paper": pass ``` Inspecting an attribute of the returned `element` seems...
The easiest way to do this would be by volume. Make an attachment in the volume's meta block, and it should show up on the same page as the full...
Generating a tar-ball "on demand" would require some dynamic server-side component, if I'm not mistaken; and the build chain is agnostic to the actual PDFs on the server. So doing...
> 2. Only the first three papers have a "bib" link. Expected in the preview. > Also, the acronym of the conference is "PROPOR", and not "PROPER" as in the...
We can fix (2) in our XML (since there's a mismatch with the PDF) and add (1) as a name variant to (3), so that all papers will show up...
@nnkennard But just to double-check: (1) would have "Nayak" as the _last_ name, (3) would have "Nayak" as a _first_ name part. Is that correct? I.e. it should be "Kennard,...
Thanks for the suggestion! I'm still on vacation, so will check this out later. Can you add an example URL that this would link to? And maybe a brief explanation...