paper-qa
paper-qa copied to clipboard
Should indexing process remove `\n` from titles?
Seen in logs:
2024-09-23 20:14:19,845 - paperqa.agents.search - INFO - New file to index: Reduction of fibroblast size-mechanical force do_abc123.pdf...
2024-09-23 20:14:26,425 - paperqa.agents.search - INFO - Complete (Reduction of fibroblast size/mechanical force down‐regulates
<scp>TGF</scp>
‐β type
<scp>II</scp>
receptor: implications for human skin aging).
It looks like this title has \n in it. Should we be removing the \ns?