SearchWorks icon indicating copy to clipboard operation
SearchWorks copied to clipboard

line break and whitespace character encoding visible in Abstracts

Open saseestone opened this issue 3 months ago • 0 comments

Reported via SW feedback (and passed around the PURL feedback too). SW-4547 (Note that Jeanette instructed Adan to fix the Abstract in Argo, and he has. Now it displays as a blob of text.)

In the abstract of SDR records, we're displaying the encoding for line breaks and whitespace characters in 4.0. I checked -morison, and we had a better display in 3.3.

Example record w/screenshot: https://searchworks.stanford.edu/view/zs304tj5371

Image

vs https://searchworks-morison.stanford.edu/view/zs304tj5371 (same record in SW3.3)

Image

Another example: https://searchworks.stanford.edu/view/cz789ms7413 https://searchworks-morison.stanford.edu/view/cz789ms7413 (same record in SW3.3)

This seems to be happening when the depositor has copy and pasted from a PDF into the Abstract field. "Real" line breaks are displaying fine, but we're seeing encoding if there is markup that came along in that copy/paste action.

SDR folks will try to educate depositors that they should be careful of copy/paste, but it's likely it will continue to happen. I'm hopeful that we have code already 🤞 to remove the encodings from displaying.

saseestone avatar Sep 18 '25 20:09 saseestone