Skip to content

line break and whitespace character encoding visible in Abstracts #6153

@saseestone

Description

@saseestone

Reported via SW feedback (and passed around the PURL feedback too). SW-4547
(Note that Jeanette instructed Adan to fix the Abstract in Argo, and he has. Now it displays as a blob of text.)

In the abstract of SDR records, we're displaying the encoding for line breaks and whitespace characters in 4.0. I checked -morison, and we had a better display in 3.3.

Example record w/screenshot:
https://searchworks.stanford.edu/view/zs304tj5371

Image

vs
https://searchworks-morison.stanford.edu/view/zs304tj5371 (same record in SW3.3)

Image

Another example:
https://searchworks.stanford.edu/view/cz789ms7413
https://searchworks-morison.stanford.edu/view/cz789ms7413 (same record in SW3.3)

This seems to be happening when the depositor has copy and pasted from a PDF into the Abstract field. "Real" line breaks are displaying fine, but we're seeing encoding if there is markup that came along in that copy/paste action.

SDR folks will try to educate depositors that they should be careful of copy/paste, but it's likely it will continue to happen. I'm hopeful that we have code already 🤞 to remove the encodings from displaying.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions