Description
On our FTP-site, there is a file which is Reuters citation index eg
ftp://ftp.wormbase.org/pub/wormbase/releases/WS280/species/c_elegans/PRJNA13758/annotation/c_elegans.PRJNA13758.WS280.reuters_citation_index.xml.gz
We have been advised that:
Please note that the xml file contains a few un-encoded ampersand characters, and would recommend that ampersands in text be encoded as &. We've worked around it, but the XML specification says they should not be used as literals.
https://www.w3.org/TR/2004/REC-xml-20040204/REC-xml-20040204.xml
"The ampersand character (&) and the left angle bracket (<) must not appear in their literal form, except when used as markup delimiters, or within a comment, a processing instruction, or a CDATA section. If they are needed elsewhere, they must be escaped using either numeric character references or the strings "&" and "<" respectively. The right angle bracket (>) may be represented using the string ">", and must, for compatibility, be escaped using either ">" or a character reference when it appears in the string "]]>" in content, when that string is not marking the end of a CDATA section."