Email addresses in HTML content are removed when sanitizing text coming from a plaintext email

When the string to sanitize comes from a plaintext email, such items are present in the original content :

```
blah blah

From: Mark <mailto:mark@mail.com>
Sent: Wednesday, August 16, 2017 19:47
To: John <john@mail.com>
Subject: Re: Document Test

Hello John
```

If the email was a HTML email, the < and > around "`<john@mail.com>`" are aleady escaped as &lt; and &gt; but if the email was plaintext, they are not.

In this specific case, the part `<john@mail.com>` is considered to be an invalid HTML tag and is removed, along with all the following content from that point.

If option "Keep child nodes of removed elements" is chosen, then only these email tags are lost.

It would be great if after testing a tag against the whitelist, an additional test was made to attempt to match it to these two authorized and standard and safe instances.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Email addresses in HTML content are removed when sanitizing text coming from a plaintext email #126

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Email addresses in HTML content are removed when sanitizing text coming from a plaintext email #126

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions