Context:
Uniform Resource Name (URNs) are URIs that use the urn scheme without any prefix.
Currently we hash the whole URI, and replace the fragment with the hash, retaining the original prefix. This poses 2 issues:
- we need to extract the prefix, which fails when urns are in the input
- sensitive data may be present in the prefix. This should be removed
Proposed solution
Using URNs (format: <urn:hash>) as output would provide a solution to both issues as the pseudonymized data would not have a namespace.
Objective: original domain is absent from pseudonymized URIs.
Requirements:
Context:
Uniform Resource Name (URNs) are URIs that use the
urnscheme without any prefix.Currently we hash the whole URI, and replace the fragment with the hash, retaining the original prefix. This poses 2 issues:
Proposed solution
Using URNs (format:
<urn:hash>) as output would provide a solution to both issues as the pseudonymized data would not have a namespace.Objective: original domain is absent from pseudonymized URIs.
Requirements:
Hashfor rdf types -> urn: