Skip to content

Add *hashed* file version to .gitallowed? So future changes get caught containing secrets again #246

Open
@vergenzt

Description

Would you be open to adding / accepting a PR for some way to record git-secrets exceptions to a repo (a la .gitallowed), except rather than just matching by filename and line contents, it can be whitelisted by hash? (i.e. the output of git hash-object)

We find ourselves wanting to remove some exceptions that have been made in the past to some IPython notebooks (which are ultimately JSON blobs with newlines). These contain lines that match some custom secrets regexes we've defined (9-digit numbers which could be SSNs 🙂), but we know they're not actually SSNs.

Up to this point we've just listed these files by name in .gitallowed per https://github.com/awslabs/git-secrets#ignoring-false-positives, but we've developed an interest in reducing the risk that somebody will modify those files and accidentally introduce secrets later on (which will not be flagged as containing secret data because it's whitelisted by filename, so they might not think to double check). We'd prefer that somebody have to manually re-whitelist a file that's flagged as potentially containing a secret any time they're committing changes to it.

Any thoughts?


I don't see a way to do this backwards compatibly within the .gitallowed file itself, so the first approach that comes to mind is checking for another file called .gitallowed-hashes or something like that... and if a tree or blob SHA appears on a line in that file then it gets ignored by git-secrets. We could then (maybe) add a command to git-secrets to add a directory/blob to the list based on its current contents, with the path to the directory/blob listed after the hash in a comment string.

So e.g. what I envision:

$ git secrets --add "[0-9]{9}"

$ echo 123456789 > my-scary-file

$ git secrets --scan
# comes up with error

$ echo "$(git hash-object my-scary-file) # my-scary-file" >> .gitallowed-hashes

$ git secrets --scan
# comes up clean

$ echo "new evil secret for real this time 089291203" >> my-scary-file

$ git secrets --scan
# comes up with error

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions