Sensitive validator branch #1364

rchache · 2022-08-18T18:20:25Z

Issue #, if available:

Description of changes:
Based on various existing services for sensitive data detection and current AWS sensitive trait usage, generated a default list of words and phrases that likely indicate the data stored inside is sensitive. It is configurable the way the non inclusive terms validator is.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

smithy-linters/src/main/java/software/amazon/smithy/linters/MissingSensitiveTraitValidator.java

docs/source-2.0/guides/model-linters.rst

smithy-linters/src/main/java/software/amazon/smithy/linters/MissingSensitiveTraitValidator.java

mtdowling · 2022-10-22T00:21:20Z

I created a PR here that is used by ReservedWords to support word boundary based term matching. I made an abstraction for this so that it could be used by this linter as well. Let me know what you think. #1461

This commit introduces a new syntax for matching words with the ReservedWords linter and is intended to be used with the upcoming sensitive words linter defined in #1364. In addition to supporting wildcard searches ("*" prefix, suffix, and contains), we now support matching based on word boundaries. This commit introduces the "terms" keyword for word boundary searches and adds dedicated abstractions for word boundary and wildcard matching. For example, "access key id" will match "AccessKeyId", "access_key_id", "accessKeyID", "access_key_id100", "AccesKeyIDValue". It will also match when all the words are concatenated together: "accesskeyid". However, it will not match "accesskey_id" because it only has two word boundaries ("accesskey" and "id").

smithy-linters/src/main/java/software/amazon/smithy/linters/WordBoundaryMatcher.java

smithy-linters/src/main/java/software/amazon/smithy/linters/MissingSensitiveTraitValidator.java

docs/source-2.0/guides/model-linters.rst

rchache requested a review from a team as a code owner August 18, 2022 18:20

rchache mentioned this pull request Aug 18, 2022

Validator for missing sensitive trait #1333

Closed

mtdowling requested changes Oct 20, 2022

View reviewed changes

mtdowling mentioned this pull request Oct 22, 2022

Add ability to lint based on word boundaries #1461

Merged

rchache added 2 commits October 25, 2022 22:51

Validator for missing sensitive trait

f23826c

minor refactors and improvements to MissingSensitiveTraitValidator

8c7c494

rchache force-pushed the SensitiveValidatorBranch branch from d3e810b to ae30062 Compare October 27, 2022 01:13

Update MissingSensitiveTrait to use word boundary matching

d3d2bb5

rchache force-pushed the SensitiveValidatorBranch branch from ae30062 to d3d2bb5 Compare October 27, 2022 15:23

mtdowling requested changes Nov 9, 2022

View reviewed changes

updated list of sensitive terms, and refactors of code

4e255af

rchache requested a review from mtdowling November 10, 2022 23:06

mtdowling approved these changes Nov 11, 2022

View reviewed changes

mtdowling merged commit 6b7e154 into smithy-lang:main Nov 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sensitive validator branch #1364

Sensitive validator branch #1364

rchache commented Aug 18, 2022

mtdowling commented Oct 22, 2022

Sensitive validator branch #1364

Sensitive validator branch #1364

Conversation

rchache commented Aug 18, 2022

mtdowling commented Oct 22, 2022