iam-manager/docs/aws-security.md at master · keikoproj/iam-manager

This document explains the security measurements in place with iam-manager solution in AWS use case.

Security:

To manage IAM role lifecycle independently, controller(Pod) needs to have AWS IAM access to create/update/delete role which is a security concern if solution is not designed properly but thanks to AWS IAM Permission Boundaries which got released in Q3 2018 and we can restrict the permissions on what user/role can do even if they have iam.* access. To have more confidence in terms of security, any good design should consider implementing Proactive and Reactive security measurements.

Proactive measurements:

Kubernetes validation web hook implementation

Kubernetes web hook validation comes in very handy if we want to validate user input before inserting into persistence system(etcd). This allows us to implement following actions and reject the request if it violates the defined policy.
i. Allow IAM Role creation only with "pre-defined IAM whitelisted policies"
ii. Allow only ONE role per namespace

AWS IAM Permission Boundary

AWS IAM Permission Boundary is the core security concept used in the iam-manager. Permission boundaries are AWS IAM policy objects that establish the maximum permissions an IAM entity can have, regardless of what permissions are granted by the attached policies.

In the context of iam-manager, permission boundaries provide a critical security control by limiting what permissions can be delegated, even if the IAM role policy appears to grant broader access.

How Permission Boundaries Work

The permission boundary acts as a guard rail for all roles created by iam-manager. The actual permissions granted to a role are the intersection of:

Permissions specified in the role's policy
Permissions allowed by the permission boundary

For example, if an IAM role is created with "AdministratorAccess" policy but has a permission boundary that only allows "s3:Get*" operations, the effective permissions would be limited to just "s3:Get*" operations, even though the role policy technically grants much broader access.

This ensures that even if a user specifies overly permissive policies in their IAMRole resource, the permission boundary will restrict the actual capabilities to only those explicitly allowed by the cluster administrator.

Real-World Example

Consider this scenario:

An IAMRole resource includes "ec2:*" permissions in its policy
The permission boundary only allows "ec2:Describe*" actions
The effective permissions for the role will be limited to only "ec2:Describe*"

This prevents escalation of privileges and ensures that roles created through iam-manager cannot be used to gain unauthorized access to AWS resources, protecting both the cluster and the broader AWS environment.

Another important security concern is having an aws iam write access to the controller itself. This is important for many reasons where an developer/hacker gets an access to controller pod (which is very unlikely, if we say this is possible than we have a bigger thing to worry about where developers having an access to resources in a different namespace. We are not talking about cluster admins here. well, cluster admin can delete the entire cluster) and start creating/deleting the roles which are not part of the Kubernetes environment (For ex: Administrator). This is where IAM Permission Boundaries, Controlling Access Using Tags comes into picture.

In brief, If we define a permission boundary with "s3.Get*" access, any role created by controller pod can get only s3.Get access even if new role has an "Administrative" policy with full access attached. For more details, please refer the IAM Permission Boundaries.

That being said, iam role attached to controller can do only following actions i. Can create roles only with pre-defined syntax. ii. Can not create a role with out providing pre-defined permission boundary name. iii. Can not delete any role which doesn't have a pre-defined TAG. (We will attach the tag only to the roles created by controller pod)

Do not provide an option to users to provide IAM role name

Role name can be constructed based on the namespace where custom resource is being created. This allows us to create IAM role with consistent naming conventions.

Custom resource controller deployed in its own namespace.

This is the recommended approach to deploy a CRD in Kubernetes. This allows us to restrict the access only to iam-manager pods.

Reactive measurements:

Remediate action triggered by AWS cloud watch rule

For some reason, if any role got created by controller pod with malicious intent(Having a different IAM policies than the pre-defined whitelisted IAM policies) we want to make sure remediate action plan is in place. Cloud watch rule which can trigger a lambda function if it detects any action(create/update/delete and even attaching a policy api call) taken by controller pod IAM role, lambda function verifies that action was taken by "within the known limits" and if it detects any anomaly it simply attaches "Deny" all access so that role can not be used for anything.

For more details: Please refer https://github.intuit.com/keikoproj/iam-manager-monitor/ repo for sample app

Finally, with all the measurements in place controller pod can do only do limited actions which can be totally pre-defined

Pros:

Solution is completely de-centralized and there are no outbound calls from the cluster.
More secure with the Permission Boundary.
Not customized solution for Intuit which means this can be distributed as open source project.
Auditing information is available with CloudTrail.
If there is any breach, ONLY clusters in this particular account gets compromised compared to ALL the clusters if iam is managed in central place.

Cons:

Solution must be carefully implemented.
If there is any breach, clusters in this particular account gets compromised.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Security:

Proactive measurements:

Kubernetes validation web hook implementation

AWS IAM Permission Boundary

How Permission Boundaries Work

Real-World Example

Do not provide an option to users to provide IAM role name

Custom resource controller deployed in its own namespace.

Reactive measurements:

Remediate action triggered by AWS cloud watch rule

Pros:

Cons:

FilesExpand file tree

aws-security.md

Latest commit

History

aws-security.md

File metadata and controls

Security:

Proactive measurements:

Kubernetes validation web hook implementation

AWS IAM Permission Boundary

How Permission Boundaries Work

Real-World Example

Do not provide an option to users to provide IAM role name

Custom resource controller deployed in its own namespace.

Reactive measurements:

Remediate action triggered by AWS cloud watch rule

Pros:

Cons: