Skip to content

ops(dev): enable VPC CNI NetworkPolicy on tb-client-dev-templates + audit EKS fleets [#102 prerequisite] #249

@saadqbal

Description

@saadqbal

Context

tb-client-dev-templates (EKS, arn:aws:eks:eu-central-1:872515254613:cluster/tb-client-dev-templates) runs a self-managed VPC CNI with NetworkPolicy disabled (aws-eks-nodeagent --enable-network-policy=false). Confirmed empirically: a DNS-only egress NetworkPolicy does not block https://example.com from a pod. So the §8.2 egress lockdown (client-runtime#102) cannot enforce on this cluster today.

Flipping the DaemonSet flags alone is not enough — the policyendpoints.networking.k8s.aws CRD is present but there's no PolicyEndpoint controller running (it ships with the EKS managed vpc-cni addon, which this cluster doesn't use). A bare DS patch was tried and reverted (no effect).

What

  1. Enable VPC CNI NetworkPolicy via the managed addon, in a maintenance window:
    aws eks create-addon --cluster-name tb-client-dev-templates --region eu-central-1 \
      --addon-name vpc-cni --configuration-values '{"enableNetworkPolicy":"true"}' \
      --resolve-conflicts OVERWRITE
    
    (adopts/rolls the self-managed CNI as a managed addon + activates the control-plane PolicyEndpoint controller).
  2. Re-run the pre-flight egress probe — it must now go BLOCKED.
  3. Audit the other EKS fleets for the same gap (self-managed VPC CNI / NetworkPolicy off) so the lockdown rollout isn't silently cosmetic anywhere.

Acceptance criteria

  • Egress NetworkPolicy is enforced on tb-client-dev-templates (probe → BLOCKED).
  • EKS fleets audited; any with NetworkPolicy disabled are listed for remediation.

Refs tracebloc/client-runtime#102. Owner: whoever manages the EKS clusters (infra).

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions