Skip to content

Sloop OOMKilled  #258

Description

@ravvereddy

We are evaluating sloop by deploying it to one of our heavily used K8S Cluster where I played with multiple config options to reduce the memory consumption as per the issues and sloop config memory tweaking guid but none of them worked as expected the pod after 3 to 6 days goes into restart loop causing CrashLoopBackOff.

Here is the sts with configMap that is deployed to cluster I set the memory Limits 1.5 and requests 1.3 GB

Sloop STS:

`
apiVersion: v1
items:

  • apiVersion: apps/v1
    kind: StatefulSet
    metadata:
    creationTimestamp: "2023-04-03T08:55:58Z"
    generation: 8
    labels:
    app.kubernetes.io/instance: release-name
    app.kubernetes.io/managed-by: Helm
    app.kubernetes.io/name: sloop
    helm.sh/chart: sloop-0.2.0
    kustomize.toolkit.fluxcd.io/name: sloop
    kustomize.toolkit.fluxcd.io/namespace: flux-system
    name: sloop
    namespace: sloop
    resourceVersion: "21457846"
    uid: ffe38f52-271f-408c-9794-3a9ca8fdc3a4
    spec:
    podManagementPolicy: OrderedReady
    replicas: 1
    revisionHistoryLimit: 10
    selector:
    matchLabels:
    app.kubernetes.io/name: sloop
    serviceName: sloop
    template:
    metadata:
    creationTimestamp: null
    labels:
    app.kubernetes.io/instance: release-name
    app.kubernetes.io/name: sloop
    spec:
    containers:
    - args:
    - --config=/sloopconfig/sloop.json
    command:
    - /sloop
    image: ghcr.io/salesforce/sloop:latest
    imagePullPolicy: IfNotPresent
    livenessProbe:
    failureThreshold: 3
    httpGet:
    path: /healthz
    port: 8080
    scheme: HTTP
    initialDelaySeconds: 30
    periodSeconds: 10
    successThreshold: 1
    timeoutSeconds: 5
    name: sloop
    ports:
    - containerPort: 8080
    protocol: TCP
    readinessProbe:
    failureThreshold: 3
    httpGet:
    path: /healthz
    port: 8080
    scheme: HTTP
    initialDelaySeconds: 5
    periodSeconds: 10
    successThreshold: 1
    timeoutSeconds: 5
    resources:
    limits:
    memory: 1536Mi
    requests:
    memory: 1350Mi
    terminationMessagePath: /dev/termination-log
    terminationMessagePolicy: File
    volumeMounts:
    - mountPath: /data/
    name: sloop-data
    - mountPath: /sloopconfig/
    name: sloopconfig
    dnsPolicy: ClusterFirst
    restartPolicy: Always
    schedulerName: default-scheduler
    securityContext: {}
    serviceAccount: sloop
    serviceAccountName: sloop
    terminationGracePeriodSeconds: 30
    volumes:
    - emptyDir:
    sizeLimit: 20Gi
    name: sloop-data
    - configMap:
    defaultMode: 420
    name: sloop
    name: sloopconfig
    updateStrategy:
    rollingUpdate:
    partition: 0
    type: RollingUpdate
    volumeClaimTemplates:
    • apiVersion: v1
      kind: PersistentVolumeClaim
      metadata:
      creationTimestamp: null
      name: sloop-data
      spec:
      accessModes:
      • ReadWriteOnce
        resources:
        requests:
        storage: 20Gi
        volumeMode: Filesystem
        status:
        phase: Pending
        status:
        availableReplicas: 1
        collisionCount: 0
        currentReplicas: 1
        currentRevision: sloop-68bc7fc588
        observedGeneration: 8
        readyReplicas: 1
        replicas: 1
        updateRevision: sloop-68bc7fc588
        updatedReplicas: 1
        kind: List
        metadata:
        resourceVersion: ""
        `

Sloop CM:

image

@sana-jawad @kartik-sareen @jarifibrahim any recommendation Thank You.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions