Mongodb in Kubernetes Timeouts when inserting large amount of data

google-cloud-platformgoogle-kubernetes-enginekubernetesmongodb

We have an api in running which receives once a day multiple batches of large data that are inserted in a mongodb.
We use the cvallance/mongo-k8s-sidecar for the replicationset configuration

This works perfectly on a local mongodatabase.

there is also no production traffic on the database which could raise raise conditions or so.

Now we deployed it to a google container engine. There the import works in general too.
But from time to time we got timeoutexceptions like this:

Cannot run replSetReconfig because the node is currently updating its configuration

MongoDB.Driver.MongoCommandException: Command insert failed: BSONObj
size: 16793637 (0x1004025) is invalid. Size must be between 0 and
16793600(16MB) First element: insert:
"LandingPageConnectionSet_Stage".

Error in workloop { MongoError: connection 0 to 127.0.0.1:27017 timed
out at Function.MongoError.create
(/opt/cvallance/mongo-k8s-sidecar/node_modules/mongodb-core/lib/error.js:29:11)
at Socket.
(/opt/cvallance/mongo-k8s-sidecar/node_modules/mongodb-core/lib/connection/connection.js:198:20)
at Object.onceWrapper (events.js:254:19) at Socket.emit
(events.js:159:13) at Socket._onTimeout (net.js:411:8) at ontimeout
(timers.js:478:11) at tryOnTimeout (timers.js:302:5) at
Timer.listOnTimeout (timers.js:262:5)

I can see that the cpu seems to not be at its limits.

Kubernetes configuration for mongodb

---
kind: StorageClass
apiVersion: storage.k8s.io/v1
metadata:
  name: fast
provisioner: kubernetes.io/gce-pd
parameters:
  type: pd-ssd
---
apiVersion: v1
kind: Service
metadata:
  name: mongo
  labels:
    name: mongo
spec:
  ports:
  - port: 27017
    targetPort: 27017
  clusterIP: None
  selector:
    role: mongo
---
apiVersion: apps/v1beta1
kind: StatefulSet
metadata:
  name: mongo
spec:
  serviceName: "mongo"
  replicas: 3
  template:
    metadata:
      labels:
        role: mongo
        environment: test
    spec:
      terminationGracePeriodSeconds: 10
      containers:
        - name: mongo
          image: mongo:3.6
          command:
            - mongod
            - "--replSet"
            - rs0
            - "--bind_ip"
            - 0.0.0.0
            - "--smallfiles"
            - "--noprealloc"
          ports:
            - containerPort: 27017
          volumeMounts:
            - name: mongo-persistent-storage
              mountPath: /data/db
        - name: mongo-sidecar
          image: cvallance/mongo-k8s-sidecar
          env:
            - name: MONGO_SIDECAR_POD_LABELS
              value: "role=mongo,environment=test"
  volumeClaimTemplates:
  - metadata:
      name: mongo-persistent-storage
      annotations:
        volume.beta.kubernetes.io/storage-class: "fast"
    spec:
      accessModes: [ "ReadWriteOnce" ]
      resources:
        requests:
          storage: 32Gi

we also little changed the config by limitting the wiretiger cachesize and removing the smallfiles options so the part in the config looked like this:

   - mongod
    - "--replSet"
    - rs0
    - "--bind_ip"
    - 0.0.0.0
    - "--noprealloc"
    - "--wiredTigerCacheSizeGB"
    - "1.5"

Best Answer

I checked the logs and the kubernetes Dashboard with Boas Enkler.

In the Kubernetes dashboard regarding the status of the PODs there were the following hints:

Pod Name: kube-lego-*****-***     
Status: Evicted 
Reason: The node was low on resource: memory.

You could have retrieved the very same information through kubectl describe pod [podname]

Notice that quoting the documentation: "If the kubelet is unable to reclaim sufficient resources on the node, kubelet begins evicting Pods."

Therefore I believed that the error with Mongodb since it was working on premise without any issue, to doublecheck we went through the Kernel logs showed by the console serial output and we found:

Memory cgroup out of memory: Kill process 4**7 (mongod) score 1494 or sacrifice child
...
Memory cgroup out of memory: Kill process 1**8 (mongod) score 1538 or sacrifice child

We noticed as well that there was no Memory Request field in the YAML file of the deployment. This is an issue since it could happen that even if there are three nodes with no workload can happen that all the PODs are started on the very same node since they theoretically fit.

In order to mitigate this behaviour there are some possible solution:

Scale vertically the cluster and introduce memory request values
Instruct the mongodb process to consume an amount of memory smaller than the Requested one.
The introduction of memory limit is essential if you have more container running on the same node and you want to avoid that they are killed by it. Consider that in this way it will be killed sometimes even if there is still memory available on the node.

Related Solutions

How to expose and access MongoDb ports in Kubernetes

In Kubernetes, if you want to expose a Port to the outside world, you can use Service with Type NodePort or LoadBalancer.

Type LoadBalancer is usually used on cloud providers since they provide external load balancers for Kubernetes.

So, in your case, NodePort is the easiest way to expose the Port. Here is an example of Service YAML:

kind: Service
apiVersion: v1
metadata:
  name: mongodb-service
spec:
  type: NodePort
  selector:
    app: mongodb
  ports:
    - port: 27017
      nodePort: 32463 
      name: MongoPort

In line port: 27017, we specified your MongoDB port, it is also usually specified in Deployment for MongoDB.

In line nodePort: 32463, we specified the external port. There, any port from the range 30000-32767 can be posted. Or it can be skipped during the creation, in that case, Kubernetes assigns the port number automatically, and you can find it using kubectl describe service mongodb-service or kubectl get service mongodb-service -o yamlcommands.

After that, you can use any client, e.g. RoboMongo, to reach you MongoDB. You need to use the IP address of any node and the port from the nodePort line, not from the port line.

For example, if you have a cluster with three nodes with external IP addresses 12.13.14.151, 12.13.14.152, 12.13.14.153, you can use any of 12.13.14.151:32463, 12.13.14.152:32463, 12.13.14.153:32463 in your RoboMongo connection settings.

For more information about Services, you can check the following link:

Publishing services - service types

Keep getting “Does not have minimum availability” in Kubernetes cluster when deploying workload

The most probable cause that the following error is showing:

"Does not have minimum availability”

Is that there are some objects (like Pods) not allocated inside of the cluster.

There are some links referencing the same message:

Focusing specifically on the example showed in the question.

The setup is following:

1 GKE node with: 1 vCPU and 3.75 GB of RAM

The resources scheduled onto this single node cluster:

4 Deployments where each have following fields:

        resources:
          requests: # <-- IMPORTANT
            cpu: "100m" # <-- IMPORTANT
            memory: "128Mi"
          limits:
            cpu: "100m"
            memory: "128Mi"

For an example I tried to replicate setup as close as possible to the one in the question:

$ kubectl get pods

NAME                           READY   STATUS    RESTARTS   AGE
nginx-four-99d88fccb-v664b     0/1     Pending   0          51m
nginx-one-8584c66446-rcw4p     1/1     Running   0          53m
nginx-three-5bcb988986-jp22f   1/1     Running   0          51m
nginx-two-6c9545d7d4-mrpw6     1/1     Running   0          52m

As you can see there is a Pod that is in Pending state. Further investigation implies:

$ kubectl describe pod/nginx-four-99d88fccb-v664b

A lot of information will show about the Pod but the part that needs to be checked is Events:

Events:
  Type     Reason            Age                From               Message
  ----     ------            ----               ----               -------
  Warning  FailedScheduling  56m (x2 over 56m)  default-scheduler  0/1 nodes are available: 1 Insufficient cpu.
  Normal   Scheduled         56m                default-scheduler  Successfully assigned default/nginx-two-6c9545d7d4-mrpw6 to gke-gke-old-default-pool-641f10b7-36qb
  Normal   Pulling           56m                kubelet            Pulling image "nginx"
  Normal   Pulled            56m                kubelet            Successfully pulled image "nginx"
  Normal   Created           56m                kubelet            Created container nginx
  Normal   Started           56m                kubelet            Started container nginx

As you can see from above output:

FailedScheduling: ... 0/1 nodes are available: 1 Insufficient cpu

As posted in the question:

I keep getting not having enough cpu availability even the node is using only 9% cpu at the same time.

This CPU availability is strictly connected to the Allocated resources. You can have CPU usage in the midst of 10% and still run into Insufficient CPU messages. Here is why:

When you create a Pod, the Kubernetes scheduler selects a node for the Pod to run on. Each node has a maximum capacity for each of the resource types: the amount of CPU and memory it can provide for Pods. The scheduler ensures that, for each resource type, the sum of the resource requests of the scheduled Containers is less than the capacity of the node. Note that although actual memory or CPU resource usage on nodes is very low, the scheduler still refuses to place a Pod on a node if the capacity check fails. This protects against a resource shortage on a node when resource usage later increases, for example, during a daily peak in request rate.

-- Kubernetes.io: Docs: Concepts: Configuration: Manage resources containers: How pods with resource requests are scheduled

Take a look on the resources.requests section in the part of Deployment I included earlier. It is specified there that each Pod in the Deployment want a guarantee that 100m of CPU time will be available in the cluster. 4x100m = 400m.

If you run (after applying the workload):

$ kubectl describe node

Allocated resources:
  (Total limits may be over 100 percent, i.e., overcommitted.)
  Resource                      Requests          Limits
  --------                      --------          ------
  cpu                          -->939m (99%)<--     501m (53%)
  memory                        1081Mi (40%)      1721Mi (65%)
  ephemeral-storage             0 (0%)            0 (0%)
  hugepages-2Mi                 0 (0%)            0 (0%)
  attachable-volumes-gce-pd     0                 0

There is already allocated the 939M CPU from a 1000M (3/4 Pods are scheduled). That's why one of the Pod cannot be scheduled (even when the CPU usage is 10%).

A side note!

This would be specific to each and every cluster but this one reported 639m of CPU requested before any workload was scheduled.

To fix that you can either:

Change the .resources.requests section
Use VPA to recommend the requests and limits
Use different node type
Enable autoscaling

Additional resources:

Best Answer

Related Solutions

How to expose and access MongoDb ports in Kubernetes

Keep getting “Does not have minimum availability” in Kubernetes cluster when deploying workload

Related Topic