Troubleshooting

There are a few potential issues that can prevent proper startup. The notes below all assume that you have started up the Kubernetes Dashboard.

Storage Provisioning

The most common reason for components not starting up correctly is incorrect storage provisioning. When the Helm chart is set up, a number of Persistent Volume Claims (PVC) are set up. The claims are requests of the underlying storage provisioning for a volume with a specific size, access model and storage class.

Single Node

In a single node, all except one PVC will provisioned using OpenEBS Local LVM from the LVM volume group defined by storage.local.vg.

So ensure that this Volume Group has enough free space. If you look at the PVC list in the dashboard you should be able to spot the issue.

The remaining PVC (for Flink job manager state) will provisioned using OpenEBS NFS storage. This type of storage is used to allow the storage to be shared between multiple PODS ‘ReadWriteMany’. The storage will be provisioned from the root file system.

Multiple Nodes

In a cluster, two types of storage class are used:

OpenEBS NFS storage on OpenEBS cStor : Storage that is available from any node in the cluster
OpenEBS Local-LVM : Storage on a specific node.

Again you should ensure that there is enough of each type

TLS

If your HTTPS certificate is not working correctly, you can look at the Ingress logs for possible issues with the certificate.

Image Downloads

There are a few ‘containerd’ commands that can be useful when figuring out what is happening with image downloads:

microk8s ctr images check

Shows what images are being retrieved (which might include multiple image layers).

microk8s ctr content active

Shows active content (images/manifests) transfers.