vctl-docs icon indicating copy to clipboard operation
vctl-docs copied to clipboard

Cannot start a stopped kind cluster

Open awk opened this issue 5 years ago β€’ 8 comments

If I create a cluster with kind:

vctl system start
Preparing storage...
Container storage has been prepared successfully under /Users/awk/.vctl/storage
Launching container runtime...
Container runtime has been started.
vctl kind
vctl-based KIND is ready now. KIND will run local Kubernetes clusters by using vctl containers as "nodes"
* All Docker commands has been aliased to vctl in the current terminal. Docker commands performed in current window would be executed through vctl. If you need to use regular Docker commands, please use a separate terminal window.
bash-3.2$ kind create cluster --name test
Creating cluster "test" ...
 βœ“ Ensuring node image (kindest/node:v1.18.2) πŸ–Ό 
 βœ“ Preparing nodes πŸ“¦  
 βœ“ Writing configuration πŸ“œ 
 βœ“ Starting control-plane πŸ•ΉοΈ 
 βœ“ Installing CNI πŸ”Œ 
 βœ“ Installing StorageClass πŸ’Ύ 
Set kubectl context to "kind-test"
You can now use your cluster with:

kubectl cluster-info --context kind-test

Not sure what to do next? πŸ˜…  Check out https://kind.sigs.k8s.io/docs/user/quick-start/

bash-3.2$ vctl ps
────                 ─────                                                                                  ───────                   ──                ─────            ──────    ─────────────               
NAME                 IMAGE                                                                                  COMMAND                   IP                PORTS            STATUS    CREATION TIME               
────                 ─────                                                                                  ───────                   ──                ─────            ──────    ─────────────               
test-control-plane   kindest/node@sha256:7b27a6d0f2517ff88ba444025beae41491b016bc6af573ba467b70c5e8e0d85f   /usr/local/bin/entry...   192.168.244.132   62541:6443/tcp   running   2020-11-14T12:08:38-05:00   
bash-3.2$ kubectl cluster-info --context kind-test
Kubernetes master is running at https://127.0.0.1:62541
KubeDNS is running at https://127.0.0.1:62541/api/v1/namespaces/kube-system/services/kube-dns:dns/proxy

To further debug and diagnose cluster problems, use 'kubectl cluster-info dump'.

Everything looks great, now if I stop the node/container:

bash-3.2$ vctl stop test-control-plane
WARNING unknown stop signal: SIGRTMIN+3, using default 
WARNING container test-control-plane did not stop within 10 second, force kill it... 
INFO container test-control-plane has been stopped 
bash-3.2$ vctl ps
────   ─────   ───────   ──   ─────   ──────   ─────────────   
NAME   IMAGE   COMMAND   IP   PORTS   STATUS   CREATION TIME   
────   ─────   ───────   ──   ─────   ──────   ─────────────   

And then attempt to restart it - it fails:

bash-3.2$ vctl start test-control-plane
ERROR error checking volume mapping config: mkdir /lib: read-only file system 

Is it not intended that clusters can be restarted? That they must be thrown away and new ones created if there's a power failure or some other system restart?

awk avatar Nov 14 '20 17:11 awk

Thanks for the reporting, this should be a valid issue. We have fixed it in house and will publish it in a future release and once it has been released, we will mark this issue closed.

jeffwubj avatar Nov 18 '20 05:11 jeffwubj

Great News!

Happy to beta test something if you need someone :-) Check with @mikeroySoft - I'm no stranger to beta builds of Fusion :-)

awk avatar Nov 18 '20 12:11 awk

Great News!

Happy to beta test something if you need someone :-) Check with @mikeroySoft - I'm no stranger to beta builds of Fusion :-)

wow, great to have your help! :-)

jeffwubj avatar Nov 19 '20 00:11 jeffwubj

Hello Friends,

I have this issue on Fusion 12.1.2. What is the suggested way to properly stop and start a cluster until the fix is released?

I asked the same question on VMWare communities also. https://communities.vmware.com/t5/VMware-Fusion-Discussions/How-to-start-the-kind-cluster-after-vctl-system-stop-command/m-p/2858944/highlight/false#M174923

Thank and regards GΓΆkalp

agergec avatar Jul 23 '21 10:07 agergec

@jeffwubj "We have fixed it in house and will publish it in a future release"

-- can I haz the fix now, plz? It's the future. ;)

jpapejr avatar Aug 26 '21 19:08 jpapejr

@jpapejr Hi, it seems minicube is a good alternative. It has vmware driver and can join the vmnet8 network. You may give it a try.

agergec avatar Aug 26 '21 21:08 agergec

If I wanted to run minikube I wouldn’t be here messing with kind and vctl, right? ;)

jpapejr avatar Aug 28 '21 22:08 jpapejr

I spent soo much time to find a solution, before running minikube. I just tried to save your time :)

agergec avatar Aug 29 '21 17:08 agergec