agones icon indicating copy to clipboard operation
agones copied to clipboard

Failed to create fleet

Open laivu266 opened this issue 2 years ago • 10 comments

I installed agones with helm chart. I got the error when creating fleet as below:

What happened:

Error from server (InternalError): error when creating "fleet.yaml": Internal error occurred: failed calling webhook "mutations.agones.dev": failed to call webhook: Post "https://agones-controller-service.agones-system.svc:443/mutate?timeout=10s": Address is not allowed

What you expected to happen:

How to reproduce it (as minimally and precisely as possible):

Anything else we need to know?:

Environment:

  • Agones version: 1.26.0
  • Kubernetes version (use kubectl version):

Client Version: version.Info{Major:"1", Minor:"24", GitVersion:"v1.24.0", GitCommit:"4ce5a8954017644c5420bae81d72b09b735c21f0", GitTreeState:"clean", BuildDate:"2022-05-03T13:36:49Z", GoVersion:"go1.18.1", Compiler:"gc", Platform:"darwin/arm64"} Kustomize Version: v4.5.4 Server Version: version.Info{Major:"1", Minor:"22+", GitVersion:"v1.22.13-eks-15b7512", GitCommit:"94138dfbea757d7aaf3b205419578ef186dd5efb", GitTreeState:"clean", BuildDate:"2022-08-31T19:15:48Z", GoVersion:"go1.16.15", Compiler:"gc", Platform:"linux/amd64"}

  • Cloud provider or hardware configuration: EKS
  • Install method (yaml/helm): helm
  • Troubleshooting guide log(s):
  • Others:

laivu266 avatar Sep 26 '22 14:09 laivu266

If you helm ls in the namespace you installed it in, what do you see?

If you do a kubectl get pods and/or kubectl get services in the same namespace what do you see?

markmandel avatar Sep 26 '22 17:09 markmandel

On a working cluster, when I run kubectl get svc -n agones-system I see

...
agones-controller-service          ClusterIP      10.5.129.186   <none>           443/TCP,8080/TCP   68d
...

Also, have you followed the steps to Confirm installation of Agones? If so (and you are still seeing this error), then we can look at adding further instructions to that page.

roberthbailey avatar Sep 26 '22 20:09 roberthbailey

Hi @roberthbailey and @markmandel , This is result to confirm installation of agones. As you can see, it's still getting error.

➜  ~ helm ls -nagones-system
NAME  	NAMESPACE    	REVISION	UPDATED                            	STATUS  	CHART        	APP VERSION
agones	agones-system	1       	2022-09-27 08:45:34.66353 +0700 +07	deployed	agones-1.26.0	1.26.0
➜  ~ kubectl get pods -nagones-system
NAME                                   READY   STATUS      RESTARTS   AGE
agones-allocator-584c9f6d69-d9ct8      1/1     Running     0          2m9s
agones-allocator-584c9f6d69-s6smz      1/1     Running     0          2m8s
agones-allocator-584c9f6d69-vz2zv      1/1     Running     0          2m8s
agones-controller-55d4845b7d-pgs7d     1/1     Running     0          2m8s
agones-delete-agones-resources-6wjq2   0/1     Completed   0          4m24s
agones-ping-c6d4f7747-8dqhz            1/1     Running     0          2m8s
agones-ping-c6d4f7747-kq9lb            1/1     Running     0          2m8s
➜  ~ kubectl get svc -nagones-system
NAME                               TYPE           CLUSTER-IP       EXTERNAL-IP                                                                    PORT(S)            AGE
agones-allocator                   LoadBalancer   172.20.90.71     <my-elb>   443:30308/TCP      2m16s
agones-allocator-metrics-service   ClusterIP      172.20.173.152   <none>                                                                         8080/TCP           2m16s
agones-controller-service          ClusterIP      172.20.228.85    <none>                                                                         443/TCP,8080/TCP   2m16s
agones-ping-http-service           LoadBalancer   172.20.138.39    <my-elb>  80:32066/TCP       2m16s
agones-ping-udp-service            LoadBalancer   172.20.162.69    <pending>                                                                      50000:31140/UDP    2m16s
➜  ~ kubectl apply -f https://raw.githubusercontent.com/googleforgames/agones/release-1.26.0/examples/simple-game-server/fleet.yaml

Error from server (InternalError): error when creating "https://raw.githubusercontent.com/googleforgames/agones/release-1.26.0/examples/simple-game-server/fleet.yaml": Internal error occurred: failed calling webhook "mutations.agones.dev": failed to call webhook: Post "https://agones-controller-service.agones-system.svc:443/mutate?timeout=10s": Address is not allowed

laivu266 avatar Sep 27 '22 01:09 laivu266

Hi @markmandel and @roberthbailey , I already fixed this issue. Because of my EKS cluster is using cilium as CNI so I need to change hostNetwork and dnsPolicy of agones-controller as below:

 hostNetwork: true
 dnsPolicy: ClusterFirstWithHostNet

The issue is gone. I think you need to add this guide to agones document if another user using other CNI instead of AWS default VPC CNI.

Thanks.

laivu266 avatar Sep 27 '22 02:09 laivu266

@laivu266 if you could file a PR on the documentation, that would be appreciated.

I have no way of testing on AWS, so appreciate the input from the community.

markmandel avatar Sep 27 '22 17:09 markmandel

Hi @markmandel and @roberthbailey , I already fixed this issue. Because of my EKS cluster is using cilium as CNI so I need to change hostNetwork and dnsPolicy of agones-controller as below:

 hostNetwork: true
 dnsPolicy: ClusterFirstWithHostNet

The issue is gone. I think you need to add this guide to agones document if another user using other CNI instead of AWS default VPC CNI.

Thanks.

i am getting same in GKE and where to change this ?

VINNUSAURUS avatar Apr 23 '23 20:04 VINNUSAURUS

Why would agones-controller need to run as hostNetwork? I'm not following why that change makes sense.

@VINNUSAURUS What are you seeing in GKE? What Agones version, GKE version?

zmerlynn avatar May 05 '23 22:05 zmerlynn

The issue looks like a cluster network malfunction, more logs should be available in the kube-apiserver

aimuz avatar May 06 '23 10:05 aimuz

'This issue is marked as Stale due to inactivity for more than 30 days. To avoid being marked as 'stale' please add 'awaiting-maintainer' label or add a comment. Thank you for your contributions '

github-actions[bot] avatar Jan 15 '24 10:01 github-actions[bot]

@kylesomers tagging you in EKS related issues, although this is an older one.

markmandel avatar Jan 26 '24 17:01 markmandel

'This issue is marked as Stale due to inactivity for more than 30 days. To avoid being marked as 'stale' please add 'awaiting-maintainer' label or add a comment. Thank you for your contributions '

github-actions[bot] avatar Mar 15 '24 10:03 github-actions[bot]

This issue is marked as obsolete due to inactivity for last 60 days. To avoid issue getting closed in next 30 days, please add a comment or add 'awaiting-maintainer' label. Thank you for your contributions

github-actions[bot] avatar Apr 15 '24 03:04 github-actions[bot]