jina Dynamic k8s namespace for generating kubernetes yaml

Describe the feature The current implementation for to_kubernetes_yaml flow takes k8s_namespace to be explicitly specified somewhere otherwise it outputs as default namespace. The namespace need not be explicitly set and can be dynamically injected through metadata, hence improving the reusability of the generated templates.

For example:

kubectl apply -f "file.yaml" --namespace=prod
kubectl apply -f "file.yaml" --namespace=dev

Your proposal

We may accept that the Optional field can be None and propagate that when generating the yaml, in which case the generated yaml will include this for env injection:

        - name: K8S_NAMESPACE_NAME
          valueFrom:
            fieldRef:
              fieldPath: metadata.namespace

And of course we may remove the injection of namespace for all the generated objects in this case.

More info from official docs

https://kubernetes.io/docs/tasks/inject-data-application/environment-variable-expose-pod-information/
https://kubernetes.io/docs/tasks/inject-data-application/downward-api-volume-expose-pod-information/

Jul 30 '23 11:07 sansmoraxz

Hey @sansmoraxz,

This looks like a great feature. Since I see thwat you have digged in the code a bit, would you be up to implement it?

Jul 30 '23 11:07 JoanFM

Sure

Jul 30 '23 11:07 sansmoraxz

Please get in touch if you need any help, we will be welcoming your PR.

Jul 30 '23 11:07 JoanFM

For deployment address (viz. injected to gateway) I see that services are being referred to by their full DNS name. Is it ok to open a sh shell, to invoke the jina cli? That way we can refer to the injected env K8S_NAMESPACE_NAME to refer to the executors.

Alternatively we can use with just the service name rather than the full DNS name of the service.

Aug 06 '23 14:08 sansmoraxz

The namespace argument is propagated in the call to the to_kubernetes_yaml to every Deplpyment in the Flow, including the gateway one.

I am not sure I fully understand the question though

Aug 07 '23 00:08 JoanFM

I meant assume gateway has the following generated command invoked:

containers:
      - args:
        - gateway
        - --k8s-namespace
        - default
        - --expose-endpoints
        - '{}'
        - --host
        - '1'
        - --uses
        - GRPCGateway
        - --graph-description
        - '{"executor0": ["executor1"], "start-gateway": ["executor0"], "executor1":
          ["end-gateway"]}'
        - --deployments-addresses
        - '{"executor0": ["grpc://executor0.default.svc:8080"], "executor1": ["grpc://executor1.default.svc:8080"]}'
        - --port
        - '8080'
        - --port-monitoring
        - '9090'
        command:
        - jina

The deployment addresses are not really dynamic

Aug 07 '23 06:08 sansmoraxz

We can probably try to use something like this:

command:
        - /bin/bash
        - -c
        - jina
        - gateway
        - --k8s-namespace
        - $K8S_NAMESPACE_NAME

We invoke the bash first because directly passing it to the args doesn't seem to expand to the env value.

Same with the deployments-addresses. Although passing it the full service name is not really required.

This works just as well

'{"executor0": ["grpc://executor0:8080"], "executor1": ["grpc://executor1:8080"]}'

Aug 07 '23 06:08 sansmoraxz

Although passing it the full service name is not really required. This works just as well

@sansmoraxz you're right. The complete FQDN is not required in deployment addresses, as both gateway & the executors are in the same namespace. We rely on service meshes (like Linkerd) that recommend using FQDNs rather than shorter names to avoid ambiguity during routing, hence we follow the best practices.

Aug 07 '23 06:08 deepankarm

I see the problem of passing the namespace by env var to the entrypoint. Perhaps passing with ENV var syntax may work but I am not sure.

Otherwise I will take a look after holiday, thanks for your deep investigation.

Aug 07 '23 07:08 JoanFM

Alternatively the jina cli can be told to look for specific environment variable names (and files) and we don't need to override the command explicitly.

Aug 19 '23 16:08 sansmoraxz

For instance, look for the presence of /var/run/secrets/kubernetes.io or .dockerenv, to detect appropriate environments at startup. Since those are fixed name environment variable they can be directly passed to be loaded. This will also make the generated manifests a lot cleaner.

Aug 19 '23 16:08 sansmoraxz

I will try to look at it in more detail this incoming week and discuss some proposals about it

Aug 20 '23 10:08 JoanFM

Hey @sansmoraxz .

I will take a look at this soon.

Thanks for the patience

Aug 23 '23 07:08 JoanFM

Hey @sansmoraxz,

What do u think about this solution:

We allow the k8s-namespace to be Optional with default as default value.
If we see a None, we do as u say, we inject the NAMESPACE as u suggest
We pass --k8s-namespace $K8S_NAMESPACE in the args (I think this should work without needing to explicitly invoke /bin/bash in the entrypoint.
If we see k8s-namespace as None, we rmove the namespace from the executor addresses.

I believe this should enable this feature, what do you say?

Actually, I think this is exactly what you have been meaning all this time right? Here I just summarized all in the same message?

Aug 23 '23 08:08 JoanFM

My concern was that args didn't resolve for env.

Testing this small code on kubernetes 1.27:

apiVersion: v1
kind: Pod
metadata:
  name: command-demo
  labels:
    purpose: demonstrate-command
spec:
  containers:
  - name: command-demo-container
    image: alpine
    command: ["echo"]
    args: ["$ENV"]
    env:
    - name: ENV
      value: "Hello World!"
  restartPolicy: OnFailure

The output is $ENV

Aug 23 '23 13:08 sansmoraxz

My concern was that args didn't resolve for env.

Testing this small code on kubernetes 1.27:

apiVersion: v1
kind: Pod
metadata:
  name: command-demo
  labels:
    purpose: demonstrate-command
spec:
  containers:
  - name: command-demo-container
    image: alpine
    command: ["echo"]
    args: ["$ENV"]
    env:
    - name: ENV
      value: "Hello World!"
  restartPolicy: OnFailure

The output is $ENV

Looking at this issue:

https://github.com/kubernetes/kubernetes/issues/57726

It seems that using downards API as u suggest should work however

Aug 23 '23 13:08 JoanFM

OK so wrapping in $() does the trick. Interesting! this does tend to generally imply nested shell execution.

Aug 23 '23 14:08 sansmoraxz

OK so wrapping in $() does the trick. Interesting! this does tend to generally imply nested shell execution.

I am not sure honestly

Aug 23 '23 14:08 JoanFM

So then, we are good with this proposal?

Aug 23 '23 16:08 JoanFM

Yes. One thing I was considering was using jinja to dynamically make the resources. We can offload the code templating to it, which in turn would make the code a lot cleaner. This may go under a future proposal. Too many changes to be made.

Aug 23 '23 17:08 sansmoraxz

this can be another improvement, I agree, but I would not mix the actual feature with the nice refactoring

Aug 23 '23 17:08 JoanFM

@jina-ai/product This issue is stale because it has been open 90 days with no activity. Remove stale label or comment or this will be closed in 14 days

Nov 22 '23 00:11 jina-bot

@jina-ai/product This issue is stale because it has been open 90 days with no activity. Remove stale label or comment or this will be closed in 14 days

Feb 21 '24 00:02 jina-bot

@jina-ai/product This issue is stale because it has been open 90 days with no activity. Remove stale label or comment or this will be closed in 14 days

May 22 '24 00:05 jina-bot

Forgot that I had this in my backlog.

There was some blocker on my side (which I don't remember properly). Give me a week to find what it was and see if I can resolve it. Otherwise you may assign this to someone else.

May 22 '24 04:05 sansmoraxz

jina jina copied to clipboard

Dynamic k8s namespace for generating kubernetes yaml

jina
jina copied to clipboard