Why does kilo get cluster config using kubeconfig (or API server URL flag) when it has a service account? #49

adamkpickering · 2020-03-20T22:33:12Z

While setting up kilo on a k3s cluster I noticed that it uses -kubeconfig, or -master to get the config that is used when interfacing with the cluster. This code can be seen here.

This seems like a security problem - why should kilo require access to my kubeconfig, which contains credentials that have the power to do anything to the cluster? Moreover, it seems redundant: I looked through kilo-k3s-flannel.yaml (which is what I used to get it working) and noticed that a service account is created for kilo with all of the permissions it should need.

This example (see main.go) uses this function to get the config. Can kilo not use this function instead?

I'm new to interfacing applications with kubernetes clusters, so if I'm missing something my apologies. If it's be welcome I'd be happy to submit a pull request for this.

The text was updated successfully, but these errors were encountered:

squat · 2020-05-11T22:14:37Z

Hi Adam, let's continue the conversation from #27 here. Thanks for investing time digging into the code.

in the event that we need to run kilo before we have a network fabric running. If we do have a network fabric running, the user doesn't have to worry about where the api server is

It's a little bit tricky. In order to establish connection to the Kubernetes API via a service IP, Kubernetes requires four things:
0. kube-proxy is running (or equivalent, e.g. kube-router) and can create rules to map service IPs to real IPs;

the IP address backing the service IP is routable;
kube-proxy (or equivalent) has a special kubeconfig with a non-service IP address for the API; and
the non-service IP address is reachable from the given node.

In Kilo's case, because we want to be able to build clusters without a shared private network (e.g. multi-cloud), these four requirements are not always guaranteed.

3 and 2 are generally OK; this is because most Kubernetes installers are smart and provision Kubeconfigs with a DNS name that resolves to the private IP when inside of the cloud's VPC and to a public IP when outside, e.g. from another cloud.

0 and 1, on the other hand, we cannot know for sure in multi-cloud environments. The problem is that most of the time, the IP address backing a service IP is the master node's private IP address, which will not be routable from nodes in other data centers. This means that even if kube-proxy is installed, we cannot guarantee that service IPs will work until Kilo is running and makes the private IPs routable.

So the two ways forward are:
0. always use a special Kubeconfig, which we can expect to have the correct address in order to reach the API from anywhere; or

tell the user to edit the Kilo manifest before deploying it to write the publicly accessible address for the API.

Each has its up- and downsides:
0 evidently doesn't work out of the box in k3s since it isn't provisioned by a traditional installer and thus worker nodes are not populated with a valid kubeconfig; Kilo uses a privileged kubeconfig with more power than it requires.
1 requires extra user intervention even if the kubeconfig would have worked.

What do you think is the best way forward?

fire · 2020-05-23T03:22:08Z

Can you provide the the 1 option as an alternative for extra user intervention?

squat · 2020-05-24T13:04:22Z

Hi @fire, option 1 should already be doable without any new code. Kilo accepts a --master flag to set the url of the K8s API, and users can simply add that flag to any Kilo manifest and take out --kubeconfig flag and the volume mount for the in-cluster kubeconfig to only use the service account. Do you think this should be an additional manifest in the manifests directory? If so, I'm very happy to merge that PR. The nice thing about it is that it is not installer-specific, so we only need one rather than one for kubeadm, bootkube etc

fire · 2020-05-24T16:56:50Z

I would like that, but this codebase is unfamiliar to me.

eddiewang · 2020-06-12T20:01:10Z

Would love to see a more permanent fix for this. k3s seems to not work great with kilo atm.

jbrinksmeier · 2020-07-31T15:55:20Z

Hi @squat
I would like to continue on the solution you proposed earlier

1. tell the user to edit the Kilo manifest before deploying it to write the publicly accessible address for the API.

I gave this a try but had to face the rather obvious point that my cluster runs on self-signed certificates and therefor the kilo pods refuse to communicate with that endpoint given with the --master flag.
I solved this by building my own squat/kilo image with added ca-certificates package and then mounting the kube-ca.pem to the kilo pods. I also had to edit the entrypoint to rebuild the ca bundle on startup.
Here is the Dockerfile I came up with:

FROM squat/kilo
RUN apk update && apk add ca-certificates
ENTRYPOINT ["/bin/sh", "-c", "update-ca-certificates && /opt/bin/kg"]

As I think that this is a use-case interesting to others, too, would you accept a PR for this?

jbrinksmeier · 2020-07-31T18:01:58Z

I just realized that since applying the changes outlined above, the kilo pods do not respect the --subnet flag anymore, building a CIDR of 10.4.0.0/24 (which is the default I guess). Any idea how to dig deeper there?

baurmatt · 2020-11-13T08:26:04Z

Also interested in seeing this fixed! :)

unixfox · 2021-02-11T13:08:01Z

Meanwhile, I developed an init container that insert a kubeconfig for kilo.

Here is the deployment yaml file:

apiVersion: apps/v1
kind: DaemonSet
metadata:
  name: kilo
  namespace: kube-system
  labels:
    app.kubernetes.io/name: kilo
spec:
  selector:
    matchLabels:
      app.kubernetes.io/name: kilo
  template:
    metadata:
      labels:
        app.kubernetes.io/name: kilo
    spec:
      serviceAccountName: kilo
      hostNetwork: true
      containers:
      - name: kilo
        image: squat/kilo
        args:
        - --kubeconfig=/etc/kubernetes/kubeconfig
        - --hostname=$(NODE_NAME)
        env:
        - name: NODE_NAME
          valueFrom:
            fieldRef:
              fieldPath: spec.nodeName
        securityContext:
          privileged: true
        volumeMounts:
        - name: cni-conf-dir
          mountPath: /etc/cni/net.d
        - name: kilo-dir
          mountPath: /var/lib/kilo
        - name: kubeconfig
          mountPath: /etc/kubernetes
          readOnly: true
        - name: lib-modules
          mountPath: /lib/modules
          readOnly: true
        - name: xtables-lock
          mountPath: /run/xtables.lock
          readOnly: false
      initContainers:
      - name: generate-kubeconfig
        image: unixfox/kilo-kubeconfig
        imagePullPolicy: Always
        volumeMounts:
        - name: kubeconfig
          mountPath: /etc/kubernetes
        env:
        - name: MASTER_URL
          value: "your.kube.api:6443"
      - name: install-cni
        image: squat/kilo
        command:
        - /bin/sh
        - -c
        - set -e -x;
          cp /opt/cni/bin/* /host/opt/cni/bin/;
          TMP_CONF="$CNI_CONF_NAME".tmp;
          echo "$CNI_NETWORK_CONFIG" > $TMP_CONF;
          rm -f /host/etc/cni/net.d/*;
          mv $TMP_CONF /host/etc/cni/net.d/$CNI_CONF_NAME
        env:
        - name: CNI_CONF_NAME
          value: 10-kilo.conflist
        - name: CNI_NETWORK_CONFIG
          valueFrom:
            configMapKeyRef:
              name: kilo
              key: cni-conf.json
        volumeMounts:
        - name: cni-bin-dir
          mountPath: /host/opt/cni/bin
        - name: cni-conf-dir
          mountPath: /host/etc/cni/net.d
      tolerations:
      - effect: NoSchedule
        operator: Exists
      - effect: NoExecute
        operator: Exists
      volumes:
      - name: cni-bin-dir
        hostPath:
          path: /opt/cni/bin
      - name: cni-conf-dir
        hostPath:
          path: /etc/cni/net.d
      - name: kilo-dir
        hostPath:
          path: /var/lib/kilo
      - name: kubeconfig
        hostPath:
          path: /etc/kilo-kubeconfig
      - name: lib-modules
        hostPath:
          path: /lib/modules
      - name: xtables-lock
        hostPath:
          path: /run/xtables.lock
          type: FileOrCreate

Then for the serviceaccount I don't know what kind of RBAC permissions kilo requires, so the current ones may not be enough. If you have any idea about that please let me know!

The source code of the project is located here: https://bitbucket.org/unixfox/kilo-kubeconfig/src/master/

jawabuu · 2021-05-10T04:58:14Z

@unixfox This is a great solution to the issue and should probably be merged into kilo.
I have tested it in k3s.
Is it possible to make the namespace (default to kube-system) configurable?

unixfox · 2021-05-10T07:36:50Z

@unixfox This is a great solution to the issue and should probably be merged into kilo.
I have tested it in k3s.
Is it possible to make the namespace (desfault to kube-system) configurable?

As you can see in the YAML I don't specify any namespace because you can deploy it to whatever namespace you want.

jawabuu · 2021-05-10T07:57:56Z

@unixfox
In your entrypoint script, I would like to understand the significance of the namespace declaration
https://bitbucket.org/unixfox/kilo-kubeconfig/src/d91d836fbf08f07a47f86f9f3458bf6410fdd62a/ENTRYPOINT.sh#lines-20,21,22,23,24,25,26,27

contexts:
- context:
    cluster: kilo
    namespace: kube-system
    user: kilo
  name: kilo

unixfox · 2021-05-10T08:25:34Z

@unixfox
In your entrypoint script, I would like to understand the significance of the namespace declaration
https://bitbucket.org/unixfox/kilo-kubeconfig/src/d91d836fbf08f07a47f86f9f3458bf6410fdd62a/ENTRYPOINT.sh#lines-20,21,22,23,24,25,26,27
contexts:
- context:
    cluster: kilo
    namespace: kube-system
    user: kilo
  name: kilo

It's just the default namespace that the user will use when not providing any "namespace" in the kubectl command for example. Just deploy my yaml file into the kube-system namespace, and you will be fine (I just edited it to make it more obvious).

jawabuu · 2021-05-10T08:35:46Z

@unixfox Thanks for the clarification.
My only concern was if it would still function if I deployed kilo in another namespace e.g. kilo

unixfox · 2021-05-10T08:49:05Z

@unixfox Thanks for the clarification.
My only concern was if it would still function if I deployed kilo in another namespace e.g. kilo

Well I'm not sure if that will work, I haven't tested though.
If that doesn't work get back to me and I'll see if I can do something about it.

stv0g · 2021-07-13T22:14:49Z

Here is an iteration of @unixfox approach which:

gets rid of the dedicated Docker image
works for any namespace which Kilo might use
autodetects the API server URL as more recent K3S versions use a dynamic port number
writes generated kubeconfig to a temporary emptyDir volume

---
apiVersion: v1
kind: ConfigMap
metadata:
  name: kilo-scripts
  namespace: kube-system
data:
  init.sh: |
    #!/bin/sh

    cat > /etc/kubernetes/kubeconfig <<EOF
        apiVersion: v1
        kind: Config
        name: kilo
        clusters:
        - cluster:
            server: $(sed -n 's/.*server: \(.*\)/\1/p' /var/lib/rancher/k3s/agent/kubelet.kubeconfig)
            certificate-authority: /var/lib/rancher/k3s/agent/server-ca.crt
        users:
        - name: kilo
          user:
            token: $(cat /var/run/secrets/kubernetes.io/serviceaccount/token)
        contexts:
        - name: kilo
          context:
            cluster: kilo
            namespace: ${NAMESPACE}
            user: kilo
        current-context: kilo
    EOF

Add the following initContainer to the Kilo daemonset:

[...]
      initContainers:
      - name: generate-kubeconfig
        image: busybox
        command:
        - /bin/sh
        args:
        - /scripts/init.sh
        imagePullPolicy: Always
        volumeMounts:
        - name: kubeconfig
          mountPath: /etc/kubernetes
        - name: scripts
          mountPath: /scripts/
          readOnly: true
        - name: k3s-agent
          mountPath: /var/lib/rancher/k3s/agent/
          readOnly: true
        env:
        - name: NAMESPACE
          valueFrom:
            fieldRef:
              fieldPath: metadata.namespace

And add the following volumes as well:

[...]
      volumes:
      - name: scripts
        configMap:
          name: kilo-scripts
      - name: kubeconfig
        emptyDir: {}
      - name: k3s-agent
        hostPath:
          path: /var/lib/rancher/k3s/agent

…r address & cacert from kubelet kubeconfig (closes squat#49)

squat mentioned this issue Mar 21, 2020

k3s kilo pods crashlooping #27

Closed

squat mentioned this issue Jun 13, 2020

Connection refused from istiod #60

Closed

unixfox mentioned this issue Oct 19, 2020

avoid having to pass kilo kubeconfig of k3s master unixfox/k8s#21

Open

stv0g added a commit to stv0g/kilo that referenced this issue Jul 14, 2021

k8s: generate kubeconfig based on token from ServiceAccount and maste…

0d824fa

…r address & cacert from kubelet kubeconfig (closes squat#49)

stv0g mentioned this issue Jul 14, 2021

k3s: Dynamically generate kubeconfig #212

Merged

stv0g added a commit to stv0g/kilo that referenced this issue Jul 14, 2021

k3s: generate kubeconfig based on token from ServiceAccount and maste…

d122edd

…r address & cacert from kubelet kubeconfig (closes squat#49)

stv0g added a commit to stv0g/kilo that referenced this issue Jul 15, 2021

k3s: generate kubeconfig based on token from ServiceAccount and maste…

d1f7c32

…r address & cacert from kubelet kubeconfig (closes squat#49)

leonnicolas closed this as completed in #212 Jul 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does kilo get cluster config using kubeconfig (or API server URL flag) when it has a service account? #49

Why does kilo get cluster config using kubeconfig (or API server URL flag) when it has a service account? #49

adamkpickering commented Mar 20, 2020

squat commented May 11, 2020

fire commented May 23, 2020

squat commented May 24, 2020

fire commented May 24, 2020

eddiewang commented Jun 12, 2020

jbrinksmeier commented Jul 31, 2020 •

edited

Loading

jbrinksmeier commented Jul 31, 2020 •

edited

Loading

baurmatt commented Nov 13, 2020

unixfox commented Feb 11, 2021 •

edited

Loading

jawabuu commented May 10, 2021 •

edited

Loading

unixfox commented May 10, 2021

jawabuu commented May 10, 2021 •

edited

Loading

unixfox commented May 10, 2021

jawabuu commented May 10, 2021

unixfox commented May 10, 2021

stv0g commented Jul 13, 2021 •

edited

Loading

Why does kilo get cluster config using kubeconfig (or API server URL flag) when it has a service account? #49

Why does kilo get cluster config using kubeconfig (or API server URL flag) when it has a service account? #49

Comments

adamkpickering commented Mar 20, 2020

squat commented May 11, 2020

fire commented May 23, 2020

squat commented May 24, 2020

fire commented May 24, 2020

eddiewang commented Jun 12, 2020

jbrinksmeier commented Jul 31, 2020 • edited Loading

jbrinksmeier commented Jul 31, 2020 • edited Loading

baurmatt commented Nov 13, 2020

unixfox commented Feb 11, 2021 • edited Loading

jawabuu commented May 10, 2021 • edited Loading

unixfox commented May 10, 2021

jawabuu commented May 10, 2021 • edited Loading

unixfox commented May 10, 2021

jawabuu commented May 10, 2021

unixfox commented May 10, 2021

stv0g commented Jul 13, 2021 • edited Loading

jbrinksmeier commented Jul 31, 2020 •

edited

Loading

jbrinksmeier commented Jul 31, 2020 •

edited

Loading

unixfox commented Feb 11, 2021 •

edited

Loading

jawabuu commented May 10, 2021 •

edited

Loading

jawabuu commented May 10, 2021 •

edited

Loading

stv0g commented Jul 13, 2021 •

edited

Loading