Proposal: Provide a flag to disable mTLS for agones-allocator #1590

pooneh-m · 2020-05-27T22:35:28Z

Is your feature request related to a problem? Please describe.
agones-allocator service supports mTLS authentication built into the service, which is used for LoadBalancer service and for enabling multi-cluster allocation solution.

However, a NodePort service used by Ingress does not need an authentication mechanism built into the service. Ingress can provide mTLS support along with other ways of authentications. Also, Ingress in the cloud solutions handle the authentication. Side car proxies such as Envoy can provide mTLS support.

Therefore, mTLS being built into the service prevents other solutions' adaptation.

Describe the solution you'd like
Provide a flag to disable mTLS for agones-allocator.

Describe alternatives you've considered

The service only enables mTLS if it is of type LoadBalancer.
By default disable mTLS and change the service type to NodePort and document setting up Ingress resource. Currently, because the IP address is not allocated before Agones installation, Agones cannot issue a certificate that after install would be ready to use by the service and requires additional step.

The text was updated successfully, but these errors were encountered:

markmandel · 2020-05-27T23:36:40Z

This sound interesting - how does this impact cross cluster communication (for allocation failover)? Would that also be unencrypted?

Or would there be multiple services - one with mTLS (for inter-cluster comms) and one without (for external system interaction through a load balancer)?

pooneh-m · 2020-05-27T23:44:59Z

Still for multi-cluster allocation, mTLS is used to secure the connections. However, if mTLS authentication is disabled on agones-allocator, other means of mTLS support should be provided either through a cloud provider, Ingress or sidecar proxies.

luna-duclos · 2020-05-28T05:37:59Z

Would very much be a fan of this, IAP or similar load balancer level solution can do the job of securing the communication!
This does mean however that this isn't useful without the ability to tell the agones allocator how to authenticate.

In that light, perhaps we should go for a flag that specifies how to authenticate instead ? Options can be disabled, mTLS, gcpserviceaccount, etc.

markmandel · 2020-05-28T16:59:00Z

This sounds like a good idea to me!

TBBle · 2020-06-23T14:50:06Z

If we're able to use an Ingress for this service with mTLS disabled on the service, it should be possible to have CertManager take care of issuing the mTLS certificates for the server side, and have the Ingress validate the mTLS connection, assuming your Ingress implementation supports mTLS.

Note that I haven't actually tested mTLS as below, it's put together from the docs, and looking at our existing CertManager-based TLS-secured Ingresses. I do have pending use-cases for mTLS, but have not implemented any of them locally.

Similar to the current approach, you'd create your CA Certificate, and then load it into a CA Issuer, and then annotate your Ingress, and CertManager generates a certificate for you. (Note that this won't work with the ACME Issuer used with e.g., LetsEncrypt as the ca.crt is not populated, so (I think but haven't tested) you can't do client-certificate validation).

You can either issues client certs by-hand yourself, or use kubectl create to create Certificate resources with the right key usages which will populate a Secret you can extract and use elsewhere. This part is similar to the mTLS server-side setup flow, although that uses a self-signed certificate as the TLS certificate, and bears no relationship to the client-presented self-signed certificate.

In this flow, NGINX Ingress appears to lack any way to limit which certificates to accept beyond the depth of the path from your CA, a feature of the allocation service's own mTLS support. On the other hand, NGINX Ingress doesn't require you to list trusted certificates to accept, a cost of the allocation service's own mTLS support.

To block compromised certificates, a CRL list given to NGINX Ingress, or a CRL Distribution Point in the certificates you create with CertManager would work, although compared to allocator-service's accept-list, this is a reject-list. And apart from embedding the CRL Distribution Point URL, Cert Manager doesn't currently appear to support revocation in any way, so you have to run the SSL commands yourself. It also doesn't yet support things like OSCP Stapling or the OSCP server URL, so if you need to manage who can access your server more carefully that certificate expiry dates, this side will prove painful.

I'm curious if it would be hard to make the allocator service's mTLS support work with CAs and existing Secret layouts, rather than having specialised lists of trusted self-signed certificates in both directions. Or is that just the example?

TBBle · 2020-06-23T15:00:54Z

More on-topic, NodePort seems like the wrong default if mTLS is disabled by default, since that's potentially open-to-the-world if you're not careful. It should default to ClusterIP, although even having it there by default with mTLS disabled allows bypassing the RBAC permissions for create GameServerAllocation to anyone on your cluster who can discover the Service/Endpoint. (Also true of NodePort and Ingress-based setups, so perhaps that's not something we're worried about, if we disable the built-in mTLS?)

Personally, I would have the allocation service off-by-default, while its primary purpose is serving multi-cluster allocation requests from the outside world, since any valid setup requires some configuration effort anyway, even if it's just a bunch of annotations in Helm. I've turned it off on my clusters as we're not using it.

devloop0 · 2020-06-29T18:07:36Z

#1645 partially address this issue. Note that we still need to get testing working for this feature and document the helm configuration parameter.

markmandel · 2020-12-16T23:42:25Z

Doing some cleanup - has this been completed? @pooneh-m @devloop0 ?

pooneh-m added kind/feature New features for Agones help wanted We would love help on these issues. Please come help us! labels May 27, 2020

markmandel added the area/operations Installation, updating, metrics etc label May 28, 2020

TBBle mentioned this issue Jun 23, 2020

Document best practices for GameServer Allocation #1594

Closed

devloop0 mentioned this issue Jun 24, 2020

Conditionally enable mtls for the allocator. #1645

Merged

pooneh-m closed this as completed Dec 17, 2020

markmandel added this to the 1.11.0 milestone Dec 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Provide a flag to disable mTLS for agones-allocator #1590

Proposal: Provide a flag to disable mTLS for agones-allocator #1590

pooneh-m commented May 27, 2020 •

edited

Loading

markmandel commented May 27, 2020

pooneh-m commented May 27, 2020

luna-duclos commented May 28, 2020 •

edited

Loading

markmandel commented May 28, 2020

TBBle commented Jun 23, 2020 •

edited

Loading

TBBle commented Jun 23, 2020

devloop0 commented Jun 29, 2020

markmandel commented Dec 16, 2020

Proposal: Provide a flag to disable mTLS for agones-allocator #1590

Proposal: Provide a flag to disable mTLS for agones-allocator #1590

Comments

pooneh-m commented May 27, 2020 • edited Loading

markmandel commented May 27, 2020

pooneh-m commented May 27, 2020

luna-duclos commented May 28, 2020 • edited Loading

markmandel commented May 28, 2020

TBBle commented Jun 23, 2020 • edited Loading

TBBle commented Jun 23, 2020

devloop0 commented Jun 29, 2020

markmandel commented Dec 16, 2020

pooneh-m commented May 27, 2020 •

edited

Loading

luna-duclos commented May 28, 2020 •

edited

Loading

TBBle commented Jun 23, 2020 •

edited

Loading