Fail to init aws cluster with the message "could not init cloud provider "aws": error finding instance ... timeout #2359

arrcher · 2020-12-03T16:35:41Z

What keywords did you search in kubeadm issues before filing this one?

could not init cloud provider

kubeadm init --cloud-provider aws

Is this a BUG REPORT or FEATURE REQUEST?

BUG REPORT

Versions

kubeadm version: v1.19.4

Environment:

Kubernetes version: v1.19.4
Cloud provider or hardware configuration: aws
OS: Red Hat Enterprise Linux, 8.3 (Ootpa)
Kernel: Linux ip-10-83-62-10.ec2.internal 4.18.0-240.1.1.el8_3.x86_64 kubeadm join on slave node fails preflight checks #1 SMP Fri Oct 16 13:36:46 EDT 2020 x86_64 x86_64 x86_64 GNU/Linux
Others:
config file:

cat ./kubeadm.yaml
---
apiVersion: kubeadm.k8s.io/v1beta2
kind: ClusterConfiguration
apiServer:
  extraArgs:
    cloud-provider: aws
clusterName: cdspidr
controlPlaneEndpoint: ip-10-83-62-10.ec2.internal
controllerManager:
  extraArgs:
    cloud-provider: aws
    configure-cloud-routes: "false"
kubernetesVersion: stable
networking:
  dnsDomain: cluster.local
  podSubnet: 10.83.62.0/24
---
apiVersion: kubeadm.k8s.io/v1beta2
kind: InitConfiguration
nodeRegistration:
  kubeletExtraArgs:
    cloud-provider: aws

What happened?

sudo kubeadm init --config=kubeadm.yaml -v=5 > ./kubeadm-run.txt 2>&1

kubeadm will never fully initialize. the output shows

[kubelet-check] Initial timeout of 40s passed.
[kubelet-check] It seems like the kubelet isn't running or healthy.
[kubelet-check] The HTTP call equal to 'curl -sSL http://localhost:10248/healthz' failed with error: Get "http://localhost:10248/healthz": dial tcp [::1]:10248: connect: connection refused.
[kubelet-check] It seems like the kubelet isn't running or healthy.
[kubelet-check] The HTTP call equal to 'curl -sSL http://localhost:10248/healthz' failed with error: Get "http://localhost:10248/healthz": dial tcp [::1]:10248: connect: connection refused.

and journalctl -xeu kubelet shows

12025 aws.go:1235] Building AWS cloudprovider
Dec 03 14:34:29 ip-10-83-62-10.ec2.internal kubelet[12025]: I1203 14:34:29.111272   12025 aws.go:1195] Zone not specified in configuration file; querying AWS metadata service
Dec 03 14:36:29 ip-10-83-62-10.ec2.internal kubelet[12025]: F1203 14:36:29.464611   12025 server.go:265] failed to run Kubelet: could not init cloud provider "aws": error finding instance : \"RequestError: send request failed\\ncaused by: Post \\\"https://ec2.us-east-1.amazonaws.com/\\\": dial tcp 10.83.60.25:443: i/o timeout\""
Dec 03 14:36:29 ip-10-83-62-10.ec2.internal kubelet[12025]: goroutine 1 [running]:

configs in /etc/manifests/ contains env settings with proxy variables.

While from terminal with same proxy settings the
curl -v https://ec2.us-east-1.amazonaws.com/ does not timeout and returns data.

What you expected to happen?

I would expect kubeadm init --cloud-provider aws to successfully complete.

How to reproduce it (as minimally and precisely as possible)?

Anything else we need to know?

It's entirely private VPC

The text was updated successfully, but these errors were encountered:

neolit123 · 2020-12-03T16:53:06Z

hello,

kubeadm seems to be doing its job to pass the flags to the components that need them, so this doesn't look like a kubeadm issue. also note that the kubeadm team does not have e2e signal for any of the legacy (non-external) cloud providers in kubernetes and we do not know if they are working as expected.

this guide on the web seem to suggest that what you are doing is valid for AWS:
https://blog.scottlowe.org/2019/08/14/setting-up-aws-integrated-kubernetes-115-cluster-kubeadm/

Dec 03 14:36:29 ip-10-83-62-10.ec2.internal kubelet[12025]: F1203 14:36:29.464611 12025 server.go:265] failed to run Kubelet: could not init cloud provider "aws": error finding instance : "RequestError: send request failed\ncaused by: Post \"https://ec2.us-east-1.amazonaws.com/\\\": dial tcp 10.83.60.25:443: i/o timeout""

this looks like a connectivity issue on the side of the kubelet.

before logging a ticket in kubernetes/kubernetes about this and tagging it with /sig node cloud-provider i'd encourage you to try to get feedback by other users on our support channels like slack, stackoverflow, discuss:
https://github.com/kubernetes/kubernetes/blob/master/SUPPORT.md

thanks
/kind support
/close

k8s-ci-robot · 2020-12-03T16:53:21Z

@neolit123: Closing this issue.

In response to this:

hello,

kubeadm seems to be doing its job to pass the flags to the components that need them, so this doesn't look like a kubeadm issue. also note that the kubeadm team does not have e2e signal for any of the legacy (non-external) cloud providers in kubernetes and we do not know if they are working as expected.

this guide on the web seem to suggest that what you are doing is valid for AWS:
https://blog.scottlowe.org/2019/08/14/setting-up-aws-integrated-kubernetes-115-cluster-kubeadm/

Dec 03 14:36:29 ip-10-83-62-10.ec2.internal kubelet[12025]: F1203 14:36:29.464611 12025 server.go:265] failed to run Kubelet: could not init cloud provider "aws": error finding instance : "RequestError: send request failed\ncaused by: Post \"https://ec2.us-east-1.amazonaws.com/\\\": dial tcp 10.83.60.25:443: i/o timeout""

this like a connectivity issue on the side of the kubelet.

before logging a ticket in kubernetes/kubernetes about this and tagging it with /sig node cloud-provider i'd encourage you to try to get feedback by other users on our support channels like slack, stackoverflow, discuss:
https://github.com/kubernetes/kubernetes/blob/master/SUPPORT.md

thanks
/kind support
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot added the kind/support Categorizes issue or PR as a support question. label Dec 3, 2020

k8s-ci-robot closed this as completed Dec 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail to init aws cluster with the message "could not init cloud provider "aws": error finding instance ... timeout #2359

Fail to init aws cluster with the message "could not init cloud provider "aws": error finding instance ... timeout #2359

arrcher commented Dec 3, 2020

neolit123 commented Dec 3, 2020 •

edited

Loading

k8s-ci-robot commented Dec 3, 2020

Fail to init aws cluster with the message "could not init cloud provider "aws": error finding instance ... timeout #2359

Fail to init aws cluster with the message "could not init cloud provider "aws": error finding instance ... timeout #2359

Comments

arrcher commented Dec 3, 2020

What keywords did you search in kubeadm issues before filing this one?

Is this a BUG REPORT or FEATURE REQUEST?

Versions

What happened?

What you expected to happen?

How to reproduce it (as minimally and precisely as possible)?

Anything else we need to know?

neolit123 commented Dec 3, 2020 • edited Loading

k8s-ci-robot commented Dec 3, 2020

neolit123 commented Dec 3, 2020 •

edited

Loading