Growing sshd_config after restarting multiple times prevents SSH access #2581

dan-osterrath · 2018-11-23T22:33:10Z

RancherOS Version: (ros os version)
seen on:
rancherOS Base-1.1.0.93-dab52c1 (ami-0655e569)
rancherOS Base-1.1.0.94-49689c2 (ami-c50a86aa)

Where are you running RancherOS? (docker-machine, AWS, GCE, baremetal, etc.)
AWS

We are starting and stopping our EC2 development instances regularly and very often. It seems that at every startup the following 4 lines will be appended to /etc/ssh/sshd_config. This remained undiscovered more than a year until we suddenly could not SSH into our machines without any modification. After a long investigation we found out that the sshd deamon does not start anymore because the sshd_config file contains the following 4 lines about 280 times:

UseDNS no
PermitRootLogin no
ServerKeyBits 2048
AllowGroups docker

It seems that this patch introduced this behaviour.

So we see 2 problems here:

Why are these 4 lines appended every time to sshd_config? They should be replaced or appending should be skipped.
Why does sshd fail on startup when these parameters are appended too often. There are no error messages at all in /var/log/syslog. We only see this line unless the sshd failed.

sshd[907]: Server listening on 0.0.0.0 port 22.

We also have no threshold for the number of configuration repetitions when sshd fails. One of our instances failed at about 260 repetitions of these 4 lines.

The text was updated successfully, but these errors were encountered:

dan-osterrath · 2018-11-23T22:38:35Z

We compared the 2 most recent overlays for /etc/ssh/sshd_config. See the sshd_config.diff for the details how the config file has been changed over time.

dan-osterrath · 2018-11-23T22:41:31Z

A temporal fix is to detach the volume from the EC2 instance, attach it as secondary volume to another EC2 instance, modify the sshd_config file in the overlay manually (remove duplicate lines) and then reattach the volume to the original EC2 instance.
This of course only works for the next X restarts.

niusmallnan · 2018-11-24T08:35:06Z

It should not be caused by that PR, it was introduced in 1.3.0, but you are using 1.1.0.

It seems that you are not using the default console.
The default console will be rebuilt every boot, so it will not be set repeatedly.
Other console data is persistent, so these lines are constantly increasing.

It should be a bug, we will fix it.

gkirchner · 2018-11-29T11:26:11Z

It seems that you are not using the default console.
The default console will be rebuilt every boot, so it will not be set repeatedly.

We are in fact using the Ubuntu console.

kordeviant · 2019-01-15T16:46:25Z

I have this same problem on ubuntu console... could you @Aisuko please just tell us how to fix our existing rancheros or should we install a new one?

kingsd041 · 2019-01-16T02:05:56Z

We will fix this issue in v1.5.1 @kordeviant

kingsd041 · 2019-02-07T05:21:41Z

Fixed this issue in RancherOS v1.5.1-rc1
@dan-osterrath Thank you for your feedback

niusmallnan added kind/bug area/console labels Nov 24, 2018

niusmallnan assigned Aisuko Nov 30, 2018

niusmallnan added this to the v1.5.1 milestone Dec 25, 2018

Aisuko mentioned this issue Dec 25, 2018

[WIP] Fix the issue for growing sshd_config contents after restarting. #2598

Closed

niusmallnan unassigned Aisuko Jan 8, 2019

niusmallnan self-assigned this Jan 16, 2019

niusmallnan added the NEEDS_AUTOMATED_TESTS label Jan 22, 2019

This was referenced Jan 26, 2019

Generate sshd_config by go template #2663

Merged

Add sshd_config.tpl for each console rancher/os-services#183

Merged

niusmallnan added the status/to-test label Feb 5, 2019

kingsd041 closed this as completed Feb 7, 2019

niusmallnan added the area/documentation label Feb 13, 2019

niusmallnan added status/tested and removed area/documentation status/to-test labels Mar 18, 2019

kingsd041 mentioned this issue Apr 2, 2019

Add test case for ssh config cnrancher/os-tests#42

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Growing sshd_config after restarting multiple times prevents SSH access #2581

Growing sshd_config after restarting multiple times prevents SSH access #2581

dan-osterrath commented Nov 23, 2018

dan-osterrath commented Nov 23, 2018

dan-osterrath commented Nov 23, 2018

niusmallnan commented Nov 24, 2018

gkirchner commented Nov 29, 2018

kordeviant commented Jan 15, 2019

kingsd041 commented Jan 16, 2019

kingsd041 commented Feb 7, 2019

Growing sshd_config after restarting multiple times prevents SSH access #2581

Growing sshd_config after restarting multiple times prevents SSH access #2581

Comments

dan-osterrath commented Nov 23, 2018

dan-osterrath commented Nov 23, 2018

dan-osterrath commented Nov 23, 2018

niusmallnan commented Nov 24, 2018

gkirchner commented Nov 29, 2018

kordeviant commented Jan 15, 2019

kingsd041 commented Jan 16, 2019

kingsd041 commented Feb 7, 2019