-
Notifications
You must be signed in to change notification settings - Fork 145
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Elastic Agent non-Windows host - Agent doesn't finish installing until after re-starting the service #150
Comments
Pinging @elastic/agent (Team:Agent) |
Pinging @elastic/elastic-agent-control-plane (Team:Elastic-Agent-Control-Plane) |
There were some fixes related to i18n. Can this be tested again? |
@dikshachauhan-qasource @amolnater-qasource thanks for the help, can you re-test and post back? |
Hi @EricDavisX Due to Vsphere issue today, we were unable to validate this. However, we will reattempt it again and will share observation accordingly. Thanks |
Hi @EricDavisX We have re-attempted to validate the current behavior of agent with endpoint on non-english VM and found same observations as shared above.
Build details: Please let me know if info is required from our end. Thanks |
I realize now the testing we did was after the expected fixes - I am not surprised here. @andresrc @jlind23 if you think this is expected world-wide usage, we could prioritize the research? At least Michal has Endgame cluster vSphere access to a non-English host we can spin up for him if desired / needed. and we can get more folks access as needed to spread the work-load, let me know in email. |
@EricDavisX did you check that the build number was with the related fixes? If yes, then it seems that we missed some things. @andresrc would you mind giving more context here? English hosts works fine? |
@jlind23 hi - glad to confirm, the 7.15.1 build is intended to have the fixes. Perhaps if we review the backport we'd find a problem? original pr is elastic/beats#26665 This is a testing hole we were lucky to find (Alvaro found it for us!) and after much review, we found code that needed to reference a translated-to-different-language word, in this case the folder on the host is called 'administradores' instead of 'administrators' in Windows so the code was updated. It was believed to be working, but QA reported this immediately after the fixes went into full testing and so we can follow up again. There is a pair of hosts in the Endgame vSphere cluster that are prepped for testing if we want them. IT helpdesk can grant access to you or others as needed. |
@EricDavisX saw another thread in which blake was saying that maybe the version was not downloaded properly. May it be related to this? #148 |
Thanks. Assigning to @amolnater-qasource to follow up and see if we can reproduce, and if so if we can isolate the downloading failure or prove it is something else. |
Hi @EricDavisX We have revalidated this issue by installing elastic-agent on French VM and found this issue still reproducible.
Kibana/Build details:
We observed below [error] log line for metricbeat under logs tab: Debug level logs: Note: cc: @jlind23 |
Port 9200 is used for elasticsearch communication isn't it? @amolnater-qasource is your elasticsearch pod running? |
Looks like a configuration issue as metricbeat cannot communicate with elasticsearch. Did you configure the output settings correctly in the Fleet UI? |
In the description it is cited as a 7.14 BC4 cloud stack, I'm betting the test didn't include changing anything in the fleet output settings. I'm unassigning Amol, seems like it needs an engineering owner |
Hi @jlind23
We were testing upon 7.15.1 cloud production environment and even agents installed Healthy on other OS's other than non-English Windows host.
Do we need to do any separate settings to install agent on non-English Windows host?
Please let us know if anything else is required from our end. |
As @blakerouse is out maybe @michalpristas could give us pointers about non-English Windows host configuration. |
@amolnater-qasource i'm digging hold issues. Is this something still relevant? |
Hi @jlind23
No issues are observed on Spanish and Korean VM
However this issue is still reproducible on French VM.
Screenshot: Build details: Please let us know if anything else is required from our end. |
Hi @jlind23
Build details: Thanks |
@amolnater-qasource do we have logs saying why the agent is stuck? @mdelapenya do you know if we can select the OS language in the e2e testing framework? |
Hi @jlind23 Logs: Further we have observed below
Internet on this VM is working fine. Please let us know if anything else is required from our end. |
@amolnater-qasource looking at the logs it seems that Elasticsearch is unreachable on its 9200 port. Does it ring a bell? |
No, we cannot, as we have a Windows 2019 AMI pre-baked with Packer to be used as the target instance for the E2E. I've just commented in elastic/e2e-testing#2805 (comment) that I'd suggest having manual tests on the language support |
Hi @jlind23 We have revalidated this issue on 8.6.0 release kibana cloud-production environment and found it fixed now. Host: Observations:
Build details: Screen Recording: Agents.-.Fleet.-.Elastic.-.Google.Chrome.2023-01-12.15-08-50.mp4Hence we are closing this issue and marking it as QA:Validated. Thanks |
Hi this is a spawn off of testing done in support of
elastic/beats#26665
I'm transferring this issue from the Endpoint team, to Beats / Agent.
From @dikshachauhan-qasource : we have attempted to validate the endpoint behavior on French VM machines and found it working fine with a small glitch.
Observations:
Scenario1:
Installed agent under a policy having endpoint.
Agent remained in updating state till we manually restart the elastic-agent service.
Host then updated to healthy status and was available under Endpoint tab with status 'success'.
Data streams were working fine.
All binaries were in running state.
Recording:
https://user-images.githubusercontent.com/12970373/127567223-9c1fd3ee-4216-4837-b0a6-2d6cb45d0300.mp4
Scenario2:
Unenrolled then Re-Installed agent under same policy having endpoint.
Observations same as mentioned above.
Scenario3:
Unenrolled then Re-Installed agent under Default policy. Later after installation of agent, we added Endpoint security.
Observations same as mentioned above.
screenshot:
Logs.zip:
logs-french-win-10-agent.zip
The text was updated successfully, but these errors were encountered: