Fix multi-host provisioning #110

apatard · 2021-05-25T08:51:27Z

This patchset is fixing multiplatform support. Until now, when multiple instances are declared, the create.yml playbook is looping over the instance list and calls the vagrant module. This leads to one Vagrantfile per instance which are overwritten at each step of the loop.
This make debugging harder and - in some cases - may confuse vagrant.
This patchset is allowing to directly give molecule.yml instances list to the vagrant module and then the module is producing a proper Vagrantfile.

apatard · 2021-05-25T11:48:30Z

recheck

sio · 2021-05-25T11:58:37Z

Vagrant Cloud is broken at the moment, these CI failures are not because of your changes: hashicorp/vagrant#12390

apatard · 2021-05-25T12:03:30Z

@sio oh... Thanks for the info. I will refrain me from sending new PR or recheck then.

sio · 2021-05-25T12:06:37Z

I didn't mean to dissuade you from submitting new PRs, I was just letting you know that CI will not be functional for some time due to upstream service provider outage :)

It took me quite some time this morning to understand that the problem is not on my end and I wanted to save you the same trouble.

apatard · 2021-05-25T18:28:51Z

recheck

apatard · 2021-05-26T12:59:05Z

hm. weird. Probably not related to this PR but ...

2021-05-26T07:59:49.8041450Z TASK [Bootstrap python for Ansible] ********************************************
2021-05-26T07:59:49.8151860Z Wednesday 26 May 2021  07:59:49 +0000 (0:00:00.065)       0:00:00.065 *********
2021-05-26T08:00:06.3782880Z �[32mok: [instance]�[0m

So the prepare.yml did install python but

2021-05-26T08:00:08.6030260Z TASK [sample task] *************************************************************
2021-05-26T08:00:08.6045050Z Wednesday 26 May 2021  08:00:08 +0000 (0:00:00.039)       0:00:00.039 *********
2021-05-26T08:00:09.4077040Z �[1;35m[WARNING]: No python interpreters found for host instance (tried�[0m
2021-05-26T08:00:09.4098840Z �[1;35m['/usr/bin/python', 'python3.9', 'python3.8', 'python3.7', 'python3.6',�[0m
2021-05-26T08:00:09.4157470Z �[1;35m'python3.5', 'python2.7', 'python2.6', '/usr/libexec/platform-python',�[0m
2021-05-26T08:00:09.4160370Z �[1;35m'/usr/bin/python3', 'python'])�[0m
2021-05-26T08:00:09.4163550Z �[31mfatal: [instance]: FAILED! => {"ansible_facts": {"discovered_interpreter_python": "/usr/bin/python"}, "changed": false, "module_stderr": "Shared connection to 127.0.0.1 closed.\r\n", "module_stdout": "/bin/sh: /usr/bin/python: not found\r\n", "msg": "The module failed to execute correctly, you probably need to set the interpreter.\nSee stdout/stderr for the exact error", "rc": 127}�[0m
2021-05-26T08:00:09.4165610Z

Given that the scenario is the default configuration, aka:

---
dependency:
  name: galaxy
driver:
  name: vagrant
  provider:
    name: libvirt
platforms:
  - name: instance
provisioner:
  name: ansible

I'm not convinced that setting the box to use the test one is a good idea....

yajo

I've been testing this PR in my local dev workflow.

Without it, I was almost about to dump molecule-vagrant. It became too complicated.

With it, it works like a charm. VMs spin up faster. If I need to debug anything or connect to one of them, I just have to cd to the scenario ephemeral directory and run vagrant ssh server0.

Non-deep code review looks good also.

Thanks! This is a must have. 😃

apatard · 2021-08-30T12:51:25Z

With it, it works like a charm. VMs spin up faster. If I need to debug anything or connect to one of them, I just have to cd to the scenario ephemeral directory and run vagrant ssh server0.

hm. molecule login is not working for you ?

yajo · 2021-08-31T07:26:24Z

hm. molecule login is not working for you ?

I didn't know it existed 😅

It's working great! (I mean: without the PR)

yajo · 2021-08-31T07:29:27Z

In any case, I still think this different approach is a good enhancement. It is more similar to what one would do if not using molecule, and is faster.

One problem I noticed is that, when using this PR, machines were created without a network interface.

When using a4bd8b1, they got created with networking.

My platforms definition is:

platforms:
  - &server
    box: generic/ubuntu2004
    groups:
      - k8s_server
      - k8s_node
    name: server0
  - <<: *server
    name: server1
  - <<: *server
    name: server2

apatard · 2021-08-31T08:19:36Z

@yajo Networking should work. Can you share your molecule.yml so that I can try to reproduce and fix over the week ? It would be bad to see this PR merged if there's a know bug...

yajo · 2021-08-31T08:26:05Z

Here it is:

dependency:
  # name: galaxy
  # options:
  #   role-file: requirements.yaml
  #   requirements-file: requirements.yaml
  name: shell
  command: ansible-galaxy install -r requirements.yaml
driver:
  name: vagrant
  provider:
    name: libvirt
platforms:
  - &server
    box: generic/ubuntu2004
    groups:
      - k8s_server
      - k8s_node
    name: server0
  - <<: *server
    name: server1
  - <<: *server
    name: server2
provisioner:
  name: ansible
  connection_options:
    ansible_become: true
  config_options:
    defaults:
      vault_password_file: ${MOLECULE_PROJECT_DIRECTORY}/.vault_password.txt
  inventory:
    links:
      group_vars: ../../group_vars
verifier:
  name: ansible

apatard · 2021-09-03T16:12:40Z

@yajo tried quickly your setup and I can use molecule login to connect so the network seems to work. Can you have a look at the vagrant .err and .out log files to see if there's a clue ?

yajo · 2021-09-07T10:09:00Z

It seems related to using libvirt.qemu_use_session = true which seems to have some problems on creating network interfaces. Not something specific to this PR.

yajo · 2021-09-28T09:50:18Z

Definitely the issues I'm experiencing are lower in the stack. Check vagrant-libvirt/vagrant-libvirt#1342 if you're interested. But nothing related to this PR.

Current code only handle 1 vagrant VM. Adding a 'instances' parameter and a loop in the jinja template is mostly enough. Unfortunately, some extra work has been needed: New options - An option called 'cachier' used to control vagrant-cachier has been added. From what I understant it's not possible to configure it at instance level, so the cachier option has been removed from instance definition. - An option called 'default_box' has been added since it can't be specified when using instances list directly from molecule.yml. - An option called 'parallel' has been added to allow setting VAGRANT_NO_PARALLEL environment variable, since people may want to disable parallel VM creation with multiple instances. Moreover, some care has been added to avoid breaking current playbooks. This allows people updating theirs and the ones from molecule-vagrant will be updated in a later commit. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

Now that the vagrant module can take directly the list of instances from molecule, update the playbooks accordingly. This should solve a bunch of issues when the module is overwriting the Vagrantfile when bringing up each VM. Moreover this should make things easier to debug and more robust. Since the provision option is not VM specific, the templates are now expecting it in the molecule.yml driver: dict. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

…ule.yml: fix provision usage the provision parameter should now be specified outside the instances list, so update the molecule configuration for that. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

… options The default scenario should really be minimal, so remove extra options. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

Since the vagrant module is supposed to remain compatible atm with old create/destroy playbooks, add a test scenario for that. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

…enario This scenario had some troubles: - the prepare.yml scenario was useless - no verify.yml - the converge.yml tasks were not so useful since better testing is provided by verify.yml - fix molecule.yml network definition to be provider-agnostic and not use .1 IP since it's usually the IP of the hypervisor with libvirt - add provider options for libvirt to not use session, since we're not setting network configuration to allow working such configuration. - reduce memory usage to 256 for each box, to avoid needing to much memory Signed-off-by: Arnaud Patard <apatard@hupstream.com>

- add the default and default-compat scenarii - add the multi-node scenario and ensure that the generated Vagrantfile has both instances in it. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

Update README.rst molecule.yml section according to the changes done in the create.yml/destroy.yml playbooks. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

for more information, see https://pre-commit.ci

… network Avoid using dhcp as there are high chances it'll use a network forbidden by default vbox configuration in github actions. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

apatard force-pushed the multiplatform branch from b8efffe to 832ca78 Compare May 25, 2021 09:40

apatard marked this pull request as ready for review May 29, 2021 08:42

ssbarnea added the bug Something isn't working label Jul 23, 2021

ssbarnea changed the title ~~Proper Multiplatform support~~ Fix multi-host provisioning Jul 23, 2021

apatard force-pushed the multiplatform branch from 832ca78 to 015caa9 Compare July 23, 2021 14:13

apatard requested a review from ssbarnea as a code owner July 23, 2021 14:13

yajo approved these changes Aug 30, 2021

View reviewed changes

apatard added 8 commits November 3, 2021 17:24

molecule_vagrant/test/scenarios/molecule/{network,vagrant_root}/molec…

42bd6ca

…ule.yml: fix provision usage the provision parameter should now be specified outside the instances list, so update the molecule configuration for that. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

molecule_vagrant/test/scenarios/molecule/default/molecule.yml: Remove…

3813dd1

… options The default scenario should really be minimal, so remove extra options. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

molecule_vagrant/test/scenarios/molecule/default-compat/: Test compat

4d2e996

Since the vagrant module is supposed to remain compatible atm with old create/destroy playbooks, add a test scenario for that. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

molecule_vagrant/test/functional/test_func.py: add new scenarii

25c8704

- add the default and default-compat scenarii - add the multi-node scenario and ensure that the generated Vagrantfile has both instances in it. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

README.rst: update

a007b3a

Update README.rst molecule.yml section according to the changes done in the create.yml/destroy.yml playbooks. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

apatard force-pushed the multiplatform branch from 0afb169 to a007b3a Compare November 3, 2021 16:55

pre-commit-ci bot and others added 2 commits November 3, 2021 16:55

[pre-commit.ci] auto fixes from pre-commit.com hooks

2cb41df

for more information, see https://pre-commit.ci

molecule_vagrant/test/scenarios/molecule/multi-node/molecule.yml: fix…

e6e7024

… network Avoid using dhcp as there are high chances it'll use a network forbidden by default vbox configuration in github actions. Signed-off-by: Arnaud Patard <apatard@hupstream.com>

apatard force-pushed the multiplatform branch from 3afba4b to e6e7024 Compare November 4, 2021 10:31

Merge branch 'main' into multiplatform

2004353

ssbarnea approved these changes Nov 4, 2021

View reviewed changes

Merge branch 'main' into multiplatform

0788c4a

ssbarnea merged commit 929c3ad into ansible-community:main Nov 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix multi-host provisioning #110

Fix multi-host provisioning #110

apatard commented May 25, 2021

apatard commented May 25, 2021

sio commented May 25, 2021

apatard commented May 25, 2021

sio commented May 25, 2021

apatard commented May 25, 2021

apatard commented May 26, 2021

yajo left a comment

apatard commented Aug 30, 2021

yajo commented Aug 31, 2021 •

edited

Loading

yajo commented Aug 31, 2021

apatard commented Aug 31, 2021

yajo commented Aug 31, 2021

apatard commented Sep 3, 2021

yajo commented Sep 7, 2021 •

edited

Loading

yajo commented Sep 28, 2021

Fix multi-host provisioning #110

Fix multi-host provisioning #110

Conversation

apatard commented May 25, 2021

apatard commented May 25, 2021

sio commented May 25, 2021

apatard commented May 25, 2021

sio commented May 25, 2021

apatard commented May 25, 2021

apatard commented May 26, 2021

yajo left a comment

Choose a reason for hiding this comment

apatard commented Aug 30, 2021

yajo commented Aug 31, 2021 • edited Loading

yajo commented Aug 31, 2021

apatard commented Aug 31, 2021

yajo commented Aug 31, 2021

apatard commented Sep 3, 2021

yajo commented Sep 7, 2021 • edited Loading

yajo commented Sep 28, 2021

yajo commented Aug 31, 2021 •

edited

Loading

yajo commented Sep 7, 2021 •

edited

Loading