I’m encountering an issue while deploying VMs with VLAN ID 600 in OpenNebula. During the VM boot process or when attaching the network interface, I’m getting the following error:
Error:
RTNETLINK answers: Device or resource busy when creating br1.600. This issue occurs because OpenNebula tries to set up the VLAN sub-interface (br1.600) and encounters a conflict with already existing interfaces or improperly linked resources.
Steps Taken to Investigate:
Checked Network Configuration: Verified that the br1.600 interface is not already created manually outside OpenNebula.
Checked bridge link membership: Ensured there are no conflicts between br1.600 and other bridges.
VLAN configuration: Ensured the proper configuration of VLAN ID 600 in the virtual network template for OpenNebula.
Solution Tried:
Cleaned up existing interfaces: Removed any leftover VLAN interfaces manually using ip link del.
Checked OpenNebula network templates: Verified that the VLAN network is configured correctly with VN_MAD=802.1Q, and the PHYDEV and VLAN_ID parameters are properly set.
Ensured no manual creation of br1.600: Made sure that br1.600 isn’t pre-created in the system’s network configuration files (netplan, etc.).
Currently issue our fixed and Thanks for the response.
I am currently working on setting up High Availability (HA) for virtual machines in OpenNebula using two KVM hypervisors with different Linux distributions:
KVM Host 1: Ubuntu 22.04
KVM Host 2: Rocky Linux 8
Here is what I have done so far:
Both KVM hosts are added successfully to OpenNebula Shared NFS storage is mounted on both hosts at /mnt/shared A shared datastore is created and assigned Both hosts can create and run VMs correctly VM disks are stored on shared storage
Now I want to enable VM High Availability, so that if one KVM host fails, the running VM should automatically:
Migrate to the second host (if possible), OR
Recreate the VM on the second host (if migration is not possible)
I want to ask the community:
What is the correct and recommended way to set up VM HA in OpenNebula?
How does the host_error hook work in practice, and which method is better — -m migrate or -r recreate — in this scenario?
Is it safe and supported to use mixed operating systems (Ubuntu and Rocky Linux) in HA setups? Any gotchas to watch out for?
How can I set up fencing correctly to avoid split-brain? Is there a recommended fencing method (e.g., IPMI, SSH, etc.) for KVM nodes in OpenNebula?
What logs or metrics should I monitor to verify the HA is working as expected?
Would really appreciate any examples, official links, or your experience with such setups.
Thu Jun 19 12:11:00 2025 [Z0][VMM][D]: Message received: DEPLOY FAILURE 131 error: Failed to create domain from /var/lib/one//datastores/104/131/deployment.0 error: internal error: process exited while connecting to monitor: 2025-06-19T12:10:59.864766Z qemu-kvm-one: -blockdev {“driver”:“file”,“filename”:“/var/lib/one/datastores/103/9f29104471af45d9c360e2f22f0ec2b9”,“node-name”:“libvirt-3-storage”,“cache”:{“direct”:true,“no-flush”:false},“auto-read-only”:true,“discard”:“unmap”}: Could not open ‘/var/lib/one/datastores/103/9f29104471af45d9c360e2f22f0ec2b9’: Permission denied Could not create domain from /var/lib/one//datastores/104/131/deployment.0 ExitCode: 255