Hello @dclavijo @ahuertas @pczerny
I have got the following problem, but it is better first you look at this screenshot, taken from “Placement” tab:
Then, and I don’t understand why, in line #1 appears another “undeploy”, but this time VM appears in host “localhost”… but nobody has replaced this VM (anyway…)
In line #3, a user with admin group privileges, executes a “poweroff-hard”. Then, it seems VM falls in “some” error state, reason why I need to execute two “resume” operations and one “migrate” operation.
However, VMs tries to start but always fails. Then, I check logs and I find these lines:
Driver Error
Wed Oct 19 10:53:05 2022: DEPLOY: error: Failed to create domain from /var/lib/one//datastores/0/11128/deployment.8 error: Cannot access storage file '/var/lib/one//datastores/0/11128/disk.0' (as uid:9869, gid:9869): No such file or directory Could not create domain from /var/lib/one//datastores/0/11128/deployment.8 ExitCode: 255
Dismiss
Wed Oct 19 10:52:34 2022 [Z0][VM][I]: New state is PENDING
Wed Oct 19 10:53:02 2022 [Z0][VM][I]: New state is ACTIVE
Wed Oct 19 10:53:02 2022 [Z0][VM][I]: New LCM state is PROLOG_UNDEPLOY
Wed Oct 19 10:53:03 2022 [Z0][VM][I]: New LCM state is BOOT_UNDEPLOY
Wed Oct 19 10:53:03 2022 [Z0][VMM][I]: Generating deployment file: /var/lib/one/vms/11128/deployment.8
Wed Oct 19 10:53:03 2022 [Z0][VMM][I]: Successfully execute transfer manager driver operation: tm_context.
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: ExitCode: 0
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: Successfully execute network driver operation: pre.
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: ExitCode: 0
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: Successfully execute virtualization driver operation: /bin/mkdir -p.
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: ExitCode: 0
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: Successfully execute virtualization driver operation: /bin/cat - >/var/lib/one//datastores/0/11128/vm.xml.
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: ExitCode: 0
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: Successfully execute virtualization driver operation: /bin/cat - >/var/lib/one//datastores/0/11128/ds.xml.
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: Command execution fail (exit code: 255): cat << EOT | /var/tmp/one/vmm/kvm/deploy '/var/lib/one//datastores/0/11128/deployment.8' 'localhost' 11128 localhost
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: error: Failed to create domain from /var/lib/one//datastores/0/11128/deployment.8
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: error: Cannot access storage file '/var/lib/one//datastores/0/11128/disk.0' (as uid:9869, gid:9869): No such file or directory
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: Could not create domain from /var/lib/one//datastores/0/11128/deployment.8
Wed Oct 19 10:53:04 2022 [Z0][VMM][I]: ExitCode: 255
Wed Oct 19 10:53:05 2022 [Z0][VMM][I]: ExitCode: 0
Wed Oct 19 10:53:05 2022 [Z0][VMM][I]: Successfully execute network driver operation: clean.
Wed Oct 19 10:53:05 2022 [Z0][VMM][I]: Failed to execute virtualization driver operation: deploy.
Wed Oct 19 10:53:05 2022 [Z0][VMM][E]: DEPLOY: error: Failed to create domain from /var/lib/one//datastores/0/11128/deployment.8 error: Cannot access storage file '/var/lib/one//datastores/0/11128/disk.0' (as uid:9869, gid:9869): No such file or directory Could not create domain from /var/lib/one//datastores/0/11128/deployment.8 ExitCode: 255
Wed Oct 19 10:53:05 2022 [Z0][VM][I]: New LCM state is BOOT_UNDEPLOY_FAILURE
Wed Oct 19 12:00:05 2022 [Z0][VM][I]: New LCM state is RUNNING
Wed Oct 19 12:01:55 2022 [Z0][LCM][I]: VM running but monitor state is POWEROFF
Wed Oct 19 12:01:55 2022 [Z0][VM][I]: New LCM state is SHUTDOWN_POWEROFF
Wed Oct 19 12:01:55 2022 [Z0][VM][I]: New state is POWEROFF
Wed Oct 19 12:01:55 2022 [Z0][VM][I]: New LCM state is LCM_INIT
After comparing folder “11128” in /var/lib/one/datastores/0" in both servers, I find out that in nebulacaos2 (where VM was created in the first instantiation time) exists a “disk.0.snap” subfolder and a disk.0 to disk.0.snap soft-link, but in server “localhost”, folder “11128” not contains this subfolder and this soft-lnk, so VM can’t start.
User “oneadmin” has same UID and GID in both servers and, also, both belongs to the same groups.
Why has happened this?
Thanks.