OneDRS showing timeout

Hello,

I have an issue with OneDRS: it shows a timeout after migration, although the VM is successfully migrated.

OpenNebula: 7.0.0 CE
Frontend OS: Debian 12
Hypervisors: KVM

When migrating a VM with OneDRS, the migration itself is successful, but OneDRS doesn’t receive the status update. It stays in the APPLYING state and then ends with TIMEOUT. However, when I check the VM logs, I can confirm that the migration was completed successfully.

Can someone give me some help on that issue ?

Thanks a lot,
Benjamin

1 Like

I noticed that this issue only happens when the VMs are powered off, not when they are running.

Any idea?

1 Like

Maybe there is a way to tell OneDRS not to take action on powered-off VMs?

1 Like

It could be related to how resources (like storage or network) are handled when VMs are powered off. Some services may stop or unmount, causing the issue. Check logs for clues.

Hello,

Thanks for your answer. I checked the logs, but could not find the reason…

oned.log:
Mon Sep 8 06:18:54 2025 [Z0][ReM][D]: Req:5136 UID:2 IP:127.0.0.1 one.cluster.planexecute invoked , 0
Mon Sep 8 06:18:54 2025 [Z0][PLM][I]: Found 1 active plans
Mon Sep 8 06:18:54 2025 [Z0][DiM][D]: Migrating VM 5647
Mon Sep 8 06:18:54 2025 [Z0][ReM][D]: Req:5136 UID:2 one.cluster.planexecute result SUCCESS, 0
Mon Sep 8 06:18:55 2025 [Z0][ReM][D]: Req:5344 UID:2 IP:127.0.0.1 one.cluster.info invoked , 0, false
Mon Sep 8 06:18:55 2025 [Z0][ReM][D]: Req:5344 UID:2 one.cluster.info result SUCCESS, “0<…”
Mon Sep 8 06:18:55 2025 [Z0][ReM][D]: Req:7888 UID:0 IP:127.0.0.1 one.vm.info invoked , 5647, false
Mon Sep 8 06:18:55 2025 [Z0][ReM][D]: Req:7888 UID:0 one.vm.info result SUCCESS, “5647<UI…”
Mon Sep 8 06:18:59 2025 [Z0][PLM][I]: Starting Plan Manager timer action…
Mon Sep 8 06:18:59 2025 [Z0][PLM][I]: Found 1 active plans
Mon Sep 8 06:23:29 2025 [Z0][PLM][I]: Starting Plan Manager timer action…
Mon Sep 8 06:23:29 2025 [Z0][PLM][I]: Found 1 active plans
Mon Sep 8 06:23:30 2025 [Z0][TrM][I]: mv: Moving 10.1.74.3:/var/lib/one/datastores/0/5647 to 10.1.74.1:/var/lib/one/datastores/0/5647
Mon Sep 8 06:23:30 2025 [Z0][TrM][D]: Message received: TRANSFER SUCCESS 5647 -
Mon Sep 8 06:23:30 2025 [Z0][ReM][D]: Req:1840 UID:0 IP:127.0.0.1 one.vm.info invoked , 5647, false
Mon Sep 8 06:23:30 2025 [Z0][ReM][D]: Req:1840 UID:0 one.vm.info result SUCCESS, “5647<UI…”
Mon Sep 8 06:23:44 2025 [Z0][PLM][I]: Starting Plan Manager timer action…
Mon Sep 8 06:23:44 2025 [Z0][PLM][I]: Found 1 active plans
Mon Sep 8 06:23:59 2025 [Z0][PLM][I]: Starting Plan Manager timer action…
Mon Sep 8 06:23:59 2025 [Z0][PLM][I]: Found 1 active plans
Mon Sep 8 06:24:14 2025 [Z0][PLM][I]: Starting Plan Manager timer action…
Mon Sep 8 06:24:14 2025 [Z0][PLM][I]: Found 0 active plans

on 5647.log:
Mon Sep 8 06:18:54 2025 [Z0][VM][I]: New LCM state is PROLOG_MIGRATE_POWEROFF
Mon Sep 8 06:18:54 2025 [Z0][VM][I]: New state is ACTIVE
Mon Sep 8 06:23:30 2025 [Z0][VM][I]: New state is POWEROFF
Mon Sep 8 06:23:30 2025 [Z0][VM][I]: New LCM state is LCM_INIT

So, I could see that the transfer is started at 06:18:54 and finished at 06:23:30.
But still there is 1 active plan showed, until the 5 minutes timeout appears and then no active plan running. On the graphical interface, I can see STATE: TIMEOUT…

Do you have any idea on which other logs I can check ?

Thanks in advance,
Benjamin

Hello Benjamin,

thank you for your message, you’ve found a bug in tracking the DRS plan actions. I’ve created a GitHub issue for this.

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.