Please, describe the problem here and provide additional information below (if applicable) …
Upgrade from 5.10.1 to 5.12.0 with migration package on CentOS7 following the documentation. Everything went smooth until I wanted to deploy new VMs. All my hosts are now in ERROR. onehost sync fails to update hosts, with just this message:
Failed to update the following hosts:
* hostXXX
but ssh connection for oneadmin to hosts are ok.
When I try to enable any hosts, I’ve got this in oned.log:
Thu Jun 25 14:05:18 2020 [Z0][ReM][D]: Req:2464 UID:1 IP:127.0.0.1 one.documentpool.info invoked , -2, -1, -1, 100
Thu Jun 25 14:05:18 2020 [Z0][AuM][D]: Message received: LOG I 20 Command execution failed (exit code: 255): /var/lib/one/remotes/auth/server_cipher/authenticate
Thu Jun 25 14:05:18 2020 [Z0][AuM][I]: Command execution failed (exit code: 255): /var/lib/one/remotes/auth/server_cipher/authenticate
Thu Jun 25 14:05:18 2020 [Z0][AuM][D]: Message received: LOG E 20 login token expired
Thu Jun 25 14:05:18 2020 [Z0][AuM][I]: login token expired
Thu Jun 25 14:05:18 2020 [Z0][AuM][D]: Message received: AUTHENTICATE FAILURE 20 login token expired
Thu Jun 25 14:05:18 2020 [Z0][AuM][E]: Auth Error: login token expired
Thu Jun 25 14:05:18 2020 [Z0][ReM][D]: Req:1872 UID:-1 IP:127.0.0.1 one.vmpool.infoextended invoked , -2, -1, -1, -1, ""
Thu Jun 25 14:05:18 2020 [Z0][ReM][E]: Req:1872 UID:- one.vmpool.infoextended result FAILURE [one.vmpool.infoextended] User couldn't be authenticated, aborting call.
Is there something new here ? Where can I configure this IP >> Req:1872 UID:-1 IP:127.0.0.1 one.vmpool.infoextended invoked , -2, -1, -1, -1, “”
It also seems that the scheduler is constently restarting. Can’t find any error in the sched.log, just this:
juin 25 14:02:07 t110.lan systemd[1]: Started OpenNebula Cloud Scheduler Daemon.
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: Init Scheduler Log system
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: Starting Scheduler Daemon
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: ----------------------------------------
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: Scheduler Configuration File
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: ----------------------------------------
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: DEFAULT_DS_SCHED=POLICY=1
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: DEFAULT_NIC_SCHED=POLICY=1
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: DEFAULT_SCHED=POLICY=1
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: DIFFERENT_VNETS=YES
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: LIVE_RESCHEDS=0
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: LOG=DEBUG_LEVEL=5,SYSTEM=syslog
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: MAX_DISPATCH=30
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: MAX_HOST=1
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: MAX_VM=5000
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: MEMORY_SYSTEM_DS_SCALE=0
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: MESSAGE_SIZE=1073741824
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: ONE_XMLRPC=http://localhost:2633/RPC2
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: SCHED_INTERVAL=30
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: TIMEOUT=60
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: ----------------------------------------
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: XML-RPC client using 1073741824 bytes for response buffer.
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: oned successfully contacted.
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: Configuring scheduler for Zone ID: 0
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: Starting scheduler loop...
juin 25 14:02:07 t110.lan mm_sched[6213]: [Z0][SCHED][I]: Scheduler loop started.
juin 25 14:02:37 t110.lan systemd[1]: opennebula-scheduler.service: main process exited, code=killed, status=6/ABRT
juin 25 14:02:37 t110.lan systemd[1]: Unit opennebula-scheduler.service entered failed state.
juin 25 14:02:37 t110.lan systemd[1]: opennebula-scheduler.service failed.
juin 25 14:02:43 t110.lan systemd[1]: opennebula-scheduler.service holdoff time over, scheduling restart.
juin 25 14:02:43 t110.lan systemd[1]: Stopped OpenNebula Cloud Scheduler Daemon.
juin 25 14:02:43 t110.lan systemd[1]: Starting OpenNebula Cloud Scheduler Daemon...
juin 25 14:02:43 t110.lan systemd[1]: Started OpenNebula Cloud Scheduler Daemon.
Versions of the related components and OS (frontend, hypervisors, VMs):
up-to-date CentOS7
OpenNebula 5.12.0 RPMS
Steps to reproduce:
Upgrade from 5.10 following the documentation
Current results:
Hosts are all in ERROR state with the following message in sunstone:
Thu Jun 25 14:07:00 2020 : Error monitoring Host t110.lan (2):
Expected results:
Hosts should be OK