Ok I’m ready to figure out what is up with this one. My scheduler daemon process hangs and stops processing requests sometimes occasionally sometimes frequently. The scheduler log just stops updating. new instances stick in LCM_INIT until I restart the scheduler service.
I’m on CentOS7 and running the 4.14.2-2 rpms. when I run systemtctl restart|stop opennebula-scheduler it takes around 30 seconds for the process to stop/restart. where as if I do it before it’s wedged it happens quickly.
I’m not sure how to troubleshoot this and get eyes on the problem this any tips would be great. Thanks.