Error monitoring Host (2): Error executing probes

Hello Everyone,

I was able to create my frontend, so now I added a host and that went ok

oneadmin@intrepid:~$ onehost create intrepid -i kvm -v kvm -n dummy
ID: 0
oneadmin@intrepid:~$ onehost list
ID NAME CLUSTER RVM ALLOCATED_CPU ALLOCATED_MEM STAT
0 intrepid - 0 - - init

This didn’t (below) work but I was able to add it via the gui

oneadmin@intrepid:/tmp$ onevnet create /tmp/mynetwork.one
command create: argument 0 must be one of file

So I have my network ip leases assigned:

Then I restarted the fronend gui

root@intrepid:~# /etc/init.d/opennebula-sunstone restart * Restarting Sunstone Web interface sunstone-server sunstone-server stopped
VNC proxy started
sunstone-server started
[ OK ]
Then I synced everything

oneadmin@intrepid:~$ onehost sync --force

  • Adding intrepid to upgrade
    [========================================] 1/1 intrepid
    All hosts updated successfully.

Now I am getting this in the frontend

Information
id 0
Name intrepid
Cluster -
State ERROR
IM MAD kvm
VM MAD kvm
VN MAD dummy
Capacity
Allocated Memory
0KB / -
Allocated CPU
0 / -
Real Memory
0KB / -
Real CPU
0 / -
Running VMs 0
Attributes
ERROR Mon Sep 14 09:07:26 2015 : Error monitoring Host intrepid (0): Error executing probes

I looked at another post here and tried everything they suggest and it didn’t work. I am finally getting this somewhere after a week of futility.

I am looking to see what to do but I can’t find anything to help me resolve this. Any ideas?

Thanks in advance,
Michael

You may check oned.log, information about the error should be clearer there with the output of the monitor command. Usually it is because of a connectivity problem (ssh related) or missing packages in the host (e.g. ruby)

Hello Thanks for your reply

Here is what I have installed

root@intrepid:~# apt-get install ruby
Reading package lists… Done
Building dependency tree
Reading state information… Done
ruby is already the newest version.
ruby set to manually installed.
0 upgraded, 0 newly installed, 0 to remove and 109 not upgraded.

I don’t mean to be a bother but this is a bit frustrating. Thank you for your help.

The full error message is in oned.log, take a look there

Here is what I got from the oned.log

Tue Sep 15 06:34:25 2015 [ReM][D]: Req:1392 UID:0 VirtualMachinePoolInfo result SUCCESS, "<VM_POOL></VM_POOL>"
Tue Sep 15 06:34:43 2015 [ONE][E]: Error monitoring Host intrepid (0): Error executing probes
Tue Sep 15 06:34:44 2015 [InM][D]: Monitoring datastore default (1)
Tue Sep 15 06:34:44 2015 [InM][D]: Monitoring datastore files (2)
Tue Sep 15 06:34:45 2015 [ImM][D]: Datastore default (1) successfully monitored.
Tue Sep 15 06:34:45 2015 [ImM][D]: Datastore files (2) successfully monitored.
Tue Sep 15 06:34:54 2015 [ReM][D]: Req:9568 UID:0 VirtualMachinePoolInfo invoked, -2, -1, -1, -1
Tue Sep 15 06:34:54 2015 [ReM][D]: Req:9568 UID:0 VirtualMachinePoolInfo result SUCCESS, "<VM_POOL></VM_POOL>"
Tue Sep 15 06:34:54 2015 [ReM][D]: Req:9568 UID:0 VirtualMachinePoolInfo invoked, -2, -1, -1, -1
Tue Sep 15 06:34:54 2015 [ReM][D]: Req:9568 UID:0 VirtualMachinePoolInfo result SUCCESS, "<VM_POOL></VM_POOL>"
Tue Sep 15 06:35:03 2015 [ONE][E]: Error monitoring Host intrepid (0): Error executing probes
Tue Sep 15 06:35:23 2015 [ONE][E]: Error monitoring Host intrepid (0): Error executing probes
Tue Sep 15 06:35:24 2015 [ReM][D]: Req:1184 UID:0 VirtualMachinePoolInfo invoked, -2, -1, -1, -1
Tue Sep 15 06:35:24 2015 [ReM][D]: Req:1184 UID:0 VirtualMachinePoolInfo result SUCCESS, "<VM_POOL></VM_POOL>"
Tue Sep 15 06:35:24 2015 [ReM][D]: Req:1184 UID:0 VirtualMachinePoolInfo invoked, -2, -1, -1, -1
Tue Sep 15 06:35:24 2015 [ReM][D]: Req:1184 UID:0 VirtualMachinePoolInfo result SUCCESS, “<VM_POOL></VM_POOL>”

Can you execute the following, in intrepid as oneadmin ?

# Make sure you are oneadmin
# And that you are in intrepid:
/var/tmp/one/im/kvm.d/../run_probes kvm-probes /var/lib/one//datastores 4124 20 0 intrepid

Please copy/paste the ouput to this thread

By the way, it’s true that the debugging info for errors in the probes is totally lacking. I have opened a ticket to see this through. It will be implemented for this release (4.14).

http://dev.opennebula.org/issues/3988

oneadmin@intrepid:~$ /var/tmp/one/im/kvm.d/…/run_probes kvm-probes /var/lib/one//datastores 4124 20 0 intrepid
error: failed to connect to the hypervisor
error: Failed to connect socket to ‘/var/run/libvirt/libvirt-sock’: Permission denied
ERROR MESSAGE --8<------
Error executing kvm.rb
ERROR MESSAGE ------>8–
ARCH=x86_64
MODELNAME="Quad-Core AMD Opteron™ Processor 2354"
oneadmin@intrepid:~$

After adding oneadmin to the libvirtd group I ran that command again and the following is what I got

oneadmin@intrepid:~$ /var/tmp/one/im/kvm.d/…/run_probes kvm-probes /var/lib/one//datastores 4124 20 0 intrepid
ARCH=x86_64
MODELNAME="Quad-Core AMD Opteron™ Processor 2354"
HYPERVISOR=kvm
TOTALCPU=800
CPUSPEED=1100
TOTALMEMORY=14434192
USEDMEMORY=2523316
FREEMEMORY=11910876
FREECPU=770.4
USEDCPU=29.600000000000023
NETRX=3483027195
NETTX=269517426073
DS_LOCATION_USED_MB=1
DS_LOCATION_TOTAL_MB=42930
DS_LOCATION_FREE_MB=37621
DS = [
ID = 0,
USED_MB = 1,
TOTAL_MB = 42930,
FREE_MB = 37621
]
DS = [
ID = 1,
USED_MB = 1,
TOTAL_MB = 42930,
FREE_MB = 37621
]
DS = [
ID = 2,
USED_MB = 1,
TOTAL_MB = 42930,
FREE_MB = 37621
]
HOSTNAME=intrepid.coopfire.loc
VM_POLL=YES
VM=[
ID=-1,
DEPLOY_ID=Nagios_4-0-8,
POLL=“STATE=a USEDCPU=2.0 USEDMEMORY=2097152 NETRX=64864279 NETTX=116840410” ]
VM=[
ID=-1,
DEPLOY_ID=WebServer,
POLL=“STATE=a USEDCPU=4.5 USEDMEMORY=1121404 NETRX=203889688 NETTX=228625366” ]
VERSION=“4.6.2”

So i removed the ID 0 host and readded the host and now I am able to see it in my OpenNebula Console this is awesome. Now I have to figure out how to get the VMs that were already there to populate. Thanks Jaime you are awesome