cgonzalez
(Christian GonzΓ‘lez)
July 2, 2020, 3:09pm
22
Hello @madko ,
As it seems that running the probes manually in the hypervisor is working for you, letβs check why the frontend is not able to start them remotely. We need you to execute the following command in the frontend:
ssh localhost-kvm 'if [ -x /var/tmp/one/im/run_monitord_client ]; then /var/tmp/one/im/run_monitord_client kvm 0 localhost-kvm; else exit 42; fi' < <XML>
Replacing the <XML>
by a path to a file containing the XML you used for starting the probes manually.
Also, could you share the output of tree /var/lib/one/remotes
?
madko
(Edouard (Madko))
July 4, 2020, 5:13pm
23
same result, it works manually, no error
/var/lib/one/remotes
βββ auth
β βββ dummy
β β βββ authenticate
β βββ ldap
β β βββ authenticate
β βββ plain
β β βββ authenticate
β βββ server_cipher
β β βββ authenticate
β βββ server_x509
β β βββ authenticate
β βββ ssh
β β βββ authenticate
β βββ x509
β βββ authenticate
βββ create_container_image.sh
βββ create_docker_image.sh
βββ datastore
β βββ ceph
β β βββ ceph_utils.sh
β β βββ clone
β β βββ cp
β β βββ export
β β βββ mkfs
β β βββ monitor
β β βββ rm
β β βββ snap_delete
β β βββ snap_flatten
β β βββ snap_revert
β β βββ stat
β βββ dev
β β βββ clone
β β βββ cp
β β βββ mkfs
β β βββ monitor
β β βββ rm
β β βββ snap_delete
β β βββ snap_flatten
β β βββ snap_revert
β β βββ stat
β βββ docker_downloader.sh
β βββ downloader.sh
β βββ dummy
β β βββ clone
β β βββ cp
β β βββ export
β β βββ mkfs
β β βββ monitor
β β βββ rm
β β βββ snap_delete
β β βββ snap_flatten
β β βββ snap_revert
β β βββ stat
β βββ fs
β β βββ clone
β β βββ cp
β β βββ export
β β βββ mkfs
β β βββ monitor
β β βββ rm
β β βββ snap_delete
β β βββ snap_flatten
β β βββ snap_revert
β β βββ stat
β βββ iscsi_libvirt
β β βββ clone
β β βββ cp
β β βββ mkfs
β β βββ monitor
β β βββ rm
β β βββ snap_delete
β β βββ snap_flatten
β β βββ snap_revert
β β βββ stat
β βββ libfs.sh
β βββ lxd_downloader.sh
β βββ url.rb
β βββ vcenter
β β βββ clone
β β βββ cp
β β βββ export
β β βββ mkfs
β β βββ monitor
β β βββ rm
β β βββ snap_delete
β β βββ snap_flatten
β β βββ snap_revert
β β βββ stat
β βββ vcenter_downloader.rb
β βββ vcenter_uploader.rb
β βββ xpath.rb
βββ etc
β βββ datastore
β β βββ ceph
β β β βββ ceph.conf
β β βββ fs
β β βββ fs.conf
β βββ im
β β βββ firecracker-probes.d
β β β βββ probe_db.conf
β β βββ kvm-probes.d
β β β βββ pci.conf
β β β βββ probe_db.conf
β β βββ lxd-probes.d
β β βββ pci.conf
β β βββ probe_db.conf
β βββ market
β β βββ http
β β βββ http.conf
β βββ tm
β β βββ fs_lvm
β β βββ fs_lvm.conf
β βββ vmm
β β βββ firecracker
β β β βββ firecrackerrc
β β βββ kvm
β β β βββ kvmrc
β β β βββ kvmrc.rpmsave
β β βββ lxd
β β β βββ lxdrc
β β βββ vcenter
β β βββ vcenterrc
β βββ vnm
β βββ OpenNebulaNetwork.conf
βββ etc.2019-12-12 [error opening dir]
βββ etc.2020-06-24 [error opening dir]
βββ hooks
β βββ alias_ip
β β βββ alias_ip.rb
β βββ ft
β β βββ fence_host.sh
β β βββ host_error.rb
β βββ raft
β β βββ vip.sh
β βββ vcenter
β βββ create_vcenter_net.rb
β βββ delete_vcenter_net.rb
β βββ templates
β βββ create_vcenter_net.tmpl
β βββ delete_vcenter_net.tmpl
β βββ instantiate_vcenter_net.tmpl
βββ im
β βββ az.d
β β βββ monitord-client_control.sh
β β βββ monitord-client.rb
β βββ az-probes.d
β β βββ host
β β β βββ beacon
β β β β βββ monitord-client-shepherd_local.sh
β β β βββ monitor
β β β β βββ probe_host_monitor.rb
β β β βββ system
β β β βββ probe_host_system.rb
β β βββ vm
β β βββ monitor
β β β βββ probe_vm_monitor.rb
β β βββ status
β β βββ probe_vm_status.rb
β βββ dummy.d
β β βββ monitord-client_control.sh
β β βββ monitord-client.rb
β βββ dummy-probes.d
β β βββ host
β β β βββ beacon
β β β β βββ date.sh
β β β β βββ monitord-client-shepherd_local.sh
β β β βββ monitor
β β β β βββ monitor.rb
β β β βββ system
β β β βββ system.rb
β β βββ vm
β β βββ monitor
β β β βββ monitor.rb
β β βββ status
β βββ ec2.d
β β βββ monitord-client_control.sh
β β βββ monitord-client.rb
β βββ ec2-probes.d
β β βββ host
β β β βββ beacon
β β β β βββ monitord-client-shepherd_local.sh
β β β βββ monitor
β β β β βββ probe_host_monitor.rb
β β β βββ system
β β β βββ probe_host_system.rb
β β βββ vm
β β βββ monitor
β β β βββ probe_vm_monitor.rb
β β βββ status
β β βββ probe_vm_status.rb
β βββ firecracker.d
β β βββ monitord-client_control.sh
β β βββ monitord-client.rb
β βββ firecracker-probes.d
β β βββ host
β β β βββ beacon
β β β β βββ date.sh
β β β β βββ monitord-client-shepherd.sh
β β β βββ monitor
β β β β βββ linux_usage.rb
β β β β βββ numa_usage.rb
β β β βββ system
β β β βββ architecture.sh
β β β βββ cpu.sh
β β β βββ linux_host.rb
β β β βββ monitor_ds.rb
β β β βββ name.sh
β β β βββ numa_host.rb
β β β βββ version.sh
β β βββ vm
β β βββ monitor
β β β βββ monitor_ds_vm.rb
β β β βββ poll.rb
β β βββ status
β β βββ state.rb
β βββ kvm.d
β β βββ monitord-client_control.sh
β β βββ monitord-client.rb
β βββ kvm-probes.d
β β βββ host
β β β βββ beacon
β β β β βββ date.sh
β β β β βββ monitord-client-shepherd.sh
β β β βββ monitor
β β β β βββ linux_usage.rb
β β β β βββ numa_usage.rb
β β β βββ system
β β β βββ architecture.sh
β β β βββ cpu.sh
β β β βββ linux_host.rb
β β β βββ machines_models.rb
β β β βββ monitor_ds.rb
β β β βββ name.sh
β β β βββ numa_host.rb
β β β βββ pci.rb
β β β βββ version.sh
β β β βββ wild_vm.rb
β β βββ vm
β β βββ monitor
β β β βββ monitor_ds_vm.rb
β β β βββ poll.rb
β β βββ status
β β βββ state.rb
β βββ lib
β β βββ domain.rb
β β βββ firecracker.rb
β β βββ kvm.rb
β β βββ linux.rb
β β βββ lxd.rb
β β βββ monitord_client.rb
β β βββ numa_common.rb
β β βββ probe_db.rb
β β βββ process_list.rb
β β βββ vcenter_cluster.rb
β β βββ vcenter_monitor.rb
β βββ lxd.d
β β βββ monitord-client_control.sh
β β βββ monitord-client.rb
β βββ lxd-probes.d
β β βββ host
β β β βββ beacon
β β β β βββ date.sh
β β β β βββ monitord-client-shepherd.sh
β β β βββ monitor
β β β β βββ linux_usage.rb
β β β β βββ numa_usage.rb
β β β βββ system
β β β βββ architecture.sh
β β β βββ cpu.sh
β β β βββ linux_host.rb
β β β βββ monitor_ds.rb
β β β βββ name.sh
β β β βββ numa_host.rb
β β β βββ pci.rb
β β β βββ profiles.sh
β β β βββ version.sh
β β β βββ wild_vm.rb
β β βββ vm
β β βββ monitor
β β β βββ monitor_ds_vm.rb
β β β βββ poll.rb
β β βββ status
β β βββ state.rb
β βββ one.d
β β βββ monitord-client_control.sh
β β βββ monitord-client.rb
β βββ one-probes.d
β β βββ host
β β β βββ beacon
β β β β βββ monitord-client-shepherd_local.sh
β β β βββ monitor
β β β β βββ probe_host_monitor.rb
β β β βββ system
β β β βββ probe_host_system.rb
β β βββ vm
β β βββ monitor
β β β βββ probe_vm_monitor.rb
β β βββ status
β β βββ probe_vm_status.rb
β βββ packet.d
β β βββ monitord-client_control.sh
β β βββ monitord-client.rb
β βββ packet-probes.d
β β βββ host
β β β βββ beacon
β β β β βββ monitord-client-shepherd_local.sh
β β β βββ monitor
β β β β βββ probe_host_monitor.rb
β β β βββ system
β β β βββ probe_host_system.rb
β β βββ vm
β β βββ monitor
β β β βββ probe_vm_monitor.rb
β β βββ status
β β βββ probe_vm_status.rb
β βββ run_monitord_client
β βββ stop_monitord_client
β βββ vcenter.d
β βββ monitord-client_control.sh
βββ ipam
β βββ dummy
β β βββ allocate_address
β β βββ free_address
β β βββ get_address
β β βββ register_address_range
β β βββ unregister_address_range
β βββ packet
β βββ allocate_address
β βββ free_address
β βββ get_address
β βββ register_address_range
β βββ unregister_address_range
βββ market
β βββ common
β β βββ lxd.rb
β βββ dockerhub
β β βββ delete
β β βββ import
β β βββ monitor
β βββ http
β β βββ delete
β β βββ import
β β βββ monitor
β βββ linuxcontainers
β β βββ delete
β β βββ import
β β βββ monitor
β βββ one
β β βββ delete
β β βββ import
β β βββ monitor
β βββ s3
β β βββ delete
β β βββ import
β β βββ monitor
β β βββ S3.rb
β βββ turnkeylinux
β βββ delete
β βββ import
β βββ monitor
βββ pm
β βββ dummy
β β βββ cancel
β β βββ deploy
β β βββ poll
β β βββ reboot
β β βββ reset
β β βββ shutdown
β βββ ec2
β β βββ cancel
β β βββ deploy
β β βββ poll
β β βββ reboot
β β βββ reset
β β βββ shutdown
β βββ packet
β βββ cancel
β βββ deploy
β βββ poll
β βββ reboot
β βββ reset
β βββ shutdown
βββ scripts_common.rb
βββ scripts_common.sh
βββ tm
β βββ ceph
β β βββ clone
β β βββ clone.ssh
β β βββ context
β β βββ cpds
β β βββ cpds.ssh
β β βββ delete
β β βββ delete.ssh
β β βββ failmigrate
β β βββ ln
β β βββ ln.ssh
β β βββ mkimage
β β βββ mkswap
β β βββ monitor
β β βββ mv
β β βββ mvds
β β βββ mvds.ssh
β β βββ postmigrate
β β βββ premigrate
β β βββ resize
β β βββ resize.ssh
β β βββ snap_create
β β βββ snap_create_live
β β βββ snap_delete
β β βββ snap_revert
β βββ dev
β β βββ clone
β β βββ cpds
β β βββ delete
β β βββ failmigrate
β β βββ ln
β β βββ mv
β β βββ mvds
β β βββ postmigrate
β β βββ premigrate
β β βββ resize
β β βββ snap_create
β β βββ snap_create_live
β β βββ snap_delete
β β βββ snap_revert
β βββ dummy
β β βββ clone
β β βββ context
β β βββ cpds
β β βββ delete
β β βββ failmigrate
β β βββ ln
β β βββ mkimage
β β βββ mkswap
β β βββ monitor
β β βββ mv
β β βββ mvds
β β βββ postmigrate
β β βββ premigrate
β β βββ resize
β β βββ snap_create
β β βββ snap_create_live
β β βββ snap_delete
β β βββ snap_revert
β βββ fs_lvm
β β βββ activate
β β βββ clone
β β βββ context
β β βββ cpds
β β βββ delete
β β βββ failmigrate
β β βββ ln
β β βββ mkimage
β β βββ mkswap
β β βββ monitor
β β βββ mv
β β βββ mvds
β β βββ postmigrate
β β βββ premigrate
β β βββ resize
β β βββ snap_create
β β βββ snap_create_live
β β βββ snap_delete
β β βββ snap_revert
β βββ iscsi_libvirt
β β βββ clone
β β βββ cpds
β β βββ delete
β β βββ failmigrate
β β βββ ln
β β βββ mv
β β βββ mvds
β β βββ postmigrate
β β βββ premigrate
β β βββ resize
β β βββ snap_create
β β βββ snap_create_live
β β βββ snap_delete
β β βββ snap_revert
β βββ qcow2
β β βββ clone
β β βββ clone.ssh
β β βββ context
β β βββ cpds
β β βββ cpds.ssh
β β βββ delete
β β βββ failmigrate
β β βββ ln
β β βββ ln.ssh
β β βββ mkimage
β β βββ mkswap
β β βββ monitor
β β βββ mv
β β βββ mvds
β β βββ mvds.ssh
β β βββ mv.ssh
β β βββ postmigrate
β β βββ premigrate
β β βββ resize
β β βββ snap_create
β β βββ snap_create_live
β β βββ snap_create_live.ssh
β β βββ snap_create.ssh
β β βββ snap_delete
β β βββ snap_delete.ssh
β β βββ snap_revert
β β βββ snap_revert.ssh
β βββ shared
β β βββ clone
β β βββ context
β β βββ cpds
β β βββ delete
β β βββ failmigrate
β β βββ ln
β β βββ ln.ssh
β β βββ mkimage
β β βββ mkswap
β β βββ monitor
β β βββ mv
β β βββ mvds
β β βββ mvds.ssh
β β βββ postmigrate
β β βββ premigrate
β β βββ resize
β β βββ snap_create
β β βββ snap_create_live
β β βββ snap_delete
β β βββ snap_revert
β βββ ssh
β β βββ clone
β β βββ context
β β βββ cpds
β β βββ delete
β β βββ failmigrate
β β βββ ln
β β βββ mkimage
β β βββ mkswap
β β βββ monitor
β β βββ monitor_ds
β β βββ mv
β β βββ mvds
β β βββ postmigrate
β β βββ premigrate
β β βββ resize
β β βββ snap_create
β β βββ snap_create_live
β β βββ snap_delete
β β βββ snap_revert
β βββ tm_common.sh
β βββ vcenter
β βββ clone
β βββ context
β βββ cpds
β βββ delete
β βββ failmigrate
β βββ ln
β βββ mkimage
β βββ mkswap
β βββ monitor
β βββ mv
β βββ mvds
β βββ postmigrate
β βββ premigrate
β βββ resize
β βββ snap_create
β βββ snap_create_live
β βββ snap_delete
β βββ snap_revert
βββ VERSION
βββ vmm
β βββ az
β β βββ attach_disk
β β βββ attach_nic
β β βββ cancel
β β βββ deploy
β β βββ detach_disk
β β βββ detach_nic
β β βββ migrate
β β βββ prereconfigure
β β βββ reboot
β β βββ reconfigure
β β βββ reset
β β βββ resize_disk
β β βββ restore
β β βββ save
β β βββ shutdown
β β βββ snapshot_create
β β βββ snapshot_delete
β β βββ snapshot_revert
β βββ ec2
β β βββ attach_disk
β β βββ attach_nic
β β βββ cancel
β β βββ deploy
β β βββ detach_disk
β β βββ detach_nic
β β βββ migrate
β β βββ prereconfigure
β β βββ reboot
β β βββ reconfigure
β β βββ reset
β β βββ resize_disk
β β βββ restore
β β βββ save
β β βββ shutdown
β β βββ snapshot_create
β β βββ snapshot_delete
β β βββ snapshot_revert
β βββ firecracker
β β βββ cancel
β β βββ client.rb
β β βββ command.rb
β β βββ deploy
β β βββ map_context
β β βββ microvm.rb
β β βββ opennebula_vm.rb
β β βββ shutdown
β βββ kvm
β β βββ attach_disk
β β βββ attach_nic
β β βββ attach_nic.rpmsave
β β βββ cancel
β β βββ deploy
β β βββ detach_disk
β β βββ detach_nic
β β βββ kvmrc.rpmsave
β β βββ migrate
β β βββ migrate_local
β β βββ prereconfigure
β β βββ reboot
β β βββ reconfigure
β β βββ reset
β β βββ resize_disk
β β βββ restore
β β βββ restore.ceph
β β βββ save
β β βββ save.ceph
β β βββ shutdown
β β βββ snapshot_create
β β βββ snapshot_delete
β β βββ snapshot_revert
β βββ lib
β β βββ command.rb
β βββ lxd
β β βββ attach_disk
β β βββ attach_nic
β β βββ cancel
β β βββ client.rb
β β βββ command.rb
β β βββ container.rb
β β βββ deploy
β β βββ detach_disk
β β βββ detach_nic
β β βββ mapper.rb
β β βββ migrate
β β βββ migrate_local
β β βββ opennebula_vm.rb
β β βββ prereconfigure
β β βββ qcow2.rb
β β βββ raw.rb
β β βββ rbd.rb
β β βββ reboot
β β βββ reconfigure
β β βββ reset
β β βββ resize_disk
β β βββ restore
β β βββ save
β β βββ shutdown
β β βββ snapshot_create
β β βββ snapshot_delete
β β βββ snapshot_revert
β βββ one
β β βββ attach_disk
β β βββ attach_nic
β β βββ cancel
β β βββ deploy
β β βββ detach_disk
β β βββ detach_nic
β β βββ migrate
β β βββ migrate_local
β β βββ prereconfigure
β β βββ reboot
β β βββ reconfigure
β β βββ reset
β β βββ restore
β β βββ save
β β βββ shutdown
β β βββ snapshot_create
β β βββ snapshot_delete
β β βββ snapshot_revert
β βββ packet
β β βββ cancel
β β βββ deploy
β β βββ poll
β β βββ reboot
β β βββ reset
β β βββ shutdown
β βββ vcenter
β βββ attach_disk
β βββ attach_nic
β βββ cancel
β βββ deploy
β βββ detach_disk
β βββ detach_nic
β βββ migrate
β βββ poll
β βββ preconfigure
β βββ prereconfigure
β βββ reboot
β βββ reconfigure
β βββ reset
β βββ resize_disk
β βββ restore
β βββ save
β βββ shutdown
β βββ snapshot_create
β βββ snapshot_delete
β βββ snapshot_revert
βββ vnm
βββ 802.1Q
β βββ clean
β βββ clean.d
β βββ post
β βββ post.d
β βββ pre
β βββ pre.d
β βββ update_sg
β βββ vlan_tag_driver.rb
βββ address.rb
βββ alias_sdnat
β βββ AliasSDNAT.rb
β βββ clean
β βββ post
β βββ pre
β βββ update_sg
βββ bridge
β βββ clean
β βββ clean.d
β βββ post
β βββ post.d
β βββ pre
β βββ pre.d
β βββ update_sg
βββ command.rb
βββ dummy
β βββ clean
β βββ clean.d
β βββ post
β βββ post.d
β βββ pre
β βββ pre.d
β βββ update_sg
βββ ebtables
β βββ clean
β βββ clean.d
β βββ Ebtables.rb
β βββ post
β βββ post.d
β βββ pre
β βββ pre.d
β βββ update_sg
βββ fw
β βββ clean
β βββ clean.d
β βββ post
β βββ post.d
β βββ pre
β βββ pre.d
β βββ update_sg
βββ hooks
β βββ clean
β β βββ firecracker
β βββ post
β βββ pre
β βββ firecracker
βββ nic.rb
βββ no_vlan.rb
βββ ovswitch
β βββ clean
β βββ clean.d
β βββ OpenvSwitch.rb
β βββ OpenvSwitch.rb.rpmsave
β βββ post
β βββ post.d
β βββ pre
β βββ pre.d
β βββ update_sg
βββ ovswitch_vxlan
β βββ clean
β βββ clean.d
β βββ OpenvSwitchVXLAN.rb
β βββ post
β βββ post.d
β βββ pre
β βββ pre.d
β βββ update_sg
βββ security_groups_iptables.rb
βββ security_groups.rb
βββ sg_driver.rb
βββ vcenter
β βββ clean
β βββ clean.d
β βββ post
β βββ post.d
β βββ pre
β βββ pre.d
β βββ update_sg
βββ vlan.rb
βββ vm.rb
βββ vnm_driver.rb
βββ vnmmad.rb
βββ vxlan
βββ clean
βββ clean.d
βββ post
βββ post.d
βββ pre
βββ pre.d
βββ update_sg
βββ vxlan_driver.rb
βββ vxlan.rb
193 directories, 628 files
Marco
(Marco)
July 7, 2020, 6:30am
24
Hello everyone
I have the same problem as @madko
But, in my case, the command "echo
'<MONITOR_CONFIGURATION><DATASTORE_LOCATION><![CDATA[/var/lib/one/β¦ " suggested by @cgonzalez has in some way βrevivedβ / βreactivatedβ the communication between nodes and frontend.
Hi
I have realized we have a similar problem (Onehost sync fails during 5.12.0 upgrade ) upgrading from 5.8.1 to 5.12.0 in CentOS 7.8
Also we can confirm the this command executed from any hyp:
echo '<MONITOR_CONFIGURATION><DATASTORE_LOCATION><![CDATA[/var/lib/one//datastores]]></DATASTORE_LOCATION><DB><CONNECTIONS><![CDATA[15]]></CONNECTIONS></DB><HOST_MONITORING_EXPIRATION_TIME><![CDATA[43200]]></HOST_MONITORING_EXPIRATION_TIME><IM_MAD><ARGUMENTS><![CDATA[-r 3 -t 15 -w 90 kvm]]></ARGUMENTS><EXECUTABLE><![CDATA[one_im_ssh]]></EXECUTABLE><NAME><![CDATA[kvm]]></NAME><SUNSTONE_NAME><![CDATA[KVM]]></SUNSTONE_NAME><THREADS><![CDATA[0]]></THREADS></IM_MAD><IM_MAD><ARGUMENTS><![CDATA[-r 3 -t 15 -w 90 lxd]]></ARGUMENTS><EXECUTABLE><![CDATA[one_im_ssh]]></EXECUTABLE><NAME><![CDATA[lxd]]></NAME><SUNSTONE_NAME><![CDATA[LXD]]></SUNSTONE_NAME><THREADS><![CDATA[0]]></THREADS></IM_MAD><IM_MAD><ARGUMENTS><![CDATA[-r 3 -t 15 -w 90 firecracker]]></ARGUMENTS><EXECUTABLE><![CDATA[one_im_ssh]]></EXECUTABLE><NAME><![CDATA[firecracker]]></NAME><SUNSTONE_NAME><![CDATA[Firecracker]]></SUNSTONE_NAME><THREADS><![CDATA[0]]></THREADS></IM_MAD><IM_MAD><ARGUMENTS><![CDATA[-c -t 15 -r 0 vcenter]]></ARGUMENTS><EXECUTABLE><![CDATA[one_im_sh]]></EXECUTABLE><NAME><![CDATA[vcenter]]></NAME><SUNSTONE_NAME><![CDATA[VMWare vCenter]]></SUNSTONE_NAME></IM_MAD><IM_MAD><ARGUMENTS><![CDATA[-r 3 -t 15 -w 90 dummy]]></ARGUMENTS><EXECUTABLE><![CDATA[one_im_sh]]></EXECUTABLE><NAME><![CDATA[dummy]]></NAME><SUNSTONE_NAME><![CDATA[Dummy]]></SUNSTONE_NAME><THREADS><![CDATA[0]]></THREADS></IM_MAD><LOG><DEBUG_LEVEL><![CDATA[3]]></DEBUG_LEVEL><SYSTEM><![CDATA[FILE]]></SYSTEM></LOG><MANAGER_TIMER><![CDATA[15]]></MANAGER_TIMER><MONITORING_INTERVAL_HOST><![CDATA[180]]></MONITORING_INTERVAL_HOST><NETWORK><ADDRESS><![CDATA[0.0.0.0]]></ADDRESS><MONITOR_ADDRESS><![CDATA[auto]]></MONITOR_ADDRESS><PORT><![CDATA[4124]]></PORT><PRIKEY><![CDATA[]]></PRIKEY><PUBKEY><![CDATA[]]></PUBKEY><THREADS><![CDATA[8]]></THREADS></NETWORK><PROBES_PERIOD><BEACON_HOST><![CDATA[30]]></BEACON_HOST><MONITOR_HOST><![CDATA[120]]></MONITOR_HOST><MONITOR_VM><![CDATA[30]]></MONITOR_VM><STATE_VM><![CDATA[5]]></STATE_VM><SYNC_STATE_VM><![CDATA[180]]></SYNC_STATE_VM><SYSTEM_HOST><![CDATA[600]]></SYSTEM_HOST></PROBES_PERIOD><VM_MONITORING_EXPIRATION_TIME><![CDATA[43200]]></VM_MONITORING_EXPIRATION_TIME><HOST_ID>0</HOST_ID></MONITOR_CONFIGURATION>' | /var/tmp/one/im/run_monitord_client kvm 0 localhost-kvm
Somehow has triggered the monitoring for the hyp with id 0 from the server, so in our case that hyp now is in enabled, but the rest of them are still in error state.
Hi
Maybe this is related, but in our case we have fixed the onehost in error status with this workaround (Onehost sync fails during 5.12.0 upgrade ). The onehost sync should work correctly as oneadmin user. After that fix our hosts are now available again.
Cheers
Γlvaro
1 Like
cgonzalez
(Christian GonzΓ‘lez)
July 7, 2020, 8:55am
27
Hello @madko and @Marco , the last command I suggested is the same command that OpenNebula execute for starting the monitoring agents inside the hypervisor nodes. For some reason this command cannot be executed by OpenNebula in your environment. Can you both confirm that you executed the ssh
command as oneadmin
and in the frontend node?
If you did it, you can use this modified version of CommandManager.rb so we can get some debug information for your case. You will need to
Replace /usr/lib/one/ruby/CommandManager.rb
in your frontend by this one (backup the original previously).
Puts your host in offline
state.
Restart OpenNebula service.
Enable your host back again.
Check for the debug information in /tmp/debug_info
and send us the output.
Also you can try to execute the command with the arguments and STDIN shown in /tmp/debug_info
and let us know if you see any error.
1 Like
madko
(Edouard (Madko))
July 7, 2020, 8:53pm
28
Yeah! Removing all the backups etc.20xx-xx-xx made with an other user than oneadmin (like root) from previous upgrades fixed everything. onehost sync is now working perfectly, monitoring is working and hosts are not in ERROR state anymore.
I guess this part needs some improvement to raise better error messages in the logs. Because even in debug level I had no clue that this could be such a simple problem.
Anyway, thank you all for your help. Iβm now enjoying OpenNebula 5.12.
best regards,
Edouard
Marco
(Marco)
July 8, 2020, 4:18pm
29
Agree with @madko
I also have some files belonging to root user
After a chown, all problems are gone.
onehost sync is working and the hosts are no longer in error.
Thanks everybody!
Regards,
Marco
Thanks. @Marco
This helped me as well with an upgrade on one of our environments.