Hi
We have found a new issue in our clusters running 5.12.0.1 release. OpenNebula is not able to create/delete images in our ceph datastore. oneimage create does not show any error:
# oneimage create -d ceph.altaria --name datatest3 --type DATABLOCK --size 400
ID: 14
but:
# oneimage list
ID USER GROUP NAME DATASTORE SIZE TYPE PER STAT RVMS
14 oneadmin oneadmin datatest3 ceph.altar 400M DB No err 0
and it shows this strange error (it was working before the upgrade):
IMAGE TEMPLATE
DEV_PREFIX="vd"
ERROR="Sat Aug 15 12:45:59 2020 : Error creating datablock: Datastore driver 'ceph' not available"
From the opennebula logs we can also see this error message:
020-08-15T11:53:09.331756+02:00 one02 oned[676348]: [Z0][ImM][I]: Creating disk at of 400Mb (type: raw)
2020-08-15T11:53:09.344430+02:00 one02 oned[676348]: [Z0][ImM][E]: Error creating datablock: Datastore driver 'ceph' not available
2020-08-15T11:53:09.352447+02:00 one02 oned[676348]: [Z0][ImM][E]: Error monitoring datastore 100: Datastore driver 'ceph' not available
2020-08-15T11:53:14.847495+02:00 one02 one_monitor[676512]: [Z0][HMM][I]: Successfully monitored VM: 16
2020-08-15T11:53:22.567502+02:00 one02 oned[676348]: [Z0][AuM][I]: Command execution failed (exit code: 255): /var/lib/one/remotes/auth/server_cipher/authenticate
2020-08-15T11:53:22.567626+02:00 one02 oned[676348]: [Z0][AuM][I]: login token expired
2020-08-15T11:53:22.567928+02:00 one02 oned[676348]: [Z0][AuM][E]: Auth Error: login token expired
2020-08-15T11:53:22.568042+02:00 one02 oned[676348]: [Z0][ReM][E]: Req:4464 UID:- one.vmpool.infoextended result FAILURE [one.vmpool.infoextended] User couldn't be authenticated, aborting call.
this setup was working before, this is our ceph datastore:
ALLOW_ORPHANS="mixed"
BRIDGE_LIST="one20.swablu.os"
CEPH_HOST="ceph031.swablu.data ceph032.swablu.data ceph033.swablu.data"
CEPH_SECRET="xxxxxxxxxxxxxxxxxxxxxxxx"
CEPH_USER="libvirt"
CLONE_TARGET="SELF"
CLONE_TARGET_SHARED="SELF"
CLONE_TARGET_SSH="SYSTEM"
DATASTORE_CAPACITY_CHECK="yes"
DISK_TYPE="RBD"
DISK_TYPE_SHARED="rbd"
DISK_TYPE_SSH="FILE"
DRIVER="raw"
DS_MAD="ceph"
LN_TARGET="NONE"
LN_TARGET_SHARED="NONE"
LN_TARGET_SSH="SYSTEM"
NAME="ceph.swablu"
POOL_NAME="one"
QUATTOR="1"
RBD_FORMAT="2"
TM_MAD="ceph"
TM_MAD_SYSTEM="ssh,shared"
TYPE="IMAGE_DS"
Any idea why the ceph auth is not working now? anyone else found the same issue?
Cheers
Álvaro