Odd datastore usage: "[TemplateInstantiate] Failed to clone images: Not enough space in datastore" on shared system datastore for (remote) ssh datastore

wusel · February 22, 2017, 9:40pm

I have (NFS backed) datastores 0 to 2 as shared, mounted on each hypervisor to /var/lib/one/datastores/?, as well as datastores 103 (image) and 102 (system) of type ssh. I want to have most VMs easily relocatable (hence shared storage), but some VMs just need all the IOPS they can get, therefore they should stay on the local disks of the HV.

My template for deploying VMs to HV-local storage has:

SCHED_DS_RANK = "FREE_MB"
SCHED_DS_REQUIREMENTS = "ID=\"102\""
SCHED_RANK = "FREE_CPU"
SCHED_REQUIREMENTS = "ID=\"2\" | ID=\"3\" | ID=\"9\" | CLUSTER_ID=\"100\""

When I try to deploy a new persistent VM (3.3 GB actual size, 20 GB virtual, qcow2) with the template, I recently get [TemplateInstantiate] Failed to clone images: Not enough space in datastore …

… which most likely is because the NFS is nearly full:

root@one-1:~# df -h /var/lib/one/datastores/
Filesystem           Size  Used Avail Use% Mounted on
nfs-int:/nfs         204G  177G   17G  92% /nfs

On the Frontend, 103 is on NFS as well:

root@one-1:~# df -h /var/lib/one/datastores/*
Filesystem           Size  Used Avail Use% Mounted on
nfs-int:/nfs         204G  177G   17G  92% /nfs
nfs-int:/nfs         204G  177G   17G  92% /nfs
nfs-int:/nfs         204G  177G   17G  92% /nfs
nfs-int:/nfs         204G  177G   17G  92% /nfs
nfs-int:/nfs         204G  177G   17G  92% /nfs

On the HVs, the directories are linked (and 103 stays empty?):

root@hv-03:~# ls -la /var/lib/one/datastores/
total 8
drwxr-xr-x 2 root     root 4096 Feb 19 02:41 .
drwxr-xr-x 6 oneadmin root 4096 Feb 19 01:01 ..
lrwxrwxrwx 1 root     root   17 Feb 19 01:01 0 -> /nfs/datastores/0
lrwxrwxrwx 1 root     root   17 Feb 19 01:01 1 -> /nfs/datastores/1
lrwxrwxrwx 1 root     root   23 Feb 19 01:28 102 -> /var/lib/libvirt/images
lrwxrwxrwx 1 root     root   34 Feb 19 02:41 103 -> /var/lib/libvirt/images/one-images
lrwxrwxrwx 1 root     root   17 Feb 19 01:01 2 -> /nfs/datastores/2
lrwxrwxrwx 1 root     root   25 Feb 19 01:01 .isofiles -> /nfs/datastores/.isofiles

Did I foobar the setup somehow? I mean, why would cloning a VM from a 3.3 GB qcow2 image to a persistent image on a remote datastore need >17 GB space on the local datastore? I would expect the 3.3 GB qcow2 file be ssh’d to the destination and then some qemu-img-magic explode that to it’s real size on the remote datastore, no? Less wear and tear on the Fronend’s storage and the overall network? Or did I just miss some concept when setting up my OpenNebula cloud?

jmelis · March 8, 2017, 2:40pm

The accounted size is the size of the virtual qcow2 disk, not the size of the file. This is because it can grow up to that size. Can you check if that’s correct?

wusel · March 8, 2017, 3:20pm

Yes, that seems to be the issue, 20 GB > 17 GB, although only 3 GB in use currently. Thanks for the clarification.

Topic		Replies	Views
Issued error "Not enough space in datastore" although there actually is enough space Product Support	7	4623	January 26, 2017
How to use a second datastore Operations	16	3315	June 13, 2019
Inadequate error message: 'oneimage create' says "Not enough space in datastore" Integration Support	1	1809	September 29, 2015
[one.image.allocate] Not enough space in datastore Operations	1	1222	April 30, 2020
Ceph system datastore returns no space usage Product Support	18	2875	May 24, 2019

Odd datastore usage: "[TemplateInstantiate] Failed to clone images: Not enough space in datastore" on shared system datastore for (remote) ssh datastore

Related topics