Non-persistent to persistent and back again (CEPH)

Yenya · May 4, 2016, 1:47pm

Hello,

[TL;DR: when I make an image from non-persistent to persistent, make some changes,and return it to non-persistent again, the modifications made during the persistent state are lost.]

Longer description:
I want to make some changes to an existing image, which has been previously used as non-persistent. I delete all VMs which have been using it, change the status to persistent, instantiate a new VM using this image (the image is visible as USED_PERS in Sunstone), make some changes, shut the VM down, wait for its state to be poweroff/lcm_init in Sunstone, delete the VM, make the image non-persistent, and instantiate yet another VM on top of it. The disk looks the same as it was at the beginning, the modifications made during the last persistent state are lost.

What works is to clone the image, immediately make it persistent (before instantiating any VMs on top of it), only then instantiate the first VM on top of it, make the necessary modifications, shut the VM down, make the image non-persistent, (delete the original image and rename the cloned one,) and instantiate VMs on top of it as needed.

It seems that the difference is whether there previously has been a non-persistent VM instantiated on top of the image or not. Is it a bug or an expected behaviour?

Thanks!

ruben · May 5, 2016, 8:05am

Hi

No, there should not be any functional difference, in fact the logic
underneath used in the both scenarios is the same, so I am not really sure
how clone+persistent works but the persistent to not-persistent doesn’t.
We’ll try to reproduce and update this thread if find any difference.

Yenya · May 5, 2016, 10:39am

OK, thanks. I have repeatedly tested this on several different source images (Fedora, Windows), but obviously on the same ONe/CEPH cluster.

Maybe the switch to persistent takes some time, and the new VM is instantiated before that? I can test it again if you give me hints what to look for - would there be for example a visible difference in the qemu command line for persistent and non-persistent images?

ruben · May 5, 2016, 12:51pm

Not really, when an image is persistent is simply takes the original rbd as
source for disk. Maybe you can try to instantiate a VM and check the
deployment file /var/lib/one/datastores/<system_ds_id>/<vm_id>/deployment.0
there you should see that the VM is using the original Ceph volume…

Yenya · May 5, 2016, 8:58pm

Okay, I captured the deployment.0 files for the following cases:

clone the image (as non-persistent), instantiate vm on top of it
the above, make the image persistent, and instantiate another vm on top of it
the above, make the image non-persistent, and instantiate another vm on top of it
clone another image, make it persistent, and instantiate vm on top of it
the above, make the image non-persistent, and instantiate another vm on top of it

The only difference is (obviously) the VM ID, and for non-persistent instances the name of the disk is one-X-Y-0 instead of one-X.

So I think the problem is not in instantiating the VM itself, but in making the image persistent and non-persistent again. How is it done? I tried to look at /var/lib/one/remotes/datastore/ceph/, but I am not sure which scripts are called when the image is made persistent and non-persistent.

Yenya · January 10, 2017, 7:55am

Reading the latest release notes, this seems to be yet another instance of the following issue, fixed in 5.2.1:

https://dev.opennebula.org/issues/4878

Yenya · May 11, 2023, 7:49am

Hi all,

many years later, and it seems this problem is back. It can be reproduced as follows:

one# oneimage create -d cephds --persistent --size 1 --name 'persistence test'
ID: 1894
one# onevm disk-attach --image 1894 7258
vm7258# echo AAAAAAAAAAAAAAAAAAAAAA > /dev/sdb; sync
one# onevm disk-detach 7258 2

one# oneimage nonpersistent 1894
one# onevm disk-attach --image 1894 7258
vm7258# dd if=/dev/sdb bs=8 count=1
AAAAAAAA
vm7258# echo BBBBBBBBBBBBBBBBBBBBBB >/dev/sdb; sync
one# onevm disk-detach 7258 2

one# oneimage persistent 1894
one# onevm disk-attach --image 1894 7258
vm7258# dd if=/dev/sdb bs=8 count=1
AAAAAAAA
# OK, the BBBBBBB written while non-persistent indeed did not persist
# Lets make another write, which will hopefully persist:
vm7258# echo CCCCCCCCCCCCCCCCCCCC >/dev/sdb; sync
one# onevm disk-detach 7258 2

one# onevm disk-attach --image 1894 7258
vm7258# dd if=/dev/sdb bs=8 count=1
CCCCCCCC
one# onevm disk-detach 7258 2
# OK, the last persistent modification is still there

one# oneimage nonpersistent 1894
one# onevm disk-attach --image 1894 7258
vm7258# dd if=/dev/sdb bs=8 count=1
AAAAAAAA
one# onevm disk-detach 7258 2
# what? I do not expect A's anymore, they should be C's instead!

one# oneimage persistent 1894
one# onevm disk-attach --image 1894 7258
vm7258# dd if=/dev/sdb bs=8 count=1
CCCCCCCC

So it seems that making the image nonpersistent again reverts incorrectly to the first persistent state, instead of the last persistent state. However, making the image persistent again reveals the modifications from the last persistent state, which is a bit strange.

Should I reopen the bug #4878 as a github issue?

This is ONe 6.4.0 CE, Ceph 17, CentOS 8Stream.

Thanks,

-Yenya

Yenya · May 11, 2023, 11:57am

A github issue:

github.com/OpenNebula/one

Ceph persitent image "loses" modifications

opened 11:56AM - 11 May 23 UTC

Yenya

Type: Bug

**Description** When I make an image from non-persistent to persistent, make so…me changes,and return it to non-persistent again, the modifications made during the persistent state are lost. **To Reproduce** ``` one# oneimage create -d cephds --persistent --size 1 --name 'persistence test' ID: 1894 one# onevm disk-attach --image 1894 7258 vm7258# echo AAAAAAAAAAAAAAAAAAAAAA > /dev/sdb; sync one# onevm disk-detach 7258 2 one# oneimage nonpersistent 1894 one# onevm disk-attach --image 1894 7258 vm7258# dd if=/dev/sdb bs=8 count=1 AAAAAAAA vm7258# echo BBBBBBBBBBBBBBBBBBBBBB >/dev/sdb; sync one# onevm disk-detach 7258 2 one# oneimage persistent 1894 one# onevm disk-attach --image 1894 7258 vm7258# dd if=/dev/sdb bs=8 count=1 AAAAAAAA # OK, the BBBBBBB written while non-persistent indeed did not persist # Lets make another write, which will hopefully persist: vm7258# echo CCCCCCCCCCCCCCCCCCCC >/dev/sdb; sync one# onevm disk-detach 7258 2 one# onevm disk-attach --image 1894 7258 vm7258# dd if=/dev/sdb bs=8 count=1 CCCCCCCC one# onevm disk-detach 7258 2 # OK, the last persistent modification is still there one# oneimage nonpersistent 1894 one# onevm disk-attach --image 1894 7258 vm7258# dd if=/dev/sdb bs=8 count=1 AAAAAAAA one# onevm disk-detach 7258 2 # what? I do not expect A's anymore, they should be C's instead! one# oneimage persistent 1894 one# onevm disk-attach --image 1894 7258 vm7258# dd if=/dev/sdb bs=8 count=1 CCCCCCCC ``` **Expected behavior** There should be `CCCCCCCC` all the time since the first time the C's are written to the disk. **Details** - Affected Component: storage - Hypervisor: kvm - Version: 6.4.0 CE - Ceph 17 Quincy - CentOS Stream 8 **Additional context** This was previously reported here in 2016, and is said to be fixed, but it looks like the bug appeared again: https://dev.opennebula.org/issues/4878.html https://forum.opennebula.io/t/non-persistent-to-persistent-and-back-again-ceph/2199/5 ## Progress Status - [ ] Code committed - [ ] Testing - QA - [ ] Documentation (Release notes - resolved issues, compatibility, known issues)

Topic		Replies	Views
Persistent vs Non-Persistent Storage- here we go again... Storage	1	158	August 12, 2024
Question about non-persistent disk Community Support	2	3274	September 22, 2016
Use cases for persistent & non-persisent Storage	2	2814	October 9, 2018
[SOLVED] Non-persistent Ceph image results in clone failure Community Support	1	852	January 26, 2017
Non-persistent VM upgrades: best practices? Discuss	5	1207	January 11, 2017

Non-persistent to persistent and back again (CEPH)

Related topics