-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Snapshot stuck on DELETING #32
Comments
(Not really a fatal error but it there's a workaround to get the snapshot and then the resource deleted, I'd appreciate it.) |
what is your output of |
Can you also provide the linstor command history you did? In case you used linstor in |
Here you go. No interactive history, sorry. The stuck resource definition is
|
Ok thanks for the update. |
I have same problem but with different reason:
Satellite logs:
lv Probably I was called rollback before deletion, and now it is somehow stuck UPD:: If I create snapshot manually via
|
Can I remove this snapshot forcefully somehow? - It is not existing anymore in backing lvm datastore. |
Have you tried to simply remove the snapshot using |
Yes, it can't be removed because linstor tries to merge it:
|
Okay, I just wanted to suggest to manually create a snapshot so that linstor can happily merge something, but as I see, you have tried that already. So, correct me if I am mistaken, but you are in a situation, where you have deleted a resource, linstor knows about a snapshot that is also already gone on LVM-level. Is that correct? If so, try to re-create a dummy-resource (with the expected name) and manually create the snapshot to make linstor happy (afterwards you can also remove the dummy-resource). |
No, resource is not deleted yet, but snapshot is
on the node
now try to restart satellite:
looks like it tries to remove same snapshot multiple times |
Not sure where the "rollback to snapshot" came from, but there may be two problems:
|
Can you please restart the linstor-satellite but this time with TRACE logging enabled and attach here the full log? Also, for completeness' sake, please attach all new ErrorReports, just to be sure |
Thanks, that helped. I was able to somehow reproduce the issue (I am not totally happy with that because the only way I could reproduce this issue was manually removing the snapshot, which I hope you did not do :) ) Anyways, here is what happens (although, as mentioned, I am not entirely sure how you got into this situation): When the controller connects to the satellite, it sends all data the satellite needs. In your case we have a strange combination of data such that the snapshot I am still thinking about how to add a safety check for preventing this unrecoverable situation from happening again. However, this might take some time, but this is on my todo-list :) To fix your current setup, I am afraid you will have to manually edit the database (usually you should NOT do that, but right now I cannot think of an alternative) and execute
These two statements will basically make linstor forget about rolling back those two resources. Edit: In case |
It might be, I was try to delete this snapshot by multiple ways after it went into this situation. Even manually, I don't remember all the actions what I made, sorry.
I don't remember it well, but presume this situation has occurred after the next actions:
I'll let you know if I could reproduce this and the problem will occur again.
OK, good to know, Thank you! |
This operation wasn't work, linstor strongly restrict to remove resources which have snapshots, even after I removed this volume using lvremove command
This was helpful. Thanks now snapshot has disappeared. |
Similar to #19 (which is about a Resource), I have a snapshot that's stuck on DELETING with a NullPointerException:
Satellite error report:
The text was updated successfully, but these errors were encountered: