how to monitor dashboard Failures when performing backup and other operations #118
-
|
npbackup3.0.0-rc13-pub As of now i have a working setup including the grafana dashboard. When I first started a backup on this machine i got multiple Failure operations logged. One failure at action=backup, and one failure at action=snapshots. The only thing I see from the commandline is an ERROR: permission denied: /var/log/npbackup-gui.log because I run this as normal user not as root or sudo. But it reverts to use /tmp/ofunctions.logger_utils.log instead. Running npbackup-cli with the gui config file and with --snapshots gives the same snapshots failure in the dashboard and no clue in the terminal log apart from the already mentioned log file folder error and workaround. How would I go ahead and investigate the cause of the reported dashboard Failures?f EDIT: I run the rest-server in append-only mode, the gui is configured with Full permissions and the default Retention policy is still active. Could that conflict? I will change it and try again and report back later. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 5 replies
-
|
experiment 1. experiment 2. so this seems to boil down to a problem with permissions? |
Beta Was this translation helpful? Give feedback.
-
|
So basically, there are two important metrics.
While the first metric shows that the backup itself suceed, the second metric shows if backup succeed and eveything around backup (pre/post exec scripts, backup minimum size checks...) have succeed also I guess I could document that a bit better. In your case, having a log permission error will trigger an error on the second metric, even if the actual backup succeed. Does this answer your question ? |
Beta Was this translation helpful? Give feedback.
So basically, there are two important metrics.
restic_backup_failurewhich is restic backend backup status (only appears on backup operations)npbackup_exec_state{action='backup} which is the full backup status (appears on every operation, whith action=operation)While the first metric shows that the backup itself suceed, the second metric shows if backup succeed and eveything around backup (pre/post exec scripts, backup minimum size checks...) have succeed also
The second metric will show an error if there is any log message with a level greater than warning.
I guess I could document that a bit better.
In your case, having a log permission error will trigger an error on the second metr…