Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Usability][HA] Provide CLI to show Failover/ Failback occurrences and/or dump logs to an user accessible log location for ease of debugging #561

Open
rahullepakshi opened this issue Apr 8, 2024 · 0 comments

Comments

@rahullepakshi
Copy link
Contributor

rahullepakshi commented Apr 8, 2024

From usability perspective, we need a CLI which user can issue to query to know history of failover/ failback with timestamp and its status(success/failure) along with entire sequence of operations and its status during failover/ failback

As of now there is no go-to place for user to look for present status and past occurrences of HA and which would make very difficult for user to understand present state of system if it would have failed over overnight/ in weekend.
Also recording this to a journal is best to debug. No ceph status detects this as of now, except searching along huge logs which is difficult as at #545, there is no other way user can be alerted on this

Further, alerting the same on dashboard would be more beneficial

Output of CLI or logs should look like below-(basically it will be good if we include the sequence and its status) Few of below parameters can be marked optional upon discussion

**Operation- Failover
Status: Success/ Failure**
Time of occurrence - dd/mm/yyyy hrs:min:sec
Source Gateway - <GW_NAME>
Destination Gateway - <GW_NAME>
output of ceph nvme-gw show nvmeof ''
Blocklisting - success/Failure
New monmap boradcast
Time of End - dd/mm/yyyy hrs:min:sec

**Operation- Failback
Status: Success/ Failure**
Time of occurrence - dd/mm/yyyy hrs:min:sec
Source Gateway - <GW_NAME>
Destination Gateway - <GW_NAME>
output of ceph nvme-gw show nvmeof ''
Blocklisting - success/Failure
New monmap boradcast
Time of End - dd/mm/yyyy hrs:min:sec

and / or
all above information should be recorded a journal to keep track off and an user can refer at any time.
Open for discussion.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: 🆕 New
Development

No branches or pull requests

1 participant