Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

scrubber: enable --exit-code and route alerts #10064

Open
3 tasks
skyzh opened this issue Dec 9, 2024 · 0 comments
Open
3 tasks

scrubber: enable --exit-code and route alerts #10064

skyzh opened this issue Dec 9, 2024 · 0 comments
Assignees
Labels
c/storage/pageserver Component: storage: pageserver

Comments

@skyzh
Copy link
Member

skyzh commented Dec 9, 2024

Currently the scrubber only exits with an error code if the scrubber suffers fatal errors itself. If something goes wrong with the tenant, it doesn't currently affect the exit code of the scrubber until we provide --exit-code in the parameter.

We will need to identify current fatal errors in prod, fix them or put them into warnings, and then add this parameter to the scrubber. That way, we can receive alerts in Slack if something goes wrong with the user.

  • resolve staging error
  • resolve prod error
  • enable exit code -> alert
@skyzh skyzh added the t/bug Issue Type: Bug label Dec 9, 2024
@skyzh skyzh self-assigned this Dec 9, 2024
@skyzh skyzh removed the t/bug Issue Type: Bug label Dec 9, 2024
@skyzh skyzh changed the title scrubber: enable --exit-code scrubber: enable --exit-code and route alerts Dec 16, 2024
@skyzh skyzh added the c/storage/pageserver Component: storage: pageserver label Dec 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
c/storage/pageserver Component: storage: pageserver
Projects
None yet
Development

No branches or pull requests

1 participant