Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update metrics not to double count blob failures #434

Merged
merged 1 commit into from
Apr 2, 2024

Conversation

ian-shim
Copy link
Contributor

@ian-shim ian-shim commented Apr 2, 2024

Why are these changes needed?

Right now, it's reporting retryable failures as Failed blobs. It should only be reported if it permanently failed dispersal.

Checks

  • I've made sure the lint is passing in this PR.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, in that case, please comment that they are not relevant.
  • Testing Strategy
    • Unit tests
    • Integration tests
    • This PR is not tested :(

@ian-shim ian-shim requested review from mooselumph and jianoaix April 2, 2024 21:13
@ian-shim ian-shim marked this pull request as ready for review April 2, 2024 21:13
@ian-shim ian-shim marked this pull request as draft April 2, 2024 21:16
@ian-shim ian-shim force-pushed the failure-retry-metrics branch from 60fcff5 to 8c39a4f Compare April 2, 2024 21:17
@ian-shim ian-shim marked this pull request as ready for review April 2, 2024 21:18
@@ -165,7 +165,8 @@ type BlobStore interface {
// GetBlobMetadata returns a blob metadata given a metadata key
GetBlobMetadata(ctx context.Context, blobKey BlobKey) (*BlobMetadata, error)
// HandleBlobFailure handles a blob failure by either incrementing the retry count or marking the blob as failed
HandleBlobFailure(ctx context.Context, metadata *BlobMetadata, maxRetry uint) error
// Returns a boolean indicating whether the blob should be retried and an error
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: It may be fine to omit subject but may need to keep it consistent with methods above

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you clarify the suggestion here? This is consistent with methods above L141, L144

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see, there is "HandleBlobFailure handles" at L167

@ian-shim ian-shim merged commit 7f012ec into Layr-Labs:master Apr 2, 2024
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants