fix(k8sprocessor): Pod Service cache invalidation #1425

swiatekm · 2024-01-11T17:34:34Z

Fixes #1414 by separately handling update events for EndpointSlices. The assumption behind the fix is that every Pod exists exactly once in one of the EndpointSlices associated with a Service, and we can never get the updates in the wrong order if it's re-added to the Service after having been removed from it.

A more robust solution, where we periodically recompute the whole Pod to Service map based on EndpointSlice data, is possible, but more expensive and more complex.

I've also replaced some homegrown slice manipulation code with the slices module, that's part of the standard library in Go 1.21.

rnishtala-sumo · 2024-01-12T15:16:49Z

pkg/processor/k8sprocessor/kube/owner_test.go

+		require.NoError(t, err)
+		assert.Eventually(t, func() bool {
+			services := op.GetServices(pod.Name)
+			if len(services) != 2 {


Why are we checking if the length is not 2 here?

It should be 2, we continue waiting while it's not 2. And it should be 2 because we updated back to the original state, where the EndpointSlice contains the Pod.

rnishtala-sumo · 2024-01-15T03:30:45Z

pkg/processor/k8sprocessor/kube/owner.go

+			podName := endpoint.TargetRef.Name
+			if slices.Index(newPodNames, podName) == -1 {
+				// not a deferred delete, as this is a dynamic property which can change often
+				op.deleteServiceFromPod(podName, serviceName)


Is a call to deleteServiceFromPod going to ensure that our cache doesn't have any stale entries? So my understanding here is that whenever any EndpointSlice for a service is updated this method gets called with the params as mentioned here - https://pkg.go.dev/k8s.io/[email protected]/tools/cache#ResourceEventHandlerFuncs.OnUpdate

Yes, that's the idea. An update to an EndpointSlice can mean that some Pods were removed from the Service, and this is what we do here.

swiatekm marked this pull request as ready for review January 11, 2024 18:14

swiatekm requested a review from a team as a code owner January 11, 2024 18:14

swiatekm force-pushed the fix/k8sprocessor/endpoint-memory-leak branch from 7d736c9 to 8c7502f Compare January 12, 2024 13:26

rnishtala-sumo reviewed Jan 12, 2024

View reviewed changes

rnishtala-sumo reviewed Jan 15, 2024

View reviewed changes

swiatekm force-pushed the fix/k8sprocessor/endpoint-memory-leak branch from 8c7502f to bb038b5 Compare January 15, 2024 14:52

fix(k8sprocessor): Pod Service cache invalidation

2ef9816

swiatekm force-pushed the fix/k8sprocessor/endpoint-memory-leak branch from bb038b5 to 2ef9816 Compare January 15, 2024 14:55

rnishtala-sumo approved these changes Jan 15, 2024

View reviewed changes

swiatekm merged commit 3891b3c into main Jan 15, 2024
28 checks passed

swiatekm deleted the fix/k8sprocessor/endpoint-memory-leak branch January 15, 2024 16:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(k8sprocessor): Pod Service cache invalidation #1425

fix(k8sprocessor): Pod Service cache invalidation #1425

swiatekm commented Jan 11, 2024

rnishtala-sumo Jan 12, 2024 •

edited

Loading

swiatekm Jan 12, 2024

rnishtala-sumo Jan 15, 2024

swiatekm Jan 15, 2024

fix(k8sprocessor): Pod Service cache invalidation #1425

fix(k8sprocessor): Pod Service cache invalidation #1425

Conversation

swiatekm commented Jan 11, 2024

rnishtala-sumo Jan 12, 2024 • edited Loading

Choose a reason for hiding this comment

swiatekm Jan 12, 2024

Choose a reason for hiding this comment

rnishtala-sumo Jan 15, 2024

Choose a reason for hiding this comment

swiatekm Jan 15, 2024

Choose a reason for hiding this comment

rnishtala-sumo Jan 12, 2024 •

edited

Loading