return poll status after first load finish #742

ivyxjc · 2023-06-07T05:55:57Z

Now, spawner does not wait for fist load finish. So it cannot detect the running pod and return incorrect status to hub.

Update by Erik

This is a bugfix for a regression introduced with KubeSpawner version 5.0.0 and Z2JH since 3.0.0 (or the pre-release 3.0.0-alpha.1 or the development release 3.0.0-0.dev.git.6133.hbfc583f8). It is resolved in KubeSpawner 6.1.0 and z2jh 3.1.0.

For more information and help cleaning up orphaned user pods, see https://discourse.jupyter.org/t/how-to-cleanup-orphaned-user-pods-after-bug-in-z2jh-3-0-and-kubespawner-6-0/21677

welcome · 2023-06-07T05:55:59Z

Thanks for submitting your first pull request! You are awesome! 🤗

If you haven't done so already, check out Jupyter's Code of Conduct. Also, please make sure you followed the pull request template, as this will help us review your contribution more quickly.

You can meet the other Jovyans by joining our Discourse forum. There is also a intro thread there where you can stop by and say Hi! 👋

Welcome to the Jupyter community! 🎉

consideRatio

I think its super tricky to follow how KubeSpawner works with this, so I'm struggling to review this.

Did you run into issues with this using enable_user_namespaces set to True @ivyxjc ?

@minrk I think this makes sense, but would appreciate your review help.

I'm especially thinking about if we should put this code in _start_watching_pods or _start_reflector.

Currently, self._start_watching_pods is awaited from _start, stop, and poll, where _start also awaits await self._start_watching_events.

Should do the await on first_load_future if needed from _start_reflector?

I think awaiting it in _start_reflector makes sense. If I were to guess based on my foggy memory of the distant past, I think perhaps _start_reflector maybe used to have a requirement to not be async, so it couldn't wait for things? That doesn't appear to be the case anymore.

danilopeixoto · 2023-08-12T00:49:57Z

We've implemented a copy of KubeSpawner with minor changes. We also noticed the Hub was deleting the spawner object of running servers because it couldn't find the server resources in the reflector at startup (init_spawners). The solution presented in the pull request solved our problem. Now the poll method waits for the reflector to flood for the first time.

We did not test the implementation in the original KubeSpawner.

minrk · 2023-08-14T07:51:07Z

Thanks for the comment @danilopeixoto! I think that this bug may be the cause of jupyterhub/mybinder.org-deploy#2686 leaving orphan pods taking up space on mybinder.org.

I moved the await of first_load to inside _start_reflector, so it's always awaited and hopefully less likely to get missed.

welcome · 2023-08-14T07:51:13Z

Congrats on your first merged pull request in this project! 🎉

Thank you for contributing, we are very proud of you! ❤️

consideRatio reviewed Jun 7, 2023

View reviewed changes

kubespawner/spawner.py Outdated Show resolved Hide resolved

return poll status after first load finish

698d54b

ivyxjc force-pushed the main branch from 28ea529 to 698d54b Compare June 7, 2023 08:24

consideRatio added the bug label Jun 7, 2023

consideRatio reviewed Jun 7, 2023

View reviewed changes

await first_load_future in reflectors

67d66ab

minrk force-pushed the main branch from f210225 to 67d66ab Compare August 14, 2023 07:44

minrk merged commit 0e5bb46 into jupyterhub:main Aug 14, 2023
7 checks passed

This was referenced Aug 28, 2023

sweep outdated pods in minesweeper jupyterhub/mybinder.org-deploy#2730

Merged

Mechanism for Spawners to detect and cleanup orphan resources jupyterhub/jupyterhub#4544

Open

jabbera mentioned this pull request Sep 16, 2023

Servers being reported as down after hub restart that are not #786

Closed

This was referenced Sep 18, 2023

2i2c/ucmerced have 34 user pods running for 6-7 days 2i2c-org/infrastructure#3130

Closed

Release planning for 6.1.0 #767

Closed

yuvipanda mentioned this pull request Sep 22, 2023

user pods severely impacted after hub pod restart jupyterhub/zero-to-jupyterhub-k8s#3229

Closed

This was referenced Sep 22, 2023

Specify tags & pullPolicy for alpine/git: image 2i2c-org/infrastructure#3165

Merged

Use main branch of kubespawner with important bugfix 2i2c-org/infrastructure#3168

Merged

jabbera mentioned this pull request Mar 26, 2024

JupyterHub/KubeSpawner lost track of started user server, pod kept running #697

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

return poll status after first load finish #742

return poll status after first load finish #742

ivyxjc commented Jun 7, 2023 •

edited by consideRatio

Loading

welcome bot commented Jun 7, 2023

consideRatio left a comment

ivyxjc commented Jun 7, 2023 •

edited

Loading

consideRatio Jun 7, 2023

minrk Jun 7, 2023

danilopeixoto commented Aug 12, 2023 •

edited

Loading

minrk commented Aug 14, 2023

welcome bot commented Aug 14, 2023

+                      reflector = await self._start_watching_pods()
+                      if not reflector.first_load_future.done():
+                          await reflector.first_load_future

return poll status after first load finish #742

return poll status after first load finish #742

Conversation

ivyxjc commented Jun 7, 2023 • edited by consideRatio Loading

Update by Erik

welcome bot commented Jun 7, 2023

consideRatio left a comment

Choose a reason for hiding this comment

Related

ivyxjc commented Jun 7, 2023 • edited Loading

consideRatio Jun 7, 2023

Choose a reason for hiding this comment

minrk Jun 7, 2023

Choose a reason for hiding this comment

danilopeixoto commented Aug 12, 2023 • edited Loading

minrk commented Aug 14, 2023

welcome bot commented Aug 14, 2023

ivyxjc commented Jun 7, 2023 •

edited by consideRatio

Loading

ivyxjc commented Jun 7, 2023 •

edited

Loading

danilopeixoto commented Aug 12, 2023 •

edited

Loading