Skip to content

remove "unknown" job state, simplify poll logic#329

Draft
kellyrowland wants to merge 10 commits intojupyterhub:mainfrom
kellyrowland:fix-poll
Draft

remove "unknown" job state, simplify poll logic#329
kellyrowland wants to merge 10 commits intojupyterhub:mainfrom
kellyrowland:fix-poll

Conversation

@kellyrowland
Copy link
Copy Markdown

draft PR for anyone interested in this work, possibly related to #314. this removes the "unknown" job status and simplifies the logic in poll.

with the project version of the polling logic, we were seeing the hub 'losing track' of batchspawner servers which required manual intervention to delete the users from the hub db in order to clear out the server's bad child state and allow the user to spawn another batch server.

we're been running the version of batchspawner from my branch here for several months now which has addressed the issue; we are no longer seeing this behavior.

I'm opening this PR as a draft since it's incomplete in that:

  • there are some debugging server ID logs to clean up
  • only some Slurm-related tests have been updated in the branch

I'm not familiar enough with other schedulers to update other tests in the code so this would need some help if the community is interested in upstreaming the work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant