Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Job-level max_run_time not being taken into account after worker crash. #1208

Open
tomyyn opened this issue Mar 20, 2024 · 1 comment
Open

Comments

@tomyyn
Copy link

tomyyn commented Mar 20, 2024

Open this issue after not getting a response from this comment: #832 (comment)

Currently, we are implementing job-level max_run_time for our application. Normally, if the process doesn't die, it works as expected. However, When restarting after a worker process crash, the job-level max_run_time is ignored, and only Worker.max_run_time is taken into cosideration.

As we run in a containerised environment, sometimes, containers that may have initiated processing a job are killed due to scaling down, but job remains in locked state until default max_run_time (Worker.max_run_time).

Is there any solution for this? If not, is it in the plans to implement it?

@tomyyn tomyyn changed the title Jobs being picked up by workers after system death even When job-level max_run_time is exceeded. Job-level max_run_time not being taken into account after worker crash. Mar 20, 2024
@lreddickGNA
Copy link

There is no known solution AFAIK within delayed_job to resolve this however there is a package that tracks locked jobs and can unlock them provided your params:

https://github.com/salsify/delayed_job_heartbeat_plugin

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants