-
Notifications
You must be signed in to change notification settings - Fork 27
Allow admins to restart tasks and added worker stats to the dashboard #270
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
|
In this change I adjusted the path to the healthcheck cron job since it looked incorrect to me. I can't verify this as I can't SSH into worker instances to verify. Please let me know if I need to update the cron job definitions. |
|
Ah yes, I'm checking if I can grant you SSH access now. |
|
@Amdrel now I was able to add you as member of:
So now you should be able to become video2commons(-io,-test) on toolforge bastion, and ssh to the cloudvps instances. Do you need some guidance to try? |
|
Thank you! I am able to ssh in and Edit: I found https://wikitech.wikimedia.org/wiki/Help:Accessing_Cloud_VPS_instances and I'm currently reading it |
|
I'm able to ssh into encoding instances now. Thank you! |
|
I've noticed a misconfiguration with the cron jobs (not just the new one), so please don't merge this for now. I'll update T407375 with more information soon as it pertains to that ticket. |
fe2df45 to
95721d6
Compare
|
I've added some changes, though I had to squash it to make merging in minified JavaScript changes from Since the new job needs to directly connect to workers using Celery, which only the workers have as a dependency, I kept the job there. To prevent all the workers from clobbering each other with inspect requests I added some random sleep and a timestamp check. I also updated the job to run once an hour. For context, the job is only needed for both the initial stats collection, and to update the active worker count that's displayed on the frontend. The workers will update the stats as they take on and complete jobs. The workers will need to be restarted for this change, and stats will be a little inaccurate until all of them get the patch. |
Description
Phabricator Ticket: https://phabricator.wikimedia.org/T410732
Changes
sudoerslist in Redis (maintainers) can now manage other users' tasks (relogin required for old sessions)backend.pp, this currently runs on all worker instances although it doesn't need to. Ideally this job would only run on one instance.