Failure to remove temp role due to falsely detecting role is not present #229
Labels
No labels
bug
config update
documentation
duplicate
enhancement
in dev
input needed
new feature
not in draft
refactor
waiting on docs update
wont-do
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
Sirdog/EnduraBot#229
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Problem
Fleff on Feb 4 at 1:41pm gave himself the
@Lrole using the/trolefunction. This role was never removed or adjusted by him.Today, when the role was scheduled for removal, EnduraBot reports the following:
A review of the server audit logs shows that Fleff did indeed have the role at the time; it was never manipulated. This suggests some kind of bug with the logic at
tasks/temp_role_monitor.py.Possible cause
Now, the server also had this occur without prompting, with large error stacks that I deleted for log brevity:
It is possible that the bot state got all messed up because of it. At time of writing 2 users were given temporary roles which are scheduled for removal tomorrow. I'm gonna reboot EnduraBot to give it a kick in the pants.
If those roles are removed correctly after the reboot I'm gonna consider this issue resolved and chock it up to a reboot being needed after those errors. If not, I'll have to do more investigation.
The issue occurred again this morning. I've restarted the bot again. That will likely cause the roles to be removed correctly. Reviewing the logs more in-depth seems to indicate the issue is network related. I have come across some solutions related to the Docker image configuration that seem like they could pan out.
I am going to manually adjust the Docker related files in production and give it 48 hours to see if the issue is successfully mitigated. If so, I will make a PR to put them into the code-base.
Well, this is solidly a networking issue and isn't related to EnduraBot. 24 hours have passed without another instability event.