You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Worker depends on an external Redis for rate-limiting purposes. Since this Redis runs on Heroku, its endpoint can sometimes change.
If it cannot connect to this Redis, worker fails to start, so no jobs can run.
We should:
make worker report this issue (and add an alert)
make worker handle this failure more gracefully
Open question: How should worker handle a situation where it cannot connect to its Redis? Should it:
fail to start (as currently)
start, but disregard rate limits completely
start, but in some different fallback mode where it makes fewer requests
something else?
Alternatively, it would not hurt to reevaluate worker's behavior towards the GCE API. Is there any way we can get rid of the dependency on an external rate-limit-checker completely?
Alternatively, it would not hurt to reevaluate worker's behavior towards the GCE API. Is there any way we can get rid of the dependency on an external rate-limit-checker completely?
I am strongly in favour of getting rid of it.
According to the Quotas page we are well within current quota limits. According to our redis rate limiting metrics we are still heavily rate limiting ourselves every now and then, but mostly are not applying any rate limits.
I would be in favour of turning it and applying for quota increases if we hit quota limits on their end.
Worker depends on an external Redis for rate-limiting purposes. Since this Redis runs on Heroku, its endpoint can sometimes change.
If it cannot connect to this Redis, worker fails to start, so no jobs can run.
We should:
Open question: How should worker handle a situation where it cannot connect to its Redis? Should it:
Alternatively, it would not hurt to reevaluate worker's behavior towards the GCE API. Is there any way we can get rid of the dependency on an external rate-limit-checker completely?
References
The text was updated successfully, but these errors were encountered: