You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I created and pushed a new branch called sandbox-old-api. This includes a change in how batch processing is done, that evolved from refactoring work Ya and I have been doing.
When deployed around 5:54 Eastern, this had a huge impact on the pipeline throughput. The impact is much larger than anticipated, so we should try to understand what was happening before and after.
Is this just because this effectively rolled back changes that Ya deployed in another sandbox?
NOTE: The last merge on master likely broke annotation. Since it autodeployed to staging, annotation in staging may no longer be working properly.
The sandbox-old-api should fix the breakage, so we may want to merge it sooner rather than later.
The text was updated successfully, but these errors were encountered:
Looks like the weirdness is actually from the previous sandbox deployment.
From 5:30pm on the 28th (Eastern) to about 4pm on the 29th, there was negligible traffic (to sandbox) from ndt or sidestream. This may be because of PT traffic problems. It looks like the annotator response time was in excess of 30 seconds, which may have been causing the horrible PT throughput, and they may have in turn caused there to be virtually no workers available to NDT.
The prometheus logs show an annotator version (not ss) from 20181126t112333. This is a bit confusing, as there doesn't seem to be a corresponding travis build. There were "load-multiple" builds around 1pm (travis time), but no sandbox builds on any nearby days. Perhaps this is a manual deployment.
The same deployment seems to have behaved better at different times. It was just this one day window from 28th to 29th that things seemed very bad.
I created and pushed a new branch called sandbox-old-api. This includes a change in how batch processing is done, that evolved from refactoring work Ya and I have been doing.
When deployed around 5:54 Eastern, this had a huge impact on the pipeline throughput. The impact is much larger than anticipated, so we should try to understand what was happening before and after.
Is this just because this effectively rolled back changes that Ya deployed in another sandbox?
NOTE: The last merge on master likely broke annotation. Since it autodeployed to staging, annotation in staging may no longer be working properly.
The sandbox-old-api should fix the breakage, so we may want to merge it sooner rather than later.
The text was updated successfully, but these errors were encountered: