-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Offsets accumulating in the producer ack state (take 2) #139
Comments
If you recently changed the ack option, I would first investigate if it is related to that. :) |
Sounds like a sensible place to start, thanks A bit more context, we have 2 pipelines with the exact same configuration, one of them is affected the other one isn't. Of course this could be a coincidence 🤷 |
@danmarcab is it something that you can reproduce in a small application? We are running version
children = [
MyApp.ConsumerBroadway,
MyApp.ConsumerBroadwayV2,
MyApp.ConsumerBroadwayV3,
MyApp.Telemetry
# Starts a worker by calling: MyApp.Worker.start_link(arg)
# {MyApp.Worker, arg}
] I'm asking because I'd like to see the issue in a small project and avoid updating until we can find the root cause. |
No we haven't. This only happens occasionally after a few hours in our main app, so it's very hard to reproduce. I'll keep the issue updated when we find anything. |
Another data point. We are not using batchers. Mentioning it because the fix from dashbitco/broadway@602931c was for the batcher stage. |
We don't cancel timers in Broadway Kafka, so I can't see it being an issue. We do use timers in the Broadway's rate limiter and it could have the same bug, but there it would raise (and not accumulate messages forever). So it may not be related to this. |
Good morning/afternoon/evening all 👋
We are experiencing an issue which shows itself as pretty much the same as #100 (cannot reopen that one)
We can see some messages stuck in the ack state:
The processor itself keeps processing, it just never acks the messages.
That original issue was fixed by dashbitco/broadway@602931c. Which was an obscure timer issue. I wonder if there is another edge case we are missing.
We are running the latest versions of broadway and broadway_kafka.
A couple of things that are different in our code/env:
offset_commit_on_ack
is set to false, (was true previously)Any ideas of where to look?
The text was updated successfully, but these errors were encountered: