Resize broadway pipeline #228

fcevado · 2021-04-01T13:48:59Z

Discussing on broadway_rabbitmq @josevalim commented about this possibility.

I was thinking that restarting everything could be avoided by changing BatchProcessorSupervisor and ProcessorSupervisor to a :simple_one_for_one supervisor(or a DynamicSupervisor), but I guess it would require broadway callbacks to be stateful. At least it's what I'm getting from broadway architecture docs

The idea is that if any process fails, we want to restart the rest of the tree. Since Broadway callbacks are stateless, we can handle errors and provide reports without crashing processes.

The text was updated successfully, but these errors were encountered:

josevalim · 2021-04-01T13:58:16Z

DynamicSupervisor is not needed. You can add children to any supervisor while they are running. It won't be as fast as a dynamic one - but this is not like an operation you would do every second.

The biggest issue with adding this feature is actually making sure the new entries supervise to their producers and their children supervise to them - and how to handle failures if anything goes wrong in this process.

There is another approach which is to restart the rest of the tree except the producer. This will be necessary whenever there is a PartitionDispatcher, since the number of partitions are specified upfront.

fcevado · 2021-04-01T14:14:17Z

You can add children to any supervisor while they are running.

I was thinking of a scenario where you want to scale down. A :one_for_all supervisor would always require to restart the entire tree when doing that, right?

josevalim · 2021-04-01T14:45:46Z

@fcevado no, you can still stop children in a regular supervisor without restarting. The restart is always about unexpected failures.

guigaoliveira · 2021-08-16T05:14:27Z

I'm not sure I understand correctly, but changing broadway configuration dynamically can be interesting to optimize pipeline according to metrics and then have best possible typology while it's running. Or change values according to specific scenarios (usually based on some metric), not necessarily optimizing, but increasing / decreasing parameters. If it is possible not only change concurrency of processors, but also other parameters such as batch_size, batch_timeout, etc.

PranavRam · 2021-09-16T12:11:37Z

I'm not sure I understand correctly, but changing broadway configuration dynamically can be interesting to optimize pipeline according to metrics and then have best possible typology while it's running. Or change values according to specific scenarios (usually based on some metric), not necessarily optimizing, but increasing / decreasing parameters. If it is possible not only change concurrency of processors, but also other parameters such as batch_size, batch_timeout, etc.

This would be really useful. Are you thinking something as simple as updating the config (producer + processor concurrency and min_demand, max_demand, etc) either sync or async?

josevalim · 2021-09-16T13:29:19Z

The only way to do that at the moment would be by restarting the pipeline. This is mostly OK except for Kafka which doesn't cope well with constant reconections.

fcevado · 2021-09-17T20:21:19Z

@josevalim I was thinking, just increasing the amount of childs of the ProcessorSupervisor and restarting the BatchProcessorSupervisor wouldn't do it? Given the genstage subscription structure, just the BatchProcessor needs to know about all the Processors, is that right? I guess it's not needed to mess up with the the producer section of the supervision tree.

josevalim · 2021-09-17T21:15:04Z

Not quite. You still need to go through the rest of the pipeline and tell them to subscribe to the new processors. Sizing down has its own challenges as you need to subscribe, drain, and then terminate. To be clear, it is all doable, but it is a sizeable amount of work and testing.

josevalim added the discussion label Sep 25, 2021

whatyouhide added Kind:Discussion and removed discussion labels Jan 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resize broadway pipeline #228

Resize broadway pipeline #228

fcevado commented Apr 1, 2021

josevalim commented Apr 1, 2021

fcevado commented Apr 1, 2021

josevalim commented Apr 1, 2021

guigaoliveira commented Aug 16, 2021

PranavRam commented Sep 16, 2021

josevalim commented Sep 16, 2021

fcevado commented Sep 17, 2021

josevalim commented Sep 17, 2021

Resize broadway pipeline #228

Resize broadway pipeline #228

Comments

fcevado commented Apr 1, 2021

josevalim commented Apr 1, 2021

fcevado commented Apr 1, 2021

josevalim commented Apr 1, 2021

guigaoliveira commented Aug 16, 2021

PranavRam commented Sep 16, 2021

josevalim commented Sep 16, 2021

fcevado commented Sep 17, 2021

josevalim commented Sep 17, 2021