Concurrent scrapyrt requests #141

BalzySte · 2022-03-25T15:59:11Z

Hello, I'm using scrapyrt to provide an HTTP interface to a big Scrapy project. I'm running a single scrapyrt instance in a Docker container, some spiders require ~60-120 seconds to complete and I've noticed that requests are handled sequentially, causing substantial delays.

Is that the expected behavior? I know scrapyrt is not suitable for long running spiders, but I'm wondering if there exist a quick fix, for example running multiple workers/threads. Asking here cause I'm not really familiar with the twisted framework.
Another solution would be running multiple scrapyrt instances behind a load balancer, but I'd rather not go down that path.

doverradio · 2022-11-15T21:44:52Z

I'm facing this same issue right now.

Previously, I had solved it by just creating more instances of the scrapy scripts on various ports.

Did you solve it?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concurrent scrapyrt requests #141

Concurrent scrapyrt requests #141

BalzySte commented Mar 25, 2022

doverradio commented Nov 15, 2022

Concurrent scrapyrt requests #141

Concurrent scrapyrt requests #141

Comments

BalzySte commented Mar 25, 2022

doverradio commented Nov 15, 2022