How do you share a variable across multiple workers? #3017

yangh0597 · 2023-06-30T09:08:52Z

yangh0597
Jun 30, 2023

How do you share a variable across multiple workers? Different pid

zacqed · 2023-08-14T05:31:31Z

zacqed
Aug 14, 2023

We use redis as the caching layer for being able to share it across

1 reply

yangh0597 Aug 22, 2023
Author

This object is a complex object, hundreds of megabytes in size

tilgovi · 2023-08-14T23:00:36Z

tilgovi
Aug 14, 2023
Collaborator

Use an external store. Gunicorn workers are designed to share nothing.

0 replies

yangh0597 · 2023-08-22T08:59:45Z

yangh0597
Aug 22, 2023
Author

In the field of machine learning, it is necessary to share a few hundred megabytes of objects, and then provide prediction functions through the interface, is there any good solution?

0 replies

tilgovi · 2023-09-16T23:00:53Z

tilgovi
Sep 16, 2023
Collaborator

Again, Gunicorn workers are designed to share nothing. The only way to share things is to use OS methods of memory sharing.

Gunicorn uses fork() without exec(), so Gunicorn workers share any memory that was allocated before the worker started. Using the preload option or putting code in your config module, you may be able to load Python modules and have (an initial snapshot of) their data shared between workers. Changes will not be reflected across workers, though, because that's not how OS memory management works! Be aware that doing this can be brittle. In particular, macOS has recently become quite strict about what's allowed after fork() without exec().

A common pattern for any web service that has heavy work is to submit that work to a queue and handle it with a separate service. You can have some lower memory machines serving web requests and higher memory machines performing the prediction work.

Consider whether you really need to run multiple prediction processes on the same machine. If you do, and you really care about de-duplicating their memory consumption, look into ways to share memory between those processes. Sharing Python objects is not easy. The multiprocessing module provides some ability to share ctypes objects, but not arbitrary Python objects. It can share arbitrary objects via proxies, but that's really just keeping the objects in one process. If you simply need to read a file, but are okay with building the Python objects in each process, consider just mmap.

0 replies

tilgovi · 2023-09-16T23:02:11Z

tilgovi
Sep 16, 2023
Collaborator

Consider also whether maybe Python is not the best language for your memory-and-cpu-intensive work.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do you share a variable across multiple workers? #3017

{{title}}

Replies: 5 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

How do you share a variable across multiple workers? #3017

yangh0597 Jun 30, 2023

Replies: 5 comments · 1 reply

zacqed Aug 14, 2023

yangh0597 Aug 22, 2023 Author

tilgovi Aug 14, 2023 Collaborator

yangh0597 Aug 22, 2023 Author

tilgovi Sep 16, 2023 Collaborator

tilgovi Sep 16, 2023 Collaborator

yangh0597
Jun 30, 2023

Replies: 5 comments 1 reply

zacqed
Aug 14, 2023

yangh0597 Aug 22, 2023
Author

tilgovi
Aug 14, 2023
Collaborator

yangh0597
Aug 22, 2023
Author

tilgovi
Sep 16, 2023
Collaborator

tilgovi
Sep 16, 2023
Collaborator