Configurable object serialisation #119

aidansteele · 2015-02-02T03:16:50Z

Hi,

I really appreciate the library. I've considered a potential optimisation and would like to know your thoughts before submitting a pull request. Currently Marshal is being used to serialise objects that cross process boundaries.

It would be great if we could opt to use a different serialisation class, e.g. Oj or MessagePack. Both these libraries are quite a bit faster than the default Marshal class. Parallel could be configured to use another (de)serialisation method by parameter or configuration block, etc.

What do you think?

grosser · 2015-02-02T03:24:01Z

Sounds good, Parallel.serializer = JSON / Marshal / XXX should do the trick, all of them respond to .load and .dump afaik

The amount of data must be gigantic for this to make any real difference, but I can see this being useful ...

aidansteele · 2015-02-02T03:51:36Z

You're right, it's a bit of an odd request. If one cares about the performance that much, why would Ruby be used? But flexibility is always nice :)

That said, I've just done a very quick implementation and another issue popped up. All of my parallel jobs took almost exactly the same amount of time to run and their resulting giant encoded blobs are deserialised serially. It would be neat to deserialise them concurrently in threads on the receiving end (assuming these libraries even release the GVL), but that adds a whole lot of additional complexity for what is probably minimal gain in the general case.

grosser · 2015-02-02T04:26:49Z

strange, I thought the deserialization is done in threads too oO

On Sun, Feb 1, 2015 at 7:51 PM, Aidan Steele [email protected]
wrote:

You're right, it's a bit of an odd request. If one cares about the
performance that much, why would Ruby be used? But flexibility is always
nice :)

That said, I've just done a very quick implementation and another issue
popped up. All of my parallel jobs took almost exactly the same amount of
time to run and their resulting giant encoded blobs are deserialised
serially. It would be neat to deserialise them concurrently in threads on
the receiving end (assuming these libraries even release the GVL), but that
adds a whole lot of additional complexity for what is probably minimal gain
in the general case.

—
Reply to this email directly or view it on GitHub
#119 (comment).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configurable object serialisation #119

Configurable object serialisation #119

aidansteele commented Feb 2, 2015

grosser commented Feb 2, 2015

aidansteele commented Feb 2, 2015

grosser commented Feb 2, 2015

Configurable object serialisation #119

Configurable object serialisation #119

Comments

aidansteele commented Feb 2, 2015

grosser commented Feb 2, 2015

aidansteele commented Feb 2, 2015

grosser commented Feb 2, 2015