Add grouped binary convolution support (1/3): the converter. #549

AdamHillier · 2020-10-22T23:38:19Z

What do these changes do?

This is first of a group of PRs to add support for grouped binary convolutions. I've split the work into three PRs to make review easier.

First, this PR adds support to the converter, by adding an attribute groups to the op definition. The correct value for groups is infered by dividing the input channels by the channels_in dimension of the filter. An error is raised if the size of each group (the input channels divided by groups) is not a multiple of 32 -- I have chosen to add this constraint because it significantly simplifies the implementation in the kernels.

How Has This Been Tested?

MLIR file-check tests have been added to check that the attribute is correctly set, and that the error is raised if the group-size is not a multiple of 32.

Benchmark Results

N/A.

Related issue number

#550, #551.

AdamHillier · 2020-10-22T23:48:50Z

larq_compute_engine/mlir/transforms/prepare_tf.cc

+  if (total_input_channels % filter_input_channels != 0) {
+    mlir::emitError(filter_val.getLoc())
+        << "Filter dimensions invalid: the number of filter input channels "
+        << filter_input_channels
+        << " does not divide the total number of input channels "
+        << total_input_channels << "\n";
+    num_groups = 1;
+  }


Technically I don't think this is required, because the Conv2D op throws an error if this condition is violated, but it's probably better safe than sorry.

AdamHillier · 2020-10-22T23:51:22Z

larq_compute_engine/mlir/transforms/prepare_tf.cc

+  if (num_groups > 1 && filter_input_channels % bitpacking_bitwidth != 0) {
+    mlir::emitError(filter_val.getLoc())
+        << "Invalid binary group convolution: the number of input channels "
+           "per-group "
+           "must be a multiple of "
+        << bitpacking_bitwidth << ", but is " << filter_input_channels << "\n";
+    num_groups = 1;
+  }


We could choose not to throw an error here, and simply not apply the transformation if this condition is violated, so that there will be a float op in the model. I'm not sure what the right approach is.

AdamHillier · 2020-10-23T10:27:40Z

Upon reflection and discussions with @Tombana, there's actually technically no need to have a groups argument, but only because of the constraint that the input-depth-per-group (if the number of groups is greater than one) is a multiple of 32. If we know we'll never see a flatbuffer with a grouped convolution that has a non-multiple-of-32 group size, then we can infer the number of groups from the (bitpacked) filter channels in dimension and (bitpacked) input channels in dimension.

I'm not sure what the right thing to do here is. Conciseness is nice, but I worry that relying on being able to infer the number of groups by relying on that constraint being satisfied might lead to subtle bugs in the future.

Tombana · 2020-10-23T10:30:55Z

Even if we can infer it in theory, I think it's good to have it as a separate parameter, even if only as an extra correctness checksum.

lgeiger · 2020-11-02T12:53:58Z

I'm not sure what the right thing to do here is. Conciseness is nice, but I worry that relying on being able to infer the number of groups by relying on that constraint being satisfied might lead to subtle bugs in the future.

I'd actually prefer not to have a dedicated group attribute in the op. TensorFlow does the same for their Conv2DOp (although in their case I probably would've prefered a dedicated groups argument for the user facing code for better validation).

I think we can do all the necessary validation in Prepare even without a group argument, since we already have a channels_in argument and the shapes of the bitpacked input and bitpacked filters, or am I missing something a case that we can't catch that could lead to subtle bugs?

In both cases we should always make sure that the number of filters per group is divisible by 32 in the converter anyway.

AdamHillier · 2020-11-02T13:03:09Z

I think I'm onboard with that.

In case somebody tries to convert a grouped Larq bconv with non-multiple-of-32 group size, do you think we should throw a converter error (as this PR currently does) or fall back to an emulated bsign + float conv?

lgeiger · 2020-11-02T13:39:52Z

In case somebody tries to convert a grouped Larq bconv with non-multiple-of-32 group size, do you think we should throw a converter error (as this PR currently does) or fall back to an emulated bsign + float conv?

I think it's fair to throw an error for now since TFLite doesn't support group convolutions anyway so it would fail during runtime anyway.

Add a validation check to the converter to ensure that grouped convolutions have a group size that is a multiple of 32, and report an error otherwise.

AdamHillier · 2020-11-05T11:04:04Z

@lgeiger I've updated the PR so that it only performs the check, rather than adding the groups attribute.

lgeiger

This is great!

Do you think it would make sense to move this check into an op verifier for the bconv op as mentioned in #406 instead? This way the check would be op specific instead of limited to this transformation. Although I am not sure if we still have all of the information there since the bconv op will already have bitpacked inputs and migth either have float or binary filters.

AdamHillier · 2020-11-05T12:48:28Z

Do you think it would make sense to move this check into an op verifier for the bconv op as mentioned in #406 instead? This way the check would be op specific instead of limited to this transformation. Although I am not sure if we still have all of the information there since the bconv op will already have bitpacked inputs and migth either have float or binary filters.

I really like this idea in theory, but you're right that it won't be possible once the filters have been bitpacked. E.g. the number of input channels could be 64, and the bitpacked filter channels in dimension could be 1, and we don't know if that's because it's 32 input channels to the filter (which is valid) or only 1 that's been padded by bitpacking (1 input channel as in a depthwise conv, which would not be valid).

lgeiger · 2020-11-05T12:53:18Z

I really like this idea in theory, but you're right that it won't be possible once the filters have been bitpacked.

That's true. In theory I think it woud still work since the filters will be bitpacked in a later pass, but this means that the verifier would be tied to this transformation anyway. Let's keep it like this for now.

This was referenced Oct 22, 2020

Add grouped binary convolution support (2/3): reference kernel. #550

Merged

Add grouped binary convolution support (3/3): indirect BGEMM kernel. #551

Merged

AdamHillier requested a review from a team October 22, 2020 23:47

AdamHillier commented Oct 22, 2020

View reviewed changes

AdamHillier added the feature New feature or request label Oct 23, 2020

AdamHillier force-pushed the grouped-convolutions-converter branch from 9c0acf3 to 1392f3e Compare November 5, 2020 00:47

Add grouped binary convolution support (1/3): the converter.

1ba74aa

Add a validation check to the converter to ensure that grouped convolutions have a group size that is a multiple of 32, and report an error otherwise.

AdamHillier force-pushed the grouped-convolutions-converter branch from 1392f3e to 1ba74aa Compare November 5, 2020 00:50

lgeiger approved these changes Nov 5, 2020

View reviewed changes

lgeiger merged commit e4fb333 into master Nov 5, 2020

lgeiger deleted the grouped-convolutions-converter branch November 5, 2020 12:53

simonmaurer mentioned this pull request Feb 9, 2022

Select indirect BGEMM kernels - Benchmarking grouped binary convolutions #711

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add grouped binary convolution support (1/3): the converter. #549

Add grouped binary convolution support (1/3): the converter. #549

AdamHillier commented Oct 22, 2020 •

edited

Loading

AdamHillier Oct 22, 2020

AdamHillier Oct 22, 2020

AdamHillier commented Oct 23, 2020

Tombana commented Oct 23, 2020

lgeiger commented Nov 2, 2020

AdamHillier commented Nov 2, 2020

lgeiger commented Nov 2, 2020

AdamHillier commented Nov 5, 2020

lgeiger left a comment

AdamHillier commented Nov 5, 2020 •

edited

Loading

lgeiger commented Nov 5, 2020

Add grouped binary convolution support (1/3): the converter. #549

Add grouped binary convolution support (1/3): the converter. #549

Conversation

AdamHillier commented Oct 22, 2020 • edited Loading

What do these changes do?

How Has This Been Tested?

Benchmark Results

Related issue number

AdamHillier Oct 22, 2020

Choose a reason for hiding this comment

AdamHillier Oct 22, 2020

Choose a reason for hiding this comment

AdamHillier commented Oct 23, 2020

Tombana commented Oct 23, 2020

lgeiger commented Nov 2, 2020

AdamHillier commented Nov 2, 2020

lgeiger commented Nov 2, 2020

AdamHillier commented Nov 5, 2020

lgeiger left a comment

Choose a reason for hiding this comment

AdamHillier commented Nov 5, 2020 • edited Loading

lgeiger commented Nov 5, 2020

AdamHillier commented Oct 22, 2020 •

edited

Loading

AdamHillier commented Nov 5, 2020 •

edited

Loading