Add cropping metadata #346

mattrwoz · 2024-03-19T20:55:33Z

This adds cropping metadata as proposed in CWG-D038. Accepted in the 10/10/2023 CWG meeting.
Most of the text is the same as the proposal document with some minor adjustments:

Replaced "valid picture data" with "rendered"
Clarified the scope of the metadata to be from "metadata_crop OBU to Keyframe and/or next metadata_crop OBU" instead of just "until next metadata_crop OBU"

podborski · 2024-04-01T17:09:40Z

06.bitstream.syntax.md

+| metadata_crop( ) {                                        | **Type**
+|     @@crop_width_minus_1                                  | f(16)
+|     @@crop_height_minus_1                                 | f(16)
+|     @@crop_offset_present                                 | f(1)


Why don't we make it byte-aligned in the first place? having something like:

f(7) reserved f(1) crop_offset_present;

would be better IMO than relying on the trailing_bits at the end of the OBU.

07.bitstream.semantics.md

podborski · 2024-04-01T17:23:26Z

07.bitstream.semantics.md

+#### Metadata crop semantics
+
+When present the metadata_crop OBU applies starting at the next frame in the sequence 
+with matching temporal_id and spatial_id, and shall apply to all matching frames until the next Key Frame


When present the metadata_crop OBU applies starting at the next frame in the sequence
with matching temporal_id and spatial_id, ...

Alexis and myself are wondering here what the behavior should be if no extension header is present. Perhaps in that case it should apply to all layers? Then the language on the persistence should also be clarified.

Also instead of:

and shall apply to all matching frames

it would be better to write "and shall apply to all frames with a matching temporal_id and spatial_id"

Yes. I think it should apply to all layers if no header is present. I will add wording clarifying that.

podborski · 2024-04-01T17:37:17Z

07.bitstream.semantics.md

+When present the metadata_crop OBU applies starting at the next frame in the sequence 
+with matching temporal_id and spatial_id, and shall apply to all matching frames until the next Key Frame
+or metadata_crop OBU. The output picture should be cropped to the region as specified in the 
+cropping metadata OBU. When applied, the crop shall be after all normal decode operations as a 


When applied, the crop shall be after all normal decode operations as a post-processing step (after film grain synthesis in the output process). This metadata has no effect on the decoding process.

Maybe it would be better to re-write like this:

The crop shall be applied after all normal decode operations as a post-processing step. This metadata information has no effect on the decoding process.

Used your new wording.

podborski · 2024-04-01T17:41:36Z

07.bitstream.semantics.md

+
+**crop_y_offset** specifies the minimum pixel row containing picture data which should be rendered.
+
+**crop_width_minus_1** specifies the number of pixel columns minus 1 which which should be rendered after applying crop_x_offset.


which which => which

podborski · 2024-04-01T18:25:07Z

07.bitstream.semantics.md

+When muxed into a container that supports signaling cropping information, this metadata should 
+be removed from the bitstream and included in the container’s signaling mechanism.  
+If both the container and bitstream signal cropping information, then the container’s cropping 
+information takes precedence.


Instead of:

When muxed into a container that supports signaling cropping information, this metadata should
be removed from the bitstream and included in the container’s signaling mechanism.
If both the container and bitstream signal cropping information, then the container’s cropping
information takes precedence.

We could write:

In cases where cropping information is present in both the bitstream and the delivery or container format, the latter should be preferred.

NOTE: Container or delivery formats that package the AV1 bitstream are recommended to address this redundancy, potentially by excluding bitstream cropping information when it is already available in the container or delivery format.

Then we will have to update the AV1 ISOBMFF spec and say there that we use the clap box and remove this metadata OBU. Thoughts @cconcolato ?

Updated to new wording.

podborski · 2024-06-25T15:08:15Z

Some relevant discussion in the file format group on how such metadata can co-exist MPEGGroup/FileFormat#77

Unlike H264/H265, AV1 contains no fields to crop encoded output to specific sizes. AMD's hardware cannot handle encoding of unaligned dimensions for AV1, hence it codes 1920x1080 as 1920x1088. Add side data to crop the output back to the original dimensions. There's an AV1-spec extension planned to fix this here: AOMediaCodec/av1-spec#346 But it seems to have stuck for now.

mattrwoz added 3 commits March 18, 2024 16:46

Define new metadata for cropping

3b33d6f

Re-scope to key frames

2ed153a

clarify both key and additional metadata

5ac8502

podborski reviewed Apr 1, 2024

View reviewed changes

07.bitstream.semantics.md Outdated Show resolved Hide resolved

podborski reviewed Apr 1, 2024

View reviewed changes

Feedback fixes

bb04abd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cropping metadata #346

Add cropping metadata #346

mattrwoz commented Mar 19, 2024

podborski Apr 1, 2024

podborski Apr 1, 2024 •

edited

Loading

mattrwoz Apr 2, 2024

podborski Apr 1, 2024

mattrwoz Apr 8, 2024

podborski Apr 1, 2024

mattrwoz Apr 8, 2024

podborski Apr 1, 2024 •

edited

Loading

mattrwoz Apr 8, 2024

podborski commented Jun 25, 2024


		crop_y_offset specifies the minimum pixel row containing picture data which should be rendered.

		crop_width_minus_1 specifies the number of pixel columns minus 1 which which should be rendered after applying crop_x_offset.

Add cropping metadata #346

Are you sure you want to change the base?

Add cropping metadata #346

Conversation

mattrwoz commented Mar 19, 2024

podborski Apr 1, 2024

Choose a reason for hiding this comment

podborski Apr 1, 2024 • edited Loading

Choose a reason for hiding this comment

mattrwoz Apr 2, 2024

Choose a reason for hiding this comment

podborski Apr 1, 2024

Choose a reason for hiding this comment

mattrwoz Apr 8, 2024

Choose a reason for hiding this comment

podborski Apr 1, 2024

Choose a reason for hiding this comment

mattrwoz Apr 8, 2024

Choose a reason for hiding this comment

podborski Apr 1, 2024 • edited Loading

Choose a reason for hiding this comment

mattrwoz Apr 8, 2024

Choose a reason for hiding this comment

podborski commented Jun 25, 2024

podborski Apr 1, 2024 •

edited

Loading

podborski Apr 1, 2024 •

edited

Loading