Defragment incoming TLS handshake messages #9872

rojer · 2024-12-25T12:36:36Z

Description

Defragment incoming TLS handshake messages.

Fixes #1840

PR checklist

changelog provided
development PR provided
TF-PSA-Crypto PR not required because: TLS only
framework PR not required
3.6 PR provided [Backport 3.6] Defragment incoming TLS handshake messages (reuse badmac_seen) #9981
2.28 PR not required because: in practice this mostly matters for TLS 1.3 which is not in 2.28
tests deferred to Basic ssl-opt testing for TLS HS defragmentation #9887, Extended ssl-opt tests for TLS HS defragmentation #9987 and Extended test_suite_ssl testing for TLS HS defragmentation #9968

Signed-off-by: Deomid rojer Ryabkov <[email protected]>

Neustradamus · 2024-12-27T01:04:00Z

To follow this PR.

waleed-elmelegy-arm · 2025-01-09T14:58:10Z

Thanks @rojer for the PR. do you currently have the capacity to add automated tests?

rojer · 2025-01-09T20:25:43Z

Thanks @rojer for the PR. do you currently have the capacity to add automated tests?

not in the near future, unfortunately. holidays are over, and so is my free time.

ChangeLog.d/tls-hs-defrag-in.txt

Co-authored-by: minosgalanakis <[email protected]> Signed-off-by: Deomid Ryabkov <[email protected]>

minosgalanakis

Looks good overall, some minor comments.

library/ssl_msg.c

library/ssl_misc.h

library/ssl_msg.c

Signed-off-by: Deomid rojer Ryabkov <[email protected]>

waleed-elmelegy-arm · 2025-01-24T18:20:30Z

This check here

if (ssl->in_msglen < mbedtls_ssl_hs_hdr_len(ssl)) {
        MBEDTLS_SSL_DEBUG_MSG(1, ("handshake message too short: %" MBEDTLS_PRINTF_SIZET,
                                  ssl->in_msglen));
        return MBEDTLS_ERR_SSL_INVALID_RECORD;
    }

causes some failures if part of the message is less than the minimum. I added a fix in #9928 that checks if hslen is equal to zero but this will fail if the fragment length is 1, also we need to add check if the final message less the minimum so not sure if this is the best way.

Also we might want to add a config to enable or disable handshake fragmentation?

rojer · 2025-01-26T09:09:24Z

if (ssl->in_msglen < mbedtls_ssl_hs_hdr_len(ssl)) {

ok, i see what you mean - good catch: we require that fragment be at least 4 bytes, which is somewhat of an arbitrary requirement. your change relaxes it to only the first fragment being at least 4 bytes.
i adopted your check with a slight change: i am testing ssl->in_hslen, which i think is the same for the purpose of checking if it is the first fragment or not. in parctice, of course, for a fragmented handshake we will have in_hslen == in_hsfraglen == 0 on the first fragment, then in_hslen set to some value X from the header, and in_hsfraglen set to some Y < X.
we could test either of them but i find that in_hslen == 0 is easier to understand.

in theory we should be able to reassemble a message chopped into 1 byte fragments, but i think it's unlikely to occur in practice (whereas tail fragments of 1 byte are are more likely). supporting fragmented handshake message header would require significant change to the code, so perhaps leave it for the future?

Also we might want to add a config to enable or disable handshake fragmentation?

@mpg suggested that defragmentation (which is what we're doing here) should always happen, and fragmentation on the way out is controlled by negotiated max fragment length / record size (or rather will be in the future).

Signed-off-by: Deomid rojer Ryabkov <[email protected]>

Except the first Signed-off-by: Deomid rojer Ryabkov <[email protected]>

mpg · 2025-01-27T08:29:36Z

in theory we should be able to reassemble a message chopped into 1 byte fragments, but i think it's unlikely to occur in practice (whereas tail fragments of 1 byte are are more likely). supporting fragmented handshake message header would require significant change to the code, so perhaps leave it for the future?

Agreed, we're not trying to achieve maximal coverage of all cases that could theoretically happen, just the ones that are likely to happen in practice.

@mpg suggested that defragmentation (which is what we're doing here) should always happen, and fragmentation on the way out is controlled by negotiated max fragment length / record size (or rather will be in the future).

Indeed, and that's still my opinion. There's a big difference between defragmentation (reassembly) and fragmentation: we control what we send, not what we receive. The peer is free to send fragmented messages at any time for any reason, and this doesn't need to be negotiated, that's part of the core TLS standard and now it's something that popular servers happen to do in practice. So I'm inclined to think we should always be prepared for it: if we're not, we'll fail handshakes for no good reason (the current situation before this PR).

Signed-off-by: Deomid rojer Ryabkov <[email protected]>

minosgalanakis · 2025-02-03T17:43:45Z

This PR has been tested alongside #9928 in #9948 . We may use the latter as a way of intergrating and merging both PR's in.

Also as a side note commit afa11db needed the change introduced by 31f2d82 but I may address that #9948 .

It looks good so far, thank you for updating the contributions promtly.

mpg

As Waleed is on holiday, I'm taking over as a reviewer. I didn't re-review everything, just the last commits that Waleed hadn't approved yet, which look good to me - but I couldn't help look at the ChangeLog entry though.

Also, as Minos already mentioned, can you cherry-pick the commit that fixes the "unused variable" warning/error in builds without DEBUG_C? CI is unlikely to pass without that.

mpg · 2025-02-04T09:21:17Z

ChangeLog.d/tls-hs-defrag-in.txt

@@ -0,0 +1,2 @@
+Changes
+   * Defragment incoming TLS handshake messages.


I think we can eve make classify that as a bugfix: (1) there is nothing in the spec that says support for receiving fragemented messages is optional (quite the opposite: it's the first point in the implementation pitfalls section), and (2) it was causing interop failures in practice.

Also, I think we should expand a bit so that users who are not aware of TLS internals may make more sense of the entry, perhaps:

Bugfix * Support re-assembly of fragmented handshake messages in TLS, as mandated by the spec. Lack of support was causing handshake failures with some servers, especially with TLS 1.3 in practice (though both protocol version could be affected in principle, and both are fixed now).

minosgalanakis · 2025-02-05T10:37:14Z

include/mbedtls/ssl.h

@@ -1808,6 +1808,8 @@ struct mbedtls_ssl_context {

    size_t MBEDTLS_PRIVATE(in_hslen);            /*!< current handshake message length,
                                                    including the handshake header   */
+    unsigned char *MBEDTLS_PRIVATE(in_hshdr);    /*!< original handshake header start  */


One more comment that was posted on the tests's pr 9928

@gilles-peskine-arm commented:

Introducing a new buffer pointer creates two risks: buffer overflow, and use-after-free.

I would prefer to have more clarity as to the size of the (sub-)buffer accessible from in_hshdr (it isn't in_hsfraglen), and also the impact on the in_hdr field (which has become less straightforward). I don't have any bad feeling about the current code here, but I'd prefer to have better documentation to help future maintainers (I do have a bad feeling about someone later fixing a bug that's unrelated to fragmentation, and missing some interaction between their bug and fragmentation).

I am a little worried about a use-after-free. handle_buffer_resizing takes care of this pointer, and it's the only risky place I can think of, but I am not very confident. Please reset in_hshdr in mbedtls_ssl_update_in_pointers.

in_hshdr points to the first handshake fragment and may legitimately stay behind the current/last message and thus should not be reset in mbedtls_ssl_update_in_pointers.

the overview of the approach to defragmentation is as follows:

we start with an empty buffer, in_hslen == 0 and in_hshdr == NULL

when the first record of the handshake message is received (which has to be at least 4 bytes) we parse the expected length into in_hslen, set in_hshdr = in_hdr and in_hsfraglen = in_msglen.

as more fragments arrive, we accumulate them in the buffer and keep track of the total length so far in_hsfraglen until it covers the entire expected handshake message. during this time in_hdr advances forward while in_hshdr stays behind.

once we have enough fragments buffered, we merge them, starting from in_hshdr, removing the record headers and obtaining a complete handshake message, which we then process and start over.

i hope this makes it clear. any suggestions wrt comments or documentation are welcome.

as a more general note, i think that an extensive comment or perhaps even a readme explaining the ways in/out buffers and pointers are managed would be helpful. it took me a while to wrap my head around it when i started.

Thanks for the explanation, that's very useful!

Would you mind adding comments? Both on the context fields, and also a comment in mbedtls_ssl_update_in_pointers to explain why this in-pointer should not be updated.

i think that an extensive comment or perhaps even a readme explaining the ways in/out buffers and pointers are managed would be helpful. it took me a while to wrap my head around it when i started.

Absolutely, yes! The more we add features like defragmentation, the more complicated it is, and it doesn't help that there's so little documentation. Whenever we identify new wisdom about how things work, we should write them down inside the code, even if it's incomplete.

actually, upon reflection i realized that we don't need to store this pointer: it will always be the first message in the buffer. so, instead in dd14c0a i'm removing it.

i tested manually with facebook as well as #9928 with this change - defrag tests are passing.

i also expanded in_hsfraglen comment a bit to clarify that it goes from 0 up to in_hslen.

Signed-off-by: Deomid rojer Ryabkov <[email protected]>

Signed-off-by: Waleed Elmelegy <[email protected]> Signed-off-by: Deomid rojer Ryabkov <[email protected]>

The first fragment of a fragmented handshake message always starts at the beginning of the buffer so there's no need to store it. Signed-off-by: Deomid rojer Ryabkov <[email protected]>

mpg

LGTM (again, incremental review, did not review things that were already approved by Waleed).

mpg · 2025-02-19T09:46:00Z

@rojer For your information, our new plan (since Monday) is to merge this in a temporary feature branch as soon as this is approved (and the 3.6 backport), so that (1) this doesn't have to wait for the various testing PRs and (2) the various testing PRs have a more stable basis in the meantime. Once all the testing PRs (and possibly additional PRs fixing any issues found by the tests) are merged in the feature branch, we'll merge the feature branch into the main branches (development and 3.6).

minosgalanakis

LGTM

gilles-peskine-arm · 2025-02-24T13:44:36Z

library/ssl_msg.c

+            ssl->in_hdr = ssl->in_msg + ssl->in_msglen;
+            ssl->in_msglen = 0;
+            mbedtls_ssl_update_in_pointers(ssl);


Testing in #9989 reveals a bug: defragmentation of encrypted records does not work correctly in TLS 1.2 when the symmetric encryption is CBC (EtM or not), CCM or GCM. It works with any TLS 1.3 cipher suite, and with ChachaPoly and null encryption in TLS 1.2. The problematic cases are exactly the ones where the record includes an explicit IV (always 8 bytes).

In the initial handshake, only the Finished message is likely to be affected, and it's only 16 bytes, so this happens when the fragment size is less than 16 bytes. This can happen with larger fragment sizes during renegotiation.

In problematic scenarios, the fragment reassembly loop looks for the second fragment 8 bytes too early in the buffer. There is an 8-byte gap between the fragments, but the reassembly expects them to be consecutive.

It's not 100% clear to me yet, but I think the offset update here isn't correct. As far as I can tell from looking at an example in a debugger, at the end of the first incomplete fragment, ssl->in_msg and ssl->in_hdr end up pointing 8 bytes past the end of the end of the fragment.

gilles-peskine-arm · 2025-02-24T13:51:52Z

library/ssl_msg.c

+            unsigned char *in_first_hdr = ssl->in_buf + MBEDTLS_SSL_SEQUENCE_NUMBER_LEN;
+            unsigned char *p = in_first_hdr, *q = NULL;
+            size_t merged_rec_len = 0;
+            do {


As I was trying to analyze and fix https://github.com/Mbed-TLS/mbedtls/pull/9872/files#r1967669302, I tried to make sense of what the various pointers and offsets should be at any given point (ssl->in_msg, ssl->in_hdr, ssl->in_hsfraglen, etc., as well as the various record fields while processing records), and I have a hard time figuring out whether the value I'm seeing is the value that should be there at any given point during parsing. I'm not sure if the reassembly loop should be changed, or if it's getting unexpected data.

One of the difficulties in understanding the code is that fragment accumulation is completely disconnected from fragment reassembly, so reassembly has to re-parse data. I would find it easier to understand if the structure of the code was: when we have finished parsing a fragment, if it wasn't the initial fragment, then merge it with the initial fragment. With this structure, there'd be fewer moving parts. Fragment reassembly would have access to the offsets and record data from the latest fragment, and wouldn't need to do any parsing, so it would be easier to figure out offsets. There would also be fewer opportunities for parsing errors or integer/buffer overflows. In addition, the input buffer would fill out less quickly — the current structure adds a 5- or 13-byte overhead per fragment. @rojer Did you try doing fragment reassembly incrementally? Are there any difficulties in doing it that way?

no, i don't see any reason why it couldn't be done that way. this just happened to be the way i went at it at the time.

hrushikesh430 · 2025-03-05T11:12:20Z

Hi, will this fix be backported to version 3.6.2

davidhorstmann-arm · 2025-03-05T11:19:55Z

Hi, will this fix be backported to version 3.6.2

It will be backported to the 3.6 LTS, so it will appear in 3.6.3 which can be easily upgraded-to from 3.6.2 (see #9981). We don't backport to specific point releases, only to LTS versions, but the upgrade between them should be almost entirely painless.

hrushikesh430 · 2025-03-05T11:22:19Z

Hi, will this fix be backported to version 3.6.2

It will be backported to the 3.6 LTS, so it will appear in 3.6.3 which can be easily upgraded-to from 3.6.2 (see #9981). We don't backport to specific point releases, only to LTS versions, but the upgrade between them should be almost entirely painless.

Thank you, we are only concerned about 3.6 LTS.

mpg · 2025-03-05T11:26:27Z

so it will appear in 3.6.3

Sorry, but it will appear in 3.6.3 if it is ready in time otherwise it will have to wait for 3.6.4. We are working very hard to include this in 3.6.3, but it won't ship if we don't have a sufficient level of testing to be confident it's not introducing security issues (we already found at least one while extending testing) and we're not there yet. Again, we're hard at work on this, because we'd really like to have a resolution in 3.6.3.

Defragment incoming TLS handshake messages

ac2cf1f

Signed-off-by: Deomid rojer Ryabkov <[email protected]>

This was referenced Dec 25, 2024

TLS handshake fragmentation support #8981

Open

TLS Handshake record layer fragmentation not working #1840

Open

Harry-Ramsey added enhancement needs-review Every commit must be reviewed by at least two team members, needs-reviewer This PR needs someone to pick it up for review labels Dec 27, 2024

waleed-elmelegy-arm added the component-tls label Dec 30, 2024

mpg requested review from mpg and waleed-elmelegy-arm January 2, 2025 09:31

mpg mentioned this pull request Jan 9, 2025

Basic ssl-opt testing for TLS HS defragmentation #9887

Open

minosgalanakis self-requested a review January 13, 2025 10:39

mpg removed their request for review January 13, 2025 10:39

mpg removed the needs-reviewer This PR needs someone to pick it up for review label Jan 13, 2025

minosgalanakis reviewed Jan 15, 2025

View reviewed changes

ChangeLog.d/tls-hs-defrag-in.txt Outdated Show resolved Hide resolved

Update ChangeLog.d/tls-hs-defrag-in.txt

5f7c2c2

Co-authored-by: minosgalanakis <[email protected]> Signed-off-by: Deomid Ryabkov <[email protected]>

minosgalanakis self-requested a review January 16, 2025 15:27

minosgalanakis reviewed Jan 17, 2025

View reviewed changes

library/ssl_msg.c Outdated Show resolved Hide resolved

library/ssl_msg.c Outdated Show resolved Hide resolved

library/ssl_misc.h Outdated Show resolved Hide resolved

library/ssl_msg.c Show resolved Hide resolved

library/ssl_msg.c Show resolved Hide resolved

Review comments

cad11ad

Signed-off-by: Deomid rojer Ryabkov <[email protected]>

rojer force-pushed the tls_hs_defrag_in branch from f0e848a to cad11ad Compare January 18, 2025 13:59

waleed-elmelegy-arm mentioned this pull request Jan 24, 2025

Add TLS Handshake defragmentation tests #9928

Open

6 tasks

rojer added 2 commits January 26, 2025 11:12

Remove mbedtls_ssl_reset_in_out_pointers

3dfe75e

Signed-off-by: Deomid rojer Ryabkov <[email protected]>

Allow fragments less HS msg header size (4 bytes)

aaa152e

Except the first Signed-off-by: Deomid rojer Ryabkov <[email protected]>

rojer force-pushed the tls_hs_defrag_in branch from 60d0e43 to aaa152e Compare January 26, 2025 09:12

Add a safety check for in_hsfraglen

b70e76a

Signed-off-by: Deomid rojer Ryabkov <[email protected]>

mpg reviewed Feb 4, 2025

View reviewed changes

minosgalanakis reviewed Feb 5, 2025

View reviewed changes

rojer and others added 2 commits February 5, 2025 13:09

Update the changelog message

eb77e5b

Signed-off-by: Deomid rojer Ryabkov <[email protected]>

Remove unused variable in ssl_server.c

cf4e6a1

Signed-off-by: Waleed Elmelegy <[email protected]> Signed-off-by: Deomid rojer Ryabkov <[email protected]>

rojer force-pushed the tls_hs_defrag_in branch from 788aed3 to cf4e6a1 Compare February 5, 2025 11:10

Faless mentioned this pull request Feb 11, 2025

[4.4 beta 1] TLS Handshake Error with Godot HTTPRequest godotengine/godot#101910

Closed

Remove in_hshdr

dd14c0a

The first fragment of a fragmented handshake message always starts at the beginning of the buffer so there's no need to store it. Signed-off-by: Deomid rojer Ryabkov <[email protected]>

gilles-peskine-arm mentioned this pull request Feb 13, 2025

[Backport 3.6] Defragment incoming TLS handshake messages (reuse badmac_seen) #9981

Merged

6 tasks

mpg changed the base branch from development to features/tls-defragmentation/development February 17, 2025 12:02

mpg requested review from mpg and minosgalanakis February 17, 2025 12:02

This was referenced Feb 17, 2025

Add basic handshake defragmentation tests in ssl-opt #9989

Merged

Issue9887 extend add basic defragmentation tests #9990

Open

mpg approved these changes Feb 19, 2025

View reviewed changes

minosgalanakis approved these changes Feb 19, 2025

View reviewed changes

mpg added this pull request to the merge queue Feb 21, 2025

github-merge-queue bot removed this pull request from the merge queue due to no response for status checks Feb 21, 2025

mpg merged commit 28f8e20 into Mbed-TLS:features/tls-defragmentation/development Feb 24, 2025
3 of 4 checks passed

gilles-peskine-arm reviewed Feb 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Defragment incoming TLS handshake messages #9872

Defragment incoming TLS handshake messages #9872

rojer commented Dec 25, 2024 •

edited by mpg

Loading

Neustradamus commented Dec 27, 2024

waleed-elmelegy-arm commented Jan 9, 2025

rojer commented Jan 9, 2025

minosgalanakis left a comment

waleed-elmelegy-arm commented Jan 24, 2025

rojer commented Jan 26, 2025 •

edited

Loading

mpg commented Jan 27, 2025

minosgalanakis commented Feb 3, 2025

mpg left a comment •

edited

Loading

mpg Feb 4, 2025

rojer Feb 4, 2025

minosgalanakis Feb 5, 2025

rojer Feb 5, 2025

gilles-peskine-arm Feb 5, 2025

rojer Feb 13, 2025

mpg left a comment

mpg commented Feb 19, 2025

minosgalanakis left a comment

gilles-peskine-arm Feb 24, 2025

gilles-peskine-arm Feb 24, 2025

rojer Feb 25, 2025

hrushikesh430 commented Mar 5, 2025

davidhorstmann-arm commented Mar 5, 2025

hrushikesh430 commented Mar 5, 2025

mpg commented Mar 5, 2025

		@@ -0,0 +1,2 @@
		Changes
		* Defragment incoming TLS handshake messages.

Defragment incoming TLS handshake messages #9872

Defragment incoming TLS handshake messages #9872

Conversation

rojer commented Dec 25, 2024 • edited by mpg Loading

Description

PR checklist

Neustradamus commented Dec 27, 2024

waleed-elmelegy-arm commented Jan 9, 2025

rojer commented Jan 9, 2025

minosgalanakis left a comment

Choose a reason for hiding this comment

waleed-elmelegy-arm commented Jan 24, 2025

rojer commented Jan 26, 2025 • edited Loading

mpg commented Jan 27, 2025

minosgalanakis commented Feb 3, 2025

mpg left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mpg left a comment

Choose a reason for hiding this comment

mpg commented Feb 19, 2025

minosgalanakis left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hrushikesh430 commented Mar 5, 2025

davidhorstmann-arm commented Mar 5, 2025

hrushikesh430 commented Mar 5, 2025

mpg commented Mar 5, 2025

rojer commented Dec 25, 2024 •

edited by mpg

Loading

rojer commented Jan 26, 2025 •

edited

Loading

mpg left a comment •

edited

Loading