fix key update fail in non-blocking by hou2gou · Pull Request #16021 · openssl/openssl

hou2gou · 2021-07-07T23:17:00Z

In tls13 key update， the block socket is rigiht. Then in non-blocking, The keyupdate is sending when local is writing, that will get the error code SSL_R_BAD_LENGTH . The keyupdate is sending when local or peer is reading, that will get the error code SSL_R_DECRYPTION_FAILED_OR_BAD_RECORD_MAC . The key update msg interrupted the complete encryption record.
My current thinking is that when there is reading and writing, it is delayed until the reading and writing are completed to send and receive key updates.

t8m · 2021-07-08T07:52:45Z

Could you please submit a CLA? https://www.openssl.org/policies/cla.html

hou2gou · 2021-07-08T11:58:07Z

ok，I sent a successfully signed pdf email yesterday.

hou2gou · 2021-07-09T02:30:25Z

@t8m the CLA email always send fail， may I send other email address？
rcpt handle timeout,last handle info: Can not connect to opensslfoundation.org:185.231.103.75:25

hou2gou · 2021-07-13T09:10:13Z

@mattcaswell @kaduk Can you help review this PR?

mattcaswell · 2021-07-13T10:35:40Z

@mattcaswell @kaduk Can you help review this PR?

Yes, I spent some time looking at this yesterday, but need to spend a bit more time on it before I provide comments.

mattcaswell · 2021-07-13T17:12:18Z

I don't think the fix presented in this PR is the correct solution.

There are two problems being addressed.

A key update operation that is triggered while a partial record read has occurred fails when the remainder of the record is subsequently read.

The root cause of this problem is a bug in the function ssl3_setup_read_buffer(). This function may be called at times when the read buffer has already been set up. Notably this occurs at the start of the "init" phase (as occurs during a key update operation). In that case this function should "do nothing". One particular line in that function resets the value of the s->rlayer.packet pointer. This pointer is used to determine where the start of the read buffer is after alignment has been taken into account. ssl3_setup_read_buffer() doesn't correctly calculate this and just sets it to the start of the buffer regardless of alignment. This doesn't normally make a difference because it gets reset to the correct value when we start reading a new record in ssl3_read_n:

openssl/ssl/record/rec_layer_s3.c

Lines 201 to 227 in 53111a8

    
               if (!extend) { 
        
                   /* start with empty packet ... */ 
        
                   if (left == 0) 
        
                       rb->offset = align; 
        
                   else if (align != 0 && left >= SSL3_RT_HEADER_LENGTH) { 
        
                       /* 
        
                        * check if next packet length is large enough to justify payload 
        
                        * alignment... 
        
                        */ 
        
                       pkt = rb->buf + rb->offset; 
        
                       if (pkt[0] == SSL3_RT_APPLICATION_DATA 
        
                           && (pkt[3] << 8 | pkt[4]) >= 128) { 
        
                           /* 
        
                            * Note that even if packet is corrupted and its length field 
        
                            * is insane, we can only be led to wrong decision about 
        
                            * whether memmove will occur or not. Header values has no 
        
                            * effect on memmove arguments and therefore no buffer 
        
                            * overrun can be triggered. 
        
                            */ 
        
                           memmove(rb->buf + align, pkt, left); 
        
                           rb->offset = align; 
        
                       } 
        
                   } 
        
                   s->rlayer.packet = rb->buf + rb->offset; 
        
                   s->rlayer.packet_length = 0; 
        
                   /* ... now we can act as if 'extend' was set */ 
        
               }

This goes wrong when we read a partial record, perform a key update operation (and hence end up calling ssl3_setup_read_buffer) and then continue the read operation afterwards.

The reset of the packet pointer in ssl3_setup_read_buffer is completely unnecessary, and simply removing it solves the problem:

Author:     Matt Caswell <[email protected]>
AuthorDate: Tue Jul 13 17:19:12 2021 +0100
Commit:     Matt Caswell <[email protected]>
CommitDate: Tue Jul 13 17:19:12 2021 +0100

    Don't reset the packet pointer in ssl3_setup_read_buffer
    
    Sometimes this function gets called when the buffers have already been
    set up. If there is already a partial packet in the read buffer then the
    packet pointer will be set to an incorrect value. The packet pointer already
    gets reset to the correct value when we first read a packet anyway, so we
    don't also need to do it in ssl3_setup_read_buffer.

diff --git a/ssl/record/ssl3_buffer.c b/ssl/record/ssl3_buffer.c
index 861610a08b..daa175d98c 100644
--- a/ssl/record/ssl3_buffer.c
+++ b/ssl/record/ssl3_buffer.c
@@ -73,7 +73,6 @@ int ssl3_setup_read_buffer(SSL *s)
         b->len = len;
     }
 
-    RECORD_LAYER_set_packet(&s->rlayer, &(b->buf[0]));
     return 1;
 }

The second problem being addressed is where SSL_write() is called and SSL_ERROR_WANT_WRITE is returned, before subsequently calling SSL_key_update().

IMO this is just incorrect API usage. If you get SSL_ERROR_WANT_WRITE returned from SSL_write() then you must not do any other write operations other than a repeat of the previous SSL_write() call.

This is what the docs say about it:

It is safe to call SSL_read() or SSL_read_ex() when more data is available
even when the call that set this error was an SSL_write() or SSL_write_ex().
However, if the call was an SSL_write() or SSL_write_ex(), it should be called
again to continue sending the application data.

I think the docs need to be updated to be even more explicit, i.e. if you get SSL_ERROR_WANT_WRITE from SSL_write then you should not do any other IO operation other than a repeat of the previous SSL_write call.

It would also be sensible to immediately fail if SSL_key_update is called when this is the case:

Author:     Matt Caswell <[email protected]>
AuthorDate: Tue Jul 13 17:44:44 2021 +0100
Commit:     Matt Caswell <[email protected]>
CommitDate: Tue Jul 13 17:44:44 2021 +0100

    Disallow SSL_key_update() if there are writes pending
    
    If an application is halfway through writing application data it should
    not be allowed to attempt an SSL_key_update() operation. Instead the
    SSL_write() operation should be completed.

diff --git a/ssl/ssl_lib.c b/ssl/ssl_lib.c
index c1e8e41f02..892a417d93 100644
--- a/ssl/ssl_lib.c
+++ b/ssl/ssl_lib.c
@@ -2262,6 +2262,11 @@ int SSL_key_update(SSL *s, int updatetype)
         return 0;
     }
 
+    if (RECORD_LAYER_write_pending(&s->rlayer)) {
+        ERR_raise(ERR_LIB_SSL, SSL_R_BAD_WRITE_RETRY);
+        return 0;
+    }
+
     ossl_statem_set_in_init(s, 1);
     s->key_update = updatetype;
     return 1;

With all of that said I think the tests presented in this PR are very useful and should be retained, with the exception of test_key_update_local_in_write which should be modified to confirm that calling SSL_key_update when a partial write has occurred produces an error as per my patch above.

@yangyangtiantianlonglong - would you be willing to update this PR to incorporate my suggested fixes above, remove yours, and update the tests as I suggest?

hou2gou · 2021-07-14T01:38:53Z

@mattcaswell Thank you for having to review it in detail. OK, I agree with the two points you made above, and I'll update the PR and test cases.
On the second point, I think the docs need to be updated to be even more explicit, i.e. if you get SSL_ERROR_WANT_WRITE from SSL_write then you should not do any other IO operation other than a repeat of the previous SSL_write call. Is this only limited to other write I/O operations? It is safe to call SSL_read() or SSL_read_ex().

t8m · 2021-07-14T07:13:50Z

@mattcaswell to make things easier, I'd suggest submitting your two fixes as a separate PR, which I'd approve immediately. After it is merged @yangyangtiantianlonglong can rebase this PR and remove the unneeded changes but keep the tests (with the necessary adjustment).

mattcaswell · 2021-07-14T08:50:07Z

Is this only limited to other write I/O operations? It is safe to call SSL_read() or SSL_read_ex().

No, if you call SSL_write() and get SSL_ERROR_WANT_WRITE, then the next IO operation must be SSL_write().

@mattcaswell to make things easier, I'd suggest submitting your two fixes as a separate PR, which I'd approve immediately. After it is merged @yangyangtiantianlonglong can rebase this PR and remove the unneeded changes but keep the tests (with the necessary adjustment).

Yes, ok. I can do that.

mattcaswell · 2021-07-14T13:50:57Z

Yes, ok. I can do that.

Raised as #16077

hou2gou · 2021-07-15T01:41:54Z

yes, After merge in #16077，I will rebase this PR, commit testcase and doc.

hou2gou · 2021-07-15T12:02:50Z

My operation problem caused this PR to be closed, and I have recreated it in #16085，already rebase #16077.

openssl-machine added the hold: cla required The contributor needs to submit a license agreement label Jul 7, 2021

t8m closed this Jul 8, 2021

t8m reopened this Jul 8, 2021

hou2gou force-pushed the SSL_key_update branch from d8a1a27 to 38fd615 Compare July 8, 2021 14:12

hou2gou mentioned this pull request Jul 9, 2021

I can't reach the [email protected] when I submit a CLA #16034

Closed

paulidale closed this Jul 9, 2021

paulidale reopened this Jul 9, 2021

openssl-machine removed the hold: cla required The contributor needs to submit a license agreement label Jul 9, 2021

hou2gou force-pushed the SSL_key_update branch from ae2cc03 to 38fd615 Compare July 9, 2021 05:39

t8m added branch: 1.1.1 Applies to OpenSSL_1_1_1-stable branch (EOL) branch: master Applies to master branch triaged: bug The issue/pr is/fixes a bug labels Jul 9, 2021

mattcaswell mentioned this pull request Jul 14, 2021

Fix SSL_key_update() problems #16077

Closed

hou2gou closed this Jul 15, 2021

hou2gou force-pushed the SSL_key_update branch from 3cff296 to 3a1d2b5 Compare July 15, 2021 08:45

hou2gou mentioned this pull request Jul 15, 2021

Add testcases for SSL_key_update problems #16085

Closed

2 tasks

Uh oh!

Comments

Conversation

hou2gou commented Jul 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

t8m commented Jul 8, 2021

Uh oh!

hou2gou commented Jul 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hou2gou commented Jul 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hou2gou commented Jul 13, 2021

Uh oh!

mattcaswell commented Jul 13, 2021

Uh oh!

mattcaswell commented Jul 13, 2021

Uh oh!

hou2gou commented Jul 14, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

t8m commented Jul 14, 2021

Uh oh!

mattcaswell commented Jul 14, 2021

Uh oh!

mattcaswell commented Jul 14, 2021

Uh oh!

hou2gou commented Jul 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hou2gou commented Jul 15, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

hou2gou commented Jul 7, 2021 •

edited

Loading

hou2gou commented Jul 8, 2021 •

edited

Loading

hou2gou commented Jul 9, 2021 •

edited

Loading

hou2gou commented Jul 14, 2021 •

edited

Loading

hou2gou commented Jul 15, 2021 •

edited

Loading