x509_vfy.c: Make chain building more intuitive, flexible, and complete by DDvO · Pull Request #13748 · openssl/openssl

DDvO · 2020-12-29T12:42:51Z

UPDATE: this may need further improvements and is put on hold until v3.0.0 is released.

This PR includes some general improvements of the chain building algorithm and fixes a design flaw of build_chain():
it makes little sense to try finding an issuer of the ~~current end of the chain~~ target cert if this cert can be found immediately in the trust store (and is actually trusted).
So better stop chain building as early as possible, i.e, as soon as reaching a trust anchor.
This is more in line with RFC 4158 advising for an efficient construction of a chain with minimal length
and should also minimize the risk of picking a non-suitable path,
which is of particular importance in the given implementation because it does not backtrack.

Note that the changes do not aim at changing the implicit trust anchor behavior:
If a certificate is in the trust store and does not have trust attributes,
accept it as implicit trust anchor if it is self-signed or X509_V_FLAG_PARTIAL_CHAIN is set.

Also improve ossl_cmp_build_cert_chain() using the improve implementation
and make it public under the name OSSL_build_cert_chain(). Update: this side-topic moved to #14128.

Moreover, generalize the internal check_trust() to work not with leaf but with current cert.
This allows getting rid of strange exceptional cases in find_issuer()
(and maybe also in other parts of the overall chain building algorithm).
Among others, this provides a more general fix (compared to #13762)
of the regression introduced reported for v1.1.1 in #13739.

Checklist

documentation is added or updated
tests are added or updated

richsalz · 2020-12-29T14:41:03Z

I am worried about changing behavior. Please make the new behavior off by default.

DDvO · 2021-01-01T20:42:50Z

I am worried about changing behavior.

@richsalz can you make more concrete what you fear could go wrong here?
The change in question just prefers short chains during chain building, which improves its completeness.
The correctness of chain building is anyway checked later during chain verification, right?
Moreover, one should hope that the multitude of existing tests on chain building and verification covers all critical cases, and none of them fails after this change.
So I cannot see where a real danger should come in here.

DDvO · 2021-01-01T20:56:24Z

The provided change happens to fix an issue that recently was reported on the openssl-users mailing list:

Someone wants to use a directly trusted self-signed SSL server cert
and for some reason puts this cert in the list of trusted certs after another version of this cert with same subject (and SAN extensions) but with a different public key.
So far, the chain building chose the other cert (because it matches first) and then failed to verify the server cert (also because no backtracking is done).
Instead, after the change provided here the chain building immediately takes the right server cert (found in second position) in the trusted list and verification succeeds.

Please make the new behavior off by default.

As the example just given shows, this would be counter-productive.

richsalz · 2021-01-01T22:55:11Z

With this change, what is the need for the PARTIAL_CHAINS flag?

richsalz · 2021-01-01T23:14:40Z

The fact that someone reported an issue does not outweigh the current users who are depending on the present semantics.

kroeckx · 2021-01-01T23:23:14Z

Have you seen the discussion in #7871?

richsalz · 2021-01-02T14:12:00Z

@kroeckx yes I have read the discussion in #7871. (My comment there raised the same issue as I did here: this is a behavior change happening very late in the 3.0 release process and I am opposed.) I wryly note @vdukhovni's comment, #7871 (comment), that @DDvO gave a thumbs-up to. :)

@DDvO, my question "what does this to the requiring PARTIAL_CHAINS" was directed to you.

This is a behavior change. There are no tests added that show the behavior change before and after. And when I pushed on that, the response was "one should hope that the multitude of existing tests on chain building and verification covers all critical cases, and none of them fails after this change." HOPE IS NOT GOOD ENOUGH. Here is a use-case that is not covered, for example: all sorts of certs are thrown into a trust store, including "trusted roots" and intermediaries. A few users, that want a particular path through a new intermediary, turn on PARTIAL_CHAINS. Consider when a CA adds a new digest mechanism (or crypto algorithm) that isn't supported on some devices.

Chain building and trust validation is hard, full of dark and mysterious corner cases, and changing it should be done upfront EARLY in the release plan, to socialize it (as Viktor said). It should not be done LATE while we are trying to count the days to get the BETA out.

kroeckx · 2021-01-02T15:13:15Z

@richsalz: My comment was directed at @DDvO, since I did not notice him participating in that issue, unlike you. I have to agree with most of what you say.

beldmit · 2021-01-02T15:24:50Z

@openssl/otc, I think this PR deserves a special OTC discussion.

DDvO · 2021-01-02T15:30:54Z

I became aware of the discussion in #7871 just a couple of hours back, and meanwhile I've added to two major contributions to it.
In the light of that discussion I've started refining this PR w.r.t. the treatment of partial chains...

DDvO · 2021-01-04T15:50:29Z

With this change, what is the need for the PARTIAL_CHAINS flag?

This PR does not aim at changing the semantics of X509_V_FLAG_PARTIAL_CHAIN, which is a different issue, still under discussion in #7871.
The improvements done here just make sure that more valid chains are found (assuming the same set of trust anchors as before).
In other words, it should not change trust policies but does increase chain building completeness.

DDvO · 2021-01-04T15:57:01Z

The fact that someone reported an issue does not outweigh the current users who are depending on the present semantics.

Sure, but I still doubt that someone is depending on the incompleteness of the hitherto implementation.
A counterexample would convince me of the contrary. @richsalz, the challenge to provide one is still open 😉

DDvO · 2021-01-04T16:03:21Z

There are no tests added that show the behavior change before and after.

I've meanwhile added the example discussed recently on the openssl-users list that I mentioned above:
https://www.mail-archive.com/[email protected]/msg88961.html

It demonstrates the effectiveness of the enhancements done here for an extremely simple case.
Conversely, the hitherto implementation had been failing even in such extremely simple cases.

richsalz · 2021-01-04T16:05:40Z

It will take me a couple of days of elapsed time to run some experiments.

vdukhovni · 2021-01-24T03:19:14Z

I haven't looked at this PR in a little while. There are some good bits here IIRC and some things here and some more controversial. Perhaps you could stage just the parts that are cleaning up code and not changing policy into a smaller separate PR, and we can then think about just the policy changes separately, if you're still keen on those too?

DDvO · 2021-02-01T22:29:29Z

Yeah I'll do that (as soon as I find the time for it).

DDvO · 2021-02-09T14:15:23Z

I haven't looked at this PR in a little while. There are some good bits here IIRC and some things here and some more controversial. Perhaps you could stage just the parts that are cleaning up code and not changing policy into a smaller separate PR, and we can then think about just the policy changes separately, if you're still keen on those too?

@vdukhovni, I've meanwhlle carved out all unrelated improvements,
fixed the minor issues you pointed out here,
updated/corrected/extended some comments, and
re-visited the topics discussed above (see there for details).

In a nutshell:
I'm meanwhile more convinced than ever that the changes proposed here are correct and even needed for improving completeness and to avoid inefficient and even counterproductive extra checks (which can lead also to spurious errors).
Moreover, the code has become a little cleaner and thus better to read and maintain.

richsalz · 2021-02-09T14:18:16Z

@DDvO, does your latest comment mean you want this in 3.0?

DDvO · 2021-02-09T14:29:41Z

@DDvO, does your latest comment mean you want this in 3.0?

Not really - I did not change the milestone, which still is Post 3.0.0.
I do not see the proposed improvements as urgent
but just wanted to update the contents of this PR
and give the possibility to advance the discussion on it.

vdukhovni · 2021-02-22T18:08:23Z

crypto/x509/x509_vfy.c

Previously exact match of untrusted certificates was only for the EE certificate, allowing one to specify the EE cert as a trust anchor (typically a non-default CAfile with just that cert) for the connection in question.

Now we appear to be doing something similar for intermediate CAs, but why???

As I wrote in the description of this PR:

it makes little sense to try finding an issuer of the current end of the chain if this end can be found immediately in the trust store (and is actually trusted).

Other than the EE cert, we always know the provenance of each subsequent certificate as we're building the chain, because we added it, either by getting it from the trust store or from the untrusted store (often peer-provided chain). This is tracked by virtue of the "num_untrusted" variable. Therefore, I still fail to see the point (which you're failing to explain precisely in a non hand waving way) of applying the EE-special trade-in for a trust-store equivalent at any other depth in the chain.

You just have to explain this much better.

Good point that for each cert in the chain being constructed, except for the target cert (which BTW does not need to be an EE cert) we already know whether its is from the trust store, and in fact we already call check_trust() on the current cert if it is from the trust store, and stop the loop if its is either trusted or rejected!
This was not clear to me because the whole chain building loop is so spread-out and complicated with detail (which partly could and better should be abstracted away).
So thanks to your comment I meanwhile see that for the certs after the target cert the new check is actually redundant!

Good point that for each cert in the chain being constructed, except for the target cert (which BTW does not need to be an EE cert) we already know whether its is from the trust store, and in fact we already call check_trust() on the current cert if it is from the trust store, and stop the loop if its is either trusted or rejected!
This was not clear to me because the whole chain building loop is so spread-out and complicated with detail (which partly could and better should be abstracted away).
So thanks to your comment I meanwhile see that for the certs after the target cert the new check is actually redundant!

We're making progress! If you limit the new check for just depth 0, with a suitable comment, my objections may be resolved. You're then just handling EE direct match for partial chain early, skipping the chain construction. That's probably OK, if that's what you're after. I can support that change, building a more complete chain seems redundant (the caller asked for partial chains, and has the EE cert in the store, could they possible indicate intent any more clearly???)

Done this, as soon as possible, as follows:

/* * Stop immediately if target cert is directly trusted/rejected. * This improves completeness and prevents inefficiency and spurious * errors, which can occur when it makes no sense to look for an issuer. */ if (num == 1 && (trust = check_trust(ctx, 1)) != X509_TRUST_UNTRUSTED) break;

vdukhovni · 2021-02-22T18:13:03Z

crypto/x509/x509_vfy.c

This used to make the EE cert "trusted", is that now done elsewhere?

Yes, this is now done within check_trust() -
as I wrote in the new preliminary comment above:

It is now covered by the generalized check_trust() called above, which checks direct trust for current cert (at index num_trusted - 1)

vdukhovni · 2021-02-22T20:45:37Z

crypto/x509/x509_vfy.c

Please explain exactly which cases this changes from prior behavior...
Is this just intended to handle the EE case of partial chain sooner?

Previous code tried to only do exact EE match as a last resort, building a chain to some trusted issuer if at all possible. I think just in case applications prefer that outcome. This was Stephen Henson's decision, and he's no longer with the project, so there's no certainty as to why, but I think that's plausible.

What's the new rationale? And why did the code have to change?

A pity that Stephen Henson did not document the rationale for that (IMHO strange) behavior.

As stated several times before, the very idea of this PR is to stop chain building as soon as possible, making the resulting chain as short as possible, which has several advantages:

completeness - the new test case that would not have succeeded before:
"accept ee-cert2 although deceptively matching also first cert in ee-certs"

IMO more security because only a minimal number of certs are involved in the chain, which less chances of errors/bug/attacks in between.

efficiency

sometimes it simply does not make sense to try and find an issuer first, and doing so can cause needless spurious errors

crypto/x509/x509_vfy.c

vdukhovni · 2021-02-22T20:59:16Z

crypto/x509/x509_vfy.c

Does this mean we can't as easily go back to not trusted-first, just by changing the flags at the top of the function?
I guess going back to "untrusted first" is not likely to happen...

I also think that going back to "untrusted first" is not likely to happen,
but if we wanna keep this option we can remove the tentative #if 0 and #endif,
which I used to verify/demonstrate that it is not necessary (anymore).

crypto/x509/x509_vfy.c

DDvO · 2021-02-23T09:05:02Z

Rebased to fix a merge conflict.
A few further adaptations and TODOs added.

mkris86

Geezzeee this is hard

vdukhovni · 2021-03-04T02:19:23Z

crypto/x509/x509_vfy.c

Didn't we finally agree that this is not quite right? Was that in a separate PR that overlaps with this one?

Right. It was in this PR, namely in #13748 (comment) - please place any further comments on that specific topic there.
~~I just haven't found the time yet to adapt this PR accordingly.~~

What you could do in the meantime is to answer to my latest reactions to your comments in #13735. This has been waiting for your further input since end of January,
and would be great if we can finalize that soon.

Update: @vdukhovni, I've just done that as discussed above.

vdukhovni · 2021-03-19T03:56:44Z

I think we need to set up a voice call and go through all the outstanding open questions in the multiple X509 PRs. There are just too many places to find all the unresolved questions. I've lost track of where to look.

My timezone is GMT-0400. I can do 9AM to 10AM some morning, pick a day and send me a conference link, along with a list of URLs for the relevant PRs and specific unresolved comments inside those PRs.

Stop chain building immediately if target cert is directly trusted/rejected. This is required because it not always makes sense to try and find an issuer cert. This allows test case "accept ee-cert although deceptively matching also first cert in ee-certs" succeed and prevents spurious errors for instance in test case "accept last-resort direct leaf match Ed25519-signed self-issued cert". This also allows getting rid of exceptional cases in find_issuer() and build_chain(). Also simplify internal_verify() and refactor the invocation of check_dane_issuer().

t8m · 2024-07-03T13:45:59Z

Could you please rebase the PR to resolve conflicts?

DDvO · 2026-01-30T12:38:22Z

@vdukhovni, @t8m, unfortunately, I won't have the time to continue working on this.
Can someone take over, or shall I close this PR, or how else to handle this?

DDvO · 2026-02-18T07:03:35Z

Closing for the time being.

DDvO added the approval: otc review pending label Dec 29, 2020

DDvO changed the title ~~x509_vfy.c: Make chain building more intuitive and complete~~ x509_vfy.c: Make chain building more intuitive, flexible, and complete Dec 29, 2020

This was referenced Dec 29, 2020

x509_vfy.c: Fix a regression in find_issuer() for v1.1.1 #13749

Closed

Regression in X509_verify_cert #13739

Closed

DDvO added the triaged: bug The issue/pr is/fixes a bug label Dec 29, 2020

DDvO added this to the 3.0.0 beta1 milestone Dec 29, 2020

DDvO mentioned this pull request Dec 30, 2020

Improve the documentation of cert path building and validation #13735

Closed

2 tasks

richsalz mentioned this pull request Jan 2, 2021

Why is X509_V_FLAG_PARTIAL_CHAIN flag required, should it be default? #7871

Closed

DDvO modified the milestones: 3.0.0 beta1, Post 3.0.0 Jan 2, 2021

DDvO removed approval: otc review pending triaged: bug The issue/pr is/fixes a bug labels Jan 2, 2021

DDvO changed the title ~~x509_vfy.c: Make chain building more intuitive, flexible, and complete~~ WIP: x509_vfy.c: Make chain building more intuitive, flexible, and complete Jan 2, 2021

DDvO force-pushed the chain_building_completeness branch 2 times, most recently from 296668a to 2511fb9 Compare January 4, 2021 15:41

DDvO removed this from the Post 3.0.0 milestone Jan 4, 2021

DDvO mentioned this pull request Feb 8, 2021

Improve ossl_cmp_build_cert_chain() and export as X509_build_chain() #14128

Closed

2 tasks

DDvO force-pushed the chain_building_completeness branch from 8c48e5f to 65b90fe Compare February 8, 2021 14:26

DDvO mentioned this pull request Feb 8, 2021

X509_STORE_CTX_get1_issuer(): Make preference w.r.t. expired certs consistent with find_issuer() #14130

Closed

DDvO force-pushed the chain_building_completeness branch from 1a1bfe3 to e6c093c Compare February 9, 2021 14:01

vdukhovni reviewed Feb 22, 2021

View reviewed changes

DDvO force-pushed the chain_building_completeness branch from e6c093c to 2c5bc52 Compare February 23, 2021 09:04

mkris86 approved these changes Mar 4, 2021

View reviewed changes

vdukhovni reviewed Mar 4, 2021

View reviewed changes

DDvO force-pushed the chain_building_completeness branch from 2c5bc52 to 9b44459 Compare March 5, 2021 18:22

DDvO changed the title ~~WIP: x509_vfy.c: Make chain building more intuitive, flexible, and complete~~ x509_vfy.c: Make chain building more intuitive, flexible, and complete Mar 5, 2021

DDvO force-pushed the chain_building_completeness branch from 9b44459 to cf695c7 Compare April 28, 2021 19:05

t8m added triaged: feature The issue/pr requests/adds a feature triaged: refactor The issue/pr requests/implements refactoring labels Aug 9, 2021

DDvO modified the milestones: Post 3.0.0, 4.0.0 Apr 28, 2023

t8m assigned vdukhovni Jul 3, 2024

DDvO added the help wanted label Jan 30, 2026

DDvO removed this from the 4.0.0 milestone Feb 14, 2026

DDvO closed this Feb 18, 2026

Uh oh!

Comments

Conversation

DDvO commented Dec 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Uh oh!

richsalz commented Dec 29, 2020

Uh oh!

DDvO commented Jan 1, 2021

Uh oh!

DDvO commented Jan 1, 2021

Uh oh!

richsalz commented Jan 1, 2021

Uh oh!

richsalz commented Jan 1, 2021

Uh oh!

kroeckx commented Jan 1, 2021 via email

Uh oh!

richsalz commented Jan 2, 2021

Uh oh!

kroeckx commented Jan 2, 2021

Uh oh!

beldmit commented Jan 2, 2021

Uh oh!

DDvO commented Jan 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DDvO commented Jan 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DDvO commented Jan 4, 2021

Uh oh!

DDvO commented Jan 4, 2021

Uh oh!

richsalz commented Jan 4, 2021

Uh oh!

vdukhovni commented Jan 24, 2021

Uh oh!

DDvO commented Feb 1, 2021

Uh oh!

DDvO commented Feb 9, 2021

Uh oh!

richsalz commented Feb 9, 2021

Uh oh!

DDvO commented Feb 9, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DDvO Feb 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vdukhovni Feb 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DDvO Feb 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DDvO Feb 23, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DDvO commented Dec 29, 2020 •

edited

Loading

DDvO commented Jan 2, 2021 •

edited

Loading

DDvO commented Jan 4, 2021 •

edited

Loading

DDvO Feb 23, 2021 •

edited

Loading

vdukhovni Feb 22, 2021 •

edited

Loading

DDvO Feb 23, 2021 •

edited

Loading

DDvO Feb 23, 2021 •

edited

Loading

DDvO Mar 4, 2021 •

edited

Loading