-
Notifications
You must be signed in to change notification settings - Fork 13.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Modules] Incorrect ODR checks caused by preferred_name
attribute.
#56490
Comments
@llvm/issue-subscribers-clang-modules |
I guess I located the problem. But it is really hard to handle. The problem here is that when the compiler reads Although due to the design of ASTReader, it wouldn't fall into the infinite reading loop. But the reading result is not correct somehow. I don't find a good solution to fix this. |
…interface Currently, the use of preferred_name would block implementing std modules in libcxx. See #56490 for example. The problem is pretty hard and it looks like we couldn't solve it in a short time. So we sent this patch as a workaround to avoid blocking us to modularize STL. This is intended to be fixed properly in the future. Reviewed By: erichkeane, aaron.ballman, tahonermann Differential Revision: https://reviews.llvm.org/D130331
Sent a2772fc as a workaround but we still need to fix it. |
…interface Currently, the use of preferred_name would block implementing std modules in libcxx. See llvm/llvm-project#56490 for example. The problem is pretty hard and it looks like we couldn't solve it in a short time. So we sent this patch as a workaround to avoid blocking us to modularize STL. This is intended to be fixed properly in the future. Reviewed By: erichkeane, aaron.ballman, tahonermann Differential Revision: https://reviews.llvm.org/D130331
…interface Currently, the use of preferred_name would block implementing std modules in libcxx. See llvm/llvm-project#56490 for example. The problem is pretty hard and it looks like we couldn't solve it in a short time. So we sent this patch as a workaround to avoid blocking us to modularize STL. This is intended to be fixed properly in the future. Reviewed By: erichkeane, aaron.ballman, tahonermann Differential Revision: https://reviews.llvm.org/D130331
It seems like the problem here is that reading So either reading the |
We would read the local redeclarations for TagDecl. So the first suggestion might not be straight forward. The second suggestion looks promising. |
I did some initial experiments. And it shows it at least not straight forward since we may read some attributes during the deserialization. So if we continue this direction, we may have to divide the attributes as lazy attributes and eager attributes.. |
The same error should be happening for header modules, I'm surprised we're not hitting it at Google with libc++ already. |
Answering my own question here: we're not hitting it at Google because the I've checked that if that actually happens, we also hit the same error with header modules. |
I've tried to compare the deserialisation of A in the versions of code with and without the The primary difference is that the version without the attribute does not visit the typedef decl for During the merge step, the typedef imported from A cannot be found Type of the existing typedef:
Type of the imported typedef:
They seem identical, but pointers are not the same, and so the typedefs aren't merged. In the code version with the attribute, the In the code version with the attribute, the constructor merge step is then triggered again for this constructor decl
which I therefore assume is connected to the typedef merge step. These are probably still all symptoms rather than the actual problem. So far I haven't identified the exact location where deserialization attempts to go on a loop, as described in the previous comments. |
This seems totally expected as
The fact that pointers are different should be the curlprit of the problem. This should be stemming from the fact that if we have the attribute, we need to deserialize the typedef when deserializing the template itself. I suspect we just don't have the information that
is the same as (previously created)
and so we end up with wrong results of merging. From a different standpoint, without the attribute:
With the attribute, the (2) still holds, but now we also need to deserialize the typedef when deserializing the template itself. And I believe it's this new aspect that screws us up. The merging of the type inside typedef fails to canonicalize the type to point to the previously parsed To confirm this theory, I'd recommend looking at:
Intuitively, this recursion coming from the attributes should be avoided and we could probably fix this by postponing the merging of this specific attribute to happen after the deserialization step for the corresponding |
My memory to the problem got vanished slowly. But I think my previous analysis might be true |
Some elaboration on the reason why deserialization of the At some point in the deserialization of the A module in Use.cppm, Visiting a
When the typedef decl is read for the first time, its entry is absent from the Reading the decl record for the typedef first sets it to loaded and then issues the actual visitation (
Visiting this decl triggers the visitation of its attributes, and visiting the This again bubbles down to
Essentially, reading the typedef type is triggered while reading the typedef type in a cycle, but the literal cycle doesn't occur, because the decl is marked as loaded once before visititation, and at the second attempt to visit it the unfinished loaded decl is returned. Then the ClassTemplateSpecialization decl visitation is finished with the result
which is also unfinished because of the unfinished attribute reading. Once typedef visitation is finished, the resulting typedef decl is
After the typedef visitation, an existing typedef is found in the decl context, but the existing typedef and the new typedef above aren't merged, since they are deemed to be different entities. And once the CXXRecordType visitation mentioned in the beginning of the post is finished, the result is
which also seems to be unfinished. |
I've been trying to find the location where the two typedefs (global and imported) should be linked as one (imported) being a redeclaration of the other (global), but this doesn't happen. One idea I had is that they might be linked here in the However, in the case of the imported typedef, reading the Decl ID inside the record returns 0 as a first decl ID, which is invalid, and the conclusion from this redeclarable visitation is that the imported typedef is the only declaration. Reading through the code, it's probably the expected behavior. This makes me think that the behavior is probably expected and |
Continuing on the previous comment, I believe that the linking of imported decls with their global counterparts happens in For this specific snippet, attaching the previous decl is triggered for:
.. but not triggered for the This makes me think that the redeclarable visit for the typedef should have taken a different branch, but I can't tell at the moment why it didn't. |
I don't think that's the place we should be looking at. The redecl chain that we are deserializing should be just one typedef (because there are no redeclarations in the module we are loading). However, I think you've already mentioned that we fail to merge the typedef we are importing either here or here because their types are different, but should be the same. We've also concluded they they are different because the TypeForDecl is assigned before we set the previous link for the redecl chain. Normally, this should not happen and it only happens because this same typedef is being referenced while we try to deserialize the type of the typedef itself. I think that the obvious fix would be to postpone deserializing and setting the Could you try implementing something along those lines and see if that fixes the issue? |
Thanks. I have tried to assert that this indeed holds
Unfortunately, the assertion that the type of In both case, type of the
|
Also tried to assert that the types of
In this case, there's a difference of behavior between the code with attribute and the code without attribute. Adding previous link to
yields different types for
and the assertion triggers. This doesn't happen for the version without the attribute. |
I've looked in a bit more detail into the PendingUpdateRecords. I am not sure this exact mechanism is what we need to fix the problem. The way I understand the mechanism:
As mentioned, I'm not sure what UPDATE_VISIBLE means and why the entire translation unit processing is delayed, but it seems to me that PendingUpdateRecords mechanism is too low-level for the purpose of delaying a single decl like the typedef decl for the preferred_name attribute, since we'd like to defer the processing of the typedef earlier than the binary record processing time. There are some more Pending.. structures involved in AST reading that might be better suited to delay processing at the level of a decl, it might be worth looking into those. |
I agree generally. BTW #121245 may be related. |
The other mechanisms (other than PendingUpdateRecords) are, e.g., PendingDeducedVarTypes or PendingDeclContextInfos. In my understanding, they allow to skip deserialization of a specific Decl or QualType. The way it works:
What's different about the preferred_name attribute:
|
The linked PR delays reading the type source info of the preferred_name attribute. Missing piece:
|
It looks like the fix was reverted in f63e8ed. I think we'd better to re-open the issue. |
I met an incorrect ODR checks caused by
preferred_name
. Here is a reduced example.The input command line is:
The output would be the incorrect ODR warning:
string_view.h:11:5: error: 'basic_string_view<char>::basic_string_view' from module 'A.<global>' is not present in definition of 'string_view' provided earlier
I've been fighting with the problem for several days. My current process is that when the ASTReader reads the
preferred_name
attribute, it would read thestring_view
type and find an existing one. However, the current behavior would create another RecordType forstring_view
despite there is already one. So here is the problem.The text was updated successfully, but these errors were encountered: