-
-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[java] Resolve explicit types using FQCNs, without hitting the classloader #1207
Comments
Some additionnal thoughts: We wouldn't really need a separate TypeScope object. A more flexible approach would be to enrich the SourceFileScope, which for the moment is next to useless. The point of having a symbol table is to resolve names to the entity they reference. For now, the symbol table only cares about the entities that are defined in the same file, but it should be more general than that, i.e., be able to resolve a reference even though we have no accompanying node. This could later be used as a stepping stone for multifile analysis. The idea of "reference" here would have to be defined precisely, since it's more general than the idea of FQName . E.g. a name could make a reference to a local variable, even though these variables don't have a canonical name. Among other things, classifying names w.r.t. the entity they describe might help improving our symbol table framework, as well as simplify some analysis steps like type resolution. This is described in JLS 6.5, and most of it doesn't even require a ClassLoader but uses only syntactical context information, which suggests that we could set some attribute in the ASTName node directly in the parser. |
@oowekyala I'm not sure what your idea actually is... without hitting the classloader at all, the only names you can unambiguously resolve are:
For everything else, you need to hit the classloader to make sure if a class exists according to the order in which they take precedence:
Notice this last one is counter-intuitive, but finding a FQCN in the code is actually hard to deal with, as it can technically be anything:
Currently, I agree the current type resolution codebase duplicates a lot of this logic (mostly down to the current split between symbol table and type resolution), and that's something worth fixing. I'd also love to see type resolution being more on-demand (ie: actually computing types as needed when asking for a node type) rather than done for everything just in case, but that's probably gonna be harder. |
Yes you're right, that was very optimistic. We will anyway need to rely on the classloader for type resolution.
That is the precedence order at the top of a compilation unit. Inside classes the shadowing order is much more complicated. TypeSet relies only on the classloader, and doesn't account at all for the context the reference is found in, which is crucial. E.g.
The JLS goes in details about when and how shadowing occurs, and how to resolve names. It supports itself with the notion of scope, which TypeSet is oblivious to. On the other end of the spectrum, our current Scope infrastructure only indexes the local AST and is oblivious to things that happen outside of the compilation unit. The framework I present in #1566 aims to unify those approaches. That PR only presents the reference framework, but what I have implemented around SymbolTable already supersedes TypeSet, and is general enough to replace Scope eventually. It's designed with multifile analysis in mind, and it's already lazy --which may allow some to someday resolve types on-demand. I think it's an attractive alternative to our current situation, and it will allow type resolution to concentrate on things like method selection and type inference rather than resolving explicit things. |
I think this is already implemented in PMD 7. Should we add this to the release notes? |
Many types are completely explicit in Java given the info that's present in the source file. These includes types in an
extends
/implements
clause, import statements, method and constructor formal parameter types, wildcard bounds, return types, (non-var
) local variable types, etc. Even though the actual.class
files may not be on our auxclasspath, we can resolve the qualified name of these in most cases, which could enhance the robustness of our type resolution when the auxclasspath is not set.Implementation
The key would be to build some part of our type resolution around TypeQualifiedNames.
The text was updated successfully, but these errors were encountered: