Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[apex] Apexlink POC #2830

Merged
merged 25 commits into from
Apr 16, 2021
Merged

[apex] Apexlink POC #2830

merged 25 commits into from
Apr 16, 2021

Conversation

nawforce
Copy link
Contributor

@nawforce nawforce commented Oct 13, 2020

Describe the PR

This is a POC for using apexlink to provide unused method analysis.

To control the global analysis I have added the flag -multifileanalysisdirectory which should be set to the root directory of the Salesforce metadata, where sfdx-project.json resides. The POC code will fail on non-SFDX projects currently but could be fixed to handle them.

If the flag is set then the apexlink analysis runs when the first class file is parsed. On large projects this analysis can take some time so this isn't a good place for it but was OK for a quick POC.

I have added an AvoidUnusedMethodRule as an example, this peeks into the 'Issues' reported by the apexlink analysis and adds a violation if there is a match. The handling here could be made more efficient, it iterates all Issues for the class on each method, but again this was sufficient to prove the approach was viable.

I have put a few extra comments in ApexMultifileAnalysis.java that will hopefully help with understanding what is happening.

Related issues

@nawforce nawforce changed the title Apexlink Apexlink POC Oct 13, 2020
@rsoesemann
Copy link
Member

@oowekyala@adangel I guess with Multifile rules we cannot use the XML based rule testing of PMD anymore. Or are there examples for tests that require multiple files in other languages?

@rsoesemann rsoesemann requested a review from oowekyala October 13, 2020 20:38
@rsoesemann rsoesemann added the a:new-rule Proposal to add a new built-in rule label Oct 13, 2020
@rsoesemann rsoesemann requested a review from adangel October 13, 2020 20:39
@oowekyala oowekyala changed the title Apexlink POC [apex] Apexlink POC Oct 13, 2020
@oowekyala oowekyala added this to the 7.0.0 milestone Oct 13, 2020
Copy link
Member

@oowekyala oowekyala left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good!

A couple of changes can be made when #2518 is implemented:

  • The new CLI option should be converted to a language property
  • The ApexMultifileAnalysis class should be part of the language state

Other than that I have very little to nitpick on. Thanks for the good work!

Copy link
Member

@adangel adangel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PoC, looks promising!

@@ -75,6 +75,10 @@
description = "Specifies the string that marks a line which PMD should ignore; default is NOPMD.")
private String suppressmarker = "NOPMD";

@Parameter(names = "-multifileanalysisdirectory",
description = "Specifies a directory that contains sources the may be used in a multi-file analysis; default is empty.")
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This option is language dependent, I agree with @oowekyala that this would be a fit for language properties.

So that I understand, how this works: The directory, you specify here, contains the file "sfdx-project.json". Where does this file come from? Is it generated by an execution of ApexLink before executing PMD? Is this directory a different directory than you would specify using "-dir" (the "Root directory for sources.")? I'm unfamiliar with the structure of Apex projects.

For me, this looks like a property to enable/make use of ApexLink, rather than enabling multifileanalysis... and ApexLink happens to understand already the relationships between the files that are being analyzed (that's what we want with multifile analysis). PMD already has the knowledge where all files a located (-dir), but doesn't take advantage of this, because we analyze file by file and don't build up a global metadata structure like "sfdx-project.json". Hence my question: Who builds this file and when?

Ok, looking at e.g. https://github.com/apex-enterprise-patterns/fflib-apex-common/blob/master/sfdx-project.json this file actually just contains the path to the sources (possibly multiple paths). So, ApexLink then builds the metadata structure, when creating the Org, right?

I assume, if the paths specified by sfdx-project.json and the paths specified by "-dir" don't match, that would be a configuration error and I'm wondering, what we could do to make it easier for users to call PMD. For apex projects, that have a sfdx-project structure (https://developer.salesforce.com/docs/atlas.en-us.sfdx_dev.meta/sfdx_dev/sfdx_dev_source_file_format.htm) we could say, that you need to specify the directory where sfdx-project.json is located via "-dir" and PMD would then add the referenced directories, as if specified manually via "-dir". In that case, we might have instead a parameter to opt-in/opt-out for that feature (apex language property: consider directories specified in sfdx-project.json).

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ok, looking at e.g. https://github.com/apex-enterprise-patterns/fflib-apex-common/blob/master/sfdx-project.json this file actually just contains the path to the sources (possibly multiple paths). So, ApexLink then builds the metadata structure, when creating the Org, right?

Yes, it reads the directories from sfdx-project.json to work out what to include in the analysis. The handling of this used to be built into ApexLink but I recently split into a separate library. There is a bit more to the format than being multiple directories, it's really more like multiple modules with either implicit or explicit dependencies between them. Treating them as modules matters in a multi-file analysis because there are rules around visibility and duplicates etc that you can't handle if you think of them as just the sum of a set of directories. There is also .forceignore which is similar in nature to .gitignore, if present its in the same directory as sfdx-project.json.

The sfdx-project.json also contains a namespace for the project. It's critical this is available for multi-file work as it enables the analysis to understand package boundaries. For single file analysis I would imagine a few rules might find it also handy if it could be made available.

I assume, if the paths specified by sfdx-project.json and the paths specified by "-dir" don't match, that would be a configuration error and I'm wondering, what we could do to make it easier for users to call PMD. For apex projects, that have a sfdx-project structure (https://developer.salesforce.com/docs/atlas.en-us.sfdx_dev.meta/sfdx_dev/sfdx_dev_source_file_format.htm) we could say, that you need to specify the directory where sfdx-project.json is located via "-dir" and PMD would then add the referenced directories, as if specified manually via "-dir". In that case, we might have instead a parameter to opt-in/opt-out for that feature (apex language property: consider directories specified in sfdx-project.json).

Having "-dir" understand sfdx-project.json makes sense to me as a general approach but likely you want to delegate resolving "-dir" to a set of class files for analysis to a Salesforce specific handler that works alongside the existing handling.

The library I have for this might be a bit overweight for what PMD needs as ApexLink is interested in other project files and they can have some quite unusual handling behaviours. The part I have that is probably most valuable to PMD is the forceignore handling which should be straight forward to port to Java from its current Scala implementation if that made sense.

@adangel

This comment was marked as off-topic.

@oowekyala oowekyala mentioned this pull request Oct 18, 2020
4 tasks
@nawforce
Copy link
Contributor Author

Yes, that will need to be considered. As PMD doesn't support multifiles in any language, there are no examples.

I didn't look at testing as part of this work but thought I should mention that I approached this using virtual file system support, the FileSystemHelper.run sets this up. You do get some issues around conversions between Paths & Strings but mostly it works well for me.

  test("Unused method") {
    FileSystemHelper.run(
      Map(
        "Dummy.cls" -> "public class Dummy {public void foo() {}}"
      )
    ) { root: PathLike =>
      val org = Org.newOrg().asInstanceOf[OrgImpl]
      val pkg = org.newMDAPIPackageInternal(None, Seq(root), Seq())
      assert(!org.issues.hasMessages)
      assert(
        pkg.reportUnused().getMessages(root.join("Dummy.cls").toString) == 
          "Unused: line 1 at 32-35: Unused Method 'void foo()'\n")
    }
  }

# Conflicts:
#	pmd-apex/pom.xml
#	pmd-apex/src/main/java/net/sourceforge/pmd/lang/apex/ApexParser.java
#	pmd-apex/src/main/java/net/sourceforge/pmd/lang/apex/ast/ApexParser.java
#	pmd-apex/src/main/java/net/sourceforge/pmd/lang/apex/ast/ApexRootNode.java
#	pmd-core/src/main/java/net/sourceforge/pmd/ant/internal/PMDTaskImpl.java
#	pmd-core/src/main/java/net/sourceforge/pmd/lang/ParserOptions.java
#	pom.xml
@ghost
Copy link

ghost commented Oct 22, 2020

2 Messages
📖 Compared to pmd/7.0.x:
This changeset changes 0 violations,
introduces 0 new violations, 0 new errors and 0 new configuration errors,
removes 0 violations, 0 errors and 0 configuration errors.
Full report
📖 Compared to master:
This changeset changes 9192 violations,
introduces 8327 new violations, 1 new errors and 0 new configuration errors,
removes 15657 violations, 10 errors and 2 configuration errors.
Full report
Compared to pmd/7.0.x:
This changeset changes 0 violations,
introduces 0 new violations, 0 new errors and 0 new configuration errors,
removes 0 violations, 0 errors and 0 configuration errors.
Full report
Compared to master:
This changeset changes 2916 violations,
introduces 8103 new violations, 1 new errors and 0 new configuration errors,
removes 15648 violations, 10 errors and 2 configuration errors.
Full report
Compared to pmd/7.0.x:
This changeset changes 0 violations,
introduces 0 new violations, 0 new errors and 0 new configuration errors,
removes 0 violations, 0 errors and 0 configuration errors.
Full report
Compared to master:
This changeset changes 2916 violations,
introduces 8103 new violations, 1 new errors and 0 new configuration errors,
removes 15648 violations, 10 errors and 2 configuration errors.
Full report
Compared to pmd/7.0.x:
This changeset changes 0 violations,
introduces 0 new violations, 0 new errors and 0 new configuration errors,
removes 0 violations, 0 errors and 0 configuration errors.
Full report
Compared to master:
This changeset changes 2916 violations,
introduces 8103 new violations, 1 new errors and 0 new configuration errors,
removes 15648 violations, 10 errors and 2 configuration errors.
Full report
Compared to pmd/7.0.x:
This changeset changes 0 violations,
introduces 0 new violations, 0 new errors and 0 new configuration errors,
removes 0 violations, 0 errors and 0 configuration errors.
Full report
Compared to master:
This changeset changes 2916 violations,
introduces 8103 new violations, 1 new errors and 0 new configuration errors,
removes 15648 violations, 10 errors and 2 configuration errors.
Full report
Compared to pmd/7.0.x:
This changeset changes 0 violations,
introduces 0 new violations, 0 new errors and 0 new configuration errors,
removes 0 violations, 0 errors and 0 configuration errors.
Full report
Compared to master:
This changeset changes 2919 violations,
introduces 8103 new violations, 1 new errors and 0 new configuration errors,
removes 15645 violations, 10 errors and 2 configuration errors.
Full report
Compared to pmd/7.0.x:
This changeset changes 0 violations,
introduces 0 new violations, 0 new errors and 0 new configuration errors,
removes 0 violations, 0 errors and 0 configuration errors.
Full report
Compared to master:
This changeset changes 2919 violations,
introduces 8103 new violations, 1 new errors and 0 new configuration errors,
removes 15645 violations, 10 errors and 2 configuration errors.
Full report
Compared to pmd/7.0.x:
This changeset changes 0 violations,
introduces 0 new violations, 0 new errors and 0 new configuration errors,
removes 0 violations, 0 errors and 0 configuration errors.
Full report
Compared to master:
This changeset changes 2068 violations,
introduces 6043 new violations, 2 new errors and 0 new configuration errors,
removes 15167 violations, 10 errors and 2 configuration errors.
Full report
No java rules are changed!
Compared to pmd/7.0.x:
This changeset introduces 0 new violations, 0 new errors and 0 new configuration errors,
removes 0 violations, 0 errors and 0 configuration errors.
Full report
Compared to master:
This changeset introduces 791 new violations, 6 new errors and 0 new configuration errors,
removes 11 violations, 16 errors and 2 configuration errors.
Full report
Compared to pmd/7.0.x:
This changeset introduces 0 new violations, 0 new errors and 0 new configuration errors,
removes 0 violations, 0 errors and 0 configuration errors.
Full report
Compared to master:
This changeset introduces 791 new violations, 6 new errors and 0 new configuration errors,
removes 11 violations, 16 errors and 2 configuration errors.
Full report

Generated by 🚫 Danger

Copy link
Member

@adangel adangel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks pretty ready for now. I think, the only part we should add, is a rule test for the new rule AvoidUnusedMethod - otherwise we'll never notice, when it doesn't work anymore.
There are of course some parts to be improved, e.g. the language properties. But that could evolve later.
Wdyt @oowekyala ?

@rsoesemann
Copy link
Member

@rmohan20 and @jbartolotta-sfdc I want to point your attention to this. The addition of ApexLinnk will bring PMD closer to Multifile Analysis and give us a chance to remove a huge source of false positives in other rules. All the existing rules currently report security issues because a Check is delegated to another method or class.

Does it make sense that your team looks closer into this and maybe join forces?

@jbartolotta-sfdc
Copy link
Contributor

@rmohan20 and @jbartolotta-sfdc I want to point your attention to this. The addition of ApexLinnk will bring PMD closer to Multifile Analysis and give us a chance to remove a huge source of false positives in other rules. All the existing rules currently report security issues because a Check is delegated to another method or class.

Does it make sense that your team looks closer into this and maybe join forces?

@rsoesemann Thanks for pointing this out, this looks like a good addition for multi file analysis.

We are currently investigating Dataflow analysis for Apex. We will circle back when we have more information on our planned roadmap and see how this might all fit together.

FYI @rmohan20

@nawforce
Copy link
Contributor Author

nawforce commented Apr 8, 2021

I have moved the apexlink version to latest, there was a bugfix in the caching that could impact the output.

oowekyala added a commit that referenced this pull request Apr 16, 2021
@oowekyala oowekyala merged commit 73f86ce into pmd:pmd/7.0.x Apr 16, 2021
@nawforce
Copy link
Contributor Author

Thanks for all the help getting my rough POC merged. Looking forward to see where we can take this next.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
a:new-rule Proposal to add a new built-in rule
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[apex] Integrate nawforce/ApexLink to build robust Unused rule
5 participants