-
-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pull #16628: use SLL prediction mode for fast javadoc parsing to improve performance #16628
Conversation
Interesting!
|
Since the Javadoc grammar contains many ambiguities and many nested rules, it may not be safe to rely on single lookahead parsing. However, if we can rewrite the grammar to reduce nesting and ambiguity, we could significantly improve parsing time by using SLL parsing @nrmancuso @rnveach @romani |
This is a 16% deduction right? Maybe consider running the performance multiple times to confirm it is a stable deduction. What are the downsides of this mode? I assume they didn't enable it by default for a reason. Just curious if we also seen what this does on the Java side? |
Yes, 100 seconds deduction which is 16%. I restarted CI by pushing again to see if this reduction is stable.
From Javadoc of this option:
As per my understanding SLL mode doesn't handle ambiguities well. If a grammar contains multiple possible parse paths for a given input, SLL mode will fail. and It doesn't backtrack, once it hits an ambiguity, it stops. As it works with a single lookahead and not taking the whole parser context into account
I didn't try this may ne we should try it in another PR but I think that Java can be more complex than Javadoc which might make it require the full LL approach for correct parsing. @nrmancuso can confirm this We can use SLL mode as the default parsing strategy, and if it encounters any issues or fails to parse, we can switch to LL mode. |
Github, generate report |
I believe we need regression without suppressing grammar failures on all our regression projects. We need to know if this change is now increasing grammar failures. Since I believe CI was passing, it seems its not really big concern. |
Github, generate report |
1 similar comment
Github, generate report |
if you think this config is frequently to use you can put it as folder to https://github.com/checkstyle/test-configs and re-use at any time. |
Time decreased by 16% so I reduced the baseline. please see if this is good and update is approved I can do similar approach for java parsing |
We are going to zero mutation phase, some most weird should be covered by pure unit testing, if no reasonable coverage can be provided. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very good, I tried this when I implemented the Java grammar and it did not work at the time. Maybe some improvements have been made since then, we should at least try it out again.
I’m don’t see any point in holding up this PR over a pitest suppression.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Items
@@ -112,6 +113,7 @@ public ParseStatus parseJavadocAsDetailNode(DetailAST javadocCommentAst) { | |||
|
|||
try { | |||
final JavadocParser javadocParser = createJavadocParser(javadocComment, errorListener); | |||
javadocParser.getInterpreter().setPredictionMode(PredictionMode.SLL); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please move it inside create..... method.
Let try to update pitest to avoid call interpreter
Like
Line 3100 in f123b09
<avoidCallsTo>org.apache.commons.logging</avoidCallsTo> |
We need to find way avoid pitest suppression.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Last resort is approach like
Line 4453 in f123b09
<param>lazyLoad</param> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done with avoidCallsTo and xwiki is unrelated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok to merge,
thanks a lot
Diff Regression config: https://gist.githubusercontent.com/mahfouz72/33967bf86e4a29cafa429771b0e95808/raw/f621465b3c7dd6e64ac603d396570bf92a1a69d6/check_patch.xml