-
-
Notifications
You must be signed in to change notification settings - Fork 913
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
preg_match missing array shape for pattern #11323
Comments
@mvorisek could you have a look? |
Note that both |
I can help with grammar, but my time is very limited and creating unit tests would help me - phpstan/phpstan-src#2589 (comment) |
@mvorisek put the contents of https://phpstan.org/r/3a6f6eef-9ecc-4304-b707-b9a688d7f48c I can build a proper unit test when the fix is made |
I will look into the hoa regex grammar fixes. Is |
Seems like without P it works yes https://phpstan.org/r/89d1bf05-2714-48fa-8274-0ee0dde90218 |
here is another example (which also uses |
here are more examples, but I don't know whether these are related to hoa parsing |
@mvorisek thanks for looking into it. the suggested patch leads to errors (regressions), see phpstan/phpstan-src#3228 |
@mvorisek I might have spoken to fast and needs another look from my side. I mixed something up |
Found another grammar issue @mvorisek https://phpstan.org/r/94070d37-7a16-4487-9098-f2a5677dfd43 The issue is that See https://3v4l.org/m0Vrv for runtime proof :) |
I will look into that. |
@staabm I think it might be fixable by changing this line https://github.com/hoaproject/Regex/blob/master/Source/Grammar.pp#L168 to this:
|
Would https://github.com/hoaproject/Regex/blob/master/Source/Grammar.pp#L76C9-L76C23 be matched then too, ie. It would be good to run/parse all https://github.com/PCRE2Project/pcre2/tree/master/testdata testcases to verify all PCRE2 supported grammars are supported. |
@Seldaek After the latest push in 1.12.x, PHPStan now reports different result with your code snippet: @@ @@
9: Dumped type: array<string>
-13: Dumped type: array{0: string, name: string, 1: string, email?: string, 2?: string}
+13: Dumped type: array<string>
18: Dumped type: array<string>
22: Dumped type: array{string} Full report
|
From PCRE:
Everything else should be just parsed as literal in there. So probably the better option is to add a new token to parse all the leftover characters:
Then again change line 168 to include that as last resort:
I am not super familiar with the syntax here so hopefully there is no mistake, but I think if this passes tests and it can parse |
BTW just to test the pattern is valid https://regex101.com/r/9SxE17/1 |
As noted above, https://regex101.com/r/pOq5Jj/1 should be parsed as inner class, otherwise we might be limited in the future when we will analyse classes (like numerical string). |
Where did you note this above? And well, right now Anyway if you want a fully compliant parser it might be better to go with https://github.com/bkiers/pcre-parser/tree/master/src/main/antlr4/nl/bigo/pcreparser + https://packagist.org/packages/antlr/antlr4-php-runtime or something else than retrofitting everything that's missing onto hoa/regex? But in the meantime I'd still be happy if this additional fix can be applied if it indeed works, as it should solve the last chunk of problem I've encountered so far. |
I have no idea what to make from your comments. If there is a grammar change I should apply please advise and give me a test-sample to verify the change Thank you both |
@staabm what I'd do is add this below https://github.com/hoaproject/Regex/blob/master/Source/Grammar.pp#L45-L49 :
then change https://github.com/hoaproject/Regex/blob/master/Source/Grammar.pp#L168 to:
Or this, I am not sure what the difference between
That should make it so that patterns like If the patch fails maybe you can create a draft PR with instructions how to run it then I can try to figure it out here? |
@Seldaek I have put the suggested change with a unit test into phpstan/phpstan-src#3241 atm it doesn't work though |
@Seldaek After the latest push in 1.12.x, PHPStan now reports different result with your code snippet: @@ @@
10: Expected type array{0: string, currency: string, 1: string}, actual: array<string>
-14: Expected type array{}|array{0: string, currency: string, 1: string}, actual: array<string>
+14: Expected type array{}|array{0: string, currency: string, 1: string}, actual: array<string>
+17: Expected type array{0: string, currency: string, 1: string}, actual: array{0: string, currency: non-empty-string, 1: non-empty-string}
+21: Expected type array{}|array{0: string, currency: string, 1: string}, actual: array{}|array{0: string, currency: non-empty-string, 1: non-empty-string} Full report
|
I think this one is good enough for now |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
Bug report
for some reason the snippet does not get a proper array shape.
hoa reports
Unexpected token "?" (zero_or_one) at line 1 and column 10:/Price: (?P<currency>£|€)\d+/
while it works at php runtime: https://3v4l.org/rugXe
Code snippet that reproduces the problem
https://phpstan.org/r/3a6f6eef-9ecc-4304-b707-b9a688d7f48c
Expected output
expectations in the snippet
Did PHPStan help you today? Did it make you happy in any way?
No response
The text was updated successfully, but these errors were encountered: