Hello,
if all you did to change the existing to this form was removing spaces and comments, then
yes, that's almost the same regex.
You also need to remove the x modifier, at the very end, that says "extended
regex", ie, regex where spaces and comments are allowed (and ignored).
I think that it's a good idea to have such complex regex to be extended. Comments are
more than welcome here. Don't you think?
If you did another edit during the transformation, I did not spot it...
Mt
----- Le 18 Juil 21, à 18:28, Jean-Christophe Helary lists(a)traduction-libre.org a écrit :
> Hello Martin,
>
> Thank you for the reference...
> Well, I'd love to provide a patch but for that I'd need to understand that
> regex... :(
>
> Basically, the regex is this, right ?
>
> $buffer =~
> /^((?:.*?\n)?(?:[^%]|(?<!\\)(?:\\\\)*\\%)*?)(${RE_ESCAPE}(?:begin|end)\{.*)$/sx
>
> JC
>
>> On Jul 19, 2021, at 0:39, Martin Quinson <martin.quinson(a)ens-rennes.fr>
wrote:
>>
>> Hello Jean-Christophe,
>>
>> this is
https://github.com/mquinson/po4a/issues/37 that was "fixed" in
the past
>> by adding a new line to org.texi.
>> I still fail to understand how to fix it properly, and a patch would be more
>> than welcome here.
>>
>> Thanks, Mt
>>
>> ----- Le 18 Juil 21, à 17:11, Jean-Christophe Helary lists(a)traduction-libre.org
>> a écrit :
>>
>>> When trying to convert org.texi to PO I get the following error:
>>>
>>> Complex regular subexpression recursion limit (32766) exceeded at
>>> /opt/local/lib/perl5/5.26/Locale/Po4a/TeX.pm line 697.
>>>
>>> That happens between line 11180 and 11200 of the file and it does not seem to
be
>>> triggered by any weird code.
>>>
>>> Checking the error on the web, I found that it is possibly related to using *
in
>>> a regex:
>>>
>>>
https://metacpan.org/pod/XML::Easy::Syntax
>>>
>>>> BUGS
>>>>
>>>> Many of these regular expressions are liable to tickle a serious bug in
perl's
>>>> regexp engine. The bug is that the * and + repeat operators don't
always match
>>>> an unlimited number of repeats: in some cases they are limited to 32767
>>>> iterations. Whether this bogus limit applies depends on the complexity of
the
>>>> expression being repeated, whether the string being examined is
internally
>>>> encoded in UTF-8, and the version of perl. In some cases, but not all, a
false
>>>> match failure is preceded by a warning "Complex regular
subexpression recursion
>>>> limit (32766) exceeded".
>>>>
>>>> This bug is present, in various forms, in all perl versions up to at
least 5.8.9
>>>> and 5.10.0. Pre-5.10 perls may also overflow their stack space, in
similar
>>>> circumstances, if a resource limit is imposed.
>>>>
>>>> There is no known feasible workaround for this perl bug. The regular
expressions
>>>> supplied by this module will therefore, unavoidably, fail to accept some
>>>> lengthy valid inputs. Where this occurs, though, it is likely that other
>>>> regular expressions being applied to the same or related input will also
suffer
>>>> the same problem. It is pervasive. Do not rely on this module (or perl)
to
>>>> process long inputs on affected perl versions.
>>>
>>> Line 697 of TeX.pm is:
>>>
>>> # detect \begin and \end (if they are not commented)
>>> → if ($buffer =~ /^((?:.*?\n)? # $1 is
>>> (?:[^%] # either not a %
>>> | # or
>>> (?<!\\)(?:\\\\)*\\%)*? # a % preceded by an odd nb of
\
>>> ) # $2 is a \begin{ with the end of
the line
>>> (${RE_ESCAPE}(?:begin|end)\{.*)$/sx
>>>
>>>
>>> If there could be a way to simplify this regex, maybe the issue would go
away...
>>>
>>> org.texi is the only file in the emacs distribution that chokes on this
regex.
>>>
>>> --
>>> Jean-Christophe Helary @brandelune
>>>
https://mac4translators.blogspot.com
>>>
https://sr.ht/~brandelune/omegat-as-a-book/
>>> _______________________________________________
>>> Devel mailing list -- devel(a)lists.po4a.org
>>> To unsubscribe send an email to devel-leave(a)lists.po4a.org
>> _______________________________________________
>> Devel mailing list -- devel(a)lists.po4a.org
>> To unsubscribe send an email to devel-leave(a)lists.po4a.org
>
> --
> Jean-Christophe Helary @brandelune
>
https://mac4translators.blogspot.com
>
https://sr.ht/~brandelune/omegat-as-a-book/