• Home
  • Raw
  • Download

Lines Matching +full:- +full:- +full:without +full:- +full:perl

13   Long-term ideas (wish list)
30 site. The generated file contains the tables for a 2-stage lookup of Unicode
48 pcre2_chartables.c.non-standard
79 sequence of bytes in the UTF-8 encoding, and vice versa. If its argument is a
80 hex number such as 0x1234, it outputs a list of the equivalent UTF-8 bytes.
81 If its argument is a sequence of concatenated UTF-8 bytes (e.g. 12e188b4) it
82 treats them as a UTF-8 string and outputs the equivalent code points in hex.
143 files are CMakeLists.txt and config-cmake.h.in. After making a release, test
146 . Run ./autogen.sh to ensure everything is up-to-date.
154 . Run tests in both 32-bit and 64-bit environments if possible. I can no longer
155 run 32-bit tests.
158 make use of -fsanitize=address and friends where possible. For gcc,
159 -fsanitize=undefined -std=gnu99 picks up undefined behaviour at runtime, but
160 needs -fno-sanitize=shift to get rid of warnings for shifts of negative
161 numbers in the JIT compiler. For clang, -fsanitize=address,undefined,integer
162 can be used but -fno-sanitize=alignment,shift,unsigned-integer-overflow must
164 -fsanitize=signed-integer-overflow
168 still works: [c]cmake -DCMAKE_UNITY_BUILD=ON sets up a unity build.
172 each test. Sometimes there are other differences in test 4 if PCRE2 and Perl
173 are using different Unicode releases. The other tests are not Perl-compatible
174 (they use various PCRE2-specific features or options).
180 NEWS (check version and date), NON-AUTOTOOLS-BUILD, and README. Many of these
185 compiler as a change from gcc. Adding -xarch=v9 to the cc options does a
186 64-bit test, but it also needs -S 64 for pcre2test to increase the stack size
224 1. Programs using the previous version may use the new version as a drop-in
229 2. Programs using the previous version may use the new version as a drop-in
249 and the zipball. I then sign these files. Double-check with "git status" that
250 the repository is fully up-to-date, then create a new tag and a release on
269 matching. Most of them try to save work by recognizing a non-match without
272 * /((A{0,5}){0,5}){0,5}(something complex)/ on a non-matching string is very
273 slow, though Perl is fast. Can we speed up somehow? Convert to {0,125}?
274 OTOH, this is pathological - the user could easily fix it.
279 * "Ends with literal string" - note that a single character doesn't gain much
285 * A required code unit from alternatives - not just the last unit, but an
292 setting the bits for 0-9 and all values >= xC0 (in 8-bit mode) when a
295 . If Perl gets to a consistent state over the settings of capturing sub-
298 leaves $2 set. In Perl, it's unset. Changing this in PCRE2 will be very hard
305 . A (non-Unix) user wanted pcregrep options to (a) list a file name just once,
307 support --outputfile=name.
316 . pcre2grep: add -rs for a sorted recurse. Having to store file names and sort
319 . Someone suggested --disable-callout to save code space when callouts are
336 non-thread-safe patch showed that this can help performance for patterns
337 where there are many alternatives. However, a simple thread-safe
344 perhaps be used as a way round this problem. However, note that Perl does not
345 distinguish: like PCRE2, a name is just an alias for a number in Perl.
352 . If Perl ever supports the POSIX notation [[.something.]] PCRE2 should try
360 . Perl supports [\N{x}-\N{y}] as a Unicode range, even in EBCDIC. PCRE2
363 . Unicode stuff from Perl:
374 . Allow callouts to return *PRUNE, *COMMIT, *THEN, *SKIP, with and without
375 continuing (that is, with and without an implied *FAIL). A new option,
387 . Perl's /a modifier sets Unicode, but restricts \d etc to ASCII characters,
389 Perl also has /aa, which in addition, disables ASCII/non-ASCII caseless
393 . There is more that could be done to the oss-fuzz setup (needs some research).
403 the line itself. For example, --before-context=-1 would list the line *after*
404 each matched line, without showing the matched line. The problem here is what
406 flag to disable showing matched lines, only valid with either -A or -B?
409 general PCRE2GREP_OPT, but only for some options - not file names or patterns.
416 . If a function could be written to find 3-character (or other length) fixed
418 pre-searching of large datasets could be implemented.
420 . If pcre2grep had --first-line (match only in the first line) it could be
421 efficiently used to find files "starting with xxx". What about --last-line?
423 start of a file. I am not keen - this is the job of "head".
426 the start-of-match optimizations, or by running the match engine. Easy enough
435 re-defining \w (and therefore \W, \b, and \B). An in-pattern sequence such as
439 amount of stack would do - how deep would anyone want to nest these things?
443 . A user suggested something like --with-build-info to set a build information
453 . There are regular requests for variable-length lookbehinds.