• Home
  • Raw
  • Download

Lines Matching +full:posix +full:- +full:character +full:- +full:classes

20 <li><a name="TOC5" href="#SEC5">CHARACTER TYPES</a>
26 <li><a name="TOC11" href="#SEC11">CHARACTER CLASSES</a>
38 <li><a name="TOC23" href="#SEC23">NON-ATOMIC LOOKAROUND ASSERTIONS</a>
54 documentation. This document contains a quick-reference summary of the syntax.
59 \x where x is non-alphanumeric is a literal x
71 is \u{...} which is not Perl-compatible and is recognized only when
80 \a alarm, that is, the BEL character (hex 07)
81 \cx "control-x", where x is a non-control ASCII character
87 \0dd character with octal code 0dd
88 \ddd character with octal code ddd, or backreference
89 \o{ddd..} character with octal code ddd..
90 \N{U+hh..} character with Unicode code point hh.. (Unicode mode only)
91 \xhh character with hex code hh
92 \x{hh..} character with hex code hh..
97 \U the character "U"
98 \uhhhh character with hex code hhhh
99 \u{hh..} character with hex code hh.. but only for EXTRA_ALT_BSUX
110 a non-zero digit is complicated; for details see the section
111 <a href="pcre2pattern.html#digitsafterbackslash">"Non-printing characters"</a>
119 <br><a name="SEC5" href="#TOC1">CHARACTER TYPES</a><br>
122 . any character except newline;
123 in dotall mode, any character whatsoever
126 \D a character that is not a decimal digit
127 \h a horizontal white space character
128 \H a character that is not a horizontal white space character
129 \N a character that is not a newline
130 \p{<i>xx</i>} a character with the <i>xx</i> property
131 \P{<i>xx</i>} a character without the <i>xx</i> property
133 \s a white space character
134 \S a character that is not a white space character
135 \v a vertical white space character
136 \V a character that is not a vertical white space character
137 \w a "word" character
138 \W a "non-word" character
142 of a UTF-8 or UTF-16 character. The application can lock out the use of \C by
147 By default, \d, \s, and \w match only ASCII characters, even in UTF-8 mode
148 or in the 16-bit and 32-bit libraries. However, if locale-specific matching is
150 128-255. If the PCRE2_UCP option is set, the behaviour of these escape
182 Mn Non-spacing mark
214 Xps POSIX space: property Z or tab, NL, VT, FF, CR
216 Xuc Universally-named character: one that can be
217 represented by a Universal Character Name
220 Perl and POSIX space are now the same. Perl added VT to its space character set
229 pcre2test -LP
234 Many script names and their 4-letter abbreviations are recognized in
238 pcre2test -LS
244 \p{Bidi_Class:&#60;class&#62;} matches a character with the given class
245 \p{BC:&#60;class&#62;} matches a character with the given class
247 The recognized classes are:
258 L left-to-right
259 LRE left-to-right embedding
260 LRI left-to-right isolate
261 LRO left-to-right override
262 NSM non-spacing mark
266 R right-to-left
267 RLE right-to-left embedding
268 RLI right-to-left isolate
269 RLO right-to-left override
274 <br><a name="SEC11" href="#TOC1">CHARACTER CLASSES</a><br>
277 [...] positive character class
278 [^...] negative character class
279 [x-y] range (can be used for hex characters)
280 [[:xxx:]] positive POSIX named set
281 [[:^xxx:]] negative POSIX named set
285 ascii 0-127
287 cntrl control character
298 In PCRE2, POSIX character set names recognize only ASCII characters by default,
300 \Q...\E inside a character class.
351 option is set, the previous behaviour is re-enabled. When this option is set,
367 (?:...) non-capture group
368 (?|...) non-capture group; reset group numbers for
371 In non-UTF modes, names may contain underscores and ASCII letters and digits;
378 (?&#62;...) atomic non-capture group
379 (*atomic:...) atomic non-capture group
397 (?aP) restrict all POSIX classes to ASCII in UCP mode
398 (?aT) restrict POSIX digit classes to ASCII in UCP mode
403 (?r) restrict caseless to either ASCII or non-ASCII
406 (?x) ignore white space except in classes or \Q...\E
407 (?xx) as (?x) but also ignore space and tab in classes
408 (?-...) unset the given option(s)
412 means that (?-aP) is really (?-PT) which disables all ASCII restrictions for
413 POSIX classes.
417 mixture of setting and unsetting such as (?i-x) is allowed, but there may be
419 (?^in). An option setting may appear at the start of a non-capture group, for
432 (*NO_AUTO_POSSESS) no auto-possessification (PCRE2_NO_AUTO_POSSESS)
435 (*NO_START_OPT) no start-match optimization (PCRE2_NO_START_OPTIMIZE)
455 (*NUL) the NUL character (binary zero)
486 Each top-level branch of a lookbehind must have a limit for the number of
493 <br><a name="SEC23" href="#TOC1">NON-ATOMIC LOOKAROUND ASSERTIONS</a><br>
495 These assertions are specific to PCRE2 and are not Perl-compatible.
523 \g-n relative reference by number
525 \g{-n} relative reference by number
539 (?-n) call subroutine by relative number
548 \g&#60;-n&#62; call subroutine by relative number (PCRE2 extension)
549 \g'-n' call subroutine by relative number (PCRE2 extension)
555 (?(condition)yes-pattern)
556 (?(condition)yes-pattern|no-pattern)
560 (?(-n) relative reference condition (PCRE2 extension)
589 afterwards. Those that advance the start-of-match point do so only if the
593 (*PRUNE) advance to next starting character
631 Copyright &copy; 1997-2023 University of Cambridge.