Lines Matching +full:posix +full:- +full:character +full:- +full:classes
3 PCRE2 - Perl-compatible regular expressions (revised API)
12 documentation. This document contains a quick-reference summary of the syntax.
18 \ex where x is non-alphanumeric is a literal x
32 is \eu{...} which is not Perl-compatible and is recognized only when
43 \ea alarm, that is, the BEL character (hex 07)
44 \ecx "control-x", where x is a non-control ASCII character
50 \e0dd character with octal code 0dd
51 \eddd character with octal code ddd, or backreference
52 \eo{ddd..} character with octal code ddd..
53 \eN{U+hh..} character with Unicode code point hh.. (Unicode mode only)
54 \exhh character with hex code hh
55 \ex{hh..} character with hex code hh..
60 \eU the character "U"
61 \euhhhh character with hex code hhhh
62 \eu{hh..} character with hex code hh.. but only for EXTRA_ALT_BSUX
72 a non-zero digit is complicated; for details see the section
75 "Non-printing characters"
87 .SH "CHARACTER TYPES"
90 . any character except newline;
91 in dotall mode, any character whatsoever
94 \eD a character that is not a decimal digit
95 \eh a horizontal white space character
96 \eH a character that is not a horizontal white space character
97 \eN a character that is not a newline
98 \ep{\fIxx\fP} a character with the \fIxx\fP property
99 \eP{\fIxx\fP} a character without the \fIxx\fP property
101 \es a white space character
102 \eS a character that is not a white space character
103 \ev a vertical white space character
104 \eV a character that is not a vertical white space character
105 \ew a "word" character
106 \eW a "non-word" character
110 of a UTF-8 or UTF-16 character. The application can lock out the use of \eC by
114 By default, \ed, \es, and \ew match only ASCII characters, even in UTF-8 mode
115 or in the 16-bit and 32-bit libraries. However, if locale-specific matching is
117 128-255. If the PCRE2_UCP option is set, the behaviour of these escape
149 Mn Non-spacing mark
181 Xps POSIX space: property Z or tab, NL, VT, FF, CR
183 Xuc Universally-named character: one that can be
184 represented by a Universal Character Name
187 Perl and POSIX space are now the same. Perl added VT to its space character set
198 pcre2test -LP
205 Many script names and their 4-letter abbreviations are recognized in
209 pcre2test -LS
216 \ep{Bidi_Class:<class>} matches a character with the given class
217 \ep{BC:<class>} matches a character with the given class
219 The recognized classes are:
230 L left-to-right
231 LRE left-to-right embedding
232 LRI left-to-right isolate
233 LRO left-to-right override
234 NSM non-spacing mark
238 R right-to-left
239 RLE right-to-left embedding
240 RLI right-to-left isolate
241 RLO right-to-left override
246 .SH "CHARACTER CLASSES"
249 [...] positive character class
250 [^...] negative character class
251 [x-y] range (can be used for hex characters)
252 [[:xxx:]] positive POSIX named set
253 [[:^xxx:]] negative POSIX named set
257 ascii 0-127
259 cntrl control character
270 In PCRE2, POSIX character set names recognize only ASCII characters by default,
272 \eQ...\eE inside a character class.
324 option is set, the previous behaviour is re-enabled. When this option is set,
341 (?:...) non-capture group
342 (?|...) non-capture group; reset group numbers for
345 In non-UTF modes, names may contain underscores and ASCII letters and digits;
353 (?>...) atomic non-capture group
354 (*atomic:...) atomic non-capture group
372 (?aP) restrict all POSIX classes to ASCII in UCP mode
373 (?aT) restrict POSIX digit classes to ASCII in UCP mode
378 (?r) restrict caseless to either ASCII or non-ASCII
381 (?x) ignore white space except in classes or \eQ...\eE
382 (?xx) as (?x) but also ignore space and tab in classes
383 (?-...) unset the given option(s)
387 means that (?-aP) is really (?-PT) which disables all ASCII restrictions for
388 POSIX classes.
391 mixture of setting and unsetting such as (?i-x) is allowed, but there may be
393 (?^in). An option setting may appear at the start of a non-capture group, for
405 (*NO_AUTO_POSSESS) no auto-possessification (PCRE2_NO_AUTO_POSSESS)
408 (*NO_START_OPT) no start-match optimization (PCRE2_NO_START_OPTIMIZE)
430 (*NUL) the NUL character (binary zero)
462 Each top-level branch of a lookbehind must have a limit for the number of
470 .SH "NON-ATOMIC LOOKAROUND ASSERTIONS"
473 These assertions are specific to PCRE2 and are not Perl-compatible.
501 \eg-n relative reference by number
503 \eg{-n} relative reference by number
517 (?-n) call subroutine by relative number
526 \eg<-n> call subroutine by relative number (PCRE2 extension)
527 \eg'-n' call subroutine by relative number (PCRE2 extension)
533 (?(condition)yes-pattern)
534 (?(condition)yes-pattern|no-pattern)
538 (?(-n) relative reference condition (PCRE2 extension)
569 afterwards. Those that advance the start-of-match point do so only if the
573 (*PRUNE) advance to next starting character
617 Copyright (c) 1997-2023 University of Cambridge.