• Home
  • Line#
  • Scopes#
  • Navigate#
  • Raw
  • Download
1[/
2  Copyright 2006-2007 John Maddock.
3  Distributed under the Boost Software License, Version 1.0.
4  (See accompanying file LICENSE_1_0.txt or copy at
5  http://www.boost.org/LICENSE_1_0.txt).
6]
7
8
9[section:collating_names Collating Names]
10
11[section:digraphs Digraphs]
12
13The following are treated as valid digraphs when used as a collating name:
14
15"ae", "Ae", "AE", "ch", "Ch", "CH", "ll", "Ll", "LL", "ss", "Ss", "SS", "nj", "Nj", "NJ", "dz", "Dz", "DZ", "lj", "Lj", "LJ".
16
17So for example the expression:
18
19[pre \[\[.ae.\]-c\] ]
20
21will match any character that collates between the digraph "ae" and the character "c".
22
23[endsect]
24
25[section:posix_symbolic_names POSIX Symbolic Names]
26
27The following symbolic names are recognised as valid collating element names,
28in addition to any single character, this allows you to write for example:
29
30[pre \[\[.left-square-bracket.\]\[.right-square-bracket.\]\]]
31
32if you wanted to match either "\[" or "\]".
33
34[table
35[[Name][Character]]
36[[NUL] 	[\\x00]]
37[[SOH] 	[\\x01]]
38[[STX] 	[\\x02]]
39[[ETX] 	[\\x03]]
40[[EOT] 	[\\x04]]
41[[ENQ] 	[\\x05]]
42[[ACK] 	[\\x06]]
43[[alert] 	[\\x07]]
44[[backspace] 	[\\x08]]
45[[tab] 	[\\t]]
46[[newline] 	[\\n]]
47[[vertical-tab] 	[\\v]]
48[[form-feed] 	[\\f]]
49[[carriage-return] 	[\\r]]
50[[SO] 	[\\xE]]
51[[SI] 	[\\xF]]
52[[DLE] 	[\\x10]]
53[[DC1] 	[\\x11]]
54[[DC2] 	[\\x12]]
55[[DC3] 	[\\x13]]
56[[DC4] 	[\\x14]]
57[[NAK] 	[\\x15]]
58[[SYN] 	[\\x16]]
59[[ETB] 	[\\x17]]
60[[CAN] 	[\\x18]]
61[[EM] 	[\\x19]]
62[[SUB] 	[\\x1A]]
63[[ESC] 	[\\x1B]]
64[[IS4] 	[\\x1C]]
65[[IS3] 	[\\x1D]]
66[[IS2] 	[\\x1E]]
67[[IS1] 	[\\x1F]]
68[[space] 	[\\x20]]
69[[exclamation-mark] 	[!]]
70[[quotation-mark] 	["]]
71[[number-sign] 	[#]]
72[[dollar-sign] 	[$]]
73[[percent-sign] 	[%]]
74[[ampersand] 	[&]]
75[[apostrophe] 	[\']]
76[[left-parenthesis] 	[(]]
77[[right-parenthesis] 	[)]]
78[[asterisk] 	[\*]]
79[[plus-sign] 	[+]]
80[[comma] 	[,]]
81[[hyphen] 	[-]]
82[[period] 	[.]]
83[[slash] 	[ / ]]
84[[zero] 	[0]]
85[[one] 	[1]]
86[[two] 	[2]]
87[[three] 	[3]]
88[[four] 	[4]]
89[[five] 	[5]]
90[[six] 	[6]]
91[[seven] 	[7]]
92[[eight] 	[8]]
93[[nine] 	[9]]
94[[colon] 	[\:]]
95[[semicolon] 	[;]]
96[[less-than-sign] 	[<]]
97[[equals-sign] 	[=]]
98[[greater-than-sign] 	[>]]
99[[question-mark] 	[?]]
100[[commercial-at] 	[@]]
101[[left-square-bracket] 	[\[]]
102[[backslash][\\]]
103[[right-square-bracket][\]]]
104[[circumflex][~]]
105[[underscore][_]]
106[[grave-accent][`]]
107[[left-curly-bracket][{]]
108[[vertical-line][|]]
109[[right-curly-bracket][}]]
110[[tilde][~]]
111[[DEL][\\x7F]]
112]
113
114[endsect]
115
116[section:named_unicode Named Unicode Characters]
117
118When using [link boost_regex.unicode Unicode aware regular expressions] (with the `u32regex` type), all
119the normal symbolic names for Unicode characters (those given in Unidata.txt)
120are recognised.  So for example:
121
122[pre \[\[.CYRILLIC CAPITAL LETTER I.\]\] ]
123
124would match the Unicode character 0x0418.
125
126[endsect]
127[endsect]
128
129