• Home
  • Raw
  • Download

Lines Matching full:unicode

45     Hir::literal(hir::Literal::Unicode('a')),
46 Hir::literal(hir::Literal::Unicode('b')),
98 An important feature provided by this crate is its Unicode support. This
100 scripts and Unicode-aware support for the Perl classes `\w`, `\s` and `\d`.
102 Unicode data tables that are substantial in size.
104 A fair number of use cases do not require full Unicode support. For this
105 reason, this crate exposes a number of features to control which Unicode
108 If a regular expression attempts to use a Unicode feature that is not available
111 construct an `Ast` for such a regular expression, since Unicode data is not
119 * **unicode** -
120 Enables all Unicode features. This feature is enabled by default, and will
121 always cover all Unicode features, even if more are added in the future.
122 * **unicode-age** -
124 [Unicode `Age` property](https://www.unicode.org/reports/tr44/tr44-24.html#Character_Age).
126 codepoints first introduced in Unicode 6.0
127 * **unicode-bool** -
128 Provide the data for numerous Unicode boolean properties. The full list
131 * **unicode-case** -
133 …[Unicode's "simple loose matches" specification](https://www.unicode.org/reports/tr18/#Simple_Loos…
134 * **unicode-gencat** -
136 …[Uncode general categories](https://www.unicode.org/reports/tr44/tr44-24.html#General_Category_Val…
139 * **unicode-perl** -
140 Provide the data for supporting the Unicode-aware Perl character classes,
142 Unicode-aware word boundary assertions. Note that if this feature is
144 `unicode-bool` and `unicode-gencat` features are enabled, respectively.
145 * **unicode-script** -
147 [Unicode scripts and script extensions](https://www.unicode.org/reports/tr24/).
150 * **unicode-segment** -
152 [Unicode text segmentation algorithms](https://www.unicode.org/reports/tr29/).
163 pub use crate::unicode::UnicodeWordError;
170 mod unicode; module
215 /// Returns true if and only if the given character is a Unicode word
218 /// A Unicode word character is defined by
219 /// [UTS#18 Annex C](https://unicode.org/reports/tr18/#Compatibility_Properties).
227 /// If the `unicode-perl` feature is not enabled, then this function panics.
232 try_is_word_character(c).expect("unicode-perl feature must be enabled") in is_word_character()
235 /// Returns true if and only if the given character is a Unicode word
238 /// A Unicode word character is defined by
239 /// [UTS#18 Annex C](https://unicode.org/reports/tr18/#Compatibility_Properties).
247 /// If the `unicode-perl` feature is not enabled, then this function always
252 unicode::is_word_character(c) in try_is_word_character()
285 #[cfg(feature = "unicode-perl")]
290 assert!(is_word_character('\u{11011}'), "Brahmi (Unicode 6.0)"); in word_char()
291 assert!(is_word_character('\u{11611}'), "Modi (Unicode 7.0)"); in word_char()
292 assert!(is_word_character('\u{11711}'), "Ahom (Unicode 8.0)"); in word_char()
293 assert!(is_word_character('\u{17828}'), "Tangut (Unicode 9.0)"); in word_char()
294 assert!(is_word_character('\u{1B1B1}'), "Nushu (Unicode 10.0)"); in word_char()
295 assert!(is_word_character('\u{16E40}'), "Medefaidrin (Unicode 11.0)"); in word_char()
302 #[cfg(not(feature = "unicode-perl"))]
308 #[cfg(not(feature = "unicode-perl"))]