Lines Matching +full:emoji +full:- +full:regex
13 * [`Hir`](hir/struct.Hir.html) is the high-level intermediate representation
14 ("HIR" or "high-level IR" for short) of regular expression. It corresponds to
17 executing a regular expression search. Given some high-level IR, it is not
20 the original pattern). To a first approximation, the high-level IR is simple
31 the top-level [`Parser`](struct.Parser.html) type. This `Parser` will first
54 [`regex` crate](https://docs.rs/regex/%2A/regex/#syntax).
93 done automatically in the `regex` crate.
100 scripts and Unicode-aware support for the Perl classes `\w`, `\s` and `\d`.
119 * **unicode** -
122 * **unicode-age** -
124 [Unicode `Age` property](https://www.unicode.org/reports/tr44/tr44-24.html#Character_Age).
127 * **unicode-bool** -
129 is not included here, but contains properties like `Alphabetic`, `Emoji`,
131 * **unicode-case** -
134 * **unicode-gencat** -
136 …[Uncode general categories](https://www.unicode.org/reports/tr44/tr44-24.html#General_Category_Val…
139 * **unicode-perl** -
140 Provide the data for supporting the Unicode-aware Perl character classes,
142 Unicode-aware word boundary assertions. Note that if this feature is
144 `unicode-bool` and `unicode-gencat` features are enabled, respectively.
145 * **unicode-script** -
150 * **unicode-segment** -
178 pub fn escape(text: &str) -> String { in escape()
198 /// Returns true if the given character has significance in a regex.
207 pub fn is_meta_character(c: char) -> bool { in is_meta_character()
210 | '}' | '^' | '$' | '#' | '&' | '-' | '~' => true, in is_meta_character()
227 /// If the `unicode-perl` feature is not enabled, then this function panics.
231 pub fn is_word_character(c: char) -> bool { in is_word_character()
232 try_is_word_character(c).expect("unicode-perl feature must be enabled") in is_word_character()
247 /// If the `unicode-perl` feature is not enabled, then this function always
251 ) -> std::result::Result<bool, UnicodeWordError> { in try_is_word_character()
258 /// `[_0-9a-zA-Z]'.
259 pub fn is_word_byte(c: u8) -> bool { in is_word_byte()
273 escape(r"\.+*?()|[]{}^$#&-~"), in escape_meta()
274 r"\\\.\+\*\?\(\)\|\[\]\{\}\^\$\#\&\-\~".to_string() in escape_meta()
281 assert!(!is_word_byte(b'-')); in word_byte()
285 #[cfg(feature = "unicode-perl")]
288 assert!(is_word_character('à'), "Latin-1"); in word_char()
296 assert!(!is_word_character('-')); in word_char()
302 #[cfg(not(feature = "unicode-perl"))]
308 #[cfg(not(feature = "unicode-perl"))]