Home
last modified time | relevance | path

Searched refs:UTF (Results 1 – 25 of 880) sorted by relevance

12345678910>>...36

/external/pcre/dist2/testdata/
Dtestoutput14-81 # These test special (mostly error) UTF features of DFA matching. They are a
9 Failed: error -16: UTF-8 error: code points 0xd800-0xdfff are not defined at offset 2
11 Error -36 (bad UTF-8 offset)
15 Failed: error -16: UTF-8 error: code points 0xd800-0xdfff are not defined at offset 2
19 Failed: error -16: UTF-8 error: code points 0xd800-0xdfff are not defined at offset 2
23 Failed: error -16: UTF-8 error: code points 0xd800-0xdfff are not defined at offset 2
27 Failed: error -16: UTF-8 error: code points 0xd800-0xdfff are not defined at offset 2
31 Failed: error -15: UTF-8 error: code points greater than 0x10ffff are not defined at offset 2
33 Failed: error -16: UTF-8 error: code points 0xd800-0xdfff are not defined at offset 2
37 Failed: error -3: UTF-8 error: 1 byte missing at end at offset 1
[all …]
Dtestoutput101 # This set of tests is for UTF-8 support and Unicode property support, with
4 # The next 4 patterns have UTF-8 errors
7 Failed: error -8 at offset 1: UTF-8 error: byte 2 top bits not 0x80
10 Failed: error -3 at offset 0: UTF-8 error: 1 byte missing at end
13 Failed: error -8 at offset 0: UTF-8 error: byte 2 top bits not 0x80
16 Failed: error -22 at offset 2: UTF-8 error: isolated byte with 0x80 bit set
21 \= Expect UTF-8 errors
23 Failed: error -3: UTF-8 error: 1 byte missing at end at offset 1
25 Failed: error -4: UTF-8 error: 2 bytes missing at end at offset 2
27 Failed: error -3: UTF-8 error: 1 byte missing at end at offset 3
[all …]
Dtestoutput14-161 # These test special (mostly error) UTF features of DFA matching. They are a
9 Failed: error -24: UTF-16 error: missing low surrogate at end at offset 2
15 Failed: error -24: UTF-16 error: missing low surrogate at end at offset 2
19 Failed: error -26: UTF-16 error: isolated low surrogate at offset 2
23 Failed: error -26: UTF-16 error: isolated low surrogate at offset 2
27 Failed: error -26: UTF-16 error: isolated low surrogate at offset 2
31 ** Failed: character \x{110000} is greater than 0x10ffff and so cannot be converted to UTF-16
33 Failed: error -25: UTF-16 error: invalid low surrogate at offset 3
Dtestoutput14-321 # These test special (mostly error) UTF features of DFA matching. They are a
9 Failed: error -27: UTF-32 error: code points 0xd800-0xdfff are not defined at offset 2
15 Failed: error -27: UTF-32 error: code points 0xd800-0xdfff are not defined at offset 2
19 Failed: error -27: UTF-32 error: code points 0xd800-0xdfff are not defined at offset 2
23 Failed: error -27: UTF-32 error: code points 0xd800-0xdfff are not defined at offset 2
27 Failed: error -27: UTF-32 error: code points 0xd800-0xdfff are not defined at offset 2
31 Failed: error -28: UTF-32 error: code points greater than 0x10ffff are not defined at offset 2
33 Failed: error -27: UTF-32 error: code points 0xd800-0xdfff are not defined at offset 2
Dtestinput101 # This set of tests is for UTF-8 support and Unicode property support, with
4 # The next 4 patterns have UTF-8 errors
17 \= Expect UTF-8 errors
59 \= Expect UTF-8 errors
65 \= Expect UTF-8 errors
83 \= Expect UTF-8 errors
110 \= Expect UTF-8 errors
117 \= Expect UTF-8 errors
126 \= Expect UTF-8 errors
133 \= Expect UTF-8 errors
[all …]
Dtestinput121 # This set of tests is for UTF-16 and UTF-32 support, including Unicode
106 \= Expect UTF error
123 /(*UTF)\x{11234}/I
126 /(*UTF-32)\x{11234}/
129 /(*UTF-32)\x{112}/
202 \= Expect bad UTF-16 offset, or no match in 32-bit
252 /(*UTF)abc/never_utf
280 # Non-UTF characters should give errors in both 16-bit and 32-bit modes.
346 /(*UTF)ab������z/B
367 # but subjects containing them must not be UTF-checked. These patterns give
Dtestoutput12-161 # This set of tests is for UTF-16 and UTF-32 support, including Unicode
6 ** Failed: invalid UTF-8 string cannot be converted to 16-bit string
10 ** Failed: invalid UTF-8 string cannot be used as input in UTF mode
522 \= Expect UTF error
524 Failed: error -24: UTF-16 error: missing low surrogate at end at offset 2
526 Failed: error -24: UTF-16 error: missing low surrogate at end at offset 2
528 Failed: error -26: UTF-16 error: isolated low surrogate at offset 2
530 Failed: error -26: UTF-16 error: isolated low surrogate at offset 2
532 Failed: error -26: UTF-16 error: isolated low surrogate at offset 2
534 ** Failed: character \x{110000} is greater than 0x10ffff and so cannot be converted to UTF-16
[all …]
/external/unicode/
Dreadme.txt7 UTF-32 to UTF-16
8 UTF-32 to UTF-8
9 UTF-16 to UTF-32
10 UTF-16 to UTF-8
11 UTF-8 to UTF-16
12 UTF-8 to UTF-32
19 between UCS2 (i.e., the BMP characters only) and UTF-7. Surrogates are
37 Version 1.3: Updated UTF-8 legality check;
38 updated to use UNI_MAX_LEGAL_UTF32 in UTF-32 conversions
39 Updated UTF-8 legality tests in harness.c
/external/nanohttpd/nanolets/.settings/
Dorg.eclipse.core.resources.prefs2 encoding//src/main/java=UTF-8
3 encoding//src/main/resources=UTF-8
4 encoding//src/test/java=UTF-8
5 encoding//src/test/resources=UTF-8
6 encoding/<project>=UTF-8
/external/tensorflow/tensorflow/core/api_def/base_api/
Dapi_def_UnicodeTranscode.pbtxt19 by ICU ucnv algorithmic converters. Examples: `"UTF-16", "US ASCII", "UTF-8"`.
26 `"UTF-8", "UTF-16-BE", "UTF-32-BE"`. Multi-byte encodings will be big-endian.
49 Note that for UTF-8, passing a replacement character expressible in 1 byte, such
51 replaced with a 1-byte replacement. For UTF-16-BE and UTF-16-LE, any 1 or 2 byte
80 (e.g. if the encoding is UTF-16 and the BOM indicates big-endian), then that
82 is marked with an explicit endianness (e.g. UTF-16-BE), then the BOM is
84 always for UTF-8).
Dapi_def_UnicodeEncode.pbtxt24 Unicode encoding of the output strings. Valid encodings are: `"UTF-8",
25 "UTF-16-BE", and "UTF-32-BE"`.
68 output_encoding = 'UTF-8'
/external/icu/icu4c/source/test/perf/utrie2perf/
Dutrie2perf.bat17 %PERF% CheckFCD -f \temp\udhr\%%f -v -e UTF-8 --passes 3 --iterations 30000
18 rem %PERF% CheckFCDAlwaysGet -f \temp\udhr\%%f -v -e UTF-8 --passes 3 --iterations 30000
19 rem %PERF% CheckFCDUTF8 -f \temp\udhr\%%f -v -e UTF-8 --passes 3 --iterations 30000
20 %PERF% ToNFC -f \temp\udhr\%%f -v -e UTF-8 --passes 3 --iterations 30000
21 %PERF% GetBiDiClass -f \temp\udhr\%%f -v -e UTF-8 --passes 3 --iterations 30000
Dutrie2perf.sh23 $PERF CheckFCD -f ~/udhr/$file -v -e UTF-8 --passes 3 --iterations 30000
26 $PERF ToNFC -f ~/udhr/$file -v -e UTF-8 --passes 3 --iterations 30000
27 $PERF GetBiDiClass -f ~/udhr/$file -v -e UTF-8 --passes 3 --iterations 30000
/external/icu/icu4c/source/test/testdata/
Dconversion.txt56 // Standard UTF-16
57 …{ "UTF-16", :bin{ 00610062 }, "ab", :intvector{ 0,2 }, :int{1}, :int{0}, "",…
58 …{ "UTF-16", :bin{ feff0061 }, "a", :intvector{ 2 }, :int{1}, :int{0}, "",…
59 …{ "UTF-16", :bin{ fffe0061 }, "\u6100", :intvector{ 2 }, :int{1}, :int{0}, "",…
61 …{ "+UTF-16,version=1", :bin{ 00610062 }, "\\x00\\x61b", :intvector{ 0,0,0,0,0,0,0,0,2 }, :int{1}…
62 …{ "+UTF-16,version=1", :bin{ feff0061 }, "a", :intvector{ 2 }, :int{1}, :int{0}, "",…
63 …{ "+UTF-16,version=1", :bin{ fffe0061 }, "\u6100", :intvector{ 2 }, :int{1}, :int{0}, "",…
64 // Standard UTF-16BE
65 …{ "UTF-16BE", :bin{ 00610062 }, "ab", :intvector{ 0,2 }, :int{1}, :int{0}, "",…
66 …{ "UTF-16BE", :bin{ feff0061 }, "\ufeffa", :intvector{ 0,2 }, :int{1}, :int{0}, "",…
[all …]
/external/libxml2/os400/libxmlrpg/
Dencoding.rpgle6 * rfc2044 (UTF-8 and UTF-16) F. Yergeau Alis Technologies
7 * [ISO-10646] UTF-8 and UTF-16 in Annexes
11 * Wesley, Volume 1, 1991, Volume 2, 1992. UTF-8 is
43 * Note that the generic "UTF-16" is not a predefined value. Instead, only
44 * the specific UTF-16LE and UTF-16BE are present.
53 d XML_CHAR_ENCODING_UTF8... UTF-8
55 d XML_CHAR_ENCODING_UTF16LE... UTF-16 little endian
57 d XML_CHAR_ENCODING_UTF16BE... UTF-16 big endian
99 * @out: a pointer to an array of bytes to store the UTF-8 result
105 * it to an UTF-8 block of chars out.
[all …]
/external/jacoco/org.jacoco.core.test/.settings/
Dorg.eclipse.core.resources.prefs2 encoding/src/test/resources=UTF-8
3 encoding/<project>=UTF-8
4 encoding/src=UTF-8
/external/icu/icu4c/source/samples/uciter8/
Dreadme.txt8 8-bit Unicode text leniently, accepting a mix of UTF-8 and CESU-8
10 UTF-8-style macros are defined as well as a UCharIterator.
14 If you wish to use the lenient-UTF/CESU-8 UCharIterator in a context outside of
19 This UCharIterator reads an arbitrary mix of UTF-8 and CESU-8 text.
26 uit_len8.c Lenient-UTF/CESU-8 UCharIterator implementation
27 uit_len8.h Header file with the prototoype for the lenient-UTF/CESU-8 UCharIterator
/external/icu/icu4c/source/data/mappings/
Dconvrtrs.txt142 UTF-8 { IANA* MIME* JAVA* WINDOWS }
143 ibm-1208 { IBM* } # UTF-8 with IBM PUA
144 ibm-1209 { IBM } # UTF-8
145 ibm-5304 { IBM } # Unicode 2.0, UTF-8 with IBM PUA
146 ibm-5305 { IBM } # Unicode 2.0, UTF-8
147 ibm-13496 { IBM } # Unicode 3.0, UTF-8 with IBM PUA
148 ibm-13497 { IBM } # Unicode 3.0, UTF-8
149 ibm-17592 { IBM } # Unicode 4.0, UTF-8 with IBM PUA
150 ibm-17593 { IBM } # Unicode 4.0, UTF-8
157 # The ICU 2.2 UTF-16/32 converters detect and write a BOM.
[all …]
/external/swiftshader/third_party/llvm-7.0/llvm/test/tools/llvm-objdump/X86/
Dmacho-info-plist.test5 .asciz "<?xml version=\"1.0\" encoding=\"UTF-8\"?>\n"
8 # CHECK: <?xml version="1.0" encoding="UTF-8"?>
11 # NOHEADER: <?xml version="1.0" encoding="UTF-8"?>
/external/libprotobuf-mutator/
DREADME.md78 ## UTF-8 strings
79 "proto2" and "proto3" handle invalid UTF-8 strings differently. In both cases
80 string should be UTF-8, however only "proto3" enforces that. So if fuzzer is
82 invalid UTF-8. If it's a "proto3" message type, only valid UTF-8 will be used.
/external/jacoco/org.jacoco.core/.settings/
Dorg.eclipse.core.resources.prefs2 encoding/<project>=UTF-8
3 encoding/src=UTF-8
/external/icu/icu4j/main/classes/collate/.settings/
Dorg.eclipse.core.resources.prefs3 encoding//src/com/ibm/icu/text/AlphabeticIndex.java=UTF-8
4 encoding/<project>=UTF-8
/external/python/cpython3/Doc/c-api/
Dsys.rst111 * ``UTF-8`` on macOS and Android;
112 * ``UTF-8`` if the Python UTF-8 mode is enabled;
141 The function now uses the UTF-8 encoding in the UTF-8 mode.
152 * ``UTF-8`` on macOS and Android;
153 * ``UTF-8`` if the Python UTF-8 mode is enabled;
160 The function uses the UTF-8 encoding in the Python UTF-8 mode.
173 The function now uses the UTF-8 encoding in the UTF-8 mode.
183 The function now supports the UTF-8 mode.
/external/swiftshader/third_party/llvm-7.0/llvm/test/tools/llvm-rc/Inputs/
Dutf8-escape-narrow.rc2 // One can't pass UTF-8 sequences via multiple escaped chars - in narrow
3 // strings in UTF-8 mode, only ASCII chars can be entered via escapes.
/external/curl/tests/data/
Dtest144845 LC_CTYPE=en_US.UTF-8
48 perl -MI18N::Langinfo=langinfo,CODESET -e 'die "Needs a UTF-8 locale" if (lc(langinfo(CODESET())) n…
51 Redirect following to UTF-8 IDN host name

12345678910>>...36