README.txt
1This directory contains some really large test files, mostly used to
2benchmark various aspects of Expat's performance.
3
4(As files are added, they should be described here, including what
5benchmark program they're intended to be used with and what that
6resulting measurements tell us.)
7
8* nes96.xml (~2.8 MB):
9 - properties: no namespaces, mixed content, average nesting depth
10 - source: http://sda.berkeley.edu:7502/ddi/nes96/
11 (no indication of license or copyright there)
12 - purpose: mostly for performance testing with the benchmark utility
13
14* wordnet_glossary-20010201.xml (~14.4 MB):
15 - properties: namespaces, element content, flat
16 - source: http://www.semanticweb.org/library/wordnet/
17 (license looks Open Source, see license.html file on the same page)
18 - purpose: mostly for performance testing with the benchmark utility
19
20* recset.xml (~29.1 MB):
21 - properties: small portion with namespaces, bulk without, element
22 content, flat
23 - source: test data donated by Karl Waclawek
24 - purpose: mostly for performance testing with the benchmark utility
25
26* ns_att_test.xml (~34.2 MB):
27 - properties: lots of prefixed attributes (28 on average), element
28 content, flat
29 - source: test data donated by Karl Waclawek
30 - purpose: mostly for performance testing with the benchmark
31 utility, specifically for testing the duplicate attribute check in
32 storeAttributes()
33
34* aaaaaa_attr.xml (~10 MB):
35 - properties: trivial file with a huge attribute value
36 - source: generated by a simple shell script
37 - purpose: performance/regression test
38
39* aaaaaa_cdata.xml (~10 MB):
40 - properties: trivial file with huge cdata content
41 - source: generated by a simple shell script
42 - purpose: performance/regression test
43
44* aaaaaa_comment.xml (~10 MB):
45 - properties: trivial file with a huge comment
46 - source: generated by a simple shell script
47 - purpose: performance/regression test
48
49* aaaaaa_tag.xml (~10 MB):
50 - properties: trivial file with a huge tag name
51 - source: generated by a simple shell script
52 - purpose: performance/regression test
53
54* aaaaaa_text.xml (~10 MB):
55 - properties: trivial file with a huge text segment (no newlines)
56 - source: generated by a simple shell script
57 - purpose: performance/regression test
58