These were the original inputs used for the memchr benchmarks. In theory, they could be replaced with the subtitle text in order to trim down the number of inputs we use in the benchmark suite. The nice thing about the subtitle corpus is that it gives us a translation into Russian and Chinese, which lets us measure our code on non-ASCII text.