1This directory contains benchmark corpora. Each sub-directory contains a README 2documenting the corpus a bit more. 3