Text


TEXT is a dataset directory which contains some texts.

Licensing:

The computer code and data files described and made available on this web page are distributed under the GNU LGPL license.

Related Data and Programs:

CHAIN_LETTERS, a dataset directory which contains several examples of chain letters.

GERMAN, a dataset directory which contains some short German texts;

NGRAMS, a dataset directory which contains information about the observed frequency of "ngrams" (particular sequences of n letters) in English text.

WORDS, a dataset directory which contains lists of words;

Datasets:

You can go up one level to the DATASETS directory.


Last revised on 25 September 2017.