WORDS


WORDS is a dataset directory which contains some examples of collections of words.

Licensing:

The computer code and data files described and made available on this web page are distributed under the GNU LGPL license.

Related Data and Programs:

CHAIN_LETTERS, a dataset directory which contains several examples of chain letters.

NGRAMS, a dataset directory which contains information about the observed frequency of "ngrams" (particular sequences of n letters) in English text.

TEXT, a dataset directory which contains actual "texts", such as the Gettysburg Address;

Datasets:

You can go up one level to the DATASETS directory.


Last revised on 07 March 2016.