Recursos

Frequency dictionaries

Resource Short description Licence
Absolute and percentile frequencies This resource, a Python dictionary saved in JSON format, contains the absolute and percentile frequencies of the words in (a selection of) the SCAP corpora. The words are stored per part of speech with lemma as the lexical unit. ODC-By
N-gram frequencies This resource, a Python dictionary saved in JSON format, contains the absolute frequencies of n-grams in (a selection of) the SCAP corpora. ODC-By

Word family resources

Resource Short description Licence
Lemma to source This resource, in TSV format, contains a mapping between Spanish lemmas and their "source lemma" (i.e. the "parent" of the lemma in the "word family tree"). ODC-By