npm install huge-word-list
using the efrt word compression library, this library packs more words into a very small space.
Most human vocabulary size estimates range from ~30k words.
This library is able to compress that into ~45k
, which means it can be easily used on the client-side.
the wordlist is generated by using the highly-varied texts in nlp-corpus
some editorial decisions:
- case insensitive
- no quotations or punctuation or numbers
- accepts unicode letters but not only-unicode words
- must be used at least twice
See also:
- word-list by sindresorhus
- nlp-compromise
MIT