Skip to content

Commit

Permalink
Merge pull request first20hours#12 from jakebathman/master
Browse files Browse the repository at this point in the history
Add swear-free lists and lists grouped by word length, courtesy of jakebathman
  • Loading branch information
worldlywisdom authored Oct 21, 2016
2 parents 61de6b9 + 8dfc1d6 commit c39523c
Show file tree
Hide file tree
Showing 6 changed files with 29,742 additions and 0 deletions.
22 changes: 22 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,28 @@ This repo is derived from [Peter Norvig's](http://norvig.com/ngrams/) compilatio

Special thanks to [koseki](https://github.com/koseki) for [de-duplicating the list](https://github.com/first20hours/google-10000-english/issues/6).

Swear-free lists
-----

There are two additional lists which are identical to the original 10,000 word list, but with swear words removed. These are ideal for generating URLs, temporary passwords, or other uses where swear words may not be desired.

Swears were removed based on these lists:

* [reimertz/curse-words](https://github.com/reimertz/curse-words)
* [MauriceButler/badwords](https://github.com/MauriceButler/badwords)
* [LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words](https://github.com/LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words)

Word length lists
-----

Three of the lists (all based on the US english list) are based on word length:

* **Short**: 1-4 characters
* **Medium**: 5-8 characters
* **Long**: 9+ characters

Each list retains the original list sorting (by frequency, decending).

Usage
-----

Expand Down
Loading

0 comments on commit c39523c

Please sign in to comment.