Skip to content

Latest commit

 

History

History
47 lines (31 loc) · 1.2 KB

README.md

File metadata and controls

47 lines (31 loc) · 1.2 KB

rhyme-es

Find rhyming words in Spanish.

Resources

Already prepared in the repo, but in case I forget:

Frequency list: RAE Corpus, ./CREA_total.TXT. Note that the list is iso-8859-1 encoded, It has to be converted to UTF8 before use.

Dictionary: hunspell dictionaries from https://github.com/wooorm/dictionaries, extracted to word list with unmunch, ./words.txt.

Usage

# Initialize word frequency map
cargo run --release --bin init
# Start the console app
cargo run --release --bin query
# Start the HTTP API
cargo run --release --bin serve

Features

  • Order by number of syllables
  • Order by frequency
  • Ability to choose dictionary
  • Speed
  • Metric syllables
  • Assonant rhymes
  • homophonous consonants, namely ll and y

And the assorted functionalities that exist in https://buscapalabras.com.ar/rimas.php

Performance

query binary takes 100ms to be ready. Memory footprint stays below 50M.

Used

  • syllabize-es crate, my own dog food.
  • serde with bincode, good stuff.

License

Dunno. The dictionary I used is GPL, but they are just data. Is it OK to use a more permissive license for the code?