PyKhmerNLP is a library designed to process and analyze Khmer language data. It includes modules for working with addresses, dictionaries, and tokenization. This documentation will guide you through the functionalities of each module and provide examples to help you get s
Check out our details documentations here
Install from PYPI
pip install pykhmernlp
git clone https://github.com/MetythornPenn/pykhmernlp.git
cd pykhmernlp
pip install -e .
- Corpus
- Khmer words
- English Word
- Khmer to Khmer Dictionary
- English to English Dictionary
- Khmer Address
- Tokenizer
- Pronounce
- Tha
This library wraps around other awesome Khmer libraries. Without these other libraries, this library wouldn't exist.
Libraries:
- khmercut: from seanghay
- khmerpronounce: from seanghay
- tha: from seanghay
Datasets:
- khmer words: from unicode-org/icu