Stars
KOMBO: Korean Character Representations Based on the Combination Rules of Subcharacters (Kim et al., ACL Findings 2024)
You like pytorch? You like micrograd? You love tinygrad! ❤️
Yeah, Right, Uh-Huh: A Deep Learning Backchannel Predictor
Repository for the CoRL 2021 paper "Learning Backchanneling Behaviors for a Social Robot via Data Augmentation from Human-Human Conversations"
A Toolkit for ToBI Labeling with Python Data Structures
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Repository containing code for "How to Train BERT with an Academic Budget" paper
Library for Korean morpheme and word vector representation
Improving Korean NLP Tasks with Linguistically Informed Subword Tokenization and Sub-character Decomposition
The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)
한국어 임베딩 (Sentence Embeddings Using Korean Corpora)
빠른 속도와 준수한 정확도를 목표로하는 한국어 띄어쓰기 교정 모델입니다. (It is a Korean spacing correction model that aims for fast speed and moderate accuracy.)