Yume (夢)

Yume is a Japanese LLM (Large Language Model) with 1.5 billion parameters, inspired by Andrej Karpathy. It is trained on dialogues from anime and manga, aimed at generating anime dialogues. Future plans include creating a better version, Yuumi, which will be a lightweight LLM for daily tasks.

Features

Large language model for Japanese
Trained on anime and manga dialogues
Configurable with various model sizes
Supports pretraining and fine-tuning
Integrates with Hugging Face for model management

Usage

Sampling Text

You can use Yume to generate text samples. Here's an example:

from yume import Yume
from yume.config import yume_small

# Optional: Create a custom config if needed
# dummy_config = Config(...)

# Initialize the Yume model with a pre-defined small configuration
yume = Yume(config=yume_small)

# Load a pretrained model from the specified path
yume.load_pretrained('zaibutcooler/yume')

# Generate a sample with the prompt '犬とは' (What is a dog?)
yume.sample('犬とは')

Training the Model

You can also train Yume with your own dataset. Here’s how you can do it:

from yume import Yume
from yume.dataset import Trainset
from yume.config import yume_medium, Config

# Initialize the dataset with the desired URL
dataset = Trainset(dataset_url="zaibutcooler/nihon-wiki")

# Build the dataset
dataset.build_dataset()

# Optional: Create a custom config if needed
# dummy_config = Config(...)

# Initialize the Yume model with a pre-defined medium configuration
yume = Yume(config=yume_medium)

# Pretrain the model with the dataset
yume.pretrain(dataset)

# Optional: Fine-tune the model with the dataset
# yume.fine_tune(dataset)

# Optional: Upload the model to Hugging Face
# yume.huggingface_login("your_hf_tokens")
# yume.save_pretrained("zaibutcooler/yume")

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgements

This project is inspired by Andrej Karpathy and utilizes dialogues from various anime and manga sources for training.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github		.github
examples		examples
notebooks		notebooks
tests		tests
yume		yume
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
interface.py		interface.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
sampling.py		sampling.py
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Yume (夢)

Features

Usage

Sampling Text

Training the Model

License

Acknowledgements

Links

About

Releases

Sponsor this project

Packages

Languages

License

zaibutcooler/yume

Folders and files

Latest commit

History

Repository files navigation

Yume (夢)

Features

Usage

Sampling Text

Training the Model

License

Acknowledgements

Links

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages