ML-Danbooru: Anime image tags detector

Introduction

An anime image tag detector based on modified ML-Decoder. Model trained with cleaned danbooru2021.

Designed a new TResNet-D structure as backbone to enhance the learning of low-level features.
Replace the ReLU in backbone with FReLU.
Using learnable queries for transformer decoder.

Model Structure

Model-Zoo

https://huggingface.co/7eu7d7/ML-Danbooru

Usage

Download the model and run below command:

python demo.py --data <path to image or directory> --model_name tresnet_d --num_of_groups 32 --ckpt <path to ckpt> --thr 0.7 --image_size 640

Keep the image ratio invariant:

python demo.py --data <path to image or directory> --model_name tresnet_d --num_of_groups 32 --ckpt <path to ckpt> --thr 0.7 --image_size 640 --keep_ratio True

ML_CAFormer

python demo_ca.py --data <path to image or directory> --model_name caformer_m36 --ckpt <path to ckpt> --thr 0.7 --image_size 448

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
imgs		imgs
src_files		src_files
LICENSE		LICENSE
README.md		README.md
class.json		class.json
demo.py		demo.py
demo_ca.py		demo_ca.py
requirements.txt		requirements.txt
train_ac.py		train_ac.py
train_ca_ac.py		train_ca_ac.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML-Danbooru: Anime image tags detector

Introduction

Model Structure

Model-Zoo

Usage

ML_CAFormer

About

Releases

Packages

Languages

License

IrisRainbowNeko/ML-Danbooru

Folders and files

Latest commit

History

Repository files navigation

ML-Danbooru: Anime image tags detector

Introduction

Model Structure

Model-Zoo

Usage

ML_CAFormer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages