Skip to content

Latest commit

 

History

History
134 lines (100 loc) · 8.12 KB

README.md

File metadata and controls

134 lines (100 loc) · 8.12 KB

📖 Introduction

We provide Kolors-IP-Adapter-FaceID-Plus module weights and inference code based on Kolors-Basemodel. Examples of Kolors-IP-Adapter-FaceID-Plus results are as follows:

Example Results
Reference Image Prompt Result Image
穿着晚礼服,在星光下的晚宴场景中,烛光闪闪,整个场景洋溢着浪漫而奢华的氛围

Wearing an evening dress, in a starry night banquet scene, candlelight flickering, the whole scene exudes a romantic and luxurious atmosphere.
西部牛仔,牛仔帽,荒野大镖客,背景是西部小镇,仙人掌,,日落余晖, 暖色调, 使用XT4胶片拍摄, 噪点, 晕影, 柯达胶卷,复古

Western cowboy, cowboy hat, Red Dead Redemption, background is a western town, cactus, sunset glow, warm tones, shot with XT4 film, grain, vignette, Kodak film, retro.
  • Our Kolors-IP-Adapter-FaceID-Plus module is trained on a large-scale and high-quality face dataset. We use the face ID embeddings generated by insightface and the CLIP features of face area to keep the face ID and structure information.

📊 Evaluation

For evaluation, we constructed a test set consisting of over 200 reference images and text prompts. We invited several image experts to provide fair ratings for the generated results of different models. The experts assessed the generated images based on five criteria: visual appeal, text faithfulness, face similarity, facial aesthetics and overall satisfaction. Visual appeal and text faithfulness are used to measure the text-to-image generation capability, adhering to the evaluation standards of BaseModel. Meanwhile, face similarity and facial aesthetics are used to evaluate the performance of the proposed Kolors-IP-Adapter-FaceID-Plus. The results are summarized in the table below, where Kolors-IP-Adapter-FaceID-Plus outperforms SDXL-IP-Adapter-FaceID-Plus across all metrics.

Model Average Text Faithfulness Average Visual Appeal Average Face Similarity Average Facial Aesthetics Average Overall Satisfaction
SDXL-IP-Adapter-FaceID-Plus 4.014 3.455 3.05 2.584 2.448
Kolors-IP-Adapter-FaceID-Plus 4.235 4.374 4.415 3.887 3.561


Comparison Results
Reference Image Prompt SDXL-IP-Adapter-FaceID-Plus Kolors-IP-Adapter-FaceID-Plus
古典油画风格,油彩厚重, 古典美感,历史气息

Classical oil painting style, thick oil paint, classical aesthetic, historical atmosphere.
夜晚,浪漫的海边,落日余晖洒在海面上,晚霞映照着整个海滩,头戴花环,花短袖,飘逸的头发,背景是美丽的海滩,可爱年轻的半身照,优雅梦幻,细节繁复,超逼真,高分辨率,柔和的背景,低对比度

Night, romantic seaside, sunset glow on the sea, evening glow reflecting on the whole beach, wearing a flower crown, short floral sleeves, flowing hair, background is a beautiful beach, cute young half-body portrait, elegant and dreamy, intricate details, ultra-realistic, high resolution, soft background, low contrast.
F1赛车手, 法拉利,戴着着红黑白相间的赛车手头盔,帅气的赛车手,飞舞的彩带,背景赛车车库和天花板泛光,璀璨闪光,穿红白黑相间赛车服,色调统一且明艳,面部白皙,面部特写,正视图

F1 racer, Ferrari, wearing a red, black, and white racing helmet, handsome racer, flying ribbons, background race car garage and ceiling lights, dazzling flashes, wearing red, white, and black racing suit, unified and bright color tone, fair face, facial close-up, front view.
和服,日本传统服饰,在海边的黄昏,远山的背景,在远处的烟火,柔和的灯光,长焦镜头,夜间摄影风格,凉爽的色调,浪漫的气氛,火花四溅,时尚摄影,胶片滤镜

Kimono, traditional Japanese clothing, at the seaside at dusk, distant mountain background, fireworks in the distance, soft lighting, telephoto lens, night photography style, cool tones, romantic atmosphere, sparks flying, fashion photography, film filter.

Kolors-IP-Adapter-FaceID-Plus employs chinese prompts, while SDXL-IP-Adapter-FaceID-Plus uses english prompts.

🛠️ Usage

Requirements

The dependencies and installation are basically the same as the Kolors-BaseModel.


  1. Repository Cloning and Dependency Installation
apt-get install git-lfs
git clone https://github.com/Kwai-Kolors/Kolors
cd Kolors
conda create --name kolors python=3.8
conda activate kolors
pip install -r requirements.txt
pip install insightface onnxruntime-gpu
python3 setup.py install
  1. Weights download link
huggingface-cli download --resume-download Kwai-Kolors/Kolors-IP-Adapter-FaceID-Plus --local-dir weights/Kolors-IP-Adapter-FaceID-Plus

or

git lfs clone https://huggingface.co/Kwai-Kolors/Kolors-IP-Adapter-FaceID-Plus weights/Kolors-IP-Adapter-FaceID-Plus
  1. Inference:
python ipadapter_FaceID/sample_ipadapter_faceid_plus.py ./ipadapter_FaceID/assets/image1.png "穿着晚礼服,在星光下的晚宴场景中,烛光闪闪,整个场景洋溢着浪漫而奢华的氛围"

python ipadapter_FaceID/sample_ipadapter_faceid_plus.py ./ipadapter_FaceID/assets/image2.png "西部牛仔,牛仔帽,荒野大镖客,背景是西部小镇,仙人掌,,日落余晖, 暖色调, 使用XT4胶片拍摄, 噪点, 晕影, 柯达胶卷,复古"

# The image will be saved to "scripts/outputs/"

Acknowledgments