ClipVision Enhancer #641
cubiq
announced in
Announcements
Replies: 1 comment 2 replies
-
maybe we can use a clip model with higher resolution (e.g. 512x512) |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I've added a
ClipVision enhancer
node, it's very experimental and I'm not even sure the math behind it is 100% correct but the preliminary results are pretty good. In the image below you can see in the middle the enhanced version, on the left is standard IPAdapter (on the right the reference image).I am basically tiling the image, generate the embeds for each tile and then I recompose embeds in same position they were in the original image and finally pool everything to the default embed size. I hope this is explained by the image below.
@xiaohu2015 do you think this could be used in training too? Have you tried something like this?
Beta Was this translation helpful? Give feedback.
All reactions