feat: Add ComfyUI tool for Stable Diffusion #8160

QunBB · 2024-09-09T13:52:10Z

Checklist:

Important

Please review the checklist below before submitting your pull request.

Please open an issue before creating a PR or link to an existing issue
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I ran dev/reformat(backend) and cd web && npx lint-staged(frontend) to appease the lint gods

Description

Describe the big picture of your changes here to communicate to the maintainers why we should accept this pull request. If it fixes a bug or resolves a feature request, be sure to link to that issue. Close issue syntax: Fixes #<issue number>, see documentation for more details.

Fixes

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update, included: Dify Document
Improvement, including but not limited to code refactoring, performance optimization, and UI/UX improvement
Dependency upgrade

Testing Instructions

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Test A
Test B

api/core/tools/provider/builtin/comfyui/_assets/icon.png

api/core/tools/provider/builtin/comfyui/comfyui.yaml

api/core/tools/provider/builtin/comfyui/tools/txt2img.json

crazywoola

See comments

QunBB · 2024-09-10T06:19:54Z

@crazywoola Done. Check it again please.

api/core/tools/provider/builtin/comfyui/comfyui.yaml

crazywoola

See the author

api/core/tools/provider/builtin/comfyui/tools/comfyui_stable_diffusion.py

crazywoola

See comments

WeepsDanky · 2024-09-12T06:46:58Z

Hi @QunBB . When I was testing this tools, an error shows up: Failed to get models, Please input model. My comfyui is started on http://127.0.0.1:8188/ with a blank canvas. How to resolve this error?
I have my models loaded in /models/checkpoints.

QunBB · 2024-09-12T08:23:21Z

@crazywoola @WeepsDanky The model name should be also exposed when verifying tool's credentials. Some modified code was after verifying credentials, so i miss the test about the changed code about validate_models. I will fix it later.

QunBB · 2024-09-12T10:57:39Z

@crazywoola @WeepsDanky I have fixed it, please check it again.

QunBB · 2024-09-12T11:02:53Z

@crazywoola But i am troubled by Ruff check, should i reformat the files?

WeepsDanky · 2024-09-13T03:23:03Z

@QunBB Great thanks, it is working now.

WeepsDanky · 2024-09-13T03:27:07Z

@QunBB I noticed the tool currently can only use a pre-defined workflow txt2img.json. Can you please be more specific about the name and description in yaml? We need to make sure other users understand this tool can only use this workflow.

For example:
name: txt2img workflow,
description: a pre-defined comfyui workflow that can use one model and up to 3 loras to generate images. Does not support newer models such as stable diffusion 3 that requires a triple clip loader.

QunBB · 2024-09-13T07:21:15Z

@WeepsDanky Hi, i have change the name and description in yaml. In addition, i add the support for SD3 and FLUX.
You could try them like the examples:

SD3
flux-fp8

wisepmlin · 2024-09-21T06:06:41Z

Mac comfloxy
Failed to get models, [Errno 111] Connection refused

* commit '7f3282ec04d87cfb8fcff892e824c96094b92636': (105 commits) Update version to 0.8.3 in packaging and docker-compose files (langgenius#8590) chore: fix webpack dependencies order (langgenius#8542) ComfyUI tool use the new internal enumeration class "VariableKey" (langgenius#8533) Fix: update qwen model and model config (langgenius#8584) fix: fix qwen series model type (langgenius#8580) feat: add hunyuan-vision (langgenius#8529) chore: improve delimiter (langgenius#8552) add storage error log (langgenius#8556) feat: sync Qwen API with Aliyun Bailian (langgenius#8538) fix: thread_pool submit count in parallel workflow not releasing (langgenius#8549) fix: ci issues(missing duckduckgo-search==6.2.11, ruff lint issue) (langgenius#8543) feat: add format util unit and add pre-commit unit check (langgenius#8427) validate user permission before enter app detail page (langgenius#8527) refactor: rename task_type to task for jina embeddings v3 (langgenius#8488) chore: Deprecate gpt-3.5-turbo-0613 and gpt-3.5-turbo-16k-0613 models (langgenius#8500) feat: Add ComfyUI tool for Stable Diffusion (langgenius#8160) chore: update the .gitignore file to include opensearch,pgvector,and myscale (langgenius#8470) feat: Add base URL settings and secure_ascii options to the Brave search tool (langgenius#8463) feat: add flux dev of siliconflow image-gen tool (langgenius#8450) chore: workflow BRANCH, PARALLEL i18n (langgenius#8452) ... # Conflicts: # api/core/file/file_obj.py # api/core/file/message_file_parser.py # api/core/helper/code_executor/code_executor.py # api/core/workflow/nodes/code/code_node.py # api/core/workflow/nodes/tool/tool_node.py

QunBB · 2024-09-23T01:57:36Z

@wisepmlin This seems to be a network issue. I have tried to install Mac comflowy and it worked too.

Dongnc1017 · 2024-09-25T08:04:01Z

https://docs.dify.ai/tutorials/tool-configuration/comfyui 的链接无法访问
1、请问我想使用flux 的模型，Flux Dev和Flux Schnell，都支持吗？是否下载过来放在ComfyUl/models/unet/ 就行了。
2、但是又说支持包含文本编码器/clip的SD1.5、SDXL、SD3和FLUX，但不支持需要clip加载器的模型。
这句话是什么意思？比如t5xxl fp16.safetensors 或t5xxl fp8 e4m3fn.safetensors:？
能否出一个详细的教程

QunBB · 2024-09-25T08:32:24Z

@Dongnc1017 The comments above have already mentioned it. You should download those models which contain text encoders, then put them into ComfyUl/models/checkpoints/ like official comfyui examples:

SD3
flux-fp8

Flux Dev and Flux Schnell are both supported.

laipz8200 · 2024-09-25T11:15:52Z

Hi @QunBB, would you be interested in updating the documentation for this tool? Or perhaps removing the inaccessible links from the configuration?

QunBB · 2024-09-25T13:36:54Z

@laipz8200 I'm intersted in updating it when i'm free later. I would replace the current link with the official ComfyUI website first, and will add it again when i finish the documentation in dify.

hjlarry · 2024-09-27T09:39:47Z

Hi @QunBB I think current ComfyUI tool can be better by reduce the user input and selection, just give the user a prompt_text input.

The comfyUi workflow is similar to dify's workflow, support a variety of nodes. Current implementation seems define specific steps, the user can only select each params of these steps, seems inflexible.

A more reasonable user case is: the user edit the workflow in the comfy UI, and then export the json of this workflow which is simliar to dify's DSL, then paste it to this tool to get the image in dify. Now they combine the dify and comfyUI's workflow.

This is the export API button:

This is a simple workflow just to change an ICON's style, but it has 40 nodes, current solution can't do this:

what do you guys think about this ?

QunBB · 2024-09-29T08:21:01Z

@hjlarry Sure, we could add it, then it will support any image generation workflows in ComfyUI. But i don't think LLM could correctly generate the prompt text via agent, so it may only be used in dify's workflow.
By the way, i think the current tool could be continue to be used for beginners or for simplicity like the pattern of stable diffusion tool. And it could be both used in dify's agent and workflow.

hjlarry · 2024-09-29T12:56:07Z

@QunBB Sure, we can add a new tool and keep both of them.
I think the new tool can config the workflow's json and which node is the text node, to make LLM just gen text prompts then used to agent app.

QunBB · 2024-09-30T01:18:36Z

@hjlarry It is a good idea.

Add ComfyUI tool for Stable Diffusion

9b4415d

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. 🔨 feat:tools Tools for agent, function call related stuff. labels Sep 9, 2024

QunBB added 2 commits September 9, 2024 22:13

for Ruff check and Super-linter

132fe4c

Organize imports

8bae80c