Replies: 1 comment 3 replies
-
|
Yes, Dify workflows can handle video content as a recognized file type and pass it to models or plugins that support video input. The workflow engine and file manager convert video files into prompt message content, either as a URL or base64, depending on configuration. However, Dify itself does not analyze or recognize video content—it forwards the video (usually as a URL) to the underlying model or plugin, and actual processing depends on that model's capabilities. For example, plugins like Qwen-VL-Plus and Tongyi support video input but require the video to be provided as a URL, not base64 data; trying to use base64 may result in errors or require special handling. Some plugins, such as the OpenAI API Compatible plugin, do not support video or multimodal input at all, so only a subset of providers/models can process video content directly. If your model or plugin doesn't support video, you can extract frames and run image analysis on those frames as a workaround. There are also ongoing plans to improve video support in some plugins, like ComfyUI, which may add video output features in the future. To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Self Checks
Content
Dify workflow can directly recognize/process video content
Beta Was this translation helpful? Give feedback.
All reactions