feat: add speech-to-video and animation pipelines #197

ehsanrs2 · 2025-09-28T06:23:53Z

Summary

add WanS2V and WanAnimate pipelines with configs, modules, and CLI wiring
introduce shared text-conditioning/memory-diagnostic utilities plus CosyVoice download helpers
document new workflows and optional dependency installs alongside unit tests

Motivation

Implementation Notes

CLI now accepts audio/TTS/animation flags and applies safer defaults for 5B jobs
CosyVoice helpers respect WAN_COSYVOICE_* env overrides and guard against missing git/huggingface-hub
Tests import modules dynamically to avoid optional dependency noise
isort/black executed per-file due to repo size; pytest warns about CUDA on CPU sandbox

Breaking Changes

none
Testing
python -m pytest tests/test_utils.py # passes (CUDA availability warning expected)

ehsanrs2 added 2 commits September 28, 2025 09:31

feat(utils): add text conditioning and memory diagnostics

8b5049e

feat(pipelines): add speech-to-video and animation tasks

f34b8fc

Provide feedback