Starred repositories
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
The official Python SDK for Model Context Protocol servers and clients
SD.Next: All-in-one for AI generative image
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
Integrate the DeepSeek API into popular softwares
Pippo: High-Resolution Multi-View Humans from a Single Image
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
A generative speech model for daily dialogue.
Make websites accessible for AI agents
[CVPR 2025] Official implementation of "MangaNinja: Line Art Colorization with Precise Reference Following"
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
Streamlit — A faster way to build and share data apps.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimation
A tab for sd-webui for replacing objects in pictures or videos using detection prompt
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
Official PyTorch implementation of Revisiting Image Pyramid Structure for High Resolution Salient Object Detection (ACCV 2022)
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
set prompt to divided region
LayerDiffuse in pure diffusers without any GUI
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。