Stars
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications
An MIT License of YOLOv9, YOLOv7, YOLO-RD
Speech To Speech: an effort for an open-sourced and modular GPT4-o
🔄 A tool for object detection and image segmentation dataset format conversion.
A Pythonic framework to simplify AI service building
serving a torch model using Celery, Redis and RabbitMQ to serve users asynchronously
This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face discord: hf.co/join/discord
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Code and dataset for photorealistic Codec Avatars driven from audio
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
A guidance language for controlling large language models.
Official implementation of "Separate Anything You Describe"