-
Soonchunhyang University
- South Korea
-
11:14
- 9h ahead
Stars
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Command & Conquer: Remastered Collection
Port of OpenAI's Whisper model in C/C++
A community-maintained Python framework for creating mathematical animations.
CodexLabsLLC / Colosseum
Forked from microsoft/AirSimOpen source simulator for autonomous robotics built on Unreal Engine with support for Unity
An open source light-weight and high performance inference framework for Hailo devices
ciderapp / node_airtunes2
Forked from xuan25/node_airtunes2node.js AirTunes v2 implementation: stream wirelessly to Apple audio devices.
Checkmate is an open-source, self-hosted tool designed to track and monitor server hardware, uptime, response times, and incidents in real-time with beautiful visualizations.
💬Speech recognition for your React app
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Open source API development ecosystem - https://hoppscotch.io (open-source alternative to Postman, Insomnia)
A high-level build system based on llbuild, used by Xcode, Swift Playground, and the Swift Package Manager
// Aesthetic, dynamic and minimal dots for Arch hyprland
Series of modern looking themes for SDDM.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Example app using MAVSDK for iOS (Swift)
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
A creative environment for the 21st century
A generative world for general-purpose robotics & embodied AI learning.
🔉 A Client for the Spotify Web API, written in C#/.NET
A nearly-live implementation of OpenAI's Whisper.
Bananas🍌, Cross-Platform screen 🖥️ sharing 📡 made simple ⚡.
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Whisper realtime streaming for long speech-to-text transcription and translation