Stars
unslothai / llama.cpp
Forked from ggml-org/llama.cppLLM inference in C/C++
This repo includes Claude prompt curation to use Claude better.
Tools to build web AI agents that can authenticate, interact with and extract data from any website.
A generative world for general-purpose robotics & embodied AI learning.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from vari…
A simple screen parsing tool towards pure vision based GUI agent
A natural language interface for computers
Follow along with my AI Agents Masterclass videos! All of the code I create and use in this series on YouTube will be here for you to use and even build on top of!
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
Chain of Thought decoding without prompting, integration to the open-webui pipelines
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
VPTQ, A Flexible and Extreme low-bit quantization algorithm
Use PEFT or Full-parameter to finetune 500+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Llama3.2-Vision, Llava…
Paint by numbers generator
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Everything-Reactivity in ComfyUI (audio, MIDI, motion, proximity, and more).
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…
An open-source & self-hostable Heroku / Netlify / Vercel alternative.
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
curtified / FluxMusicGUI
Forked from camenduru/FluxMusicText-to-Music Generation with Rectified Flow Transformer