-
Red Buffer
- Islamabad, Pakistan
-
10:59
- 5h ahead - https://www.linkedin.com/in/nauyan/
Stars
SpatialLM: Large Language Model for Spatial Understanding
🤖 Build voice-based LLM agents. Modular + open source.
Model Context Protocol Servers
A Conversational Speech Generation Model
🤗 smolagents: a barebones library for agents that think in python code.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.
Go from no deep learning knowledge to implementing GPT.
Benchmarks of approximate nearest neighbor libraries in Python
Famous Vision Language Models and Their Architectures
Multimodal Whole Slide Foundation Model for Pathology
🐍 Geometric Computer Vision Library for Spatial AI
Data processing with ML, LLM and Vision LLM
Implementation of Nougat Neural Optical Understanding for Academic Documents
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Ansari is an AI assistant to help Muslims practice more effectively and non-Muslims to understand Islam
FastMLX is a high performance production ready API to host MLX models.
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Real time interactive streaming digital human