fujingling

fujingling fujingling

6 followers · 8 following

Stars

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 9,279 642 Updated Mar 27, 2025

showlab / Show-o

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,293 56 Updated Mar 24, 2025

applicaai / CCpdf

Index of URLs to pdf files all over the internet and scripts

Shell 23 3 Updated May 2, 2023

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 5,045 527 Updated Mar 25, 2025

vectorch-ai / ScaleLLM

A high-performance inference system for large language models, designed for production environments.

C++ 426 34 Updated Mar 20, 2025

LLaVA-VL / LLaVA-NeXT

Python 3,617 334 Updated Feb 24, 2025

Yuliang-Liu / Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,735 121 Updated Mar 20, 2025

SpursGoZmy / Table-LLaVA

Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tab…

Python 190 7 Updated Sep 27, 2024

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,083 1,374 Updated Mar 3, 2025

NoEdgeAI / pdfdeal

A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装，同时附带本地的文本处理(提升PDF在RAG中的召回率)。

Python 247 15 Updated Feb 19, 2025

THUDM / LVBench

LVBench: An Extreme Long Video Understanding Benchmark

Python 85 1 Updated Aug 30, 2024

PeterH0323 / Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…

Python 3,109 481 Updated Mar 8, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,738 1,592 Updated Dec 25, 2024