Skip to content
@MME-Benchmarks

MME Benchmarks

Multimodal LLM Benchmarks of MME series

Pinned Loading

  1. Video-MME Public

    ✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

    535 20

  2. MME-RealWorld Public

    ✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

    Python 114 8

  3. MME-CoT Public

    MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

    Python 103 3

  4. MME-Unify Public

    MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

    Python 33 2

Repositories

Showing 4 of 4 repositories
  • Video-MME Public

    ✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

    535 20 6 0 Updated Apr 17, 2025
  • MME-Unify Public

    MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

    Python 33 2 0 0 Updated Apr 10, 2025
  • MME-CoT Public

    MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

    Python 103 3 3 0 Updated Mar 29, 2025
  • MME-RealWorld Public

    ✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

    Python 114 8 3 0 Updated Mar 4, 2025