Skip to content
View soeaver's full-sized avatar
  • BUPT
  • Beijing

Highlights

  • Pro

Block or report soeaver

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Unified Tokenizer for Visual Generation and Understanding

Python 174 4 Updated Mar 3, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 824 41 Updated Feb 25, 2025

[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence

Python 373 32 Updated Mar 5, 2025

Official implementation of the WACV 2025 ( Oral ) paper. RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision.

Python 106 5 Updated Feb 27, 2025

Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496

Jupyter Notebook 86 5 Updated Jul 22, 2024

Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed

Jupyter Notebook 79 6 Updated Oct 25, 2024

EVE Series: Encoder-Free Vision-Language Models from BAAI

Python 307 7 Updated Mar 1, 2025
Python 19 Updated Feb 27, 2025

Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

Python 160 8 Updated Jan 24, 2025

[CVPR 2025] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 692 44 Updated Feb 26, 2025
Python 17 4 Updated Aug 9, 2024

[NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

Python 40 4 Updated Mar 25, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,543 365 Updated Mar 6, 2025

Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)

Python 202 10 Updated Feb 19, 2025

Code of AAAI2025 Paper 《VIoTGPT: Learning to Schedule Vision Tools in LLMs towards Intelligent Video Internet of Things》

Python 11 3 Updated Jan 16, 2025

An official implementation of "Hulk: A Universal Knowledge Translator for Human-Centric Tasks"

Python 121 5 Updated Dec 4, 2024

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 896 37 Updated Jan 21, 2025

Next-Token Prediction is All You Need

Python 2,020 78 Updated Oct 24, 2024

High-resolution models for human tasks.

Python 4,857 289 Updated Nov 18, 2024

Free, simple, and intuitive online database diagram editor and SQL generator.

JavaScript 25,691 1,809 Updated Mar 5, 2025

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Python 4,156 463 Updated Nov 18, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,385 1,484 Updated Dec 25, 2024

A programming language exclusively designed for cybersecurity

Go 435 50 Updated Mar 6, 2025

Cyber Security ALL-IN-ONE Platform

TypeScript 6,210 721 Updated Mar 6, 2025

RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.

Python 308 40 Updated Feb 28, 2025
Python 318 36 Updated Dec 19, 2024
Python 488 59 Updated Aug 22, 2024

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 4,804 426 Updated Jan 22, 2025
Next
Showing results