Skip to content
View hxhcreate's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Fudan Univ | BIT
  • Shanghai, China

Highlights

  • Pro

Block or report hxhcreate

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fix bug and add a new task

Python 30 6 Updated Oct 14, 2024

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 698 74 Updated Feb 20, 2025

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Python 21 Updated Mar 25, 2025
Python 5 Updated Mar 8, 2025

Situational Awareness Dataset

HTML 26 3 Updated Dec 14, 2024
Python 10 1 Updated Jun 11, 2024

Public repository containing METR's DVC pipeline for eval data analysis

Python 32 7 Updated Mar 24, 2025

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

77 Updated Mar 26, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,741 114 Updated Mar 27, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

844 31 Updated Mar 27, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 656 38 Updated Mar 27, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

Python 94 5 Updated Mar 10, 2025

MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Python 445 15 Updated Mar 27, 2025

Explore the Multimodal “Aha Moment” on 2B Model

Python 535 17 Updated Mar 18, 2025
Python 25 1 Updated Mar 18, 2025

Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking

Python 4 Updated Mar 9, 2025

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 320 14 Updated Feb 28, 2025

[Official Repo] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation

Python 133 9 Updated Jan 5, 2025

Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as DeepSeek-R1 and OpenAI o1, which are currently very popular.

Python 53 2 Updated Mar 27, 2025

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 254 14 Updated Feb 24, 2025

Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"

Python 28 Updated Feb 26, 2025

Awesome RL-based LLM Reasoning

348 19 Updated Mar 26, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

512 29 Updated Mar 23, 2025

TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs)

Python 90 5 Updated Feb 25, 2025

Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

Python 84 4 Updated Feb 17, 2025

Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"

Python 131 9 Updated Feb 11, 2025

Official Repo for Open-Reasoner-Zero

Python 1,681 80 Updated Mar 5, 2025

A fork to add multimodal model training to open-r1

Python 1,130 58 Updated Feb 8, 2025
Next
Showing results