hxhcreate

Follow

🎯

Focusing

HXH hxhcreate

🎯

Focusing

Follow

22 followers · 147 following

Fudan Univ | BIT
Shanghai, China

Achievements

Achievements

Highlights

Pro

Lists (8)

Sort

Agent

alignment

25 repositories

Awareness

EAI

LRM

16 repositories

Safety

school

17 repositories

Tools

27 repositories

Stars

Bailey-24 / Rekep

Fix bug and add a new task

Python 30 6 Updated Oct 14, 2024

huangwl18 / ReKep

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 698 74 Updated Feb 20, 2025

MARS-EAI / RoboFactory

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Python 21 Updated Mar 25, 2025

saferlhf-v / saferlhf-v

Python 5 Updated Mar 8, 2025

LRudL / sad

Situational Awareness Dataset

HTML 26 3 Updated Dec 14, 2024

HowieHwong / Awareness-in-LLM

Python 10 1 Updated Jun 11, 2024

XuchanBao / behavioral-self-awareness

Python 23 4 Updated Feb 20, 2025

METR / eval-analysis-public

Public repository containing METR's DVC pipeline for eval data analysis

Python 32 7 Updated Mar 24, 2025

jingyi0000 / R1-VL

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

77 Updated Mar 26, 2025

Video-R1 / Awesome-Multimodal-Reasoning

92 8 Updated Mar 26, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 1,741 114 Updated Mar 27, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

844 31 Updated Mar 27, 2025

TideDra / lmm-r1

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 656 38 Updated Mar 27, 2025

OpenRLHF / OpenRLHF-M

An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.

Python 94 5 Updated Mar 10, 2025

ModalMinds / MM-EUREKA

MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Python 445 15 Updated Mar 27, 2025

turningpoint-ai / VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 535 17 Updated Mar 18, 2025

PKU-Alignment / SafeVLA

Python 25 1 Updated Mar 18, 2025

chuhac / Reasoning-to-Defend

Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking

Python 4 Updated Mar 9, 2025

Ola-Omni / Ola

Ola: Pushing the Frontiers of Omni-Modal Language Model

Python 320 14 Updated Feb 28, 2025

Cuzyoung / CrossEarth

[Official Repo] CrossEarth: Geospatial Vision Foundation Model for Cross-Domain Generalization in Remote Sensing Semantic Segmentation

Python 133 9 Updated Jan 5, 2025

wonderNefelibata / Awesome-LRM-Safety

Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as DeepSeek-R1 and OpenAI o1, which are currently very popular.

Python 53 2 Updated Mar 27, 2025

LeslieTrue / SFTvsRL

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 254 14 Updated Feb 24, 2025

thu-ml / STAIR

Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"

Python 28 Updated Feb 26, 2025

bruno686 / Awesome-RL-based-LLM-Reasoning

Awesome RL-based LLM Reasoning

348 19 Updated Mar 26, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

512 29 Updated Mar 23, 2025

TrustGen / TrustEval-toolkit

TrustEval: A modular and extensible toolkit for comprehensive trust evaluation of generative foundation models (GenFMs)

Python 90 5 Updated Feb 25, 2025

HKUNLP / critic-rl

Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

Python 84 4 Updated Feb 17, 2025

TIGER-AI-Lab / CritiqueFineTuning

Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"

Python 131 9 Updated Feb 11, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,681 80 Updated Mar 5, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,130 58 Updated Feb 8, 2025