Skip to main content

R1 Zero GRPO Resources

Tiu MoLess than 1 minute

R1 Zero GRPO Resources

A curated collection of resources related to R1 Zero and GRPO (Generative Reward-Penalty Optimization) implementations and research.

Official Implementations

Training Tools & Frameworks

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT https://github.com/CaraJ7/T2I-R1

Documentation

Additional Resources

https://github.com/qiwang067/awesome-visual-rl

https://github.com/datawhalechina/easy-rl?tab=readme-ov-file 强化学习教程

训练框架

https://github.com/Simple-Efficient/RL-Factory

物理规则推理模型 https://github.com/nvidia-cosmos/cosmos-reason1