R1 Zero GRPO Resources
Less than 1 minute
R1 Zero GRPO Resources
A curated collection of resources related to R1 Zero and GRPO (Generative Reward-Penalty Optimization) implementations and research.
Official Implementations
- Open R1 - Official implementation by Hugging Face
- X-R1 - C++ implementation
- R1-Onevision - Vision-language model implementation
- Open R1 Multimodal - Multimodal implementation
Training Tools & Frameworks
- LLaMA Factory - Training framework with quickstart guide
- EasyR1 - Simplified R1 implementation
- VERL - Volcengine's implementation
- VLM-R1 - Vision-Language Model implementation
Documentation
- Swift GRPO Documentation - Official GRPO documentation
Additional Resources
- Awesome LLM Resources - Comprehensive collection of LLM resources