Featured Projects
DeepScaleR
DeepScaleR: Surpassing O1-Preview with a 1.5B Model
A 1.5B model that surpasses O1-Preview by scaling RL
DeepCoder
DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level
A 14B coding model matching O3-mini performance
o3-mini-2025-01-031 (Low) and o1-2024-12-17 performance
Release Date: April 8, 2025
DeepSWE
DeepSWE: Training a State-of-the-Art Coding Agent by Scaling RL
A 32B software engineering agent trained purely with RL
Tongyi DeepResearch
Tongyi DeepResearch
A New Era of Open-Source AI Researchers
Terminal-Bench-RL
Terminal-Bench-RL
Training Long-Horizon Terminal Agents with Reinforcement Learning
PettingLLMs
PettingLLMs
Using On-Policy Reinforcement Learning for Stronger Multi-Agent Systems
SETA
SETA: Scaling Environments for Terminal Agents
Scaling environments for terminal agent training
LLM-in-Sandbox
LLM-in-Sandbox
Building General Agents by running LLMs in a sandbox (virtual computer)
Research Projects
Cogito, Ergo Ludo
Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and Planning
Game-playing agent using reasoning and planning
Cut the Bill, Keep the Turns
Cut the Bill, Keep the Turns: Affordable Multi-Turn Search RL
Cost-efficient multi-turn search with RL
Experiential Reinforcement Learning
Experiential Reinforcement Learning
Reinforcement Learning with an Experience–Reflection–Consolidation Loop
Official Projects
rLLM-FinQA-4B
rLLM-FinQA-4B
A 4B Financial Analysis Agent that Outperforms 235B Models
Project Categories
Reasoning
DeepScaleR (AIME 43.1%)
Coding
DeepCoder (60.6% LiveCodeBench)DeepSWE (59% SWEBench-Verified)
Research
Tongyi DeepResearchExperiential RL
Terminal Agents
Terminal-Bench-RLSETA
Multi-Agent
PettingLLMs
Finance
rLLM-FinQA-4B
Contributing Your Project
To add your project to this list:- Open a Pull Request on GitHub
-
Include the following information:
- Project name and description
- GitHub repository or project page link
- Key achievements or benchmarks
- Any published papers or blog posts
- Join our community:
Resources
Getting Started
Install rLLM and start building
Tutorials
Learn through step-by-step guides
Discord Community
Join the rLLM community
GitHub
Contribute to rLLM