Skip to content

Learn Team pricingStart free

Loading page

Product

How it works
Curriculum
Interview prep tracks
Cursor at work tracks
For teams
Team pricing
The course

Learn

All guides
Coding workflows
Coding agents
Technical guides
Cursor basics
Agents
Cursor rules

Resources

Cursor pricing explained
Comparisons
Best tools
By stack
Enterprise rollout
By industry
Glossary
FAQ

Account

Create free account
Sign in

Company

About
Contact
Privacy
Terms

© 2026 Clique Media Group Pty Ltd (ABN 22 614 800 341) · Learn Cursor is an independent study tool. Not affiliated with Cursor / Anysphere.

Home
Glossary
long-horizon RL

What is long-horizon RL?

long-horizon reinforcement learning

Training a coding agent by running many rollouts on real problems and reinforcing the ones that succeed; a single rollout can reach 200K tokens and hundreds of tool calls.