Guide
How to Use a Failing Test as an AI Coding Spec
A failing test is one of the best specs for an AI coding agent because it defines the target behavior and the check. Give the agent the failing output, relevant file, expected behavior and one repair boundary. Ask it to make the smallest patch that turns the test green.
What is the working pattern for failing test as an AI coding spec?
- Move
- Start with a bounded task
- Use this when
- Developers using test-first or bug-fix loops with AI coding agents.
- Proof to save
- Issue, files, checks and owner are named
- Move
- Give the agent context
- Use this when
- The repo has patterns the agent must follow
- Proof to save
- Prompt cites files, errors and constraints
- Move
- Review the diff
- Use this when
- The task changes production code
- Proof to save
- Changed files, test output and risks are visible
| Move | Use this when | Proof to save |
|---|---|---|
| Start with a bounded task | Developers using test-first or bug-fix loops with AI coding agents. | Issue, files, checks and owner are named |
| Give the agent context | The repo has patterns the agent must follow | Prompt cites files, errors and constraints |
| Review the diff | The task changes production code | Changed files, test output and risks are visible |
A good AI coding workflow is specific enough to review and small enough to recover.
Interactive diagram. Use Tab to move through hotspots or use the step controls when shown.
Use this loop when the change is larger than autocomplete and smaller than a full project rewrite.

The public guide connects to lessons, recall and readiness checks inside Learn Cursor.
Can I adapt the prompt to my repo?
Interactive diagram. Use Tab to move through hotspots or use the step controls when shown.
Task: Add keyboard support to the command menu Context: - Use src/components/command-palette.tsx and current button patterns Boundary: - Do not change routing or auth code Done when: - Run npm run lint and test keyboard focus in browser Before editing, write a short plan with files, risk and checks.
Keep the static prompt frame on the page. The builder only helps readers adapt it to their repo.
Change the fields, then copy the prompt into your AI coding tool.
Task: [one outcome] Context: [files, errors, docs and examples] Boundary: [what not to touch] Done when: [test, typecheck, screenshot or review proof] Before editing, write a short plan with files, risk and checks.
How should a team run failing test as an AI coding spec?
- 1Pick one real backlog item with a clear owner and expected result.
- 2Add only the context the agent needs: files, failing output, constraints and done state.
- 3Ask for a plan before code when the task touches more than one file.
- 4Run checks that match the risk: unit test, typecheck, visual pass or review checklist.
- 5Capture the prompt, diff, result and reviewer note so the workflow can be repeated.
Task, context, constraints, done state and checks.
Open the diff, read changed files and rerun the check yourself.
The guide uses one failing assertion as context, boundary, patch and rerun steps.
What should you keep after the run?
- The prompt or plan that shaped the work.
- The files changed and the reason each file changed.
- The command, screenshot or review note that proved the result.
- The rule, checklist or template you would reuse next time.
Frequently asked questions
Who is How to Use a Failing Test as an AI Coding Spec for?
Developers using test-first or bug-fix loops with AI coding agents.
What makes this page credible?
The guide uses one failing assertion as context, boundary, patch and rerun steps.
What should I do next?
Start with one real repo task, capture the prompt and review the result before scaling the workflow.
Sources & last verified
- Cursor agent best practices
- Cursor Learn: working with agents
- Cursor Learn: context
- Cursor docs: prompting agents
Cursor ships frequently. Facts verified against primary sources on June 23, 2026.
