Skip to content

Commit 21236d8

Browse files
committed
chore(keyword-detector): add verification guarantee section to ultrawork prompt
Added comprehensive VERIFICATION GUARANTEE section to ultrawork prompt to enforce proof-based task completion. Includes: - Pre-implementation success criteria definition (Functional, Observable, Pass/Fail) - Mandatory Test Plan template for non-trivial tasks - Execution & Evidence requirements table (Build, Test, Manual Verify, Regression) - TDD workflow with evidence requirements - Verification anti-patterns and blocking violations This enhancement ensures agents must provide PROOF that something works before claiming completion - eliminating vague "it should work now" claims without evidence. 🤖 GENERATED WITH ASSISTANCE OF [OhMyOpenCode](https://github.com/code-yeongyu/oh-my-opencode)
1 parent ea8ca1a commit 21236d8

File tree

1 file changed

+60
-9
lines changed

1 file changed

+60
-9
lines changed

src/hooks/keyword-detector/constants.ts

Lines changed: 60 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -112,15 +112,66 @@ TELL THE USER WHAT AGENTS YOU WILL LEVERAGE NOW TO SATISFY USER'S REQUEST.
112112
3. Always Use Plan agent with gathered context to create detailed work breakdown
113113
4. Execute with continuous verification against original requirements
114114
115-
## TDD (if test infrastructure exists)
116-
117-
1. Write spec (requirements)
118-
2. Write tests (failing)
119-
3. RED: tests fail
120-
4. Implement minimal code
121-
5. GREEN: tests pass
122-
6. Refactor if needed (must stay green)
123-
7. Next feature, repeat
115+
## VERIFICATION GUARANTEE (NON-NEGOTIABLE)
116+
117+
**NOTHING is "done" without PROOF it works.**
118+
119+
### Pre-Implementation: Define Success Criteria
120+
121+
BEFORE writing ANY code, you MUST define:
122+
123+
| Criteria Type | Description | Example |
124+
|---------------|-------------|---------|
125+
| **Functional** | What specific behavior must work | "Button click triggers API call" |
126+
| **Observable** | What can be measured/seen | "Console shows 'success', no errors" |
127+
| **Pass/Fail** | Binary, no ambiguity | "Returns 200 OK" not "should work" |
128+
129+
Write these criteria explicitly. Share with user if scope is non-trivial.
130+
131+
### Test Plan Template (MANDATORY for non-trivial tasks)
132+
133+
\`\`\`
134+
## Test Plan
135+
### Objective: [What we're verifying]
136+
### Prerequisites: [Setup needed]
137+
### Test Cases:
138+
1. [Test Name]: [Input] → [Expected Output] → [How to verify]
139+
2. ...
140+
### Success Criteria: ALL test cases pass
141+
### How to Execute: [Exact commands/steps]
142+
\`\`\`
143+
144+
### Execution & Evidence Requirements
145+
146+
| Phase | Action | Required Evidence |
147+
|-------|--------|-------------------|
148+
| **Build** | Run build command | Exit code 0, no errors |
149+
| **Test** | Execute test suite | All tests pass (screenshot/output) |
150+
| **Manual Verify** | Test the actual feature | Demonstrate it works (describe what you observed) |
151+
| **Regression** | Ensure nothing broke | Existing tests still pass |
152+
153+
**WITHOUT evidence = NOT verified = NOT done.**
154+
155+
### TDD Workflow (when test infrastructure exists)
156+
157+
1. **SPEC**: Define what "working" means (success criteria above)
158+
2. **RED**: Write failing test → Run it → Confirm it FAILS
159+
3. **GREEN**: Write minimal code → Run test → Confirm it PASSES
160+
4. **REFACTOR**: Clean up → Tests MUST stay green
161+
5. **VERIFY**: Run full test suite, confirm no regressions
162+
6. **EVIDENCE**: Report what you ran and what output you saw
163+
164+
### Verification Anti-Patterns (BLOCKING)
165+
166+
| Violation | Why It Fails |
167+
|-----------|--------------|
168+
| "It should work now" | No evidence. Run it. |
169+
| "I added the tests" | Did they pass? Show output. |
170+
| "Fixed the bug" | How do you know? What did you test? |
171+
| "Implementation complete" | Did you verify against success criteria? |
172+
| Skipping test execution | Tests exist to be RUN, not just written |
173+
174+
**CLAIM NOTHING WITHOUT PROOF. EXECUTE. VERIFY. SHOW EVIDENCE.**
124175
125176
## ZERO TOLERANCE FAILURES
126177
- **NO Scope Reduction**: Never make "demo", "skeleton", "simplified", "basic" versions - deliver FULL implementation

0 commit comments

Comments
 (0)