[Bug]: ⚠️Cron isolated tasks without failure limit trigger infinite retry loop → API cooldown

## Summary
When a cron task with `sessionTarget="isolated"` fails, the cron scheduler retries it infinitely. Each retry spawns a new isolated agent session and calls the LLM API, causing rapid rate_limit cooldown that freezes the entire OpenClaw instance.

## Steps to Reproduce

1. Create a cron task that will fail:
```json
{
  "name": "Test failing isolated task",
  "schedule": { "kind": "at", "atMs": 1770165091000 },
  "sessionTarget": "isolated",
  "payload": {
    "kind": "agentTurn",
    "message": "Execute: pkill -f 'process_that_does_not_exist_xyz'"
  }
}
```

2. Wait for the scheduled time
3. Task fails because the process doesn't exist
4. Observe the logs:
   - Cron immediately retries the failed task
   - Each retry creates a new isolated session
   - Each session calls the LLM API
   - Rapid 429 rate_limit errors flood the logs
   - API enters cooldown: "Provider anthropic is in cooldown (all profiles unavailable)"
   - All sessions become unresponsive

## Current Behavior
```
Error: All models failed (2): anthropic/claude-haiku-4-5: Provider anthropic is in cooldown (all profiles unavailable) (rate_limit) | anthropic/claude-opus-4-5: Provider anthropic is in cooldown (all profiles unavailable) (rate_limit)
```
This error repeats indefinitely, preventing all message processing.

## Expected Behavior
1. Cron tasks should **NOT** infinitely retry on failure
2. After N failures (e.g., 3), the task should:
   - Be automatically disabled/marked as failed
   - Stop retrying
   - Notify the user with error details
3. Failing tasks should **NOT** trigger full provider cooldown
4. Other sessions should remain responsive

## Impact
- **Severity:** CRITICAL 🔴
- **Affected users:** Anyone using isolated cron tasks (agentTurn payload)
- **Trigger:** Creating duplicate cron tasks or tasks with invalid commands
- **Recovery:** Manual intervention in web UI to disable the failed task

## Example Log Output
```
18:31:31 PM - cron task 32982bf7-... started (isolated)
18:31:31 PM - agentTurn failed (exit code 1)
18:31:32 PM - cron retry attempt 1
18:31:33 PM - HTTP 429 rate_limit_error
18:31:34 PM - cron retry attempt 2
18:31:34 PM - HTTP 429 rate_limit_error
18:31:35 PM - Provider anthropic is in cooldown
18:31:35 PM - [No response] API frozen
```

## Proposed Fix

### Option A: Failure limit + auto-disable
```typescript
// cron/scheduler.ts
const MAX_CONSECUTIVE_FAILURES = 3;

if (job.state.consecutiveFailures >= MAX_CONSECUTIVE_FAILURES) {
  job.enabled = false;
  logger.error(`Job ${job.id} disabled after ${MAX_CONSECUTIVE_FAILURES} failures`);
  notifyUser(`Cron job "${job.name}" disabled: max retries exceeded`);
  return;
}
```

### Option B: Exponential backoff on isolated tasks
```typescript
// cron/scheduler.ts
if (sessionTarget === "isolated") {
  const delay = Math.min(
    1000 * Math.pow(2, job.state.consecutiveFailures),
    3600000 // max 1 hour
  );
  scheduleRetry(job, delay);
}
```

### Option C: Rate limit circuit breaker
```typescript
// If a single cron task triggers rate_limit, pause all cron execution
// until rate limit clears (rather than keep hammering the API)
```

## Environment
- OpenClaw version: 2026-01-31
- Node: v22.22.0
- OS: macOS 24.6.0 (arm64)
- Provider: Anthropic

## Additional Context
This mirrors the documented issues #5159 (no exponential backoff) and #5744 (provider-wide cooldown), but is specific to the cron scheduler's handling of isolated task failures.

## Related Issues
- #5159 - No exponential backoff on 429 rate limit errors
- #5744 - Per-model rate limit triggers full provider cooldown
- #4410 - Auto-restart gateway on stuck sessions
## Logs or screenshots

Paste relevant logs or add screenshots (redact secrets).

<img width="624" height="1052" alt="Image" src="https://github.com/user-attachments/assets/06b16afc-a982-4c09-ac22-ea3a8827101b" />

<img width="1229" height="873" alt="Image" src="https://github.com/user-attachments/assets/4ea953b2-950f-4037-810f-38586dc5a1f2" />

<img width="1035" height="1217" alt="Image" src="https://github.com/user-attachments/assets/55b3d87b-450b-4347-a25d-0afff85888c7" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug]: ⚠️Cron isolated tasks without failure limit trigger infinite retry loop → API cooldown #8520

Summary

Steps to Reproduce

Current Behavior

Expected Behavior

Impact

Example Log Output

Proposed Fix

Option A: Failure limit + auto-disable

Option B: Exponential backoff on isolated tasks

Option C: Rate limit circuit breaker

Environment

Additional Context

Related Issues

Logs or screenshots

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: ⚠️Cron isolated tasks without failure limit trigger infinite retry loop → API cooldown #8520

Description

Summary

Steps to Reproduce

Current Behavior

Expected Behavior

Impact

Example Log Output

Proposed Fix

Option A: Failure limit + auto-disable

Option B: Exponential backoff on isolated tasks

Option C: Rate limit circuit breaker

Environment

Additional Context

Related Issues

Logs or screenshots

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions