Back to Skills
    šŸ¦ž

    smart-model-switching

    Auto-route tasks to the cheapest Claude model

    By @millibus
    View on GitHub
    SKILL.md
    ---
    name: smart-model-switching
    description: >-
      Auto-route tasks to the cheapest Claude model that works correctly.
      Three-tier progression: Haiku → Sonnet → Opus. Classify before responding.
      HAIKU (default): factual Q&A, greetings, reminders, status checks, lookups,
      simple file ops, heartbeats, casual chat, 1-2 sentence tasks.
      ESCALATE TO SONNET: code >10 lines, analysis, comparisons, planning, reports,
      multi-step reasoning, tables, long writing >3 paragraphs, summarization,
      research synthesis, most user conversations.
      ESCALATE TO OPUS: architecture decisions, complex debugging, multi-file
      refactoring, strategic planning, nuanced judgment, deep research, critical
      production decisions. Rule: If a human needs >30 seconds of focused thinking,
      escalate. If Sonnet struggles with complexity, go to Opus. Save 50-90% on
      API costs by starting cheap and escalating only when needed.
    author: "OpenClaw Community"
    version: 1.0.0
    homepage: https://clawhub.com
    metadata:
      openclaw:
        emoji: "šŸ’°"
    ---
    
    # Smart Model Switching
    
    **Three-tier Claude routing: Haiku → Sonnet → Opus**
    
    Start with the cheapest model. Escalate only when needed. Save 50-90% on API costs.
    
    ## The Golden Rule
    
    > If a human would need more than 30 seconds of focused thinking, escalate from Haiku to Sonnet.
    > If the task involves architecture, complex tradeoffs, or deep reasoning, escalate to Opus.
    
    ## Cost Reality
    
    | Model | Input | Output | Relative Cost |
    |-------|-------|--------|---------------|
    | Haiku | \$0.25/M | \$1.25/M | 1x (baseline) |
    | Sonnet | \$3.00/M | \$15.00/M | 12x |
    | Opus | \$15.00/M | \$75.00/M | 60x |
    
    **Bottom line:** Wrong model selection wastes money OR time. Haiku for simple, Sonnet for standard, Opus for complex.
    
    ---
    
    ## šŸ’š HAIKU — Default for Simple Tasks
    
    **Stay on Haiku for:**
    - Factual Q&A — "what is X", "who is Y", "when did Z"
    - Quick lookups — definitions, unit conversions, short translations
    - Status checks — calendar, file reads, session monitoring
    - Heartbeats — periodic checks, HEARTBEAT_OK responses
    - Memory & reminders — "remember this", "remind me to..."
    - Casual conversation — greetings, small talk, acknowledgments
    - Simple file ops — read, list, basic writes
    - One-liner tasks — anything answerable in 1-2 sentences
    
    ### NEVER do these on Haiku
    - āŒ Write code longer than 10 lines
    - āŒ Create comparison tables
    - āŒ Write more than 3 paragraphs
    - āŒ Do multi-step analysis
    - āŒ Write reports or proposals
    
    ---
    
    ## šŸ’› SONNET — Standard Work (The Workhorse)
    
    **Escalate to Sonnet for:**
    
    ### Code & Technical
    - Code generation — write functions, build features, scripts
    - Code review — PR reviews, quality checks
    - Debugging — standard bug investigation
    - Documentation — README, comments, user guides
    
    ### Analysis & Planning  
    - Analysis & evaluation — compare options, assess trade-offs
    - Planning — project plans, roadmaps, task breakdowns
    - Research synthesis — combining multiple sources
    - Multi-step reasoning — "first... then... finally"
    
    ### Writing & Content
    - Long-form writing — reports, proposals, articles (>3 paragraphs)
    - Creative writing — blog posts, descriptions, copy
    - Summarization — long documents, transcripts
    - Structured output — tables, outlines, formatted docs
    
    ---
    
    ## ā¤ļø OPUS — Complex Reasoning Only
    
    **Escalate to Opus for:**
    
    ### Architecture & Design
    - System architecture decisions
    - Major codebase refactoring
    - Design pattern selection with tradeoffs
    - Database schema design
    
    ### Deep Analysis
    - Complex debugging (multi-file, race conditions)
    - Security reviews
    - Performance optimization strategy
    - Root cause analysis of subtle bugs
    
    ### Strategic & Creative
    - Strategic planning — business decisions, roadmaps
    - Nuanced judgment — ethics, ambiguity, competing values
    - Deep research — comprehensive multi-source analysis
    
    ---
    
    ## šŸ”„ Implementation
    
    ### For Subagents
    \`\`\`javascript
    // Routine monitoring
    sessions_spawn(task="Check backup status", model="haiku")
    
    // Standard code work  
    sessions_spawn(task="Build the REST API endpoint", model="sonnet")
    
    // Architecture decisions
    sessions_spawn(task="Design the database schema for multi-tenancy", model="opus")
    \`\`\`
    
    ### For Cron Jobs
    \`\`\`json
    {
      "payload": {
        "kind": "agentTurn",
        "model": "haiku"
      }
    }
    \`\`\`
    Always use Haiku for cron unless the task genuinely needs reasoning.
    
    ---
    
    ## šŸ“Š Quick Decision Tree
    
    \`\`\`
    Is it a greeting, lookup, status check, or 1-2 sentence answer?
      YES → HAIKU
      NO ↓
    
    Is it code, analysis, planning, writing, or multi-step?
      YES → SONNET  
      NO ↓
    
    Is it architecture, deep reasoning, or critical decision?
      YES → OPUS
      NO → Default to SONNET, escalate if struggling
    \`\`\`
    
    ---
    
    ## šŸ“‹ Quick Reference Card
    
    \`\`\`
    ā”Œā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”
    │                  SMART MODEL SWITCHING                      │
    │                  Haiku → Sonnet → Opus                      │
    ā”œā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”¤
    │  šŸ’š HAIKU (cheapest)                                        │
    │  • Greetings, status checks, quick lookups                  │
    │  • Factual Q&A, definitions, reminders                      │
    │  • Simple file ops, 1-2 sentence answers                    │
    ā”œā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”¤
    │  šŸ’› SONNET (standard)                                       │
    │  • Code > 10 lines, debugging                               │
    │  • Analysis, comparisons, planning                          │
    │  • Reports, proposals, long writing                         │
    ā”œā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”¤
    │  ā¤ļø OPUS (complex)                                          │
    │  • Architecture decisions                                   │
    │  • Complex debugging, multi-file refactoring                │
    │  • Strategic planning, deep research                        │
    ā”œā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”¤
    │  šŸ’” RULE: If a human needs > 30 sec thinking → escalate     │
    │  šŸ’° COST: Haiku 1x → Sonnet 12x → Opus 60x                  │
    ā””ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”€ā”˜
    \`\`\`
    
    ---
    
    *Built for Claude-only setups with Haiku, Sonnet, and Opus.*
    *Inspired by save-money skill, extended with three-tier progression.*