Back to Skills
    🦞

    screen-monitor

    Dual-mode screen sharing and analysis.

    By @emasoudy
    View on GitHub
    SKILL.md
    ---
    name: screen-monitor
    description: Dual-mode screen sharing and analysis. Model-agnostic (Gemini/Claude/Qwen3-VL).
    metadata: {"clawdbot":{"emoji":"🖥️","requires":{"model_features":["vision"]}}}
    ---
    
    # Screen Monitor
    
    This skill provides two ways for the agent to see and interact with your screen.
    
    ## 🟢 Path A: Fast Share (WebRTC)
    *Best for: Quick visual checks, restricted browsers, or non-technical environments.*
    
    ### Tools
    - **`screen_share_link`**: Generates a local WebRTC portal URL.
    - **`screen_analyze`**: Captures the current frame from the portal and analyzes it with vision.
    
    **Usage:**
    ```bash
    # Get the link
    bash command:"{baseDir}/references/get-share-url.sh"
    
    # Analyze
    bash command:"{baseDir}/references/screen-analyze.sh"
    ```
    
    ---
    
    ## 🔵 Path B: Full Control (Browser Relay)
    *Best for: Deep debugging, UI automation, and clicking/typing in tabs.*
    
    ### Setup
    1. Run `clawdbot browser extension install`.
    2. Load the unpacked extension from `clawdbot browser extension path`.
    3. Click the Clawdbot icon in your Chrome toolbar to **Attach**.
    
    ### Tools
    - **`browser action:snapshot`**: Take a precise screenshot of the attached tab.
    - **`browser action:click`**: Interact with elements (requires `profile="chrome"`).
    
    ---
    
    ## Technical Details
    - **Port**: 18795 (WebRTC Backend)
    - **Files**: 
      - `web/screen-share.html`: The sharing portal.
      - `references/backend-endpoint.js`: Frame storage server.