Sypha AI Docs
Features

Automated Browser Interaction

Execute complex web automation and UI testing directly from your Sypha-enhanced VS Code.

Automated Browser Interaction

Sypha integrates advanced browser automation, allowing the AI to interact with live websites without leaving your development environment. This capability is essential for UI testing, automating repetitive web tasks, and verifying deployments through visual feedback.

[!IMPORTANT] To ensure optimal performance, this feature is optimized for high-reasoning models like Claude 3.5/3.7 Sonnet or Claude 4.

Core Functionality

Sypha's integrated browser engine provides several automated capabilities:

  • Instant Access: Launches a session automatically upon receiving a URL-related request.
  • Visual Validation: Captures high-resolution screenshots for UI analysis.
  • Interactive Control: Allows the AI to navigate, interact with forms, and trigger page elements.
  • Background Execution: Operates efficiently in a headless state to minimize distraction.

Practical Examples

Direct Sypha to perform web-based tasks using natural language:

  • "Launch the browser and navigate to our staging dashboard."
  • "Verify that our primary site at https://sypha.ai is loading as expected."
  • "Access http://localhost:3000 and confirm the navigation menu is responsive."

Supported Actions

The browser_action utility manages the following operations:

  • launch: Initializes a browser session at a specified destination.
  • click: Performs precise interactions at defined screen coordinates.
  • type: Injects text into targeted input fields.
  • scroll_down / scroll_up: Moves through page content for full visibility.
  • close: Terminates the active session to conserve system resources.

Customization

Manage your automation preferences in Settings (gear icon) -> Browser / Computer Use.

  • Toggle Utility: Enable or disable the browser engine entirely.
  • Resolution Profiles: Switch between Desktop, Tablet, and Mobile viewports for responsive testing.
  • Image Compression: Balance screenshot detail with token efficiency (75% default).
  • External Integration: Synchronize Sypha with an existing Chrome instance for persistent logins or specific DevContainer environments.

Configurable Remote Debugging

To interface with a visible window, initialize Chrome with the remote debugging flag:

# macOS Example
/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome --remote-debugging-port=9222 --user-data-dir=/tmp/chrome-debug

On this page