Resonix GUI Agent

AI-Powered Desktop Automation Agent

What is Resonix GUI Agent?

Resonix GUI Agent is an autonomous AI agent that can see your screen, understand what's happening, and help you with desktop tasks - automatically, proactively, and intelligently.

Key Features

👾 Autonomous Observation - Watches your screen in the background and detects when you need help
👁 AI Vision - Uses GPT-4V, Claude Vision, or local LLaVA to understand screen content
🖱️ Full Desktop Control - Click, type, scroll, hotkeys, drag & drop
🔄 Multi-Display Support - Works with multiple monitors and HiDPI displays
📝 OCR Built-in - Reads text from any screen
🧠 Smart Decision Making - Knows when to help and when to wait

How It Works

Resonix starts
    ↓
GUI Agent automatically begins observation 👾👾👾
    ↓
Every 15 seconds → Analyzes screen
    ↓
Smart scene detection:
    - Using other browser → "Use Resonix instead?"
    - Search intent → "I can help search"
    - Error detected → "I can fix that"
    - Working → waits quietly
    ↓
Proactively offers help when needed

Quick Start

# Install
npm install -g resonix

# Start (GUI Agent is built-in and auto-starts!)
resonix

# Or run GUI Agent directly
resonix gui-agent --mode auto

Commands

# Auto mode (AI makes decisions)
resonix gui-agent --mode auto

# Interactive mode (you give commands)
resonix gui-agent --mode interactive

# Disable vision
resonix gui-agent --no-vision

# Custom interval
resonix gui-agent --interval 3000

Environment Variables

# For AI Vision (choose one)
OPENAI_API_KEY=sk-...      # GPT-4V (best)
ANTHROPIC_API_KEY=sk-...   # Claude Vision

# For local (free)
# Just run Ollama with LLaVA model

Requirements

macOS (Windows/Linux coming soon)
For full functionality: Accessibility permission (auto-prompts)

Architecture

src/agents/tools/gui-agent/
├── smart-agent.ts       # Autonomous AI decision making
├── perception.ts       # Screen capture, OCR, multi-display
├── accessibility.ts    # macOS UI element detection
├── auto-vision.ts     # AI vision provider
├── executor/           # Cross-platform mouse/keyboard control
└── index.ts           # Built-in integration

Contributing

Open source under MIT license. Contributions welcome!

License

MIT

Resonix - AI That Actually Helps

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src		src
.gitignore		.gitignore
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
tsconfig.plugin-sdk.dts.json		tsconfig.plugin-sdk.dts.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Resonix GUI Agent

What is Resonix GUI Agent?

Key Features

How It Works

Quick Start

Commands

Environment Variables

Requirements

Architecture

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Resonix GUI Agent

What is Resonix GUI Agent?

Key Features

How It Works

Quick Start

Commands

Environment Variables

Requirements

Architecture

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages