Skip to content

mangiapanejohn-dev/Resonix-GUI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Resonix GUI Agent

Resonix Logo
AI-Powered Desktop Automation Agent


What is Resonix GUI Agent?

Resonix GUI Agent is an autonomous AI agent that can see your screen, understand what's happening, and help you with desktop tasks - automatically, proactively, and intelligently.

Key Features

  • 👾 Autonomous Observation - Watches your screen in the background and detects when you need help
  • 👁 AI Vision - Uses GPT-4V, Claude Vision, or local LLaVA to understand screen content
  • 🖱️ Full Desktop Control - Click, type, scroll, hotkeys, drag & drop
  • 🔄 Multi-Display Support - Works with multiple monitors and HiDPI displays
  • 📝 OCR Built-in - Reads text from any screen
  • 🧠 Smart Decision Making - Knows when to help and when to wait

How It Works

Resonix starts
    ↓
GUI Agent automatically begins observation 👾👾👾
    ↓
Every 15 seconds → Analyzes screen
    ↓
Smart scene detection:
    - Using other browser → "Use Resonix instead?"
    - Search intent → "I can help search"
    - Error detected → "I can fix that"
    - Working → waits quietly
    ↓
Proactively offers help when needed

Quick Start

# Install
npm install -g resonix

# Start (GUI Agent is built-in and auto-starts!)
resonix

# Or run GUI Agent directly
resonix gui-agent --mode auto

Commands

# Auto mode (AI makes decisions)
resonix gui-agent --mode auto

# Interactive mode (you give commands)
resonix gui-agent --mode interactive

# Disable vision
resonix gui-agent --no-vision

# Custom interval
resonix gui-agent --interval 3000

Environment Variables

# For AI Vision (choose one)
OPENAI_API_KEY=sk-...      # GPT-4V (best)
ANTHROPIC_API_KEY=sk-...   # Claude Vision

# For local (free)
# Just run Ollama with LLaVA model

Requirements

  • macOS (Windows/Linux coming soon)
  • For full functionality: Accessibility permission (auto-prompts)

Architecture

src/agents/tools/gui-agent/
├── smart-agent.ts       # Autonomous AI decision making
├── perception.ts       # Screen capture, OCR, multi-display
├── accessibility.ts    # macOS UI element detection
├── auto-vision.ts     # AI vision provider
├── executor/           # Cross-platform mouse/keyboard control
└── index.ts           # Built-in integration

Contributing

Open source under MIT license. Contributions welcome!


License

MIT


Resonix - AI That Actually Helps

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors