Agents

Some tasks are too complex for a single prompt. Organizing thousands of files into folders, conducting deep research across multiple sources, building a feature across a codebase—these require planning, iteration, and persistence. Agents let you hand off complex work to AI that can break it down, execute it step by step, and keep working even when you close the window.

What is an Agent?

An Agent is an autonomous AI mode that executes multi-step tasks with built-in planning and issue tracking. Unlike regular chat (which responds to one message at a time), Agents can:

Break down complex requests into trackable issues
Generate execution plans with up to 10 steps per issue
Perform file operations (read, write, edit, search, organize)
Use tools to browse the web, search online, and automate your Mac
Leverage skills for specialized methodologies
Ask for clarification when tasks are ambiguous
Continue working in the background

Think of an Agent as a capable assistant you can delegate tasks to. You describe what you want, it creates a plan, and then executes that plan while you focus on other work.

Features

Issue Tracking

Every Agent task is organized into issues with clear status tracking:

Status	Description
Open	Task is queued and waiting to be worked on
In Progress	Agent is actively working on this task
Blocked	Task is waiting on a dependency or clarification
Completed	Task finished successfully
Cancelled	Task was stopped by the user

Each issue can have:

Priority — High, Medium, or Low
Dependencies — Issues that must complete first
Description — Detailed explanation of what needs to be done
Plan — AI-generated steps to complete the task

Parallel Tasks

Run multiple Agent tasks simultaneously for increased productivity. While one Agent organizes your photo library, another can be researching vacation destinations. Or while one refactors code, another writes documentation.

Each task runs independently
Progress is tracked separately
Results are collected when all tasks complete

Execution Planning

When you create a task, the Agent generates a step-by-step plan before executing:

Analysis — Agent examines the request and working directory
Planning — Creates up to 10 concrete steps
Review — You can approve, modify, or reject the plan
Execution — Agent works through each step

Plans are transparent—you always see what the Agent intends to do before it does it.

Working Directory

Select a folder for the Agent to work in:

Folder Detection — Automatically recognizes folder structure and file types
Scoped Operations — File operations are limited to the selected directory
Context Awareness — Agent understands folder organization and naming conventions

To set a working directory:

Open Agent Mode in the Chat window
Click Select Folder or drag a folder onto the window
The Agent now has context about your files

File Operations

Agents can interact with your filesystem to complete tasks:

Operation	Description
Read	View file contents to understand what's there
Write	Create new files (documents, notes, summaries)
Edit	Modify existing files with precise changes
Search	Find files and content matching patterns
Move	Organize files into folders
Rename	Batch rename files with consistent naming schemes

All file operations support undo—if something goes wrong, you can revert changes.

Discovery

Agents automatically scan your folder to find:

File Types — Documents, images, spreadsheets, code, and more
Patterns — Naming conventions, folder structure, duplicates
Content — Keywords, dates, topics within files
Issues — Missing files, broken links, incomplete items

This discovery phase helps the Agent understand what you have and plan accordingly.

Clarification

When a task is ambiguous, the Agent pauses to ask questions rather than guessing:

"Should I organize photos by date or by event name?"
"I found 50 duplicate files. Should I move them to a 'Duplicates' folder or delete them?"
"The research has multiple angles. Should I focus on cost comparison or feature comparison?"

You can respond to clarification requests, and the Agent continues with the updated context.

Background Execution

Tasks continue running after you close the Agent window:

Close the window and work on other things
Return later to check progress
Get notified when tasks complete (if notifications are enabled)

Long-running tasks don't require you to keep the window open.

Tools Integration

Agents have access to all your installed tools, enabling capabilities beyond file operations:

Tool Category	What Agents Can Do
Browser	Navigate websites, fill forms, take screenshots, extract data
Web Search	Search the internet, find articles, gather information
Git	Check status, view history, create commits
System	Interact with macOS, run scripts, automate workflows

Tools extend what Agents can accomplish—from researching topics online to automating repetitive Mac tasks.

Skills Integration

Agents can leverage your enabled skills for specialized methodologies:

Research Analyst — Structured approach to gathering and evaluating information
Debug Assistant — Systematic debugging methodology
Code Reviewer — Thorough code review with best practices
Custom Skills — Any skills you've created or imported

Skills give Agents domain expertise, making them more effective at specialized tasks.

Accessing Agents

Open Agent Mode from the Chat window:

Open the chat overlay with ⌘; or click the Osaurus menu bar icon
Click the Agent Mode tab at the top of the chat window
You're now in Agent Mode, ready to create tasks

Creating a Task

Enter Agent Mode — Click the Agent Mode tab in the chat window
Set Working Directory — Select the folder for file operations
Describe the Task — Type what you want to accomplish
Review the Plan — The Agent generates a step-by-step plan
Approve and Execute — Click Start to begin execution

Example Tasks

Everyday Task:

Organize my Downloads folder. Sort files by type into subfolders
(Documents, Images, Videos, Archives), and identify any duplicates.

Agent creates issues:

Scan Downloads folder and categorize all files by type
Create subfolders (Documents, Images, Videos, Archives, Other)
Move document files (.pdf, .doc, .txt) to Documents folder
Move image files (.jpg, .png, .gif) to Images folder
Move video files (.mp4, .mov) to Videos folder
Move archive files (.zip, .rar) to Archives folder
Identify and report duplicate files

Developer Task:

Add user authentication to the Express app. Use JWT tokens,
create login and register endpoints, and add middleware to
protect the /api routes.

Agent creates issues:

Install dependencies (jsonwebtoken, bcrypt)
Create User model with email and password fields
Implement register endpoint with password hashing
Implement login endpoint with JWT generation
Create auth middleware for protected routes
Apply middleware to /api routes
Add error handling for auth failures

Each issue gets its own detailed plan, and the Agent works through them sequentially.

Managing Tasks

Viewing Progress

The Agent Mode interface shows:

Active Issues — Tasks currently being worked on
Completed Issues — Finished tasks with results
Pending Issues — Tasks waiting to start
Blocked Issues — Tasks waiting for clarification or dependencies

Pausing and Resuming

Click Pause to stop the Agent after the current step
Click Resume to continue execution
Pausing is useful when you need to make manual changes

Cancelling Tasks

Click Cancel on any issue to stop it
Completed steps are preserved
You can create a new task to continue from where it stopped

Viewing Results

After completion, each issue shows:

Files Modified — List of files that were changed
Steps Completed — Summary of what was done
Errors Encountered — Any issues that occurred
Undo Option — Revert all changes from this task

Use Cases

For Everyone

File Organization

Task: "Organize my Documents folder by year and category"

The Agent will:

Scan all files and identify creation dates
Create year-based folders (2024, 2025, etc.)
Create category subfolders (Work, Personal, Finance, etc.)
Move files to appropriate locations
Generate a summary of what was organized

Deep Research

Task: "Research the best laptops for video editing under $2000 and create a comparison document"

The Agent will:

Search the web for current laptop reviews and specifications
Visit manufacturer websites for detailed specs
Compare prices across retailers
Create a structured comparison document
Summarize pros and cons for each option

Content Compilation

Task: "Compile all my meeting notes from Q4 into a quarterly summary"

The Agent will:

Find all meeting note files from October-December
Read and extract key points from each
Organize by project or topic
Create a comprehensive summary document
Highlight action items and decisions

Web Automation

Task: "Check my favorite news sites and create a morning briefing document"

The Agent will:

Open and navigate to each news site
Extract top headlines and summaries
Compile everything into a formatted document
Save to your specified folder
Optionally include links to full articles

Photo Management

Task: "Sort my vacation photos by location and create albums"

The Agent will:

Scan photo metadata for location data
Group photos by city or landmark
Create folders for each location
Rename files with descriptive names
Identify and flag potential duplicates

For Developers

Building Features

Task: "Add a dark mode toggle to the settings page"

The Agent will:

Analyze your existing settings component
Create a theme context or state management
Add toggle UI to settings
Implement CSS variables or theme switching
Update components to use the theme

Refactoring Code

Task: "Convert all class components to functional components with hooks"

The Agent will:

Scan for class components
Create issues for each component
Convert state to useState
Convert lifecycle methods to useEffect
Update imports and exports

Debugging Issues

Task: "Fix the memory leak in the WebSocket connection handler"

The Agent will:

Analyze the WebSocket code
Identify potential leak sources
Check for missing cleanup in useEffect
Verify event listener removal
Test and validate the fix

Documentation

Task: "Generate JSDoc comments for all exported functions in src/utils"

The Agent will:

Find all exported functions
Analyze function signatures and implementation
Generate appropriate JSDoc comments
Add parameter and return type documentation

Agents vs Personas vs Skills

Aspect	Personas	Skills	Agents
Purpose	Define AI personality	Add domain expertise	Execute multi-step tasks
Scope	Chat behavior and tools	Instructions and context	Full task automation
Execution	Interactive conversation	Enhances responses	Autonomous background work
Best for	Different assistant modes	Specialized knowledge	Complex tasks across multiple files
Output	Conversational responses	Enhanced reasoning	Organized files and documents

When to use each:

Persona: You want a different AI personality (e.g., research-focused vs creative)
Skill: You need specialized methodology (e.g., research framework, writing style)
Agent: You have a complex task that requires multiple steps and file changes

Tips and Best Practices

Be specific — Detailed task descriptions yield better plans
Start small — Try simple tasks first to understand how Agents work
Review plans — Always check the generated plan before approving
Use working directories — Scoped operations are safer and more accurate
Break down large tasks — Multiple focused tasks work better than one massive task
Preview changes — Review what the Agent did before considering it final
Use undo — Don't hesitate to revert if something goes wrong
Provide context — Mention relevant files, folders, or patterns in your task description

Limitations

Max 10 steps per issue — Very complex tasks may need to be split
Single folder — Each Agent session works in one working directory
Tool availability — Web and automation features require the relevant tools to be installed
Review recommended — Always review results before relying on them

For more about AI capabilities, see Personas and Skills.

What is an Agent?​

Features​

Issue Tracking​

Parallel Tasks​

Execution Planning​

Working Directory​

File Operations​

Discovery​

Clarification​

Background Execution​

Tools Integration​

Skills Integration​

Accessing Agents​

Creating a Task​

Example Tasks​

Managing Tasks​

Viewing Progress​

Pausing and Resuming​

Cancelling Tasks​

Viewing Results​

Use Cases​

For Everyone​

File Organization​

Deep Research​

Content Compilation​

Web Automation​

Photo Management​

For Developers​

Building Features​

Refactoring Code​

Debugging Issues​

Documentation​

Agents vs Personas vs Skills​

Tips and Best Practices​

Limitations​

What is an Agent?

Features

Issue Tracking

Parallel Tasks

Execution Planning

Working Directory

File Operations

Discovery

Clarification

Background Execution

Tools Integration

Skills Integration

Accessing Agents

Creating a Task

Example Tasks

Managing Tasks

Viewing Progress

Pausing and Resuming

Cancelling Tasks

Viewing Results

Use Cases

For Everyone

File Organization

Deep Research

Content Compilation

Web Automation

Photo Management

For Developers

Building Features

Refactoring Code

Debugging Issues

Documentation

Agents vs Personas vs Skills

Tips and Best Practices

Limitations