Agent Registry
Dexto comes with a curated collection of pre-built agents ready to use for common tasks. Each agent is optimized with specific tools, system prompts, and LLM configurations for its domain.
Want to add your Agent to this registry? Check out our Community Contribution Guide for step-by-step instructions.
Quick Start
# List all available agents
dexto list-agents
# Install an agent
dexto install <agent-name>
# Use an installed agent
dexto --agent <agent-name>
For detailed installation instructions, see the Installing Custom Agents guide.
Agent Catalog Overview
| Agent | Category | Best For | LLM |
|---|---|---|---|
| Podcast Agent | Content Creation | Multi-speaker audio, podcast intros | OpenAI GPT-5 Mini |
| Music Agent | Content Creation | Music composition, audio processing | OpenAI GPT-5 Mini |
| Nano Banana Agent | Content Creation | Image generation & editing | Google Gemini 2.5 Flash |
| Sora Video Agent | Content Creation | AI video generation | OpenAI GPT-5 Mini |
| Image Editor Agent | Content Creation | Image manipulation, face detection | OpenAI GPT-5 Mini |
| Coding Agent | Development | Software development, debugging | Anthropic Claude Haiku 4.5 |
| Database Agent | Data & Analysis | SQL queries, database operations | OpenAI GPT-5 Mini |
| Talk2PDF Agent | Data & Analysis | PDF analysis, document conversation | OpenAI GPT-5 Mini |
| GitHub Agent | DevOps | GitHub operations, PR analysis | OpenAI GPT-5 Mini |
| Product Researcher | Research | Product naming, branding research | Anthropic Claude Sonnet 4.5 |
| Triage Agent | Multi-Agent | Customer support routing | OpenAI GPT-5 |
| Default Agent | General Purpose | General tasks, file operations | Any |
Detailed Agent Information
Content Creation
🎙️ Podcast Agent
ID: podcast-agent
Best For: Multi-speaker audio content, podcast intros, voice synthesis
Create professional podcast content with realistic multi-speaker audio using Google Gemini TTS.
Key Features:
- Multi-speaker podcast generation
- High-quality voice synthesis
- Audio editing and production
- Various podcast format support
Example Use:
dexto --agent podcast-agent "Generate an intro for a tech podcast about AI"
Recommended LLM: OpenAI GPT-5 Mini
Requires: GOOGLE_GENERATIVE_AI_API_KEY
🎵 Music Agent
ID: music-agent
Best For: Music creation, audio processing, sound design
AI agent specialized in music composition and audio manipulation.
Key Features:
- Music composition and generation
- Audio processing and effects
- Sound design capabilities
- Multiple musical styles
Example Use:
dexto --agent music-agent "Create a lo-fi chill beat for studying"
Recommended LLM: OpenAI GPT-5 Mini
Tutorial: Music Agent Tutorial
🖼️ Nano Banana Agent
ID: nano-banana-agent
Best For: Image generation, editing, object removal, style transfer
Advanced image generation and editing using Google's Nano Banana (Gemini 2.5 Flash Image) with near-instantaneous processing.
Key Features:
- Image Generation – Create stunning images from text prompts with various styles
- Image Editing – Modify existing images using natural language
- Object Removal – Remove unwanted objects while preserving backgrounds
- Background Changes – Replace backgrounds seamlessly
- Image Fusion – Combine multiple images creatively
- Style Transfer – Apply artistic styles with character consistency
- Figurine Effect – Nano Banana's signature feature
- Multi-image Processing – Complex compositions support
Capabilities:
- Near-instantaneous processing with high visual coherence
- Character consistency across multiple edits
- Scene preservation with seamless blending
- SynthID watermarks for safety
- Supports JPG, PNG, WebP, GIF (max 20MB per image)
Example Use:
dexto --agent nano-banana-agent "Create a futuristic cityscape with flying cars"
dexto --agent nano-banana-agent "Remove the person from this photo"
Recommended LLM: Google Gemini 2.5 Flash (Required)
Requires: GOOGLE_GENERATIVE_AI_API_KEY
Demo: Image Generation Example
🎬 Sora Video Agent
ID: sora-video-agent
Best For: AI video generation, video remixing, video management
Create AI-generated videos using OpenAI's Sora technology with comprehensive video creation and management capabilities.
Key Features:
- Video Generation – Create videos from text prompts with custom duration, resolution, and style
- Reference-Based Creation – Use images or videos as reference for precise generation
- Video Management – Monitor progress, list all videos, organize creations
- Video Remixing – Create variations and extensions with new prompts
- File Management – Auto-download and organize generated videos
- Quality Control – Delete unwanted videos and manage storage
Supported Specifications:
- Durations: 4s, 8s, 16s, 32s
- Resolutions: 720x1280 (9:16), 1280x720 (16:9), 1024x1024 (1:1), 1024x1808 (9:16 HD), 1808x1024 (16:9 HD)
- Reference Formats: JPG, PNG, WebP, MP4, MOV, AVI, WebM
Example Use:
dexto --agent sora-video-agent "Create a 16s cinematic video of a sunset over mountains"
Recommended LLM: OpenAI GPT-5 Mini
Requires: OPENAI_API_KEY
🖌️ Image Editor Agent
ID: image-editor-agent
Best For: Image manipulation, face detection, OpenCV operations
General-purpose image editing and manipulation with computer vision capabilities.
Key Features:
- Image editing and transformation
- Face detection and annotation
- OpenCV-powered operations
- Graphics manipulation
Example Use:
dexto --agent image-editor-agent "Detect all faces in this image and draw bounding boxes"
Recommended LLM: OpenAI GPT-5 Mini
Demo: Face Detection Example
Development & Coding
👨💻 Coding Agent
ID: coding-agent
Best For: Software development, debugging, code review, refactoring
Expert software development assistant with comprehensive internal coding tools for building, debugging, and maintaining codebases.
Key Features:
- Codebase Analysis – Read and analyze code using glob and grep patterns
- File Operations – Write, edit, and organize code files
- Command Execution – Run shell commands for testing and building
- Debugging – Identify and fix bugs by examining errors and code structure
- Refactoring – Improve code following best practices
- Testing – Write and run unit tests
Internal Tools:
read_file,write_file,edit_file– File operationsglob_files,grep_content– Code searchbash_exec– Shell command executionask_user– Interactive clarification
Starter Prompts Include:
- 🔍 Analyze Codebase
- 🐛 Debug Error
- ♻️ Refactor Code
- 🧪 Write Tests
- ✨ Implement Feature
- ⚡ Optimize Performance
- 🚀 Setup Project
- 👀 Code Review
Example Use:
dexto --agent coding-agent "Analyze this codebase and suggest improvements"
dexto --agent coding-agent "Create a landing page for a coffee brand"
Recommended LLM: Anthropic Claude Haiku 4.5
File Support: 50+ programming languages and config formats
Demo: Snake Game Development
Data & Analysis
🗄️ Database Agent
ID: database-agent
Best For: SQL queries, database operations, data analysis
AI agent specialized in database operations and SQL query generation.
Key Features:
- SQL query generation
- Database schema analysis
- Data operations and transformations
- Query optimization suggestions
Example Use:
dexto --agent database-agent "Show me all users who signed up last month"
Recommended LLM: OpenAI GPT-5 Mini
Tutorial: Database Agent Tutorial
📄 Talk2PDF Agent
ID: talk2pdf-agent
Best For: PDF analysis, document conversation, content extraction
Conversational interface for analyzing and extracting information from PDF documents.
Key Features:
- PDF document analysis
- Natural language queries about content
- Information extraction
- Document summarization
Example Use:
dexto --agent talk2pdf-agent "Summarize the key findings in this research paper"
Recommended LLM: OpenAI GPT-5 Mini
Tutorial: Talk2PDF Tutorial
Collaboration & DevOps
🐙 GitHub Agent
ID: github-agent
Best For: GitHub operations, PR analysis, repository management
Specialized agent for GitHub operations including pull request analysis, issue management, and repository insights.
Key Features:
- Analyze pull requests and issues
- Repository insights and statistics
- Code review assistance
- GitHub workflow automation
- Collaboration features via MCP
Example Use:
dexto --agent github-agent "Analyze the open pull requests in this repo"
Recommended LLM: OpenAI GPT-5 Mini
Requires: GITHUB_TOKEN
Research & Branding
🔍 Product Researcher
ID: product-researcher
Best For: Product naming, branding research, market analysis
AI agent specialized in product name research, branding strategies, and market positioning.
Key Features:
- Product name generation and evaluation
- Brand identity research
- Competitive analysis
- Market positioning insights
- Naming conventions and trends
Example Use:
dexto --agent product-researcher "Suggest names for a sustainable fashion startup"
Recommended LLM: Anthropic Claude Sonnet 4.5
Tutorial: Product Name Scout Tutorial
Multi-Agent Systems
🎯 Triage Agent
ID: triage-agent
Best For: Customer support routing, multi-agent coordination
Demonstration of a multi-agent customer support triage system that routes queries to specialized agents.
System Architecture:
- Triage Agent (Main) – Routes queries to specialized agents
- Technical Support Agent – Handles technical issues
- Billing Agent – Manages billing and payment queries
- Product Info Agent – Answers product-related questions
- Escalation Agent – Handles complex cases requiring human intervention
Key Features:
- Intelligent query routing
- Multi-agent coordination
- Specialized domain handling
- Escalation workflows
Example Use:
dexto --agent triage-agent "I need help with my billing"
Recommended LLM: OpenAI GPT-5
Tutorial: Building Multi-Agent Systems
General Purpose
⚙️ Default Agent
ID: default-agent
Best For: General tasks, file operations, web automation
Default Dexto agent with filesystem and Playwright tools for general-purpose tasks.
Key Features:
- Filesystem operations
- Web browser automation via Playwright
- General conversation and assistance
- Task execution
Example Use:
dexto # Uses default-agent by default
Recommended LLM: Any supported provider Comes pre-installed: Yes
Installation & Usage
Installing Agents
# Install single agent
dexto install nano-banana-agent
# Install multiple agents
dexto install podcast-agent music-agent coding-agent
# Install with default LLM (skip preference injection)
dexto install nano-banana-agent --no-inject-preferences
# Install all agents
dexto install --all
Using Installed Agents
# Use specific agent
dexto --agent coding-agent
# Auto-install and use (if not installed)
dexto -a podcast-agent "Generate a podcast intro"
Managing Agents
# List installed agents
dexto list-agents --installed
# Find agent location
dexto which nano-banana-agent
# Uninstall agent
dexto uninstall music-agent
Setting Default Agent
Edit ~/.dexto/preferences.yml:
defaults:
defaultAgent: coding-agent # Change to your preferred agent
Agent Comparison
| Agent | Category | LLM Requirement | Special Requirements |
|---|---|---|---|
| podcast-agent | Content | Google Gemini | GOOGLE_GENERATIVE_AI_API_KEY |
| music-agent | Content | Any | - |
| nano-banana-agent | Content | Google Gemini (Required) | GOOGLE_GENERATIVE_AI_API_KEY |
| sora-video-agent | Content | OpenAI GPT | OPENAI_API_KEY |
| image-editor-agent | Content | Any | - |
| coding-agent | Development | Claude/GPT | - |
| database-agent | Data | Claude/GPT | - |
| talk2pdf-agent | Data | Claude/Gemini | - |
| github-agent | DevOps | Claude/GPT | GITHUB_TOKEN |
| product-researcher | Research | Claude/GPT | - |
| triage-agent | Multi-Agent | Claude/GPT | - |
| default-agent | General | Any | Pre-installed |
Choosing the Right Agent
For Content Creation
- Images: Use
nano-banana-agentfor fast, high-quality generation and editing - Videos: Use
sora-video-agentfor AI-generated video content - Audio/Podcasts: Use
podcast-agentfor multi-speaker content - Music: Use
music-agentfor composition and sound design
For Development
- Coding: Use
coding-agentfor comprehensive development assistance - GitHub: Use
github-agentfor repository management and PR analysis - Databases: Use
database-agentfor SQL and data operations
For Analysis & Research
- Documents: Use
talk2pdf-agentfor PDF analysis - Branding: Use
product-researcherfor naming and market research
For Complex Systems
- Multi-Agent: Use
triage-agentas a template for building agent coordination systems
API Key Requirements
Most agents require API keys for their LLM providers:
# Run setup to configure keys
dexto setup
# Or add to ~/.dexto/.env
OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...
GOOGLE_GENERATIVE_AI_API_KEY=...
GITHUB_TOKEN=ghp_...
Contributing Your Own Agent
Built something useful? Share it with the community!
- Create your agent following the agent.yml configuration guide
- Add documentation and examples
- Submit to our GitHub repository
- See Contributing Guide for details
Pre-installed status is available for high-quality, well-documented agents that serve common use cases.
Next Steps
- Get Started: Follow the Install Your First Agent tutorial
- Install Guide: Learn more about installing custom agents
- Create Your Own: Build custom agents with the Configuration Guide
- Examples: Explore examples and demos
- Tutorials: Deep-dive into agent-specific tutorials in the tutorials section