Skip to main content

Agent Registry

Dexto comes with a curated collection of pre-built agents ready to use for common tasks. Each agent is optimized with specific tools, system prompts, and LLM configurations for its domain.

Adding Custom Agents

Want to add your Agent to this registry? Check out our Community Contribution Guide for step-by-step instructions.

Quick Start

# List all available agents
dexto list-agents

# Install an agent
dexto install <agent-name>

# Use an installed agent
dexto --agent <agent-name>

For detailed installation instructions, see the Installing Custom Agents guide.


Agent Catalog Overview

AgentCategoryBest ForLLM
Podcast AgentContent CreationMulti-speaker audio, podcast introsOpenAI GPT-5 Mini
Music AgentContent CreationMusic composition, audio processingOpenAI GPT-5 Mini
Nano Banana AgentContent CreationImage generation & editingGoogle Gemini 2.5 Flash
Sora Video AgentContent CreationAI video generationOpenAI GPT-5 Mini
Image Editor AgentContent CreationImage manipulation, face detectionOpenAI GPT-5 Mini
Coding AgentDevelopmentSoftware development, debuggingAnthropic Claude Haiku 4.5
Database AgentData & AnalysisSQL queries, database operationsOpenAI GPT-5 Mini
Talk2PDF AgentData & AnalysisPDF analysis, document conversationOpenAI GPT-5 Mini
GitHub AgentDevOpsGitHub operations, PR analysisOpenAI GPT-5 Mini
Product ResearcherResearchProduct naming, branding researchAnthropic Claude Sonnet 4.5
Triage AgentMulti-AgentCustomer support routingOpenAI GPT-5
Default AgentGeneral PurposeGeneral tasks, file operationsAny

Detailed Agent Information

Content Creation

🎙️ Podcast Agent

ID: podcast-agent Best For: Multi-speaker audio content, podcast intros, voice synthesis

Create professional podcast content with realistic multi-speaker audio using Google Gemini TTS.

Key Features:

  • Multi-speaker podcast generation
  • High-quality voice synthesis
  • Audio editing and production
  • Various podcast format support

Example Use:

dexto --agent podcast-agent "Generate an intro for a tech podcast about AI"

Recommended LLM: OpenAI GPT-5 Mini

Requires: GOOGLE_GENERATIVE_AI_API_KEY


🎵 Music Agent

ID: music-agent Best For: Music creation, audio processing, sound design

AI agent specialized in music composition and audio manipulation.

Key Features:

  • Music composition and generation
  • Audio processing and effects
  • Sound design capabilities
  • Multiple musical styles

Example Use:

dexto --agent music-agent "Create a lo-fi chill beat for studying"

Recommended LLM: OpenAI GPT-5 Mini

Tutorial: Music Agent Tutorial


🖼️ Nano Banana Agent

ID: nano-banana-agent Best For: Image generation, editing, object removal, style transfer

Advanced image generation and editing using Google's Nano Banana (Gemini 2.5 Flash Image) with near-instantaneous processing.

Key Features:

  • Image Generation – Create stunning images from text prompts with various styles
  • Image Editing – Modify existing images using natural language
  • Object Removal – Remove unwanted objects while preserving backgrounds
  • Background Changes – Replace backgrounds seamlessly
  • Image Fusion – Combine multiple images creatively
  • Style Transfer – Apply artistic styles with character consistency
  • Figurine Effect – Nano Banana's signature feature
  • Multi-image Processing – Complex compositions support

Capabilities:

  • Near-instantaneous processing with high visual coherence
  • Character consistency across multiple edits
  • Scene preservation with seamless blending
  • SynthID watermarks for safety
  • Supports JPG, PNG, WebP, GIF (max 20MB per image)

Example Use:

dexto --agent nano-banana-agent "Create a futuristic cityscape with flying cars"
dexto --agent nano-banana-agent "Remove the person from this photo"

Recommended LLM: Google Gemini 2.5 Flash (Required)

Requires: GOOGLE_GENERATIVE_AI_API_KEY

Demo: Image Generation Example


🎬 Sora Video Agent

ID: sora-video-agent Best For: AI video generation, video remixing, video management

Create AI-generated videos using OpenAI's Sora technology with comprehensive video creation and management capabilities.

Key Features:

  • Video Generation – Create videos from text prompts with custom duration, resolution, and style
  • Reference-Based Creation – Use images or videos as reference for precise generation
  • Video Management – Monitor progress, list all videos, organize creations
  • Video Remixing – Create variations and extensions with new prompts
  • File Management – Auto-download and organize generated videos
  • Quality Control – Delete unwanted videos and manage storage

Supported Specifications:

  • Durations: 4s, 8s, 16s, 32s
  • Resolutions: 720x1280 (9:16), 1280x720 (16:9), 1024x1024 (1:1), 1024x1808 (9:16 HD), 1808x1024 (16:9 HD)
  • Reference Formats: JPG, PNG, WebP, MP4, MOV, AVI, WebM

Example Use:

dexto --agent sora-video-agent "Create a 16s cinematic video of a sunset over mountains"

Recommended LLM: OpenAI GPT-5 Mini

Requires: OPENAI_API_KEY


🖌️ Image Editor Agent

ID: image-editor-agent Best For: Image manipulation, face detection, OpenCV operations

General-purpose image editing and manipulation with computer vision capabilities.

Key Features:

  • Image editing and transformation
  • Face detection and annotation
  • OpenCV-powered operations
  • Graphics manipulation

Example Use:

dexto --agent image-editor-agent "Detect all faces in this image and draw bounding boxes"

Recommended LLM: OpenAI GPT-5 Mini

Demo: Face Detection Example


Development & Coding

👨‍💻 Coding Agent

ID: coding-agent Best For: Software development, debugging, code review, refactoring

Expert software development assistant with comprehensive internal coding tools for building, debugging, and maintaining codebases.

Key Features:

  • Codebase Analysis – Read and analyze code using glob and grep patterns
  • File Operations – Write, edit, and organize code files
  • Command Execution – Run shell commands for testing and building
  • Debugging – Identify and fix bugs by examining errors and code structure
  • Refactoring – Improve code following best practices
  • Testing – Write and run unit tests

Internal Tools:

  • read_file, write_file, edit_file – File operations
  • glob_files, grep_content – Code search
  • bash_exec – Shell command execution
  • ask_user – Interactive clarification

Starter Prompts Include:

  • 🔍 Analyze Codebase
  • 🐛 Debug Error
  • ♻️ Refactor Code
  • 🧪 Write Tests
  • ✨ Implement Feature
  • ⚡ Optimize Performance
  • 🚀 Setup Project
  • 👀 Code Review

Example Use:

dexto --agent coding-agent "Analyze this codebase and suggest improvements"
dexto --agent coding-agent "Create a landing page for a coffee brand"

Recommended LLM: Anthropic Claude Haiku 4.5

File Support: 50+ programming languages and config formats

Demo: Snake Game Development


Data & Analysis

🗄️ Database Agent

ID: database-agent Best For: SQL queries, database operations, data analysis

AI agent specialized in database operations and SQL query generation.

Key Features:

  • SQL query generation
  • Database schema analysis
  • Data operations and transformations
  • Query optimization suggestions

Example Use:

dexto --agent database-agent "Show me all users who signed up last month"

Recommended LLM: OpenAI GPT-5 Mini

Tutorial: Database Agent Tutorial


📄 Talk2PDF Agent

ID: talk2pdf-agent Best For: PDF analysis, document conversation, content extraction

Conversational interface for analyzing and extracting information from PDF documents.

Key Features:

  • PDF document analysis
  • Natural language queries about content
  • Information extraction
  • Document summarization

Example Use:

dexto --agent talk2pdf-agent "Summarize the key findings in this research paper"

Recommended LLM: OpenAI GPT-5 Mini

Tutorial: Talk2PDF Tutorial


Collaboration & DevOps

🐙 GitHub Agent

ID: github-agent Best For: GitHub operations, PR analysis, repository management

Specialized agent for GitHub operations including pull request analysis, issue management, and repository insights.

Key Features:

  • Analyze pull requests and issues
  • Repository insights and statistics
  • Code review assistance
  • GitHub workflow automation
  • Collaboration features via MCP

Example Use:

dexto --agent github-agent "Analyze the open pull requests in this repo"

Recommended LLM: OpenAI GPT-5 Mini

Requires: GITHUB_TOKEN


Research & Branding

🔍 Product Researcher

ID: product-researcher Best For: Product naming, branding research, market analysis

AI agent specialized in product name research, branding strategies, and market positioning.

Key Features:

  • Product name generation and evaluation
  • Brand identity research
  • Competitive analysis
  • Market positioning insights
  • Naming conventions and trends

Example Use:

dexto --agent product-researcher "Suggest names for a sustainable fashion startup"

Recommended LLM: Anthropic Claude Sonnet 4.5

Tutorial: Product Name Scout Tutorial


Multi-Agent Systems

🎯 Triage Agent

ID: triage-agent Best For: Customer support routing, multi-agent coordination

Demonstration of a multi-agent customer support triage system that routes queries to specialized agents.

System Architecture:

  • Triage Agent (Main) – Routes queries to specialized agents
  • Technical Support Agent – Handles technical issues
  • Billing Agent – Manages billing and payment queries
  • Product Info Agent – Answers product-related questions
  • Escalation Agent – Handles complex cases requiring human intervention

Key Features:

  • Intelligent query routing
  • Multi-agent coordination
  • Specialized domain handling
  • Escalation workflows

Example Use:

dexto --agent triage-agent "I need help with my billing"

Recommended LLM: OpenAI GPT-5

Tutorial: Building Multi-Agent Systems


General Purpose

⚙️ Default Agent

ID: default-agent Best For: General tasks, file operations, web automation

Default Dexto agent with filesystem and Playwright tools for general-purpose tasks.

Key Features:

  • Filesystem operations
  • Web browser automation via Playwright
  • General conversation and assistance
  • Task execution

Example Use:

dexto  # Uses default-agent by default

Recommended LLM: Any supported provider Comes pre-installed: Yes


Installation & Usage

Installing Agents

# Install single agent
dexto install nano-banana-agent

# Install multiple agents
dexto install podcast-agent music-agent coding-agent

# Install with default LLM (skip preference injection)
dexto install nano-banana-agent --no-inject-preferences

# Install all agents
dexto install --all

Using Installed Agents

# Use specific agent
dexto --agent coding-agent

# Auto-install and use (if not installed)
dexto -a podcast-agent "Generate a podcast intro"

Managing Agents

# List installed agents
dexto list-agents --installed

# Find agent location
dexto which nano-banana-agent

# Uninstall agent
dexto uninstall music-agent

Setting Default Agent

Edit ~/.dexto/preferences.yml:

defaults:
defaultAgent: coding-agent # Change to your preferred agent

Agent Comparison

AgentCategoryLLM RequirementSpecial Requirements
podcast-agentContentGoogle GeminiGOOGLE_GENERATIVE_AI_API_KEY
music-agentContentAny-
nano-banana-agentContentGoogle Gemini (Required)GOOGLE_GENERATIVE_AI_API_KEY
sora-video-agentContentOpenAI GPTOPENAI_API_KEY
image-editor-agentContentAny-
coding-agentDevelopmentClaude/GPT-
database-agentDataClaude/GPT-
talk2pdf-agentDataClaude/Gemini-
github-agentDevOpsClaude/GPTGITHUB_TOKEN
product-researcherResearchClaude/GPT-
triage-agentMulti-AgentClaude/GPT-
default-agentGeneralAnyPre-installed

Choosing the Right Agent

For Content Creation

  • Images: Use nano-banana-agent for fast, high-quality generation and editing
  • Videos: Use sora-video-agent for AI-generated video content
  • Audio/Podcasts: Use podcast-agent for multi-speaker content
  • Music: Use music-agent for composition and sound design

For Development

  • Coding: Use coding-agent for comprehensive development assistance
  • GitHub: Use github-agent for repository management and PR analysis
  • Databases: Use database-agent for SQL and data operations

For Analysis & Research

  • Documents: Use talk2pdf-agent for PDF analysis
  • Branding: Use product-researcher for naming and market research

For Complex Systems

  • Multi-Agent: Use triage-agent as a template for building agent coordination systems

API Key Requirements

Most agents require API keys for their LLM providers:

# Run setup to configure keys
dexto setup

# Or add to ~/.dexto/.env
OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...
GOOGLE_GENERATIVE_AI_API_KEY=...
GITHUB_TOKEN=ghp_...

Contributing Your Own Agent

Built something useful? Share it with the community!

  1. Create your agent following the agent.yml configuration guide
  2. Add documentation and examples
  3. Submit to our GitHub repository
  4. See Contributing Guide for details

Pre-installed status is available for high-quality, well-documented agents that serve common use cases.

Next Steps