AI/ML Integrationintermediate

voice-agents

Name: voice-agents
Author: antigravity

You are a voice AI architect who has shipped production voice agents handling millions of calls. You understand the physics of latency - every component adds milliseconds, and the sum determines wheth

✓Works with OpenClaude

Your core insight: Two architectures exist. Speech-to-speech (S2S) models like OpenAI Realtime API preserve emotion and achieve lowest latency but are less controllable. Pipeline architectures (STT→LLM→TTS) give you control at each step but add latency. Mos

Capabilities

voice-agents
speech-to-speech
speech-to-text
text-to-speech
conversational-ai
voice-activity-detection
turn-taking
barge-in-detection
voice-interfaces

Patterns

Speech-to-Speech Architecture

Direct audio-to-audio processing for lowest latency

Pipeline Architecture

Separate STT → LLM → TTS for maximum control

Voice Activity Detection Pattern

Detect when user starts/stops speaking

Anti-Patterns

❌ Ignoring Latency Budget

❌ Silence-Only Turn Detection

❌ Long Responses

⚠️ Sharp Edges

Issue	Severity	Solution
Issue	critical	# Measure and budget latency for each component:
Issue	high	# Target jitter metrics:
Issue	high	# Use semantic VAD:
Issue	high	# Implement barge-in detection:
Issue	medium	# Constrain response length in prompts:
Issue	medium	# Prompt for spoken format:
Issue	medium	# Implement noise handling:
Issue	medium	# Mitigate STT errors:

Related Skills

Works well with: agent-tool-builder, multi-agent-orchestration, llm-architect, backend

When to Use

This skill is applicable to execute the workflow or actions described in the overview.

Quick Info

CategoryAI/ML Integration

Difficultyintermediate

Version1.0.0

Authorantigravity

communityantigravityopenai

Install command:

Related AI/ML Integration Skills

Other Claude Code skills in the same category — free to download.

Browse all

AI/ML Integrationintermediate

OpenAI Integration

Integrate OpenAI API with best practices

AI/ML Integrationintermediate

Claude API Setup

Set up Claude/Anthropic API integration

AI/ML Integrationadvanced

Embedding Search

Implement vector embedding search

AI/ML Integrationadvanced

RAG Pipeline

Build Retrieval-Augmented Generation pipeline

AI/ML Integrationbeginner

Prompt Template

Create reusable prompt templates with variables

AI/ML Integrationintermediate

AI Streaming

Implement streaming AI responses

AI/ML Integrationintermediate

LangChain Setup

Set up LangChain for AI workflows

AI/ML Integrationintermediate

Model Comparison

Compare responses from multiple AI models

Want a AI/ML Integration skill personalized to YOUR project?

This is a generic skill that works for everyone. Our AI can generate one tailored to your exact tech stack, naming conventions, folder structure, and coding patterns — with 3x more detail.

Custom Agent — $5 →|Analyze My Stack — $3 →