Voicex

⚡

Sub-Second Latency

Streaming STT → LLM → TTS pipeline. The AI starts speaking before finishing its thought. Typical first-audio in ~800ms.

🤖

Multi-Agent Support

Create multiple AI agents with different personas, LLM models, TTS voices, and behavior thresholds. Each agent is independently configurable.

🔐

Multi-Tenant SaaS

Organizations, users, JWT auth, API keys, plan-based access control. Full dashboard for managing agents, calls, providers, and analytics.

🧩

Pluggable Providers

Unified provider architecture for LLM (Groq, OpenAI, Ollama), TTS (ElevenLabs, OpenAI, Edge), and STT (Deepgram). Bring your own API keys.

📊

Plan & Billing System

Four tiers (Free → Enterprise) with model access control, agent limits, custom provider support, and Redis-cached plan lookups.

📞

Phone & Web Calling

Browser-based voice via WebSocket and phone calls via Twilio Media Streams. Same pipeline, different transports.

Documentation

Section	Description
Getting Started	Install, configure, seed data, and run locally in 10 minutes
Architecture	System overview, voice pipeline, data flow diagrams
Database Schema	All collections, fields, indexes, relationships, and defaults
Authentication	Signup/signin flow, JWT tokens, API keys, org status lifecycle
Providers	Unified provider system — global vs client, encryption, registry
Plans & Billing	Plan tiers, model access, features, Redis caching, pricing
Frontend	Next.js dashboard — pages, components, contexts, voice UI
REST API	All HTTP endpoints with request/response examples
WebSocket API	Voice protocol, message types, audio format, reconnection
Environment Variables	Every env var with description, defaults, and examples
Admin Scripts	Seed scripts, migrations, mongosh commands, bash helpers
Deployment	Docker, Nginx, scaling, production checklist
Client Integration	Embed voice in your app — WebSocket + REST from client code
Twilio	Phone call integration via Twilio Media Streams

Section

Description

Getting Started

Install, configure, seed data, and run locally in 10 minutes

Architecture

System overview, voice pipeline, data flow diagrams

Database Schema

All collections, fields, indexes, relationships, and defaults

Authentication

Signup/signin flow, JWT tokens, API keys, org status lifecycle

Providers

Unified provider system — global vs client, encryption, registry

Plans & Billing

Plan tiers, model access, features, Redis caching, pricing

Frontend

Next.js dashboard — pages, components, contexts, voice UI

REST API

All HTTP endpoints with request/response examples

WebSocket API

Voice protocol, message types, audio format, reconnection

Environment Variables

Every env var with description, defaults, and examples

Admin Scripts

Seed scripts, migrations, mongosh commands, bash helpers

Deployment

Docker, Nginx, scaling, production checklist

Client Integration

Embed voice in your app — WebSocket + REST from client code

Twilio

Phone call integration via Twilio Media Streams

Quick Start

bash

git clone <your-repo-url> voicex
cd voicex
pnpm install

# Configure
cp backend/.env.example backend/.env.local
# Edit backend/.env.local with your API keys

# Seed database (plans, providers, test data)
bash scripts/seed-plans.sh
bash scripts/seed-global-providers.sh
bash scripts/seed.sh

# Run
pnpm dev

Open http://localhost:3000 → Sign in with a@a.dev / 12345678 (test account).

VoicexReal-time AI Voice Agent Platform

Sub-Second Latency

Multi-Agent Support

Multi-Tenant SaaS

Pluggable Providers

Plan & Billing System

Phone & Web Calling

Documentation

Quick Start

How It Works

VoicexReal-time AI Voice Agent Platform

Sub-Second Latency

Multi-Agent Support

Multi-Tenant SaaS

Pluggable Providers

Plan & Billing System

Phone & Web Calling

Documentation ​

Quick Start ​

How It Works ​

Documentation

Quick Start

How It Works