v1.2.0 is out
MIT License
Build voice agents
that actually listen.
The open-source orchestration engine for production-grade AI voice agents. Low latency, provider agnostic, and built for scale.
import { createAgent, OpenAILLM, DeepgramSTT, ElevenLabsTTS } from 'sharyx-os';
const agent = createAgent({
stt: new DeepgramSTT({ apiKey: process.env.DEEPGRAM_API_KEY }),
llm: new OpenAILLM({ apiKey: process.env.OPENAI_API_KEY }),
tts: new ElevenLabsTTS({ apiKey: process.env.EL_API_KEY }),
});
agent.start({ port: 3000 });
Core Capabilities
Everything you need for voice AI
โก Low Latency
Optimized streaming architecture for near-instant responses. Sub-500ms conversational loops.
๐ Tool Calling
Enable agents to search, book, and act in real-time with recursive tool calling support.
๐ Provider Agnostic
Swap between OpenAI, Gemini, Deepgram, and ElevenLabs with one line of code.
๐ Multi-Channel
Native support for Twilio, Plivo, and WebRTC. Deploy your agent anywhere.
๐ง Persistent Memory
Redis-backed session management for stateful, long-running conversations.
๐ ๏ธ CRM Integrations
Built-in support for HubSpot, Google Calendar, and WhatsApp Cloud API.
The Orchestration Engine
User
Transport
STT (Deepgram)
Sharyx OS
LLM (GPT-4)
TTS (ElevenLabs)
Easy Setup
Implementation Procedures
Simple steps to get your voice agent up and running.
Ready to start building?
Get started with our lightweight SDK today.
npm install sharyx-os