Skip to main content

Retell AI Teardown — $7M ARR YC W24 Voice Agent Platform

Copyable to YOU

Sign in with Google to see your personal Copyable Score - a 5-dimension breakdown of how likely you (with your budget, tech stack, channels, network, and timing) can replicate this product.

TL;DR

Retell AI is the second name people mention when they talk about AI voice agent infrastructure, and the gap between first and second is smaller than you would guess from the funding announcements. Vapi raised the louder round and gets the louder Twitter discourse. Retell, founded by Yifei Wang out of YC W24, quietly built to roughly $600K MRR through a different route: ship a working dashboard before shipping a working SDK, court the call-center buyer instead of the developer-evangelist, and plant a flag in India and Southeast Asia where the unit economics of voice automation are violent enough to make procurement teams move in weeks rather than quarters.

The product is a hosted runtime that stitches together speech-to-text, an LLM, text-to-speech, and a telephony layer into a single websocket pipeline with a sub-500ms target turn latency. You point it at a number, give it a prompt and some function-call tools, and it picks up the phone. Pricing is the same usage-based model the whole category settled on around mid-2024: roughly $0.07 per minute of conversation, which bundles the LLM cost, the voice synthesis, and the carrier minutes into one line item. The seed round closed in December 2024 at $4.6M led by Altimeter, with participation from a handful of operator angels who run actual call centers.

What makes Retell interesting as a teardown subject is not the technology. The technology is now a commodity — every team in this space is wiring the same four vendors in roughly the same order. What makes it interesting is the geographic concentration. Most US infra companies treat India and SEA as an afterthought, a region that will eventually convert. Retell treated it as the beachhead. Call center labor in Bangalore or Manila is not free, and the buyers there are sophisticated enough to evaluate latency, accent, and barge-in behavior with a stopwatch. Winning there is harder than winning the US developer audience, but the contracts close faster and the customers stick longer.

The headline for any founder reading this: the platform layer is crowded and capital-intensive. The vertical layer sitting on top of Retell or Vapi — voice agents for a specific industry with a specific workflow — is wide open, low-capital, and where the next twenty $5K MRR products will come from.

Dimension Score
Capital required 25 / 100
Stack reproducibility 35 / 100
Distribution channel 45 / 100
Network effect 30 / 100
Timing 60 / 100

Five Minutes Inside Retell AI

I signed up on a Tuesday evening with a throwaway Gmail and a US virtual number I keep around for these reviews. The dashboard loaded fast. The onboarding did not try to be clever — no animated mascot, no five-step tour with a progress bar. There was a sidebar with Agents, Phone Numbers, Knowledge Base, Call History, and a button that said Create Agent. I clicked it.

The agent builder is a single scrollable form. You pick a voice from a dropdown that pulls from ElevenLabs, PlayHT, and Cartesia. You write a system prompt in a textarea. You add tools — functions the agent can call mid-conversation to look something up or trigger a workflow. You set the LLM (GPT-4o-mini was the default, with options for Claude Haiku and a few Llama variants). You can attach a knowledge base, which is a vector store hidden behind a normal file uploader. The whole thing took me maybe six minutes to fill out for a fake dental appointment scheduler.

Then came the test. There is a Talk to Agent button at the top of the page that opens a browser-based call. I clicked it and it picked up. The first thing I noticed was that the agent did not wait for me to finish a sentence — it started responding while I was still trailing off, which is the right behavior for natural conversation but feels strange the first time. End-to-end turn latency felt like maybe 700-900ms in the browser, which is fine but not magical. The voice was an ElevenLabs Rachel-equivalent that sounded human enough that I forgot I was testing for about three turns.

Then I bought a phone number from the Numbers tab — $2 per month, instant provisioning through Twilio passthrough — and pointed my new agent at it. I dialed from my real phone. Over the actual PSTN, on a real cellular connection, latency was noticeably better than the browser test, probably because the audio codec was friendlier. The barge-in worked. When I tried to interrupt mid-sentence to change my appointment time, the agent stopped talking within maybe 200ms and adjusted. That is the moment that separates a working voice product from a demo.

The honest critiques: the analytics view is thin. You get call recordings and transcripts, you get a duration column, you do not get sentiment or topic clustering without bolting on your own pipeline. The Knowledge Base is fine for FAQs but does not handle structured data like business hours or pricing tables gracefully — I ended up putting all of that in the system prompt. The function-calling UI is clean but the error messages when a function fails mid-call are buried in the call log, not surfaced to the agent itself.

Three things stood out as well-engineered. First, the websocket reconnect logic — I deliberately killed my wifi mid-call to see what happened, and the agent gracefully degraded to a hold tone and picked up where it left off. Second, the prompt templates for common verticals (booking, qualification, support triage) are short and sharp rather than the bloated 2000-word system prompts most platforms ship as defaults. Third, the pricing meter on the dashboard ticks in real time during a call, which sounds like a small thing but is the kind of detail that makes finance teams trust the platform enough to put it in front of a CFO.

Business Model Deep Dive

Retell charges roughly $0.07 per minute of conversation. That number is not a list price you would find on a pricing page in 2023 — it is a 20

Sign in to read this report

You have read your 1 free report. Sign in with Google to unlock 2 more.

Sign in with Google