Replicate Teardown — Vercel for AI Inference ($50M+ ARR, Cog OSS Standard)
Copyable to YOU
Sign in with Google to see your personal Copyable Score - a 5-dimension breakdown of how likely you (with your budget, tech stack, channels, network, and timing) can replicate this product.
Replicate Teardown — Vercel for AI Inference ($50M+ ARR, Cog OSS Standard)
When Ben Firshman left Docker in 2019, he carried with him a thesis that was, at the time, almost contrarian: the next decade of software wouldn't be defined by who could train the best models, but by who could run them cheaply, predictably, and with one line of code. Five years later, Replicate is a roughly $50M ARR company sitting on top of a piece of open-source plumbing called Cog that thousands of ML researchers reach for by default, the same way web developers once reached for Docker without thinking about it. That is not a coincidence. It is the entire strategy.
TL;DR
Replicate is a YC W20 company founded by Ben Firshman (Docker Compose), Andreas Jansson (ex-Spotify ML infra), and Zeke Sikelianos (ex-GitHub, ex-npm). It raised $40M Series B in September 2023 led by a16z. Reported ARR has climbed from ~$34M in early 2024 to estimated $50M+ by late 2024 / early 2025, driven by usage pricing — pay per second of GPU compute.
The business sits on three layered assets: (1) Cog, open-source tool for packaging ML models into standardized containers, becoming the de facto packaging spec; (2) a model marketplace of hundreds of community-uploaded models including Stable Diffusion variants, Llama, FLUX; (3) a hosted inference API. Each layer feeds the next.
The strategic question for builders is not "can I build another Replicate" — that ship sailed when Cog hit critical mass around 2022. The question is "where does the Replicate model leak demand to a more focused player." That answer looks like a vertical inference service: a single model class (image upscaling, voice cloning, virtual try-on, face swap) wrapped as a vertical SaaS.
Quick Facts
- Company: Replicate, Inc.
- Founded: 2019, San Francisco
- Founders: Ben Firshman (CEO, ex-Docker Compose), Andreas Jansson (ex-Spotify), Zeke Sikelianos (ex-GitHub)
- YC batch: W20
- Funding: $2.5M seed → $17.8M Series A (2022, a16z) → $40M Series B (Sep 2023, a16z lead, Sequoia + YC)
- Total raised: ~$60M
- Reported ARR: ~$34M (early 2024) → ~$50M+ (late 2024/early 2025)
- Headcount: 35-50
- Pricing model: Pay-per-second GPU, no minimums
- Open source: Cog (Apache 2.0, ~8k+ GitHub stars)
- Core API: REST + Python client + Node.js client + model marketplace at replicate.com/explore
- Customer types: Indie devs, mid-market SaaS, consumer AI apps (PhotoAI, Magnific, Lensa-style clones)
The Data Story — Replicate vs Modal vs Banana vs Together AI vs Anyscale vs Fal
| Dimension | Replicate | Modal | Banana (shut 2024) | Together AI | Anyscale | Fal.ai |
|---|---|---|---|---|---|---|
| Founded | 2019 | 2021 | 2021 | 2022 | 2019 | 2021 |
| Reported ARR | ~$50M+ | ~$30M | n/a | ~$100M+ | $100M+ (Ray) | ~$15M growing |
| Total raised | ~$60M | ~$23M | ~$25M | ~$229M | ~$259M | ~$23M |
| Primary primitive | model = coSign in to read this reportYou have read your 1 free report. Sign in with Google to unlock 2 more. Sign in with Google |