Skip to main content

Replicate Teardown — Vercel for AI Inference ($50M+ ARR, Cog OSS Standard)

Copyable to YOU

Sign in with Google to see your personal Copyable Score - a 5-dimension breakdown of how likely you (with your budget, tech stack, channels, network, and timing) can replicate this product.

Replicate Teardown — Vercel for AI Inference ($50M+ ARR, Cog OSS Standard)

When Ben Firshman left Docker in 2019, he carried with him a thesis that was, at the time, almost contrarian: the next decade of software wouldn't be defined by who could train the best models, but by who could run them cheaply, predictably, and with one line of code. Five years later, Replicate is a roughly $50M ARR company sitting on top of a piece of open-source plumbing called Cog that thousands of ML researchers reach for by default, the same way web developers once reached for Docker without thinking about it. That is not a coincidence. It is the entire strategy.

TL;DR

Replicate is a YC W20 company founded by Ben Firshman (Docker Compose), Andreas Jansson (ex-Spotify ML infra), and Zeke Sikelianos (ex-GitHub, ex-npm). It raised $40M Series B in September 2023 led by a16z. Reported ARR has climbed from ~$34M in early 2024 to estimated $50M+ by late 2024 / early 2025, driven by usage pricing — pay per second of GPU compute.

The business sits on three layered assets: (1) Cog, open-source tool for packaging ML models into standardized containers, becoming the de facto packaging spec; (2) a model marketplace of hundreds of community-uploaded models including Stable Diffusion variants, Llama, FLUX; (3) a hosted inference API. Each layer feeds the next.

The strategic question for builders is not "can I build another Replicate" — that ship sailed when Cog hit critical mass around 2022. The question is "where does the Replicate model leak demand to a more focused player." That answer looks like a vertical inference service: a single model class (image upscaling, voice cloning, virtual try-on, face swap) wrapped as a vertical SaaS.

Quick Facts

  • Company: Replicate, Inc.
  • Founded: 2019, San Francisco
  • Founders: Ben Firshman (CEO, ex-Docker Compose), Andreas Jansson (ex-Spotify), Zeke Sikelianos (ex-GitHub)
  • YC batch: W20
  • Funding: $2.5M seed → $17.8M Series A (2022, a16z) → $40M Series B (Sep 2023, a16z lead, Sequoia + YC)
  • Total raised: ~$60M
  • Reported ARR: ~$34M (early 2024) → ~$50M+ (late 2024/early 2025)
  • Headcount: 35-50
  • Pricing model: Pay-per-second GPU, no minimums
  • Open source: Cog (Apache 2.0, ~8k+ GitHub stars)
  • Core API: REST + Python client + Node.js client + model marketplace at replicate.com/explore
  • Customer types: Indie devs, mid-market SaaS, consumer AI apps (PhotoAI, Magnific, Lensa-style clones)

The Data Story — Replicate vs Modal vs Banana vs Together AI vs Anyscale vs Fal

Dimension Replicate Modal Banana (shut 2024) Together AI Anyscale Fal.ai
Founded 2019 2021 2021 2022 2019 2021
Reported ARR ~$50M+ ~$30M n/a ~$100M+ $100M+ (Ray) ~$15M growing
Total raised ~$60M ~$23M ~$25M ~$229M ~$259M ~$23M
Primary primitive model = co

Sign in to read this report

You have read your 1 free report. Sign in with Google to unlock 2 more.

Sign in with Google