Skip to main content
NewRun every model from your AI assistant via MCP

Ship AI features
without a backend

Wrap any model into your own endpoint — set a custom prompt, expose only the inputs you choose, then call it from your app. ModelRunner runs the model and stores every output, so there's no server to stand up and no files to host.

No backend to build, deploy, or maintain
Customize the prompt and inputs, lock the rest
Swap the base model with a single parameter
Outputs hosted and saved to your asset library
one endpoint per wrapper· call it from any language
room-redesignWrapper
Endpoint configprompt locked · 2 inputs exposed
Style
Scandinavian sanctuaryexposed
Image
living-room.jpgexposed
Base modelswap with one param
Room redesigned with nano-banana-2
ug2WqZ6S.pngHighest quality
media.modelrunner.ai/ug2WqZpng
latency 3.0scost $0.060quality Best
Hosted & saved to your asset library
POST roomix/room-redesign
Run models from the providers you already trust
GoogleByteDanceOpenAIStability AILumaAlibabaxAIPruna AIInworldBlack Forest LabsWan VideoRecraft AIBria
Featured models

Production-ready models, one API

MCP serverNew

Give your AI assistant the whole catalog

Point Claude, Cursor, or any MCP client at ModelRunner and every model becomes a tool your assistant can call — generate an image, caption a clip, transcribe audio — without leaving the conversation.

Generate inside your editor
Ask Claude or Cursor for a hero image or a five-second clip and get a hosted URL back in the thread.
Models as agent tools
Hand an autonomous agent the catalog as callable tools, with spend capped by the key you issue.
One setup, every client
The same server runs in Claude Desktop, Cursor, Windsurf, or anything that speaks MCP.
claude_desktop_config.jsonOAuth
1Add the server
{
  "mcpServers": {
    "modelrunner": {
      "command": "npx",
      "args": ["-y", "mcp-remote", "https://mcp.modelrunner.ai/sse"]
    }
  }
}
No API key — your browser opens once to authorize via OAuth.
2Ask in your assistant
you
Generate 3 reference frames for a misty-forest-at-dawn video.
mcp
run(nano-banana-2-pro)×3 · 2.4s
ref_01–03.png ↳ hosted
you
Animate those into a 5s clip with veo-3.1.
mcp
run(veo-3.1, images=ref_01–03)38s
forest_dawn.mp4
1080p · 5.0s
↳ hosted URL
you
Now subtitle it from the voiceover track.
mcp
run(whisper-subtitle)4.1s
captions.srt · burned into forest_dawn_sub.mp4 ↳ hosted
6 tools registered · run · caption · transcribe · upscale
Asset library

Every file you touch, one click away

Every upload, every input, and every generated output becomes a tracked asset in your library — so a model's result drops straight into the next run with no re-downloading and no re-uploading.

Three ways to fill any file field
Upload a file, paste a URL, or pick from your library — every model’s file input becomes a media picker.
Outputs become inputs
Generated results are auto-hosted and indexed, so a model’s output is one click away as the next model’s input.
Find anything fast
Filter by source, type, or favorites, tag what matters, and search across every run you’ve made.
reference_imagestring · format: uriPicker
Add image
Upload
Drag & drop or browse — presigned S3
Enter URL
Paste an external link — rehosted on run
Search assets
Pick from your library
All 248Outputs 96Uploads 38Inputs 84Favorites
output
output
input
output
output
upload
upload
output
Selected living-room.jpg↳ media.modelrunner.ai
Quickstart

Your first run in three lines

Grab a key, pick a model, send a request. Same shape for every modality, every provider.

Bearer-token auth on every endpoint
Typed SDKs for Python & TypeScript
Streaming, webhooks & polling built in
$ curl https://api.modelrunner.ai/v1/run \
    -H "Authorization: Bearer $MODELRUNNER_KEY" \
    -d '{
      "model": "black-forest-labs/flux-pro",
      "input": { "prompt": "a neon koi, ultra-detailed" }
    }'

Pay only for what you run

No subscriptions, no idle GPU bills. Per-request pricing — per image, per second of video, or per megapixel — quoted before you run and snapshotted at request time.

Frequently asked

What is ModelRunner?+

ModelRunner is a unified API for running AI models — image, video, audio, 3D, and text — from providers like Google, OpenAI, ByteDance, Stability AI, and Black Forest Labs. One API key reaches every model, and each one uses the same request shape, so switching models is a one-line change.

Which AI models and providers can I run through ModelRunner?+

Hundreds of models across image, video, audio, 3D, and text generation — from labs including Google, OpenAI, ByteDance, Stability AI, and Black Forest Labs, with new ones added regularly. You reach all of them through one API key, with no separate accounts or SDKs per provider.

How do I connect ModelRunner to my AI assistant with MCP?+

Point any MCP client — Claude, Cursor, or your own — at the ModelRunner MCP server with your API key. Every model becomes a tool your assistant can call directly, with no extra integration code.

How do I turn an AI model into my own API endpoint (a wrapper)?+

Create a wrapper: package any model with a preset prompt and only the inputs you choose, and ModelRunner gives you a custom endpoint to call from your app — with no backend to build or maintain. Every output is hosted and saved to your asset library automatically.

How much does ModelRunner cost?+

You pay per request at each model’s published rate — per image, per second of video, or per megapixel. Pricing is snapshotted at request time with no subscriptions or hidden fees, so you only ever pay for what you run.

How is ModelRunner different from calling each AI provider directly?+

One API key and one request shape replace a separate account, SDK, and billing relationship for every provider. You can compare or switch models — even across providers — by changing a single parameter, and ModelRunner hosts every output so there’s no server or storage to manage.

Start building today

One key unlocks every model, every modality, and the MCP server. Free to start — pay only when you run.