Lowest-cost media generation API
Transparent, pay-as-you-go pricing across image, video, audio and text. Search the full catalog below and see the exact cost for every configuration — no monthly fees, no surprises.
Showing 89 models
| Model | Type | Configuration | Price | Actions |
|---|---|---|---|---|
Recraft V3 recraft | Image → Image | Per output | $0.04 | |
Chatterbox Voice Conversion resemble-ai | Text → Audio | Per second | $0.00025 | |
Bria Eraser bria | Image → Image | Per output | $0.04 | |
DeepFilterNet 3 rikorose | Audio → Audio | Per second | $0.001 | |
Ideogram Character ideogram | Image → Image | Per output | $0.15 | |
Whisper Large v3 openai | Audio → Text | Per output | $0.01 | |
Kling 2.6 Pro Text-to-Video kuaishou | Text → Video | Per second | $0.14 | |
| Audio → Audio | Per second | $0.001667 | ||
GPT Image 2 Edit openai | Image → Image | Per output | $0.151 | |
Imagen 4 Fast google | Text → Image | Per output | $0.02 | |
Imagen 4 google | Text → Image | Per output | $0.04 | |
RIFE Video Interpolation megvii-research | Video → Video | Per second | $0.000225 | |
| Image → Image | Per output | $0.04 | ||
Florence-2 Large Caption microsoft | Image → Text | Per output | $0 | |
Clarity Upscaler philz1337x | Image → Image | Per megapixel | $0.03 | |
Florence-2 Large Object Detection microsoft | Image → Image | Per output | $0 | |
Lyria 2 google | Text → Audio | Per output | $0.1 | |
| Image → Image | Per output | $0 | ||
ElevenLabs Sound Effects V2 elevenlabs | Text → Audio | Per second | $0.002 | |
FLUX.1 Kontext [dev] black-forest-labs | Image → Image | Per megapixel | $0.025 | |
CodeFormer sczhou | Image → Image | Per megapixel | $0.0021 | |
| Image → Image | Per output | $0.018 | ||
Bria Expand bria | Image → Image | Per output | $0.04 | |
Recraft Vectorize recraft | Image → Image | Per output | $0.01 | |
Depth Anything V2 depth-anything | Image → Image | Per output | $0 |
Page 1 of 4
Frequently Asked Questions
Everything you need to know about ModelRunner
How is ModelRunner different from other AI providers?
ModelRunner provides a unified API for generative AI models across image and video. Instead of managing multiple provider integrations, you access Google, ByteDance, and other providers through a single endpoint. This simplifies development, reduces integration overhead, and gives you flexibility to switch between providers or models without changing your code.
What models does ModelRunner support?
ModelRunner supports models across four categories: text-to-image, image-to-image, text-to-video, and image-to-video. We integrate with leading providers including Google (Veo 3.1, Imagen) and ByteDance (Seedance, Seedream), as well as open-source models like FLUX running on serverless compute. New models are added regularly, and you can test any model instantly in our Playground before integrating.
How does pricing work?
ModelRunner offers transparent, pay-as-you-go pricing with no hidden fees. Pricing varies by model type: per-second GPU time for serverless models, per-output for simple generation, or per-output-second for video (based on duration). All pricing is clearly displayed for each model, including GPU costs for serverless inference. You purchase credits via Stripe and only pay for what you use.
Can I try models before integrating?
Yes. Every model on ModelRunner has an interactive Playground where you can test generation with full parameter control. You can explore example runs to see real inputs and outputs, then copy ready-to-use code snippets for your integration. The Playground mirrors the exact API behavior, so what you see is what you get in production.
How do I integrate ModelRunner into my application?
Integration is straightforward: sign up, purchase credits, and create an API key. You can then make requests via our REST API or use our JavaScript SDK for a more streamlined experience. All models share a consistent interface—same authentication, same request/response patterns—so switching between models requires minimal code changes.
Is my data private and secure?
Yes. ModelRunner does not use your inputs or outputs for training. We support secure authentication including passkeys (WebAuthn), OAuth (GitHub, Google), and two-factor authentication. Your API keys are managed securely, and all requests are processed through our infrastructure without data retention beyond what's needed to fulfill your request.
Can I use ModelRunner for commercial projects?
Yes. You can use ModelRunner-generated content for commercial purposes. However, licensing terms depend on the underlying model provider. We clearly document licensing for each model in our catalog. For open-source models, standard open-source licenses apply. For provider APIs like Google or ByteDance, their respective terms of service govern commercial use.
Does ModelRunner support enterprise workloads?
Yes. ModelRunner supports production applications requiring high throughput and reliability. Our async queue system handles requests at scale with real-time status updates via SSE. For enterprise needs like dedicated capacity, custom SLAs, or volume pricing, contact our sales team to discuss tailored solutions.
Do pro models ever use faster or cheaper models under the hood?
No. When you select a pro model, you always get that exact model—we never substitute it with a faster or cheaper alternative. Every request is processed by the model you chose, ensuring consistent quality and predictable results. This transparency is core to how ModelRunner operates: what you select is exactly what runs your generation.

