Model Details
Tongyi-MAI's Z-Image Turbo is a streamlined text-to-image generation model engineered for speed and economic scalability. Built on a 6-billion parameter architecture, it prioritizes throughput without sacrificing essential visual coherence. By compressing the diffusion process into a maximum of 8 inference steps, this model significantly reduces generation time compared to standard architectures that typically require 20 to 50 steps.
### Capabilities and Features
This model is specifically tuned for high-volume production environments. Users can generate images with resolutions up to 4 megapixels, supporting various aspect ratios from square to wide landscape. The 8-step pipeline is fully configurable; users can lower the step count to as few as 1 for ultra-fast thumbnail generation or utilize the full 8 steps for final production assets.
**Key benefits include:**
* **Rapid Iteration:** Support for batch sizes up to 4 images per request allows for quick side-by-side comparison of prompts and seeds. * **Flexible Output:** Customize image dimensions via standard presets (e.g., `landscape_4_3`) or specific pixel counts, with support for JPEG, PNG, and WebP formats. * **Cost-Effective Scaling:** The lightweight architecture makes it ideal for applications requiring thousands of assets, such as dynamic content generation or A/B testing visuals.
### When to use this model
Choose Z-Image Turbo when speed and volume are the primary constraints. It excels at rapid prototyping, storyboarding, and generating content variations where cost-per-pixel is a critical metric. While it offers robust prompt adherence, users requiring pixel-perfect photorealism or complex spatial reasoning might prefer larger, slower models.
To run via ModelRunner javascript client, use the following code:
```javascript import { modelrunner } from "@modelrunner/client";
const result = await modelrunner.subscribe('tongyi-mai/z-image/turbo', { input: { prompt: "Cinematic shot of a futuristic cyberpunk street, neon lights, rain on pavement, highly detailed", image_size: "landscape_16_9", num_inference_steps: 8, num_images: 1, enable_safety_checker: true, output_format: "png" } }); ```






