Model Details
Stable Diffusion 3.5 Large turns a text prompt into a high-quality image. It is Stability AI's flagship 8-billion-parameter Multimodal Diffusion Transformer (MMDiT), built for strong prompt adherence, accurate typography, and a wide range of visual styles from photorealism to illustration and 3D renders. Give it a descriptive prompt and pick an image size; it returns one or more generated images. Its standout strengths are complex-prompt understanding and stylistic diversity, making it a strong general-purpose text-to-image default.
## Best for - Photorealistic scenes and portraits from a detailed text description - Rendering legible text, signage, and typography inside an image - Concept art, illustration, and stylized renders across many aesthetics - Marketing visuals, product mockups, and social imagery from a prompt - Complex multi-subject compositions that need faithful prompt adherence
## Choose another model when - You want to edit, restyle, or inpaint an existing image rather than generate from scratch — use an image-editing model - You need to enlarge or add detail to an existing image — use an upscaling model - You need a video or animation — use a text-to-video model
## Tips - Be specific about subject, setting, lighting, and style; SD 3.5 Large rewards detailed prompts - Raise `guidance_scale` (default 3.5) toward 5-7 for tighter prompt adherence; lower it for more creative latitude - Use `negative_prompt` to exclude unwanted elements (e.g. "blurry, extra fingers, watermark") - Set `image_size` to a preset ("square_hd", "portrait_16_9", "landscape_4_3", …) or pass a custom `{ width, height }` object
## Advanced Configuration - `controlnet` — condition generation on a control image (structure/pose/edges) via a ControlNet model path plus a `control_image_url`. API-only. - `ip_adapter` — image-prompt the generation from a reference image (`image_url`) via an IP-Adapter path, optionally masked. API-only. - `loras` — apply one or more LoRA weights (by `path`, with an optional `scale`) for custom styles or subjects. API-only.
To run via the ModelRunner JavaScript client: ```js import { modelrunner } from "@modelrunner/client";
const result = await modelrunner.subscribe("stability-ai/stable-diffusion-v3.5-large", { input: { prompt: "a serene mountain lake at golden hour, mist over the water, photorealistic", image_size: "landscape_4_3", num_inference_steps: 28, guidance_scale: 3.5, }, }); ```




