Discover Amazing Models

Explore a curated collection of powerful models ready to solve your problems

All Models

31 models available
seedance-v1.5/image-to-video

bytedance / seedance-v1.5/image-to-video

Transform static images into dynamic videos with synchronized audio. Supports text-guided animations and start/end frame keying.

seedance-v1.5/text-to-video

bytedance / seedance-v1.5/text-to-video

Generate high-quality videos with synchronized audio directly from text prompts using Seedance 1.5.

seedream-v5/edit

bytedance / seedream-v5/edit

Edit and seamlessly compose images using text prompts and multiple reference images with the fast, high-quality Seedream 5.0 Lite model.

seedream-v5/text-to-image

bytedance / seedream-v5/text-to-image

Generate high-quality, intelligent images from text prompts using the fast Lite version of Seedream 5.0.

seedream-v4.5/edit

bytedance / seedream-v4.5/edit

Advanced image editing model by ByteDance that uses text prompts and up to 10 reference images to stylize, transform, and seamlessly composite visuals.

seedream-v4.5/text-to-image

bytedance / seedream-v4.5/text-to-image

A next-generation text-to-image model by ByteDance, capable of high-fidelity generation, precise text rendering, and complex stylistic control for highly detailed visual compositions.

nano-banana-2/edit

google / nano-banana-2/edit

Edit images with text prompts. Make targeted changes like adding or removing objects, changing styles, or modifying specific elements while preserving the rest of the image.

nano-banana-2

google / nano-banana-2

Nano Banana 2 is a fast and versatile text-to-image model. It excels at creating high-quality images, from photorealistic scenes to complex infographics with accurate text, and can optionally use Google Search to generate content based on real-time information.

bitdance

shallowdream204 / bitdance

Generate fast, high-resolution, photorealistic images from text prompts using an advanced autoregressive model for efficient, high-quality results.

firered-image-edit

fireredteam / firered-image-edit

An advanced image editing model that modifies images based on text prompts. It supports single-image edits and multi-image compositions for tasks like style transfer or virtual try-on.

upscale/image

topazlabs / upscale/image

Upscale and enhance your images using a variety of powerful AI models. Increase resolution up to 4x, restore details, and use specialized modes for photos, CGI, and text, with optional face enhancement for professional-quality results.

upscale/video

topazlabs / upscale/video

Enhance and enlarge your videos with professional-grade upscaling, detail recovery, and smooth frame interpolation.

z-image/turbo/image-to-image

tongyi-mai / z-image/turbo/image-to-image

Generate images from text and an initial image using Tongyi-MAI's super-fast Z-Image Turbo model.

z-image/base

tongyi-mai / z-image/base

Generate high-quality, stylistically diverse images with precise prompt adherence using the Z-Image foundation model.

z-image/turbo

tongyi-mai / z-image/turbo

High-speed 6B parameter text-to-image generation optimized for cost efficiency and volume. Produces up to 4MP images in an 8-step pipeline suitable for rapid prototyping.

veo-3.1-extend-video

google / veo-3.1-extend-video

Extend existing Veo-generated videos by seamlessly adding 7 seconds of high-fidelity footage and synchronized audio using text prompts.

veo-3.1-reference-to-video

google / veo-3.1-reference-to-video

Generate high-fidelity, cinematic videos with synchronized audio by using text prompts and up to three reference images to guide visual style and content.

veo-3.1-first-last-frame-to-video

google / veo-3.1-first-last-frame-to-video

Generate seamless 8-second video transitions by interpolating between a first and last frame with high-fidelity visuals and native audio.

veo-3.1-image-to-video

google / veo-3.1-image-to-video

Turn static images into high-fidelity 720p or 1080p videos with synchronized native audio using text prompts to guide the animation.

clothes-on-model

modelrunner / clothes-on-model

Generate realistic on-model fashion photos from your garment/accessory references and an optional model photo.

meituan-longcat / longcat-video-t2v-480p

Turn plain text into cinematic, on-brand video. Describe the scene and camera feel; LongCat generates smooth, consistent shots with adjustable length, FPS, and quality.

meituan-longcat / longcat-video-i2v-480p

LongCat-Video turns a single still image into minutes-long, smooth 480p, 30fps video with stable style, lighting, and identity — fast, consistent, production-ready animation from one frame.

jewelry-modeling

modelrunner / jewelry-modeling

Generate realistic jewelry modeling photos by applying provided jewelry images onto a model or person, with professional lighting and detailed textures.

google / veo-3.1-text-to-video

Create cinematic 8-second videos with Veo 3.1, Google’s latest text-to-video model in the Gemini API — now with native audio, frame control, and reference image support.

real-esrgan-upscaler

nightmareai / real-esrgan-upscaler

High-quality image upscaler with optional face enhancement

bytedance / seedance-v1-pro

Seedance 1.0 generates 1080P videos with smooth motion, rich detail, and diverse styles, while the pro version adds multi-shot narrative and advanced instruction following for cinematic results.

seedream-v4

bytedance / seedream-v4

Seedream 4.0 is a next-generation image creation model that unifies generation and editing in a single architecture, enabling advanced multimodal reasoning and reference consistency while delivering stunning 4K images with significantly faster inference.

nano-banana

google / nano-banana

State of the art image editing model from Google Gemini 2.5.

sdxl-lightning-4step

bytedance / sdxl-lightning-4step

SDXL-Lightning is a lightning-fast text-to-image generation model that produces high-quality 1024px images in just a few steps, distilled from Stable Diffusion XL.

musicgen

meta / musicgen

A fast, controllable auto-regressive Transformer for high-fidelity music generation.

inspyrenet

swook / inspyrenet

Helps find and highlight important objects in high-resolution images. It works without needing special high-quality training data and gives sharp, accurate results.