Skip to main content
Google avatar

google

Google

https://github.com/google

Models

lyria2

google / lyria2

Generate ~30 seconds of high-fidelity instrumental music from a text prompt, as a 48kHz WAV file.

nano-banana-2/edit

google / nano-banana-2/edit

Edit images with text prompts. Make targeted changes like adding or removing objects, changing styles, or modifying specific elements while preserving the rest of the image.

nano-banana-2

google / nano-banana-2

Nano Banana 2 is a fast and versatile text-to-image model. It excels at creating high-quality images, from photorealistic scenes to complex infographics with accurate text, and can optionally use Google Search to generate content based on real-time information.

veo-3.1/extend-video

google / veo-3.1/extend-video

Extend existing Veo-generated videos by seamlessly adding 7 seconds of high-fidelity footage and synchronized audio using text prompts.

google / veo-3.1/reference-to-video

Generate high-fidelity, cinematic videos with synchronized audio by using text prompts and up to three reference images to guide visual style and content.

google / veo-3.1/first-last-frame-to-video

Generate seamless 8-second video transitions by interpolating between a first and last frame with high-fidelity visuals and native audio.

google / veo-3.1/image-to-video

Turn static images into high-fidelity 720p or 1080p videos with synchronized native audio using text prompts to guide the animation.

google / veo-3.1/text-to-video

Create cinematic 8-second videos with Veo 3.1, Google’s latest text-to-video model in the Gemini API — now with native audio, frame control, and reference image support.

nano-banana

google / nano-banana

State of the art image editing model from Google Gemini 2.5.