Generate and edit images using Google's Gemini Nano Banana Pro model (gemini-3-pro-image-preview).
- Text-to-Image: Generate high-quality images from text descriptions
- Image Editing: Edit existing images with text prompts (add/remove elements, style transfer)
- Multi-Image Composition: Combine elements from multiple images
- High Resolution: Support for 1K, 2K, and 4K output resolutions
- Accurate Text Rendering: Generate images with legible, well-placed text
- Multiple Aspect Ratios: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9
- Google Search Grounding: Generate images based on real-time information
- Multi-turn Editing: Iteratively refine images through conversation
Before using this plugin, you must set your Gemini API key as an environment variable:
-
Get a free API key from Google AI Studio
-
Export the key in your shell profile (
~/.zshrc,~/.bashrc, etc.):export GEMINI_API_KEY="your_api_key_here"
-
Restart your terminal or run
source ~/.zshrc(or~/.bashrc)
Invoke this skill when you want to:
- "Generate an image of..."
- "Create a picture showing..."
- "Edit this image to..."
- "Add [element] to this image"
- "Transform this photo into [style]"
- "Create a logo for..."
- "Generate a product mockup..."
Generate a photorealistic image of a cozy coffee shop interior with warm lighting
Take this photo and add a small wizard hat to the cat
Transform this city photo into Van Gogh's Starry Night style
Create a minimalist logo for a coffee shop called 'The Daily Grind'
The gemini-3-pro-image-preview (Nano Banana Pro) model features:
- Advanced reasoning for complex prompts
- "Thinking" process for composition refinement
- Up to 14 reference images for input
- Google Search grounding for real-time data
- High-resolution output up to 4K
- Best performance with: EN, ar-EG, de-DE, es-MX, fr-FR, hi-IN, id-ID, it-IT, ja-JP, ko-KR, pt-BR, ru-RU, ua-UA, vi-VN, zh-CN
- No audio or video inputs
- Works best with up to 5 high-fidelity images (up to 14 total)
- All generated images include a SynthID watermark
- Version: 1.1.1
- Author: DAIR.AI