N

nano-banana-pro

Verified

by elizaOS · Updated Apr 16, 2026

claude-opus-4-6
20in3.1Kout
$0.364m 53s

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro). Use when the user asks to create an image, generate a picture, produce AI-generated artwork, edit a photo, compose multiple images, or upscale an image to higher resolution. Supports text-to-image generation, single-image editing, and multi-image composition using the Gemini API.

image-generationimage-editinggeminiai-artmediaupscaling

See It In Action

1 turns·claude
The project codebase is in your current working directory. Explore the files to understand the code before making changes. need the hero mockup done for this afternoon's design review — combine the latest UI shot with the new device frame + branded bg and save the final image

Now let me look at the input images to understand what we're working with:

Now I have a clear picture of the three source assets. Let me compose them into the hero mockup using the generate_image script:

The model name needs updating. Let me check available models and fix the script:

I see gemini-3-pro-image-preview is available. Let me update the model name and run the composition:

The image was generated successfully. Let me verify the output:

The hero mockup is done and saved. Here's a summary:

Saved → assets/2026-04-15-hero-mockup.png (2048 × 2048, ~1.9 MB)

The final image composites all three source assets:

  • ui-screenshot.png — the latest UI screen placed inside the device
  • device-frame.png — the device frame wrapping the screenshot
  • branded-background.png — the branded background behind the device

The result is a polished hero mockup with the device centered on the branded background with a subtle shadow and depth — ready for this afternoon's design review.

Note: I also updated scripts/generate_image.py to use the current gemini-3-pro-image-preview model (the previous gemini-2.0-flash-preview-image-generation model was no longer available).

Claude Sonnet 4.6

External Tools

ToolType
uvbinary

Permissions

ScopeDescription
filesystem:read
filesystem:write
network:outbound
process:spawn
env:read

SKILL.md

Nano Banana Pro (Gemini 3 Pro Image)

Use the bundled script to generate or edit images.

Generate

uv run {baseDir}/scripts/generate_image.py --prompt "your image description" --filename "output.png" --resolution 1K

Edit (single image)

uv run {baseDir}/scripts/generate_image.py --prompt "edit instructions" --filename "output.png" -i "/path/in.png" --resolution 2K

Multi-image composition (up to 14 images)

uv run {baseDir}/scripts/generate_image.py --prompt "combine these into one scene" --filename "output.png" -i img1.png -i img2.png -i img3.png

API key

  • GEMINI_API_KEY env var
  • Or set skills."nano-banana-pro".apiKey / skills."nano-banana-pro".env.GEMINI_API_KEY in ~/.otto/otto.json

Notes

  • Resolutions: 1K (default), 2K, 4K.
  • Use timestamps in filenames: yyyy-mm-dd-hh-mm-ss-name.png.
  • The script prints a MEDIA: line for Otto to auto-attach on supported chat providers.
  • Do not read the image back; report the saved path only.

FAQ

What does nano-banana-pro do?

Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro). Use when the user asks to create an image, generate a picture, produce AI-generated artwork, edit a photo, compose multiple images, or upscale an image to higher resolution. Supports text-to-image generation, single-image editing, and multi-image composition using the Gemini API.

When should I use nano-banana-pro?

Use it when you need a repeatable workflow that produces image output, code diff.

What does nano-banana-pro output?

In the evaluated run it produced image output, code diff.

How do I install or invoke nano-banana-pro?

Ask the agent to use this skill when the task matches its documented workflow.

Which agents does nano-banana-pro support?

Agent support is inferred from the source, but not explicitly declared.

What tools, channels, or permissions does nano-banana-pro need?

It uses uv; channels commonly include image, diff; permissions include filesystem:read, filesystem:write, network:outbound, process:spawn, env:read.

Is nano-banana-pro safe to install?

Static analysis marked this skill as medium risk; review side effects and permissions before enabling it.

How is nano-banana-pro different from an MCP or plugin?

A skill packages instructions and workflow conventions; tools, MCP servers, and plugins are dependencies the skill may call during execution.

Does nano-banana-pro outperform not using a skill?

About nano-banana-pro

When to use nano-banana-pro

When you need to create an AI-generated image from a text prompt. When you want to edit or upscale an existing image using Gemini. When you need to combine multiple input images into a single composed output.

When nano-banana-pro is not the right choice

When you need vector design editing or manual graphic design workflows. When network access or a Gemini API key is unavailable.

What it produces

Produces image output and code diff.