Single-file Gemini image generation CLI with batch and grid output.

Python 59.9%
HTML 40.1%

Find a file

dikkadev 3e13d605e4 feat(cli): add selectable Nano Banana 2 model variants This switches the CLI to Nano Banana 2 defaults while letting users opt into the non-lite image model explicitly, making the new model selection visible in both the command interface and the README.		2026-07-01 10:36:02 +02:00
.agents/context	docs(agents): add repository guidance for AI assistants and local context files	2026-04-21 23:59:53 +02:00
.gitignore	docs(agents): add repository guidance for AI assistants and local context files	2026-04-21 23:59:53 +02:00
AGENTS.md	docs(agents): add repository guidance for AI assistants and local context files	2026-04-21 23:59:53 +02:00
cafe-image.prompt	script	2025-12-01 13:01:40 +01:00
CLAUDE.md	docs(agents): add repository guidance for AI assistants and local context files	2026-04-21 23:59:53 +02:00
img.py	feat(cli): add selectable Nano Banana 2 model variants	2026-07-01 10:36:02 +02:00
prompt	chore(prompt): update prompt text and ignore prompt files	2026-03-01 11:48:42 +01:00
README.md	feat(cli): add selectable Nano Banana 2 model variants	2026-07-01 10:36:02 +02:00
viewer.html	fix(viewer): make auto-refresh resilient to failed image loads	2026-03-05 22:33:02 +01:00

README.md

Nano Banana 2

Single-file CLI for generating images with Google's Nano Banana 2 models. Defaults to Nano Banana 2 Lite (gemini-3.1-flash-lite-image) and can opt into the non-lite model (gemini-3.1-flash-image). Supports text-to-image, image-to-image, batch generation with parallel API calls, and automatic grid composition.

Setup

Requires Python 3.13+ and a Gemini API key.

export GEMINI_API_KEY="your-key-here"

Dependencies (requests, pillow) are declared inline via PEP 723 — uv run handles them automatically.

Usage

# text-to-image
uv run img.py -p "a cat wearing sunglasses"

# prompt from file
uv run img.py -f my-prompt.prompt -o portrait.png

# image-to-image with reference photos
uv run img.py -p "oil painting style" -i photo.jpg -o painting.png

# multiple reference images (shell glob)
uv run img.py -p "blend these" -i in_photos/* -o blended.png

# batch: generate 4 variations in parallel, get a grid
uv run img.py -p "sunset landscape" -n 4 -s 2K -a 16:9 -o sunset.png
# → sunset-1.png, sunset-2.png, sunset-3.png, sunset-4.png, sunset.png (grid)

# opt into the non-lite Nano Banana 2 model
uv run img.py -p "a cinematic robot portrait" --model non-lite

Options

Flag	Description	Default
`-p, --prompt`	Prompt text	—
`-f, --prompt-file`	Read prompt from file	—
`-i, --input`	Reference image(s), repeatable	none
`-o, --out`	Output filename	`output.png`
`-n, --num-images`	Number of images to generate in parallel	`1`
`-s, --size`	Resolution: `512px`, `1K`, `2K`, `4K`	`1K`
`-a, --aspect-ratio`	Aspect ratio (e.g. `16:9`, `3:2`, `9:16`)	API default
`--model`	Nano Banana 2 variant: `lite` or `non-lite`	`lite`

Batch output

When -n is greater than 1, individual images are written as <name>-1.png, <name>-2.png, etc. A grid image combining all results is saved to the base output name. The grid uses shelf-packing at original resolution — no resizing or cropping.

Viewer

Open viewer.html in a browser to live-monitor generation progress. It auto-refreshes and displays output images in a grid.