home / skills / openclaw / skills / ai-image-generation
This skill helps you generate AI images across 50+ models via inference.sh, enabling text-to-image, inpainting, upscaling, and editing.
npx playbooks add skill openclaw/skills --skill ai-image-generationReview the files below or copy the command above to add this skill to your agents.
---
name: ai-image-generation
description: "Generate AI images with FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image"
allowed-tools: Bash(infsh *)
---
# AI Image Generation
Generate images with 50+ AI models via [inference.sh](https://inference.sh) CLI.

## Quick Start
```bash
# Install CLI
curl -fsSL https://cli.inference.sh | sh && infsh login
# Generate an image with FLUX
infsh app run falai/flux-dev-lora --input '{"prompt": "a cat astronaut in space"}'
```
> **Install note:** The [install script](https://cli.inference.sh) only detects your OS/architecture, downloads the matching binary from `dist.inference.sh`, and verifies its SHA-256 checksum. No elevated permissions or background processes. [Manual install & verification](https://dist.inference.sh/cli/checksums.txt) available.
## Available Models
| Model | App ID | Best For |
|-------|--------|----------|
| FLUX Dev LoRA | `falai/flux-dev-lora` | High quality with custom styles |
| FLUX.2 Klein LoRA | `falai/flux-2-klein-lora` | Fast with LoRA support (4B/9B) |
| Gemini 3 Pro | `google/gemini-3-pro-image-preview` | Google's latest |
| Gemini 2.5 Flash | `google/gemini-2-5-flash-image` | Fast Google model |
| Grok Imagine | `xai/grok-imagine-image` | xAI's model, multiple aspects |
| Seedream 4.5 | `bytedance/seedream-4-5` | 2K-4K cinematic quality |
| Seedream 4.0 | `bytedance/seedream-4-0` | High quality 2K-4K |
| Seedream 3.0 | `bytedance/seedream-3-0-t2i` | Accurate text rendering |
| Reve | `falai/reve` | Natural language editing, text rendering |
| ImagineArt 1.5 Pro | `falai/imagine-art-1-5-pro-preview` | Ultra-high-fidelity 4K |
| Topaz Upscaler | `falai/topaz-image-upscaler` | Professional upscaling |
## Browse All Image Apps
```bash
infsh app list --category image
```
## Examples
### Text-to-Image with FLUX
```bash
infsh app run falai/flux-dev-lora --input '{
"prompt": "professional product photo of a coffee mug, studio lighting"
}'
```
### Fast Generation with FLUX Klein
```bash
infsh app run falai/flux-2-klein-lora --input '{"prompt": "sunset over mountains"}'
```
### Google Gemini 3 Pro
```bash
infsh app run google/gemini-3-pro-image-preview --input '{
"prompt": "photorealistic landscape with mountains and lake"
}'
```
### Grok Imagine
```bash
infsh app run xai/grok-imagine-image --input '{
"prompt": "cyberpunk city at night",
"aspect_ratio": "16:9"
}'
```
### Reve (with Text Rendering)
```bash
infsh app run falai/reve --input '{
"prompt": "A poster that says HELLO WORLD in bold letters"
}'
```
### Seedream 4.5 (4K Quality)
```bash
infsh app run bytedance/seedream-4-5 --input '{
"prompt": "cinematic portrait of a woman, golden hour lighting"
}'
```
### Image Upscaling
```bash
infsh app run falai/topaz-image-upscaler --input '{"image_url": "https://..."}'
```
### Stitch Multiple Images
```bash
infsh app run infsh/stitch-images --input '{
"images": ["https://img1.jpg", "https://img2.jpg"],
"direction": "horizontal"
}'
```
## Related Skills
```bash
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@inference-sh
# FLUX-specific skill
npx skills add inference-sh/skills@flux-image
# Upscaling & enhancement
npx skills add inference-sh/skills@image-upscaling
# Background removal
npx skills add inference-sh/skills@background-removal
# Video generation
npx skills add inference-sh/skills@ai-video-generation
# AI avatars from images
npx skills add inference-sh/skills@ai-avatar-video
```
Browse all apps: `infsh app list`
## Documentation
- [Running Apps](https://inference.sh/docs/apps/running) - How to run apps via CLI
- [Image Generation Example](https://inference.sh/docs/examples/image-generation) - Complete image generation guide
- [Apps Overview](https://inference.sh/docs/apps/overview) - Understanding the app ecosystem
This skill enables AI image generation using the inference.sh CLI with access to 50+ models including FLUX, Gemini, Grok, Seedream, Reve and more. It supports text-to-image, image-to-image, inpainting, LoRA, upscaling, and text rendering to produce art, mockups, and marketing visuals. Use the CLI to run specific app IDs and tune outputs across speed and fidelity trade-offs.
You install and authenticate the inference.sh CLI then run model apps by their app IDs (for example falai/flux-dev-lora or google/gemini-3-pro-image-preview). Inputs are JSON payloads containing prompts, images, aspect ratios and other parameters; models return generated images or edited outputs. The skill exposes models optimized for text rendering, LoRA fine-tuning, high-resolution cinematic output and professional upscaling.
Do I need an account to use the CLI?
Yes. Install the inference.sh CLI and run infsh login to authenticate before running apps.
Which model should I pick for fast iterations?
Use FLUX.2 Klein LoRA or other Klein/Flash models for speed, then switch to Seedream or ImagineArt for final high-quality renders.