home / skills / openclaw / skills / ai-image-generation

ai-image-generation skill

needs review

This skill helps you generate AI images across 50+ models via inference.sh, enabling text-to-image, inpainting, upscaling, and editing.

npx playbooks add skill openclaw/skills --skill ai-image-generation

Review the files below or copy the command above to add this skill to your agents.

Files (2)

SKILL.md

4.4 KB

---
name: ai-image-generation
description: "Generate AI images with FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image"
allowed-tools: Bash(infsh *)
---

# AI Image Generation

Generate images with 50+ AI models via [inference.sh](https://inference.sh) CLI.

![AI Image Generation](https://cloud.inference.sh/app/files/u/4mg21r6ta37mpaz6ktzwtt8krr/01kg0v0nz7wv0qwqjtq1cam52z.jpeg)

## Quick Start

```bash
# Install CLI
curl -fsSL https://cli.inference.sh | sh && infsh login

# Generate an image with FLUX
infsh app run falai/flux-dev-lora --input '{"prompt": "a cat astronaut in space"}'
```

> **Install note:** The [install script](https://cli.inference.sh) only detects your OS/architecture, downloads the matching binary from `dist.inference.sh`, and verifies its SHA-256 checksum. No elevated permissions or background processes. [Manual install & verification](https://dist.inference.sh/cli/checksums.txt) available.

## Available Models

| Model | App ID | Best For |
|-------|--------|----------|
| FLUX Dev LoRA | `falai/flux-dev-lora` | High quality with custom styles |
| FLUX.2 Klein LoRA | `falai/flux-2-klein-lora` | Fast with LoRA support (4B/9B) |
| Gemini 3 Pro | `google/gemini-3-pro-image-preview` | Google's latest |
| Gemini 2.5 Flash | `google/gemini-2-5-flash-image` | Fast Google model |
| Grok Imagine | `xai/grok-imagine-image` | xAI's model, multiple aspects |
| Seedream 4.5 | `bytedance/seedream-4-5` | 2K-4K cinematic quality |
| Seedream 4.0 | `bytedance/seedream-4-0` | High quality 2K-4K |
| Seedream 3.0 | `bytedance/seedream-3-0-t2i` | Accurate text rendering |
| Reve | `falai/reve` | Natural language editing, text rendering |
| ImagineArt 1.5 Pro | `falai/imagine-art-1-5-pro-preview` | Ultra-high-fidelity 4K |
| Topaz Upscaler | `falai/topaz-image-upscaler` | Professional upscaling |

## Browse All Image Apps

```bash
infsh app list --category image
```

## Examples

### Text-to-Image with FLUX

```bash
infsh app run falai/flux-dev-lora --input '{
  "prompt": "professional product photo of a coffee mug, studio lighting"
}'
```

### Fast Generation with FLUX Klein

```bash
infsh app run falai/flux-2-klein-lora --input '{"prompt": "sunset over mountains"}'
```

### Google Gemini 3 Pro

```bash
infsh app run google/gemini-3-pro-image-preview --input '{
  "prompt": "photorealistic landscape with mountains and lake"
}'
```

### Grok Imagine

```bash
infsh app run xai/grok-imagine-image --input '{
  "prompt": "cyberpunk city at night",
  "aspect_ratio": "16:9"
}'
```

### Reve (with Text Rendering)

```bash
infsh app run falai/reve --input '{
  "prompt": "A poster that says HELLO WORLD in bold letters"
}'
```

### Seedream 4.5 (4K Quality)

```bash
infsh app run bytedance/seedream-4-5 --input '{
  "prompt": "cinematic portrait of a woman, golden hour lighting"
}'
```

### Image Upscaling

```bash
infsh app run falai/topaz-image-upscaler --input '{"image_url": "https://..."}'
```

### Stitch Multiple Images

```bash
infsh app run infsh/stitch-images --input '{
  "images": ["https://img1.jpg", "https://img2.jpg"],
  "direction": "horizontal"
}'
```

## Related Skills

```bash
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@inference-sh

# FLUX-specific skill
npx skills add inference-sh/skills@flux-image

# Upscaling & enhancement
npx skills add inference-sh/skills@image-upscaling

# Background removal
npx skills add inference-sh/skills@background-removal

# Video generation
npx skills add inference-sh/skills@ai-video-generation

# AI avatars from images
npx skills add inference-sh/skills@ai-avatar-video
```

Browse all apps: `infsh app list`

## Documentation

- [Running Apps](https://inference.sh/docs/apps/running) - How to run apps via CLI
- [Image Generation Example](https://inference.sh/docs/examples/image-generation) - Complete image generation guide
- [Apps Overview](https://inference.sh/docs/apps/overview) - Understanding the app ecosystem

Overview

This skill enables AI image generation using the inference.sh CLI with access to 50+ models including FLUX, Gemini, Grok, Seedream, Reve and more. It supports text-to-image, image-to-image, inpainting, LoRA, upscaling, and text rendering to produce art, mockups, and marketing visuals. Use the CLI to run specific app IDs and tune outputs across speed and fidelity trade-offs.

How this skill works

You install and authenticate the inference.sh CLI then run model apps by their app IDs (for example falai/flux-dev-lora or google/gemini-3-pro-image-preview). Inputs are JSON payloads containing prompts, images, aspect ratios and other parameters; models return generated images or edited outputs. The skill exposes models optimized for text rendering, LoRA fine-tuning, high-resolution cinematic output and professional upscaling.

When to use it

Creating concept art, illustrations, and stylized AI artwork from text prompts.
Generating high-quality product mockups and studio-style photos for marketing.
Editing or inpainting parts of an existing image with precise text instructions.
Upscaling low-res images to print or social media sizes with Topaz upscaler.
Producing social graphics or posters that require accurate text rendering.

Best practices

Choose the model that matches your goal: Seedream or ImagineArt for 4K fidelity, Klein variants for fast iterations, Reve for text rendering and editing.
Start with concise, descriptive prompts and iterate with negative prompts or style tokens for refinement.
Use LoRA-enabled models when you need custom styles or consistent character features.
Provide reference images for image-to-image or inpainting tasks to guide composition.
Batch test prompts at low cost/fast models before running final high-res renders.

Example use cases

Generate a photorealistic product photo: infsh app run falai/flux-dev-lora --input '{"prompt": "professional product photo of a coffee mug, studio lighting"}'
Fast scene drafts with Klein LoRA for concept exploration: infsh app run falai/flux-2-klein-lora --input '{"prompt": "sunset over mountains"}'
Create a poster with readable text using Reve: infsh app run falai/reve --input '{"prompt": "A poster that says HELLO WORLD in bold letters"}'
Produce cinematic portraits in 4K with Seedream 4.5 for marketing assets.
Upscale and enhance images before publishing: infsh app run falai/topaz-image-upscaler --input '{"image_url": "https://..."}'

FAQ

Do I need an account to use the CLI?

Yes. Install the inference.sh CLI and run infsh login to authenticate before running apps.

Which model should I pick for fast iterations?

Use FLUX.2 Klein LoRA or other Klein/Flash models for speed, then switch to Seedream or ImagineArt for final high-quality renders.