home / skills / openclaw / skills / ai-image-generation

ai-image-generation skill

/skills/okaris/ai-image-generation

This skill helps you generate AI images across 50+ models via inference.sh, enabling text-to-image, inpainting, upscaling, and editing.

npx playbooks add skill openclaw/skills --skill ai-image-generation

Review the files below or copy the command above to add this skill to your agents.

Files (2)
SKILL.md
4.4 KB
---
name: ai-image-generation
description: "Generate AI images with FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image"
allowed-tools: Bash(infsh *)
---

# AI Image Generation

Generate images with 50+ AI models via [inference.sh](https://inference.sh) CLI.

![AI Image Generation](https://cloud.inference.sh/app/files/u/4mg21r6ta37mpaz6ktzwtt8krr/01kg0v0nz7wv0qwqjtq1cam52z.jpeg)

## Quick Start

```bash
# Install CLI
curl -fsSL https://cli.inference.sh | sh && infsh login

# Generate an image with FLUX
infsh app run falai/flux-dev-lora --input '{"prompt": "a cat astronaut in space"}'
```

> **Install note:** The [install script](https://cli.inference.sh) only detects your OS/architecture, downloads the matching binary from `dist.inference.sh`, and verifies its SHA-256 checksum. No elevated permissions or background processes. [Manual install & verification](https://dist.inference.sh/cli/checksums.txt) available.

## Available Models

| Model | App ID | Best For |
|-------|--------|----------|
| FLUX Dev LoRA | `falai/flux-dev-lora` | High quality with custom styles |
| FLUX.2 Klein LoRA | `falai/flux-2-klein-lora` | Fast with LoRA support (4B/9B) |
| Gemini 3 Pro | `google/gemini-3-pro-image-preview` | Google's latest |
| Gemini 2.5 Flash | `google/gemini-2-5-flash-image` | Fast Google model |
| Grok Imagine | `xai/grok-imagine-image` | xAI's model, multiple aspects |
| Seedream 4.5 | `bytedance/seedream-4-5` | 2K-4K cinematic quality |
| Seedream 4.0 | `bytedance/seedream-4-0` | High quality 2K-4K |
| Seedream 3.0 | `bytedance/seedream-3-0-t2i` | Accurate text rendering |
| Reve | `falai/reve` | Natural language editing, text rendering |
| ImagineArt 1.5 Pro | `falai/imagine-art-1-5-pro-preview` | Ultra-high-fidelity 4K |
| Topaz Upscaler | `falai/topaz-image-upscaler` | Professional upscaling |

## Browse All Image Apps

```bash
infsh app list --category image
```

## Examples

### Text-to-Image with FLUX

```bash
infsh app run falai/flux-dev-lora --input '{
  "prompt": "professional product photo of a coffee mug, studio lighting"
}'
```

### Fast Generation with FLUX Klein

```bash
infsh app run falai/flux-2-klein-lora --input '{"prompt": "sunset over mountains"}'
```

### Google Gemini 3 Pro

```bash
infsh app run google/gemini-3-pro-image-preview --input '{
  "prompt": "photorealistic landscape with mountains and lake"
}'
```

### Grok Imagine

```bash
infsh app run xai/grok-imagine-image --input '{
  "prompt": "cyberpunk city at night",
  "aspect_ratio": "16:9"
}'
```

### Reve (with Text Rendering)

```bash
infsh app run falai/reve --input '{
  "prompt": "A poster that says HELLO WORLD in bold letters"
}'
```

### Seedream 4.5 (4K Quality)

```bash
infsh app run bytedance/seedream-4-5 --input '{
  "prompt": "cinematic portrait of a woman, golden hour lighting"
}'
```

### Image Upscaling

```bash
infsh app run falai/topaz-image-upscaler --input '{"image_url": "https://..."}'
```

### Stitch Multiple Images

```bash
infsh app run infsh/stitch-images --input '{
  "images": ["https://img1.jpg", "https://img2.jpg"],
  "direction": "horizontal"
}'
```

## Related Skills

```bash
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@inference-sh

# FLUX-specific skill
npx skills add inference-sh/skills@flux-image

# Upscaling & enhancement
npx skills add inference-sh/skills@image-upscaling

# Background removal
npx skills add inference-sh/skills@background-removal

# Video generation
npx skills add inference-sh/skills@ai-video-generation

# AI avatars from images
npx skills add inference-sh/skills@ai-avatar-video
```

Browse all apps: `infsh app list`

## Documentation

- [Running Apps](https://inference.sh/docs/apps/running) - How to run apps via CLI
- [Image Generation Example](https://inference.sh/docs/examples/image-generation) - Complete image generation guide
- [Apps Overview](https://inference.sh/docs/apps/overview) - Understanding the app ecosystem

Overview

This skill enables AI image generation using the inference.sh CLI with access to 50+ models including FLUX, Gemini, Grok, Seedream, Reve and more. It supports text-to-image, image-to-image, inpainting, LoRA, upscaling, and text rendering to produce art, mockups, and marketing visuals. Use the CLI to run specific app IDs and tune outputs across speed and fidelity trade-offs.

How this skill works

You install and authenticate the inference.sh CLI then run model apps by their app IDs (for example falai/flux-dev-lora or google/gemini-3-pro-image-preview). Inputs are JSON payloads containing prompts, images, aspect ratios and other parameters; models return generated images or edited outputs. The skill exposes models optimized for text rendering, LoRA fine-tuning, high-resolution cinematic output and professional upscaling.

When to use it

  • Creating concept art, illustrations, and stylized AI artwork from text prompts.
  • Generating high-quality product mockups and studio-style photos for marketing.
  • Editing or inpainting parts of an existing image with precise text instructions.
  • Upscaling low-res images to print or social media sizes with Topaz upscaler.
  • Producing social graphics or posters that require accurate text rendering.

Best practices

  • Choose the model that matches your goal: Seedream or ImagineArt for 4K fidelity, Klein variants for fast iterations, Reve for text rendering and editing.
  • Start with concise, descriptive prompts and iterate with negative prompts or style tokens for refinement.
  • Use LoRA-enabled models when you need custom styles or consistent character features.
  • Provide reference images for image-to-image or inpainting tasks to guide composition.
  • Batch test prompts at low cost/fast models before running final high-res renders.

Example use cases

  • Generate a photorealistic product photo: infsh app run falai/flux-dev-lora --input '{"prompt": "professional product photo of a coffee mug, studio lighting"}'
  • Fast scene drafts with Klein LoRA for concept exploration: infsh app run falai/flux-2-klein-lora --input '{"prompt": "sunset over mountains"}'
  • Create a poster with readable text using Reve: infsh app run falai/reve --input '{"prompt": "A poster that says HELLO WORLD in bold letters"}'
  • Produce cinematic portraits in 4K with Seedream 4.5 for marketing assets.
  • Upscale and enhance images before publishing: infsh app run falai/topaz-image-upscaler --input '{"image_url": "https://..."}'

FAQ

Do I need an account to use the CLI?

Yes. Install the inference.sh CLI and run infsh login to authenticate before running apps.

Which model should I pick for fast iterations?

Use FLUX.2 Klein LoRA or other Klein/Flash models for speed, then switch to Seedream or ImagineArt for final high-quality renders.