stability-ai

Generate high-quality images via Stability AI API (SDXL, SD3, Stable Image Core). Use when user asks to "generate image", "make a picture", or "draw this".

3,891 stars

byopenclaw

Complexity: easy

View on GitHub Installation ↓

About this skill

The Stability AI Skill empowers AI agents to generate diverse and high-quality images directly through the Stability AI API. It provides a robust interface to popular models like SDXL, SD3, and Stable Image Core, allowing users to create visuals ranging from realistic photographs to fantastical art. The skill is designed for programmatic image creation, enabling agents to interpret user prompts and execute sophisticated generation commands. This skill is highly versatile, supporting various parameters such as specific aspect ratios (e.g., 16:9, 1:1), numerous style presets (e.g., anime, cinematic, pixel-art), and advanced controls like negative prompts, seed values for reproducibility, and custom steps/CFG scales. It streamlines the process of integrating powerful image generation capabilities into AI workflows, making it ideal for content creation, digital art projects, and visual prototyping. Users would leverage this skill to quickly produce visual assets, explore creative concepts, or integrate dynamic image generation into larger automated tasks. The output includes a local image file in various formats (PNG, JPG, WebP) along with comprehensive JSON metadata, ensuring transparency and reproducibility of generated content.

Best use case

This skill's primary use case is for AI agents or users who need to generate custom images based on text prompts, controlling various artistic and technical parameters. It benefits digital artists, content creators, marketers, and developers looking to automate visual content creation or integrate powerful image generation into their applications and workflows.

Generate high-quality images via Stability AI API (SDXL, SD3, Stable Image Core). Use when user asks to "generate image", "make a picture", or "draw this".

A high-quality image file (PNG, JPG, or WebP) saved to a local path, accompanied by a JSON file containing all generation metadata.

Practical example

Example input

Generate a cyberpunk street at night with neon lights, in a 16:9 aspect ratio and a neon-punk style.

Example output

Image generated: `/path/to/generated_image_123.jpg`. Metadata saved: `/path/to/generated_image_123.json`

When to use this skill

When a user asks to "generate image", "make a picture", or "draw this".
To create visual content for websites, social media, or presentations from text descriptions.
When specific artistic styles, aspect ratios, or advanced generation parameters are required.
For rapid prototyping of visual concepts or generating diverse image variations.

When not to use this skill

When an existing image needs editing, modification, or upscaling (this skill is for generation).
If offline image generation is required, as it relies on an active API key and internet connection.
For tasks requiring real-time video generation or 3D model creation.
If the generated image quality or specific style presets do not meet highly niche, specialized requirements.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/stability-ai/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/1999azzar/stability-ai/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/stability-ai/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How stability-ai Compares

Feature / Agent	stability-ai	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	easy	N/A

Frequently Asked Questions

What does this skill do?

Generate high-quality images via Stability AI API (SDXL, SD3, Stable Image Core). Use when user asks to "generate image", "make a picture", or "draw this".

How difficult is it to install?

The installation complexity is rated as easy. You can find the installation instructions above.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

Best AI Skills for Claude

Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.

ChatGPT vs Claude for Agent Skills

Compare ChatGPT and Claude for AI agent skills across coding, writing, research, and reusable workflow execution.

Top AI Agents for Productivity

See the top AI agent skills for productivity, workflow automation, operational systems, documentation, and everyday task execution.

SKILL.md Source

# Stability AI Skill

## Setup
1. Copy `.env.example` to `.env`.
2. Set `STABILITY_API_KEY` in `.env`.
3. Optional: Set `API_HOST` if using a custom endpoint.

## Usage
- **Role**: Digital Artist.
- **Trigger**: "Draw a cat", "Generate cyberpunk city", "Create an image".
- **Output**: Local file path to the generated image and JSON metadata.

## Commands
(The script automatically handles dependencies on first run)

```bash
# Basic generation
scripts/generate "A futuristic city with neon lights"

# Aspect ratio (1:1, 16:9, 9:16, 21:9, 4:3, 3:4)
scripts/generate "Landscape painting" --ratio 16:9

# Style preset
scripts/generate "Portrait of a warrior" --style photographic

# Seed for reproducibility
scripts/generate "Abstract art" --seed 42

# Negative prompt
scripts/generate "Beautiful sunset" --negative "blurry, low quality"

# Output format (png, jpg, webp)
scripts/generate "Nature scene" --format webp

# Advanced: Custom model, steps, CFG scale
scripts/generate "Fantasy landscape" \
  --model stable-diffusion-3-medium \
  --steps 50 \
  --cfg 7.0 \
  --ratio 21:9

# V2 API (experimental)
scripts/generate "Modern architecture" --v2

# Combined options
scripts/generate "Cyberpunk street at night" \
  --ratio 16:9 \
  --style neon-punk \
  --seed 123 \
  --format jpg \
  --steps 45 \
  --cfg 6.5
```

## Features

### Style Presets
Available styles: enhance, anime, photographic, digital-art, comic-book, fantasy-art, line-art, analog-film, neon-punk, isometric, low-poly, origami, modeling-compound, cinematic, 3d-model, pixel-art, tile-texture

### Aspect Ratios
Supported: 1:1 (default), 16:9, 9:16, 21:9, 4:3, 3:4

### Output Formats
- PNG (default, lossless)
- JPEG/JPG (lossy, smaller size)
- WebP (modern, efficient)

### Metadata
Each generated image includes a JSON metadata file with:
- Prompt and negative prompt
- Model, parameters, and settings
- Generation timestamp
- API version used

## Models
- Default: SDXL 1.0 (`stable-diffusion-xl-1024-v1-0`)
- See `references/models.md` for complete model list, API versions, and credit costs.

## Auto-Cleanup
Automatically keeps the last 20 generated images. Older files and their metadata are removed to save disk space.

Content & Documentation

stability-ai

About this skill

Best use case

Practical example

Example input

Example output

When to use this skill

When not to use this skill

Installation

How stability-ai Compares

Frequently Asked Questions

What does this skill do?

How difficult is it to install?

Where can I find the source code?

Related Guides

Best AI Skills for Claude

ChatGPT vs Claude for Agent Skills

Top AI Agents for Productivity

SKILL.md Source

Related Skills

﻿---

humanizer

linkedin-cli

小红书长图文发布 Skill

openclaw-youtube

openclaw-media-gen

Cold Email Writer

Presentation Mastery — Complete Slide Design & Delivery System

ai-humanizer

Employee Handbook Generator

afrexai-copywriting-mastery

afrexai-conversion-copywriting

---