yollomi-ai-api

Generate AI images and videos using Yollomi API. Use when the user wants to create images from text, remove image backgrounds, or generate AI videos. Requires YOLLOMI_API_KEY and optionally YOLLOMI_BASE_URL.

7 stars

byDemerzels-lab

View on GitHub Installation ↓

Best use case

yollomi-ai-api is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using yollomi-ai-api should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/yollomi/SKILL.md --create-dirs "https://raw.githubusercontent.com/Demerzels-lab/elsamultiskillagent/main/public/skills/anichikage/yollomi/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/yollomi/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How yollomi-ai-api Compares

Feature / Agent	yollomi-ai-api	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Yollomi AI API Skill

Generates images and videos via the Yollomi API. All models use a **single unified endpoint** with different `modelId` parameters.

## Setup

1. **API Key**: User must provide `YOLLOMI_API_KEY` (env var, .env file, or Cursor rule).
2. **Base URL** (optional): `YOLLOMI_BASE_URL` defaults to `https://yollomi.com`.

## Unified Endpoint

```
POST /api/v1/generate
```

**Headers**: `Authorization: Bearer ${YOLLOMI_API_KEY}` or `X-API-Key: ${YOLLOMI_API_KEY}`  
**Content-Type**: `application/json`

**Body**:
- `type` (required): `"image"` or `"video"`
- `modelId` (required): Model identifier
- Additional params depend on model (prompt, imageUrl, etc.)

**Response (image)**: `{ images: string[], remainingCredits: number }`  
**Response (video)**: `{ video: string, remainingCredits: number }`

## List Models

```
GET /api/v1/models
```

Returns all available image and video modelIds.

## Common Examples

**Generate image (Flux)**:
```bash
curl -X POST "${YOLLOMI_BASE_URL:-https://yollomi.com}/api/v1/generate" \
  -H "Authorization: Bearer $YOLLOMI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"type":"image","modelId":"flux","prompt":"A cat in a hat","aspectRatio":"1:1"}'
```

**Remove background**:
```bash
curl -X POST "${YOLLOMI_BASE_URL:-https://yollomi.com}/api/v1/generate" \
  -H "Authorization: Bearer $YOLLOMI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"type":"image","modelId":"remove-bg","imageUrl":"https://example.com/photo.jpg"}'
```

**Generate video**:
```bash
curl -X POST "${YOLLOMI_BASE_URL:-https://yollomi.com}/api/v1/generate" \
  -H "Authorization: Bearer $YOLLOMI_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"type":"video","modelId":"kling-2-1","prompt":"A cat walking in the rain"}'
```

## Aspect Ratio (aspectRatio)

文生图类模型支持的比例参数：

| 比例 | 说明 |
|------|------|
| 1:1 | 正方形（默认） |
| 16:9 | 横屏 |
| 9:16 | 竖屏 |

## Image ModelIds

| modelId | Credits | Required | aspectRatio |
|---------|---------|----------|-------------|
| flux | 4/img | prompt | 1:1, 16:9, 9:16 |
| flux-schnell | 2/img | prompt | 同上 |
| flux-2-pro | 15/img | prompt | 同上 |
| remove-bg | 0 | imageUrl | - |
| nano-banana | 4 | prompt | 1:1, 16:9, 9:16 |
| nano-banana-pro | 15 | prompt | 同上 |
| flux-kontext-pro | 4 | prompt | 同上 |
| z-image-turbo | 1 | prompt | width, height |
| imagen-4-ultra | 6 | prompt | 同上 |
| image-4-fast | 3 | prompt | 同上 |
| ideogram-v3-turbo | 3 | prompt | 同上 |
| stable-diffusion-3-5-large | 7/img | prompt | 同上 |
| seedream-4-5 | 4 | prompt | 同上 |
| object-remover | 3 | image, mask | - |
| face-swap | 3 | swapImage, inputImage | - |
| image-upscaler | 1 | imageUrl, scale | - |
| photo-restoration | 4 | imageUrl | - |
| qwen-image-edit | 3 | image, prompt | - |
| qwen-image-edit-plus | 3 | image, prompt | - |
| virtual-try-on | 3 | clothImage, personImage | - |
| ai-background-generator | 5 | imageUrl | prompt |

## Video ModelIds

| modelId | Credits |
|---------|---------|
| openai-sora-2 | ~50+ |
| google-veo-3 | 10 |
| google-veo-3-fast | 9 |
| google-veo-3-1 | 10 |
| google-veo-3-2 | 10 |
| google-veo-3-1-fast | 9 |
| kling-2-1 | 9 |
| kling-v2-6-motion-control | 7/sec |
| minimax-hailuo-2-3 | 9 |
| minimax-hailuo-2-3-fast | 9 |
| bytedance-seedance-1-pro-fast | 8 |
| runway-gen4-turbo | varies |
| pixverse-5 | 9 |
| wan-2-5-i2v | 9 |
| wan-2-5-t2v | 9 |
| wan-2-6-i2v | 29 |
| wan-2-6-t2v | 29 |

## Workflow

1. **Generate image** → POST /api/v1/generate with `type: "image"`, `modelId`, and model params
2. **Generate video** → POST /api/v1/generate with `type: "video"`, `modelId`, `prompt`, optional `inputs`
3. **List models** → GET /api/v1/models
4. **401/402** → Check API key and credits

## Reference

Full model list and params: [models-reference.md](models-reference.md) or GET /api/v1/models

Related Skills

paylock

from Demerzels-lab/elsamultiskillagent

Non-custodial SOL escrow for AI agent deals.

agent-reputation

from Demerzels-lab/elsamultiskillagent

summary: Cross-platform AI agent reputation checker with trust scoring and PayLock escrow recommendations.

Telecom Agent Skill

from Demerzels-lab/elsamultiskillagent

Turn your AI Agent into a Telecom Operator. Bulk calling, ChatOps, and Field Monitoring.

OpenClaw-Finnhub

from Demerzels-lab/elsamultiskillagent

OpenClaw skill for real-time stock quote, and financials via Finnhub API.

```markdown

from Demerzels-lab/elsamultiskillagent

# OpenClaw-Last.fm

security-operator

from Demerzels-lab/elsamultiskillagent

Runtime security guardrails for OpenClaw agents.

operator-humanizer

from Demerzels-lab/elsamultiskillagent

Transform AI-generated text into authentic human writing.

kit-email-operator

from Demerzels-lab/elsamultiskillagent

**AI-powered email marketing for Kit (ConvertKit)**.

agora

from Demerzels-lab/elsamultiskillagent

Trade prediction markets on Agora — the prediction market exclusively for AI agents. Register, browse markets, trade YES/NO, create markets, earn reputation via Brier scores.

surf-check

from Demerzels-lab/elsamultiskillagent

Surf forecast decision engine.

jinko-flight-search

from Demerzels-lab/elsamultiskillagent

Search flights and discover travel destinations using the Jinko MCP server. Provides two core capabilities: (1) Destination discovery — find where to travel based on criteria like budget, climate, or activities when the user has no specific destination in mind, and (2) Specific flight search — compare flights between two known cities/airports with flexible dates, cabin classes, and budget filters. Use this skill when the user wants to: search for flights, find cheap flights, discover travel destinations, compare flight prices, plan a trip, find deals from a specific city, or explore where to go. Triggers on any flight-booking, travel-planning, or destination-discovery request. Requires the Jinko MCP server connected at https://mcp.gojinko.com.

mlx-whisper

from Demerzels-lab/elsamultiskillagent

Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).