yollomi-ai-api
Generate AI images and videos using Yollomi API. Use when the user wants to create images from text, remove image backgrounds, or generate AI videos. Requires YOLLOMI_API_KEY and optionally YOLLOMI_BASE_URL.
Best use case
yollomi-ai-api is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate AI images and videos using Yollomi API. Use when the user wants to create images from text, remove image backgrounds, or generate AI videos. Requires YOLLOMI_API_KEY and optionally YOLLOMI_BASE_URL.
Teams using yollomi-ai-api should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/yollomi/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How yollomi-ai-api Compares
| Feature / Agent | yollomi-ai-api | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate AI images and videos using Yollomi API. Use when the user wants to create images from text, remove image backgrounds, or generate AI videos. Requires YOLLOMI_API_KEY and optionally YOLLOMI_BASE_URL.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Yollomi AI API Skill
Generates images and videos via the Yollomi API. All models use a **single unified endpoint** with different `modelId` parameters.
## Setup
1. **API Key**: User must provide `YOLLOMI_API_KEY` (env var, .env file, or Cursor rule).
2. **Base URL** (optional): `YOLLOMI_BASE_URL` defaults to `https://yollomi.com`.
## Unified Endpoint
```
POST /api/v1/generate
```
**Headers**: `Authorization: Bearer ${YOLLOMI_API_KEY}` or `X-API-Key: ${YOLLOMI_API_KEY}`
**Content-Type**: `application/json`
**Body**:
- `type` (required): `"image"` or `"video"`
- `modelId` (required): Model identifier
- Additional params depend on model (prompt, imageUrl, etc.)
**Response (image)**: `{ images: string[], remainingCredits: number }`
**Response (video)**: `{ video: string, remainingCredits: number }`
## List Models
```
GET /api/v1/models
```
Returns all available image and video modelIds.
## Common Examples
**Generate image (Flux)**:
```bash
curl -X POST "${YOLLOMI_BASE_URL:-https://yollomi.com}/api/v1/generate" \
-H "Authorization: Bearer $YOLLOMI_API_KEY" \
-H "Content-Type: application/json" \
-d '{"type":"image","modelId":"flux","prompt":"A cat in a hat","aspectRatio":"1:1"}'
```
**Remove background**:
```bash
curl -X POST "${YOLLOMI_BASE_URL:-https://yollomi.com}/api/v1/generate" \
-H "Authorization: Bearer $YOLLOMI_API_KEY" \
-H "Content-Type: application/json" \
-d '{"type":"image","modelId":"remove-bg","imageUrl":"https://example.com/photo.jpg"}'
```
**Generate video**:
```bash
curl -X POST "${YOLLOMI_BASE_URL:-https://yollomi.com}/api/v1/generate" \
-H "Authorization: Bearer $YOLLOMI_API_KEY" \
-H "Content-Type: application/json" \
-d '{"type":"video","modelId":"kling-2-1","prompt":"A cat walking in the rain"}'
```
## Aspect Ratio (aspectRatio)
文生图类模型支持的比例参数:
| 比例 | 说明 |
|------|------|
| 1:1 | 正方形(默认) |
| 16:9 | 横屏 |
| 9:16 | 竖屏 |
## Image ModelIds
| modelId | Credits | Required | aspectRatio |
|---------|---------|----------|-------------|
| flux | 4/img | prompt | 1:1, 16:9, 9:16 |
| flux-schnell | 2/img | prompt | 同上 |
| flux-2-pro | 15/img | prompt | 同上 |
| remove-bg | 0 | imageUrl | - |
| nano-banana | 4 | prompt | 1:1, 16:9, 9:16 |
| nano-banana-pro | 15 | prompt | 同上 |
| flux-kontext-pro | 4 | prompt | 同上 |
| z-image-turbo | 1 | prompt | width, height |
| imagen-4-ultra | 6 | prompt | 同上 |
| image-4-fast | 3 | prompt | 同上 |
| ideogram-v3-turbo | 3 | prompt | 同上 |
| stable-diffusion-3-5-large | 7/img | prompt | 同上 |
| seedream-4-5 | 4 | prompt | 同上 |
| object-remover | 3 | image, mask | - |
| face-swap | 3 | swapImage, inputImage | - |
| image-upscaler | 1 | imageUrl, scale | - |
| photo-restoration | 4 | imageUrl | - |
| qwen-image-edit | 3 | image, prompt | - |
| qwen-image-edit-plus | 3 | image, prompt | - |
| virtual-try-on | 3 | clothImage, personImage | - |
| ai-background-generator | 5 | imageUrl | prompt |
## Video ModelIds
| modelId | Credits |
|---------|---------|
| openai-sora-2 | ~50+ |
| google-veo-3 | 10 |
| google-veo-3-fast | 9 |
| google-veo-3-1 | 10 |
| google-veo-3-2 | 10 |
| google-veo-3-1-fast | 9 |
| kling-2-1 | 9 |
| kling-v2-6-motion-control | 7/sec |
| minimax-hailuo-2-3 | 9 |
| minimax-hailuo-2-3-fast | 9 |
| bytedance-seedance-1-pro-fast | 8 |
| runway-gen4-turbo | varies |
| pixverse-5 | 9 |
| wan-2-5-i2v | 9 |
| wan-2-5-t2v | 9 |
| wan-2-6-i2v | 29 |
| wan-2-6-t2v | 29 |
## Workflow
1. **Generate image** → POST /api/v1/generate with `type: "image"`, `modelId`, and model params
2. **Generate video** → POST /api/v1/generate with `type: "video"`, `modelId`, `prompt`, optional `inputs`
3. **List models** → GET /api/v1/models
4. **401/402** → Check API key and credits
## Reference
Full model list and params: [models-reference.md](models-reference.md) or GET /api/v1/modelsRelated Skills
paylock
Non-custodial SOL escrow for AI agent deals.
agent-reputation
summary: Cross-platform AI agent reputation checker with trust scoring and PayLock escrow recommendations.
Telecom Agent Skill
Turn your AI Agent into a Telecom Operator. Bulk calling, ChatOps, and Field Monitoring.
OpenClaw-Finnhub
OpenClaw skill for real-time stock quote, and financials via Finnhub API.
```markdown
# OpenClaw-Last.fm
security-operator
Runtime security guardrails for OpenClaw agents.
operator-humanizer
Transform AI-generated text into authentic human writing.
kit-email-operator
**AI-powered email marketing for Kit (ConvertKit)**.
agora
Trade prediction markets on Agora — the prediction market exclusively for AI agents. Register, browse markets, trade YES/NO, create markets, earn reputation via Brier scores.
surf-check
Surf forecast decision engine.
jinko-flight-search
Search flights and discover travel destinations using the Jinko MCP server. Provides two core capabilities: (1) Destination discovery — find where to travel based on criteria like budget, climate, or activities when the user has no specific destination in mind, and (2) Specific flight search — compare flights between two known cities/airports with flexible dates, cabin classes, and budget filters. Use this skill when the user wants to: search for flights, find cheap flights, discover travel destinations, compare flight prices, plan a trip, find deals from a specific city, or explore where to go. Triggers on any flight-booking, travel-planning, or destination-discovery request. Requires the Jinko MCP server connected at https://mcp.gojinko.com.
mlx-whisper
Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).