ai-task-hub
AI task hub for image analysis, background removal, speech-to-text, text-to-speech, markdown conversion, and async execute/poll/presentation orchestration. Use when users need hosted AI outcomes while host runtime manages identity, credits, payment, and risk control.
Best use case
ai-task-hub is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
AI task hub for image analysis, background removal, speech-to-text, text-to-speech, markdown conversion, and async execute/poll/presentation orchestration. Use when users need hosted AI outcomes while host runtime manages identity, credits, payment, and risk control.
Teams using ai-task-hub should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/ai-task-hub/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How ai-task-hub Compares
| Feature / Agent | ai-task-hub | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
AI task hub for image analysis, background removal, speech-to-text, text-to-speech, markdown conversion, and async execute/poll/presentation orchestration. Use when users need hosted AI outcomes while host runtime manages identity, credits, payment, and risk control.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
SKILL.md Source
# AI Task Hub Formerly `skill-hub-gateway`. Public package boundary: - Only orchestrates `portal.skill.execute`, `portal.skill.poll`, and `portal.skill.presentation`. - Does not exchange `api_key` or `userToken` inside this package. - Does not handle recharge or payment flows inside this package. - Assumes host runtime injects short-lived task tokens and attachment URLs. Chinese documentation: `SKILL.zh-CN.md` ## When to Use This Skill Use this skill when the user asks to: - detect people, faces, hands, keypoints, or tags from images - remove backgrounds or generate cutout/matting results for products or portraits - transcribe uploaded audio into text (`speech to text`, `audio transcription`) - generate speech from text input (`text to speech`, `voice generation`) - convert uploaded files into markdown (`document to markdown`) - start async jobs and check status later (`poll`, `check job status`) - fetch rendered visual outputs such as `overlay`, `mask`, and `cutout` - run embedding or reranking tasks for retrieval workflows ## Common Requests Example requests that should trigger this skill: - "Detect faces in this image and return bounding boxes." - "Tag this image and summarize the main objects." - "Remove the background from this product photo." - "Create a clean cutout from this portrait image." - "Transcribe this meeting audio into text." - "Generate speech from this paragraph." - "Convert this PDF file into markdown." - "Start this job now and let me poll the run status later." - "Fetch overlay and mask files for run_456." - "Generate embeddings for this text list and rerank the candidates." ## Search-Friendly Capability Aliases - `vision` aliases: face detection, human detection, person detection, image tagging - `background` aliases: remove background, background removal, cutout, matting, product-cutout - `asr` aliases: speech to text, audio transcription, transcribe audio - `tts` aliases: text to speech, voice generation, speech synthesis - `markdown_convert` aliases: document to markdown, file to markdown, markdown conversion - `poll` aliases: check job status, poll long-running task, async run status - `presentation` aliases: rendered output, overlay, mask, cutout files - `embeddings/reranker` aliases: vectorization, semantic vectors, relevance reranking ## Runtime Contract Default API base URL: `https://gateway-api.binaryworks.app` Action to endpoint mapping: - `portal.skill.execute` -> `POST /agent/skill/execute` - `portal.skill.poll` -> `GET /agent/skill/runs/:run_id` - `portal.skill.presentation` -> `GET /agent/skill/runs/:run_id/presentation` ## Auth Contract (Host-Managed) Every request must include: - `X-Agent-Task-Token: <jwt_or_paseto>` Recommended token claims: - `sub` (user_id) - `agent_uid` - `conversation_id` - `scope` (`execute|poll|presentation`) - `exp` - `jti` CLI argument order for `scripts/skill.mjs`: - `[agent_task_token] <action> <payload_json> [base_url]` - If token arg is omitted, script reads `AGENT_TASK_TOKEN` from environment. - Host runtime should refresh and inject short-lived `AGENT_TASK_TOKEN` automatically to avoid user-facing auth friction. ## Payload Contract - `portal.skill.execute`: payload requires `capability` and `input`. - `payload.request_id` is optional and passed through. - `portal.skill.poll` and `portal.skill.presentation`: payload requires `run_id`. - `portal.skill.presentation` supports `include_files` (defaults to `true`). Attachment normalization: - Prefer explicit `image_url` / `audio_url` / `file_url`. - `attachment.url` is mapped to target media field by capability. - Local `file_path` is disabled in the published package. - Host must upload chat attachments first, then pass URL fields. - Example host upload endpoint: `/api/blob/upload-file`. ## Error Contract - Preserve gateway envelope: `request_id`, `data`, `error`. - Preserve `POINTS_INSUFFICIENT` and pass through `error.details.recharge_url`. ## Bundled Files - `scripts/skill.mjs` - `scripts/agent-task-auth.mjs` - `scripts/base-url.mjs` - `scripts/attachment-normalize.mjs` - `scripts/telemetry.mjs` (compatibility shim) - `references/capabilities.json` - `references/openapi.json` - `SKILL.zh-CN.md`
Related Skills
youtube-watcher
Fetch and read transcripts from YouTube videos. Use when you need to summarize a video, answer questions about its content, or extract information from it.
youtube-transcript
Fetch and summarize YouTube video transcripts. Use when asked to summarize, transcribe, or extract content from YouTube videos. Handles transcript fetching via residential IP proxy to bypass YouTube's cloud IP blocks.
youtube-auto-captions - YouTube 自动字幕
## 描述
youtube
YouTube Data API integration with managed OAuth. Search videos, manage playlists, access channel data, and interact with comments. Use this skill when users want to interact with YouTube. For other third party apps, use the api-gateway skill (https://clawhub.ai/byungkyu/api-gateway).
yahoo-finance
Get stock prices, quotes, fundamentals, earnings, options, dividends, and analyst ratings using Yahoo Finance. Uses yfinance library - no API key required.
xurl
A Twitter research and content intelligence skill focused on attracting WordPress and Shopify clients. Use to analyze Twitter profiles, threads, and conversations for: (1) Identifying what small agency founders and eCommerce brands are discussing; (2) Understanding pain points around WordPress performance, Shopify CRO, and development bottlenecks; (3) Extracting high-performing content angles; (4) Turning insights into authority-building posts; (5) Converting Twitter intelligence into business leverage for clear content angles, strong positioning, and qualified inbound leads.
xlsx
Use this skill any time a spreadsheet file is the primary input or output. This means any task where the user wants to: open, read, edit, or fix an existing .xlsx, .xlsm, .csv, or .tsv file (e.g., adding columns, computing formulas, formatting, charting, cleaning messy data); create a new spreadsheet from scratch or from other data sources; or convert between tabular file formats. Trigger especially when the user references a spreadsheet file by name or path — even casually (like "the xlsx in my downloads") — and wants something done to it or produced from it. Also trigger for cleaning or restructuring messy tabular data files (malformed rows, misplaced headers, junk data) into proper spreadsheets. The deliverable must be a spreadsheet file. Do NOT trigger when the primary deliverable is a Word document, HTML report, standalone Python script, database pipeline, or Google Sheets API integration, even if tabular data is involved.
xiaohongshu-mcp
Automate Xiaohongshu (RedNote) content operations using a Python client for the xiaohongshu-mcp server. Use for: (1) Publishing image, text, and video content, (2) Searching for notes and trends, (3) Analyzing post details and comments, (4) Managing user profiles and content feeds. Triggers: xiaohongshu automation, rednote content, publish to xiaohongshu, xiaohongshu search, social media management.
twitter-openclaw
Interact with Twitter/X — read tweets, search, post, like, retweet, and manage your timeline.
x-twitter-growth
X/Twitter growth engine for building audience, crafting viral content, and analyzing engagement. Use when the user wants to grow on X/Twitter, write tweets or threads, analyze their X profile, research competitors on X, plan a posting strategy, or optimize engagement. Complements social-content (generic multi-platform) with X-specific depth: algorithm mechanics, thread engineering, reply strategy, profile optimization, and competitive intelligence via web search.
akshare-online-alpha
Run Wyckoff master-style analysis from stock codes, holdings (symbol/cost/qty), cash, CSV data, and optional chart images. Use when users want online multi-source data fetching with source switching, strict Beijing-time trading-session checks, fixed system prompt analysis, single-stock analysis, holding rotation, holding add/reduce suggestions, or empty-position cash deployment suggestions.
writing-skills
Use when creating new skills, editing existing skills, or verifying skills work before deployment