caixu-ingest-materials
Import Personal Documents. Use when the user wants to import a local directory or explicit files into 材序 at the ingest stage, including “导入一批材料”“先把这个目录吃进去”“先建立可继续建库的解析上下文”. Prefer caixu-skill when the user asks for the full end-to-end mainline or is unsure which stage to run. This skill creates or loads a library, starts an ingest pipeline run, lets the agent choose per-file routes over low-level OCR and parser tools, persists ParsedFile records, and hands off library_id plus run_id to build-asset-library. Do not use it for asset extraction, lifecycle judgment, package building, or submission.
Best use case
caixu-ingest-materials is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Import Personal Documents. Use when the user wants to import a local directory or explicit files into 材序 at the ingest stage, including “导入一批材料”“先把这个目录吃进去”“先建立可继续建库的解析上下文”. Prefer caixu-skill when the user asks for the full end-to-end mainline or is unsure which stage to run. This skill creates or loads a library, starts an ingest pipeline run, lets the agent choose per-file routes over low-level OCR and parser tools, persists ParsedFile records, and hands off library_id plus run_id to build-asset-library. Do not use it for asset extraction, lifecycle judgment, package building, or submission.
Teams using caixu-ingest-materials should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/caixu-ingest-materials/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How caixu-ingest-materials Compares
| Feature / Agent | caixu-ingest-materials | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Import Personal Documents. Use when the user wants to import a local directory or explicit files into 材序 at the ingest stage, including “导入一批材料”“先把这个目录吃进去”“先建立可继续建库的解析上下文”. Prefer caixu-skill when the user asks for the full end-to-end mainline or is unsure which stage to run. This skill creates or loads a library, starts an ingest pipeline run, lets the agent choose per-file routes over low-level OCR and parser tools, persists ParsedFile records, and hands off library_id plus run_id to build-asset-library. Do not use it for asset extraction, lifecycle judgment, package building, or submission.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
Best AI Skills for Claude
Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.
AI Agents for Marketing
Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.
SKILL.md Source
# Import Personal Documents 在用户要“导入一批材料”“先把这个目录吃进去”“先建立可继续建库的解析上下文”时使用这个 skill。 ## Quick flow 1. 创建或加载 `library_id`,再创建 `ingest` pipeline run 2. 调 `list_local_files`,让 agent 为每个文件选择 route 3. 调低层工具提取文本,归一化成 `ParsedFile`,写入 `upsert_parsed_files` ## Read next only when needed - 输入是目录、混合格式、需要 route 判断时,读 [references/workflow.md](references/workflow.md) - 需要确认低层工具输入输出、route 含义或 `PipelineRun` 结构时,读 [references/tool-contracts.md](references/tool-contracts.md) - 需要最小 JSON 输出模板时,读 [references/output-patterns.md](references/output-patterns.md) - 遇到 route 决策失败、部分文件失败或写库失败时,读 [references/failure-modes.md](references/failure-modes.md) ## Required tools - `caixu-data-mcp.create_or_load_library` - `caixu-data-mcp.create_pipeline_run` - `caixu-ocr-mcp.list_local_files` - `caixu-ocr-mcp.read_local_text_file` - `caixu-ocr-mcp.extract_parser_text` - `caixu-ocr-mcp.extract_visual_text` - `caixu-ocr-mcp.render_pdf_pages` - `caixu-data-mcp.upsert_parsed_files` - `caixu-data-mcp.append_pipeline_step` - `caixu-data-mcp.complete_pipeline_run` ## Required input - `input_root` or explicit local files - `library_id?` - `owner_hint?` ## Workflow 1. 如果上下文里没有 `library_id`,先调 `create_or_load_library`。 2. 创建 `ingest` run;之后每个关键动作都要追加 step。 3. 先用 `list_local_files` 展开目录,再让 agent 为每个文件选择 `text | parser_lite | parser_export | ocr | vlm | skip`。 4. 只调低层工具,不自己做 OCR/Parser 大一统封装。 5. 把成功结果归一化成 `ParsedFile`,再统一写入 `upsert_parsed_files`。 6. 结束时完成 pipeline run,并返回单个 `ToolResult` 风格结果,至少包含: - `data.library_id` - `data.run_id` - `data.file_ids` - `data.parsed_count` - `data.failed_count` - `data.warning_count` - `data.skipped_count` - `data.parsed_files` - `data.failed_files` - `data.warning_files` - `data.skipped_files` 7. 成功或部分成功时,推荐下一步 `build-asset-library`。 ## Guardrails - 不要发明文件内容、OCR 结果、issuer、date 或任何资产字段。 - 不要把目录路径直接当文件列表使用;必须先 `list_local_files`。 - 这是 ingest skill,不负责 `asset_card` 抽取、归并、生命周期判断、打包或提交。 - 单文件失败不能阻断成功文件;但 route 决策失败必须明确记 step 与结构化错误。 - 如果 route 决策连续失败,允许 pipeline 保守回退到 `suggested_route`,但必须留下 step 和 warning。 - 低价值或不支持格式应走 `skip`,不要硬塞进 parser/OCR。 - 如果低层提取成功但 `upsert_parsed_files` 失败,返回结构化存储错误,并停止推荐下一步。
Related Skills
caixu-query-assets
Search Personal Asset Library. Use when the user wants to search or filter an existing 材序 asset library at the query stage, including “查我有哪些材料”“看哪些可复用”“按类型筛资产”“用自然语言找相关材料”. Prefer caixu-skill when the user asks for the full end-to-end mainline or is unsure which stage to run. This skill normalizes filters into precise retrieval inputs, uses agent tags plus FTS by default, and only calls the optional semantic vector retrieval function when the user explicitly asks for similar or related materials.
caixu-maintain-asset-library
Maintain Personal Asset Library. Use when the user wants to inspect, review, patch, archive, restore, or verify a personal material asset library after initial build, including “看库概览”“修一条资产”“归档这条不该进库的材料”“看待复核队列”. Prefer caixu-skill when the user asks for the full end-to-end mainline or is unsure which stage to run. This skill loads library overviews and review queues from caixu-data-mcp, applies minimal maintenance actions such as patching or archiving one asset at a time, and re-checks query results. Do not use it for fresh ingest, asset extraction, lifecycle judgment, package building, or submission.
caixu-build-asset-library
Build Personal Asset Library. Use when the user wants to convert parsed materials into asset cards and a queryable library at the asset-build stage, including “建资产库”“生成资产卡”“去重整理版本”. Prefer caixu-skill when the user asks for the full end-to-end mainline or is unsure which stage to run. This skill creates a build_asset_library pipeline run, reads parsed files from caixu-data-mcp, extracts canonical asset cards from trustworthy parsed text, merges likely duplicate versions conservatively, persists assets and merged groups, and completes the pipeline run with library_id. Do not use it for querying, lifecycle judgment, package creation, or submission.
caixu-skill
Asset Library Skill. Use when the user expresses the overall end-to-end intent in one request, including “把这些材料建成资产库”“列出未来 60 天需要续办或补办的事项”“生成暑期实习申请材料包”, when the user wants to search materials by natural language but is unsure whether the local retrieval environment is ready, when the user is unsure which phase skill to run, or when the environment may not be installed yet. This skill routes only to the current child skill, sends first-time installation agents to references/install.md when keys, retrieval config, or runtime setup are missing, explains stage boundaries and next steps, and does not directly call MCP tools, extract files, build packages, or submit forms.
agnost-ingestion
USE when implementing data ingestion for Agnost AI analytics. Contains API reference, SDK guides for Python and TypeScript, and code examples for tracking AI conversations, MCP server events, and user interactions.
ingestigate
Investigative intelligence — document search, entity extraction, and relationship graphing. Analyze document corpuses to find connections between people, organizations, and identifiers.
teaching-materials
中小学教学文档智能生成工具。当用户需要制作课件(PPT)、编写教学设计(教案)、或创建学生任务单(导学案)时使用此技能。支持数学、语文等学科的备课、课堂巩固、课后作业等教学场景的文档生成。
Intelligence Ingestion
Auto-analyze URLs/info for OpenClaw strategic value, classify, create Obsidian notes, update memory. Use when user shares a URL, article, tweet, or any external info source.
caixu-check-lifecycle
Check Document Renewal Requirements. Use when the user wants a lifecycle diagnosis for an existing asset library at the lifecycle stage, including “看未来 60 天续办”“查缺件”“判断能不能提交某个目标场景”. Prefer caixu-skill when the user asks for the full end-to-end mainline or is unsure which stage to run. This skill loads assets and a versioned RuleProfileBundle, asks an agent to produce a complete CheckLifecycleData decision, validates readiness and asset references with shared rules helpers, records an audit sidecar, and persists the lifecycle run only when the decision is structurally valid.
caixu-build-package
Build Application Document Package. Use when the user wants export files and a submission bundle from an existing library at the packaging stage, including “生成材料包”“导出资产总表和续办清单”“准备提交包”. Prefer caixu-skill when the user asks for the full end-to-end mainline or is unsure which stage to run. This skill loads the latest validated lifecycle result, a RuleProfileBundle, assets and parsed files, asks an agent to choose package contents, validates the decision, runs deterministic docgen exports, persists the package run with an audit sidecar, and prepares the final bundle for downstream use.
---
name: article-factory-wechat
humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, negative parallelisms, and excessive conjunctive phrases.