visual-asset-workflow
Generate distinctive educational visuals using creative brief methodology. Use when creating chapter illustrations, diagrams, or teaching visuals with Gemini.
Best use case
visual-asset-workflow is best used when you need a repeatable AI agent workflow instead of a one-off prompt. It is especially useful for teams working in multi. Generate distinctive educational visuals using creative brief methodology. Use when creating chapter illustrations, diagrams, or teaching visuals with Gemini.
Generate distinctive educational visuals using creative brief methodology. Use when creating chapter illustrations, diagrams, or teaching visuals with Gemini.
Users should expect a more consistent workflow output, faster repeated execution, and less time spent rewriting prompts from scratch.
Practical example
Example input
Use the "visual-asset-workflow" skill to help with this workflow task. Context: Generate distinctive educational visuals using creative brief methodology. Use when creating chapter illustrations, diagrams, or teaching visuals with Gemini.
Example output
A structured workflow result with clearer steps, more consistent formatting, and an output that is easier to reuse in the next run.
When to use this skill
- Use this skill when you want a reusable workflow rather than writing the same prompt again and again.
When not to use this skill
- Do not use this when you only need a one-off answer and do not need a reusable workflow.
- Do not use it if you cannot install or maintain the related files, repository context, or supporting tools.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/visual-asset-workflow/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How visual-asset-workflow Compares
| Feature / Agent | visual-asset-workflow | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate distinctive educational visuals using creative brief methodology. Use when creating chapter illustrations, diagrams, or teaching visuals with Gemini.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Visual Asset Workflow Skill
## Context & Problem
Educational visual generation converges toward generic infographics with technical specifications ("44pt Roboto Bold, 250px box") that activate prediction mode instead of reasoning mode. This produces bland, PowerPoint-default aesthetics instead of distinctive, pedagogically effective visuals.
This skill provides professional creative brief methodology to activate Gemini 3's reasoning capabilities.
---
## Core Principles
1. **Story activates reasoning** - Narrative intent produces distinctive visuals; technical specs produce generic ones
2. **Proficiency dictates complexity** - A2 students need <5 sec grasp; C2 professionals handle dense information
3. **Prerequisites gate content** - Visuals cannot assume knowledge students don't have yet
4. **Pedagogy drives hierarchy** - Visual weight teaches importance, not arbitrary aesthetics
---
## Dimensional Guidance
### Planning Before Execution
**Avoid:** Jumping into visual analysis without context
**Prefer:** Strategic planning phase (Q0)
Read FIRST:
- `apps/learn-app/docs/chapter-index.md` → Extract part, proficiency (A2/B1/C2), prerequisites
- `apps/learn-app/docs/[part]/[chapter]/README.md` → Understand lesson structure
Detect conflicts BEFORE work:
- Proficiency-complexity mismatch (complex visual for A2 beginners)
- Prerequisite violations (Python code when students haven't learned it)
- Pedagogical layer incoherence (Layer 1 content using Layer 4 approaches)
Output strategic plan, WAIT for approval before proceeding.
**Principle:** Plan prevents wasted work (Chapter 9 failure: 5 wrong lessons from skipping planning)
---
### Prompt Structure: Professional Creative Briefs
**Avoid:** Technical specifications
```
❌ "Title: 44pt Roboto Bold at (50, 20)"
❌ "Box: 250px × 90px, #aaaaaa, 8px corners"
❌ "Shadow: 4px offset, 8px blur"
```
**Prefer:** Story + Intent + Metaphor
```
✅ The Story: [1-2 sentence narrative of what's visualized]
✅ Emotional Intent: Should feel [exponential growth, surprising magnitude]
✅ Visual Metaphor: [Multiplication cascade - like compound interest]
✅ Key Insight: [ONE thing students must grasp]
✅ Color Semantics: Blue (#2563eb) = Authority (teaches governance concept)
✅ Typography Hierarchy: Largest = Key insight (not arbitrary sizing)
✅ Pedagogical Reasoning: Why these choices serve teaching
```
**Principle:** Creative briefs activate reasoning mode; specifications activate prediction mode
**Why it matters:** Gemini 3 reasons about HOW to achieve intent → Distinctive visuals instead of generic
---
### Token Conservation Strategy
**When:** Batch mode with >8 visuals OR continuation session
**Apply condensation while preserving reasoning activation:**
**ALWAYS KEEP:**
- Story (1-2 sentence narrative)
- Emotional Intent (what it should FEEL like)
- Visual Metaphor (universal concept)
- Key Insight (ONE thing students must grasp)
- Color semantics with hex codes (#2563eb)
- Pedagogical reasoning (why these choices)
**CONDENSE:**
- Long examples → Short labels
- Verbose descriptions → Bullet points
- Repeated patterns → Compact notation
**NEVER REMOVE:**
- Narrative elements
- Intent statements
- Reasoning explanations
**Example:**
```
FULL: "Top Layer shows the Coordinator at center top with label..."
CONDENSED: "Top Layer - Coordinator: Center top: 'Orchestrator'..."
```
**Target:** 60-70% token reduction, 100% reasoning activation preserved
**Principle:** Efficiency through compression, not through elimination of reasoning triggers
---
### Proficiency-Complexity Alignment
**Avoid:** One-size-fits-all complexity
**Prefer:** Proficiency-gated constraints
**A2 Beginner** (Non-negotiable limits):
- Max 5-7 elements
- <5 second grasp
- Static only (no interactive)
- Max 2×2 grids
- Clear hierarchy (largest = most important)
**B1 Intermediate**:
- Max 7-10 elements
- <10 second grasp
- Interactive Tier 1 OK (tap-to-reveal)
- Max 3×3 grids
**C2 Professional**:
- No artificial limits
- Dense infographics OK
- Full interactive architecture
- Production complexity
**Principle:** Overwhelming A2 students = learning failure; artificial simplicity for C2 = patronizing
---
### Prerequisite Validation Gate
**Avoid:** Assuming knowledge students don't have
**Prefer:** Validate against chapter prerequisites
**Detection:**
- Check Part number: Part 1-2 = no programming, Part 3 = markdown/prompts, Part 4+ = Python
- Check prerequisite list from chapter-index.md
**Example Violations:**
- ❌ Python code in Chapter 9 (Part 3 - students haven't learned it)
- ❌ Git commands in Part 2 (students haven't learned CLI)
**Exception:** Meta-level teaching OK
- ✅ Teaching "markdown code block syntax" by showing Python code block (teaches markdown, not Python)
**Principle:** Visual cannot require unknown knowledge
---
### Constitutional Alignment
**Avoid:** Decorative visuals without pedagogical purpose
**Prefer:** Every visual serves specific learning objective
**Principle 3 (Factual Accuracy):**
- Verify all statistics, dates, technical specs
- Enable Google Search grounding for factual claims
- Cite sources for data
**Principle 7 (Minimal Content):**
- Reject "let's add a visual for variety"
- Every element must teach something
- Remove non-teaching decoration
**Principle:** Visual decisions align with project constitution
---
### Pedagogical Layer Coherence
**Avoid:** Layer mismatch
**Prefer:** Visual approach matches chapter's pedagogical layer
**L1 (Manual Foundation):**
- Step-by-step diagrams
- Concrete examples
- Clear labeling (building vocabulary)
**L2 (AI Collaboration):**
- Before/after comparisons
- Iteration flows
- Three Roles Framework INVISIBLE (no role labels)
**L3 (Intelligence Design):**
- Architecture diagrams
- Reusable pattern illustrations
**L4 (Spec-Driven):**
- Specification → implementation flow
- Component composition diagrams
**Principle:** Visual design reinforces pedagogical approach
---
### Duplicate Prevention Protocol
**Avoid:** Generating different prompts that produce the same visual
**Prevent BEFORE generation:**
1. **Review existing visuals in chapter:**
- List all `*.png` files in target chapter directory
- Read corresponding `*.prompt.md` files
- Identify visual patterns already used
2. **Validate prompt distinctiveness:**
- Does this prompt's intent differ clearly from existing prompts?
- Example conflicts to detect:
- ❌ Timeline + Graph → Both might render as timeline
- ❌ Architecture + Workflow → Both might render as hierarchy
- ❌ Same metaphor, different names → Same visual result
3. **Differentiation strategy:**
- Make visual type explicit in story ("GRAPH showing exponential growth" not just "showing growth")
- Use distinct metaphors (cascade vs tree vs timeline vs curve)
- Specify unique structural elements (2D axes vs linear flow vs hierarchical pyramid)
**Detect AFTER generation (in image-generator):**
- Visual comparison with existing chapter images
- Prompt alignment check (does output match brief intent?)
**Principle:** Prevention cheaper than rework
---
## Anti-Patterns
**Never:**
- Generate visuals without reading chapter-index.md first (skipping context)
- Use pixel specifications, font sizes, coordinates in prompts (kills reasoning)
- Assume knowledge not in prerequisites (prerequisite violation)
- Create decorative visuals without learning objective (Principle 7 violation)
- Apply same complexity to A2 and C2 students (proficiency mismatch)
- Create prompts without checking for duplicate visual patterns (causes rework)
**Even if it seems reasonable:**
- Don't use Python examples in Part 3 (students don't know Python yet)
- Don't create complex multi-step visuals for A2 (cognitive overload)
- Don't specify "44pt Roboto Bold" (removes Gemini's judgment)
- Don't skip distinctiveness validation "because they have different filenames" (names differ, visuals might not)
---
## Creative Variance
You tend to default to comparison diagrams even with story-driven prompts. Vary visual types:
- **Timeline progressions** (evolution over time)
- **Multiplication cascades** (compound growth visualization)
- **Hierarchical authority flows** (governance models)
- **Transformation sequences** (before → after → impact)
- **Conceptual metaphors** (abstract → concrete mapping)
Match visual type to story, not habit.
---
## Post-Generation Reflection
After batch completion, analyze systematically (Q8):
**Success patterns:**
- Quality gate performance (which caught most issues?)
- Average iterations (efficiency indicator)
- Time vs estimate (planning accuracy)
**Failure analysis:**
- Deferred visuals root causes (layout? spelling? concept mismatch?)
- Guardrail gaps (what principle would have prevented this?)
- Planning effectiveness (conflicts caught early vs missed?)
**Continuous improvement:**
- Pattern-based updates (not one-off fixes)
- New guardrails from learnings
- Prompt template refinements
Document in: `history/visual-assets/reflections/chapter-{NN}-reflection.md`
**Principle:** Systematic reflection → Improved future performance
---
## Success Indicators
You'll know this skill is working when:
- ✅ Zero pixel specifications in prompts (creative briefs only)
- ✅ Strategic plan created before visual analysis (Q0 complete)
- ✅ Proficiency conflicts detected early (A2 limits enforced)
- ✅ Prerequisite violations prevented (no unknown concepts)
- ✅ Story/Intent/Metaphor in every prompt (reasoning activated)
- ✅ Token conservation applied in batch mode (60-70% reduction)
- ✅ Duplicate prevention validation passed (zero duplicate visuals)
- ✅ Visuals feel distinctive and compelling (not generic PowerPoint)
- ✅ Reflection document created after batch (systematic learning)
**Result:** Professional-quality visuals that teach effectively, generated efficiently through planning, with zero duplicates requiring rework.Related Skills
req-change-workflow
Standardize requirement/feature changes in an existing codebase (especially Chrome extensions) by turning "改需求/需求变更/调整交互/改功能/重构流程" into a repeatable loop: clarify acceptance criteria, confirm current behavior from code, assess impact/risk, design the new logic, implement with small diffs, run a fixed regression checklist, and update docs/decision log. Use when the user feels the change process is chaotic, when edits tend to sprawl across files, or when changes touch manifest/service worker/OAuth/storage/UI and need reliable verification + rollback planning.
visual-design-foundations
Apply typography, color theory, spacing systems, and iconography principles to create cohesive visual designs. Use when establishing design tokens, building style guides, or improving visual hierarchy and consistency.
defou-workflow
将原始想法转化为结构清晰、判断明确、具有长期价值的“得否”风格内容报告。
defou-stanley-workflow
Defou x Stanley 融合工作流:结合深度结构化思考与人性弱点洞察,生成极简、犀利且具有长期价值的爆款内容。
agentic-workflow
Practical AI agent workflows and productivity techniques. Provides optimized patterns for daily development tasks such as commands, shortcuts, Git integration, MCP usage, and session management.
workflow-patterns
Use this skill when implementing tasks according to Conductor's TDD workflow, handling phase checkpoints, managing git commits for tasks, or understanding the verification protocol.
workflow-orchestration-patterns
Design durable workflows with Temporal for distributed systems. Covers workflow vs activity separation, saga patterns, state management, and determinism constraints. Use when building long-running processes, distributed transactions, or microservice orchestration.
workflow-automation
Workflow automation is the infrastructure that makes AI agents reliable. Without durable execution, a network hiccup during a 10-step payment flow means lost money and angry customers. With it, workflows resume exactly where they left off. This skill covers the platforms (n8n, Temporal, Inngest) and patterns (sequential, parallel, orchestrator-worker) that turn brittle scripts into production-grade automation. Key insight: The platforms make different tradeoffs. n8n optimizes for accessibility
ui-visual-validator
Rigorous visual validation expert specializing in UI testing, design system compliance, and accessibility verification. Masters screenshot analysis, visual regression testing, and component validation. Use PROACTIVELY to verify UI modifications have achieved their intended goals through comprehensive visual analysis.
tdd-workflows-tdd-refactor
Use when working with tdd workflows tdd refactor
tdd-workflows-tdd-red
Generate failing tests for the TDD red phase to define expected behavior and edge cases.
tdd-workflows-tdd-green
Implement the minimal code needed to make failing tests pass in the TDD green phase.