prompt-engineering

Prompt design, chain-of-thought, few-shot learning, system prompts, and structured output patterns

39 stars

Best use case

prompt-engineering is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Prompt design, chain-of-thought, few-shot learning, system prompts, and structured output patterns

Teams using prompt-engineering should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/prompt-engineering/SKILL.md --create-dirs "https://raw.githubusercontent.com/InugamiDev/ultrathink-oss/main/.claude/skills/prompt-engineering/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/prompt-engineering/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How prompt-engineering Compares

Feature / Agent	prompt-engineering	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Prompt design, chain-of-thought, few-shot learning, system prompts, and structured output patterns

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Prompt Engineering Skill

## Purpose

Design effective prompts for large language models that produce accurate, consistent, and well-structured outputs. This skill covers system prompt design, chain-of-thought reasoning, few-shot learning, structured output, and prompt evaluation. Good prompts are precise about what they want, explicit about format, and include enough context for the model to succeed.

## Key Concepts

### Prompt Components

```
SYSTEM PROMPT:
- Role and persona definition
- Core instructions and constraints
- Output format specification
- Rules and boundaries

USER PROMPT:
- The specific task or question
- Context and background information
- Input data to process
- Examples (few-shot)

ASSISTANT PREFILL:
- Partial response to guide format
- First few tokens to steer output
- JSON opening bracket for structured output
```

### Prompting Techniques

```
ZERO-SHOT:
Just the instruction, no examples.
"Classify this email as spam or not spam."

FEW-SHOT:
Instruction + examples of input/output pairs.
"Classify these emails:
Email: 'Win a free iPhone!' -> spam
Email: 'Meeting at 3pm tomorrow' -> not spam
Email: '{input}' -> "

CHAIN-OF-THOUGHT (CoT):
Ask the model to think step by step.
"Think through this problem step by step before giving your answer."

SELF-CONSISTENCY:
Generate multiple CoT paths and take the majority answer.

TREE OF THOUGHTS:
Explore multiple reasoning branches, evaluate each, prune bad ones.

RETRIEVAL-AUGMENTED:
Provide relevant context from a knowledge base before asking the question.
```

## Patterns

### System Prompt Template

```
You are [ROLE] with expertise in [DOMAIN].

Your task is to [OBJECTIVE].

## Rules
- [Rule 1]
- [Rule 2]
- [Rule 3]

## Output Format
[Exactly how the response should be structured]

## Examples
[Input/output examples if needed]

## Constraints
- Do not [boundary 1]
- Always [boundary 2]
- If uncertain, [fallback behavior]
```

### Classification Prompt (Few-Shot)

```
Classify the following customer support ticket into exactly one category.

Categories:
- billing: Payment issues, invoices, refunds, pricing questions
- technical: Bugs, errors, feature requests, integration help
- account: Login issues, password reset, account settings
- general: Everything else

Examples:
Ticket: "I was charged twice for my subscription"
Category: billing

Ticket: "The export button returns a 500 error"
Category: technical

Ticket: "I cannot log in with my Google account"
Category: account

Ticket: "{ticket_text}"
Category:
```

### Structured Output (JSON)

```
Extract the following information from the provided text and return it as JSON.

Required fields:
- name (string): Full name of the person
- email (string): Email address
- company (string): Company name
- role (string): Job title or role
- sentiment (string): "positive", "neutral", or "negative"

If a field cannot be determined from the text, use null.

Text: "{input_text}"

Return ONLY valid JSON, no other text.
```

### Chain-of-Thought for Complex Reasoning

```
Analyze the following code for security vulnerabilities.

For each vulnerability found:
1. First, identify the line(s) of code that are problematic
2. Then, explain what type of vulnerability this is (e.g., XSS, SQL injection)
3. Next, describe how an attacker could exploit it
4. Finally, provide the fixed code

Think through each potential vulnerability carefully before reporting it.
Do not report false positives. If the code is secure, say so.

Code:
```
{code}
```
```

### Self-Reflection Prompt

```
[After initial response]

Now review your answer:
1. Are there any errors in your reasoning?
2. Did you miss any important edge cases?
3. Is your answer complete and actionable?
4. Would a domain expert agree with your assessment?

If you find issues, provide a corrected answer. If your answer is correct, confirm it.
```

## Prompt Optimization Techniques

### Be Specific, Not Vague

```
BAD: "Write something about databases"
GOOD: "Write a 3-paragraph explanation of PostgreSQL JSONB indexing
for a developer who knows SQL but has not used JSONB before.
Include one code example."
```

### Use Delimiters for Input Data

```
BAD: "Summarize this: The quick brown fox..."
GOOD: "Summarize the text between the <article> tags.

<article>
The quick brown fox...
</article>"
```

### Specify Output Format Explicitly

```
BAD: "List the pros and cons"
GOOD: "List the pros and cons in this exact format:

## Pros
1. [pro]
2. [pro]

## Cons
1. [con]
2. [con]

Limit to 3-5 items per section."
```

### Use Negative Instructions Sparingly

```
BAD: "Don't use jargon, don't be verbose, don't include examples"
GOOD: "Use plain language that a non-technical reader can understand.
Keep the response under 200 words.
Focus on concepts, not code."
```

## Evaluation

### Prompt Quality Metrics

```
ACCURACY: Does the output contain correct information?
RELEVANCE: Does it answer what was asked?
COMPLETENESS: Are all required parts present?
FORMAT: Does it match the specified output structure?
CONSISTENCY: Does it produce similar quality across different inputs?
SAFETY: Does it avoid harmful, biased, or inappropriate content?
```

### A/B Testing Prompts

```
Prompt A: "Summarize this article in 3 bullet points."
Prompt B: "Read the article carefully. Extract the 3 most important
takeaways. Present each as a single sentence."

Test both with 20+ diverse inputs. Compare on:
- Information coverage
- Clarity of output
- Consistency across inputs
- Token efficiency
```

## Best Practices

1. **Be explicit about format** -- show the model exactly what the output should look like
2. **Provide examples** -- few-shot examples are the most reliable way to steer output
3. **Use chain-of-thought for reasoning** -- "think step by step" improves accuracy on complex tasks
4. **Separate instructions from data** -- use XML tags or delimiters to prevent prompt injection
5. **Start with the simplest prompt** -- add complexity only when simpler versions fail
6. **Test with diverse inputs** -- a prompt that works for one input may fail on edge cases
7. **Iterate systematically** -- change one thing at a time and measure the impact
8. **Use system prompts for persistent instructions** -- role, format, and rules go in system
9. **Avoid ambiguity** -- "a few" means different things to different people; use exact numbers
10. **Version control your prompts** -- prompts are code; track changes and roll back when needed

## Common Pitfalls

| Pitfall | Impact | Fix |
|---------|--------|-----|
| Vague instructions | Inconsistent, low-quality output | Be specific about task, format, and constraints |
| Too many instructions | Model ignores some rules | Prioritize; fewer clear rules beat many vague ones |
| No examples | Model guesses at format | Add 2-3 representative examples |
| Prompt injection vulnerability | User input manipulates behavior | Use delimiters, validate input |
| Not testing edge cases | Fails on unusual inputs | Test with 20+ diverse examples |
| Over-engineering | Simple task, complex prompt | Start simple, add complexity as needed |

Related Skills

promptfoo

from InugamiDev/ultrathink-oss

LLM red teaming and security testing — automated vulnerability scanning for AI agents, RAGs, and LLM pipelines. Covers prompt injection, jailbreaks, data leaks, PII exposure, and 50+ vulnerability types.

prompt-caching

from InugamiDev/ultrathink-oss

Prompt caching strategies for LLM APIs — cache breakpoints, system prompt caching, and cost optimization.

context-engineering

from InugamiDev/ultrathink-oss

Context window optimization, token budget management, and information compression for AI-assisted workflows

ai-prompts

from InugamiDev/ultrathink-oss

AI prompt library covering system prompts, few-shot templates, structured output schemas, and prompt engineering patterns for production LLM applications

ultrathink

from InugamiDev/ultrathink-oss

UltraThink Workflow OS — 4-layer skill mesh with persistent memory and privacy hooks for complex engineering tasks. Routes prompts through intent detection to activate the right domain skills automatically.

ultrathink_review

from InugamiDev/ultrathink-oss

Multi-pass code review powered by UltraThink's quality gate — checks correctness, security (OWASP), performance, readability, and project conventions in a single structured pass.

ultrathink_memory

from InugamiDev/ultrathink-oss

Persistent memory system for UltraThink — search, save, and recall project context, decisions, and patterns across sessions using Postgres-backed fuzzy search with synonym expansion.

ui-design

from InugamiDev/ultrathink-oss

Comprehensive UI design system: 230+ font pairings, 48 themes, 65 design systems, 23 design languages, 30 UX laws, 14 color systems, Swiss grid, Gestalt principles, Pencil.dev workflow. Inherits ui-ux-pro-max (99 UX rules) + impeccable-frontend-design (anti-AI-slop). Triggers on any design, UI, layout, typography, color, theme, or styling task.

Zod

from InugamiDev/ultrathink-oss

> TypeScript-first schema validation with static type inference.

webinar-registration-page

from InugamiDev/ultrathink-oss

Build a webinar or live event registration page as a self-contained HTML file with countdown timer, speaker bio, agenda, and registration form. Triggers on: "build a webinar registration page", "create a webinar sign-up page", "event registration landing page", "live training registration page", "workshop sign-up page", "create a webinar page", "build an event page", "free webinar landing page", "live demo registration page", "online event page", "create a registration page for my webinar", "build a training event page".

webhooks

from InugamiDev/ultrathink-oss

Webhook design patterns — delivery, retry with exponential backoff, HMAC signature verification, payload validation, idempotency keys

web-workers

from InugamiDev/ultrathink-oss

Offload heavy computation from the main thread using Web Workers, SharedWorkers, and Comlink — structured messaging, transferable objects, and off-main-thread architecture patterns