assessment-item-development

Create valid, reliable assessment items across formats (multiple choice, constructed response, performance tasks) following psychometric best practices

509 stars

Best use case

assessment-item-development is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Create valid, reliable assessment items across formats (multiple choice, constructed response, performance tasks) following psychometric best practices

Teams using assessment-item-development should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/assessment-item-development/SKILL.md --create-dirs "https://raw.githubusercontent.com/a5c-ai/babysitter/main/library/specializations/domains/social-sciences-humanities/education/skills/assessment-item-development/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/assessment-item-development/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How assessment-item-development Compares

Feature / Agentassessment-item-developmentStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Create valid, reliable assessment items across formats (multiple choice, constructed response, performance tasks) following psychometric best practices

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Assessment Item Development

Create valid, reliable assessment items across formats including multiple choice, constructed response, and performance tasks following psychometric best practices.

## Overview

This skill enables the development of high-quality assessment items that accurately measure learning outcomes. It encompasses item writing across formats, alignment with objectives, and application of psychometric principles to create valid and reliable assessments.

## Capabilities

### Multiple Choice Items
- Write clear, unambiguous stems
- Develop plausible distractors
- Avoid item-writing flaws
- Address various cognitive levels
- Apply item analysis principles

### Constructed Response
- Design short-answer items
- Create essay prompts
- Develop case-based questions
- Write open-ended problems
- Create scoring guidelines

### Performance Tasks
- Design authentic tasks
- Develop task specifications
- Create rubrics and scoring guides
- Plan administration conditions
- Document task requirements

### Quality Assurance
- Review for bias and sensitivity
- Verify content alignment
- Apply item statistics
- Conduct item review
- Document item metadata

## Usage Guidelines

### Item Development Process
1. Review learning objectives
2. Select appropriate item format
3. Draft items following guidelines
4. Review and revise items
5. Pilot test when possible
6. Analyze and refine

### Multiple Choice Guidelines
- Single correct answer
- Parallel answer choices
- Avoid "all of the above"
- Place correct answer randomly
- Keep options similar length

### Constructed Response Guidelines
- Clear task requirements
- Specific scoring criteria
- Appropriate scope
- Sufficient context
- Model responses available

## Integration Points

### Related Processes
- Formative Assessment Design
- Summative Assessment Development
- Item Writing and Test Development

### Collaborating Skills
- learning-objectives-writing
- rubric-design-validation
- learning-analytics-interpretation

## References

- Item writing guidelines (Haladyna)
- ETS item development standards
- Psychometric principles
- Assessment best practices

Related Skills

vue-development

509
from a5c-ai/babysitter

Vue 3 development with Composition API, reactivity system, component patterns, TypeScript integration, and best practices.

react-development

509
from a5c-ai/babysitter

Specialized skill for React component development, hooks patterns, state management, context API, performance optimization, and modern React best practices.

angular-development

509
from a5c-ai/babysitter

Angular development patterns including modules, components, services, dependency injection, signals, and enterprise architecture.

REPL Development

509
from a5c-ai/babysitter

Expert skill for building interactive REPLs with rich editing and evaluation features

Swift/SwiftUI Development

509
from a5c-ai/babysitter

Expert skill for native iOS development with Swift and SwiftUI

React Native Development

509
from a5c-ai/babysitter

Deep integration with React Native ecosystem for cross-platform mobile development

Kotlin/Jetpack Compose Development

509
from a5c-ai/babysitter

Expert skill for native Android development with Kotlin and Jetpack Compose

Flutter/Dart Development

509
from a5c-ai/babysitter

Specialized skill for Flutter app development and Dart programming

unreal-development

509
from a5c-ai/babysitter

Unreal Engine integration skill for C++/Blueprint development, actor lifecycle management, plugin development, and editor automation. Enables LLMs to interact with Unreal Editor through MCP servers for level manipulation, Blueprint generation, and automated workflows.

unity-development

509
from a5c-ai/babysitter

Unity Engine integration skill for project setup, C# scripting, scene management, prefab creation, and editor automation. Enables LLMs to interact with Unity Editor through MCP servers for asset manipulation, script generation, and automated workflows.

godot-development

509
from a5c-ai/babysitter

Godot Engine integration skill for GDScript/C# development, scene composition, node management, and editor automation. Enables LLMs to interact with Godot Editor through MCP servers for asset manipulation, script generation, and automated workflows.

psychometric-assessment

509
from a5c-ai/babysitter

Develop, validate, and adapt measurement instruments including factor analysis, reliability testing, and cross-cultural validation