photo-composition-critic

Expert photography composition critic grounded in graduate-level visual aesthetics education, computational aesthetics research (AVA, NIMA, LAION-Aesthetics, VisualQuality-R1), and professional image analysis with custom tooling. Use for image quality assessment, composition analysis, aesthetic scoring, photo critique. Activate on "photo critique", "composition analysis", "image aesthetics", "NIMA", "AVA dataset", "visual quality". NOT for photo editing/retouching (use native-app-designer), generating images (use Stability AI directly), or basic image processing (use clip-aware-embeddings).

85 stars

Best use case

photo-composition-critic is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Expert photography composition critic grounded in graduate-level visual aesthetics education, computational aesthetics research (AVA, NIMA, LAION-Aesthetics, VisualQuality-R1), and professional image analysis with custom tooling. Use for image quality assessment, composition analysis, aesthetic scoring, photo critique. Activate on "photo critique", "composition analysis", "image aesthetics", "NIMA", "AVA dataset", "visual quality". NOT for photo editing/retouching (use native-app-designer), generating images (use Stability AI directly), or basic image processing (use clip-aware-embeddings).

Teams using photo-composition-critic should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/photo-composition-critic/SKILL.md --create-dirs "https://raw.githubusercontent.com/curiositech/some_claude_skills/main/.claude/skills/photo-composition-critic/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/photo-composition-critic/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How photo-composition-critic Compares

Feature / Agentphoto-composition-criticStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Expert photography composition critic grounded in graduate-level visual aesthetics education, computational aesthetics research (AVA, NIMA, LAION-Aesthetics, VisualQuality-R1), and professional image analysis with custom tooling. Use for image quality assessment, composition analysis, aesthetic scoring, photo critique. Activate on "photo critique", "composition analysis", "image aesthetics", "NIMA", "AVA dataset", "visual quality". NOT for photo editing/retouching (use native-app-designer), generating images (use Stability AI directly), or basic image processing (use clip-aware-embeddings).

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Photo Composition Critic

Expert photography critic with deep grounding in graduate-level visual aesthetics, computational aesthetics research, and professional image analysis.

## When to Use This Skill

**Use for:**
- Evaluating image composition quality
- Aesthetic scoring with ML models (NIMA, LAION)
- Photo critique with actionable feedback
- Analyzing color harmony and visual balance
- Comparing multiple crop options
- Understanding photography theory

**Do NOT use for:**
- Generating images → use **Stability AI** directly
- Photo editing/retouching → use **native-app-designer**
- Simple image similarity → use **clip-aware-embeddings**
- Collage creation → use **collage-layout-expert**

## MCP Integrations

| MCP | Purpose |
|-----|---------|
| **Firecrawl** | Research latest computational aesthetics papers |
| **Hugging Face** (if configured) | Access NIMA, LAION aesthetic models |

## Quick Reference

### Compositional Frameworks

| Framework | Key Points |
|-----------|------------|
| **Visual Weight** | Size, color warmth, isolation, intrinsic interest, position |
| **Gestalt** | Proximity, similarity, continuity, closure, figure-ground |
| **Dynamic Symmetry** | Root rectangles (√2, √3, φ), baroque/sinister diagonals |
| **Arabesque** | S-curve, spiral, diagonal thrust - eye flow through frame |

### Color Harmony Types

| Type | Score | Notes |
|------|-------|-------|
| Complementary | 0.9 | High visual interest |
| Monochromatic | 0.85 | Safe, cohesive |
| Triadic | 0.85 | Balanced, vibrant |
| Analogous | 0.8 | Natural, harmonious |
| Achromatic | 0.7 | B&W or desaturated |
| Complex | 0.6 | May be chaotic or intentional |

### ML Model Score Interpretation

| Score Range | Meaning |
|-------------|---------|
| 7.0+ | Exceptional (top ~1%) |
| 6.5+ | Great (top ~5%) |
| 5.0-5.5 | Mediocre (most images) |
| <5.0 | Below average |

## Analysis Protocol

```
1. FIRST IMPRESSION (2 seconds)
   └── Where does the eye go? Emotional hit? Anything "off"?

2. TECHNICAL SCAN
   └── Exposure, focus, noise, color, artifacts

3. COMPOSITIONAL ANALYSIS
   └── Subject clarity, structure, balance, flow, depth, edges

4. AESTHETIC EVALUATION
   └── Light quality, color harmony, decisive moment, story

5. CONTEXTUAL ASSESSMENT
   └── Genre success, photographer intent, audience fit

6. ACTIONABLE RECOMMENDATIONS
   └── Specific improvements, post-processing, alt crops
```

## Anti-Patterns

### "Just use rule of thirds"

| What it looks like | Why it's wrong |
|--------------------|----------------|
| Blindly placing subjects on thirds intersections | Oversimplification ignores visual weight, gestalt, dynamic symmetry |
| **Instead**: Analyze visual weight center, consider multiple frameworks |

### "Higher NIMA score = better photo"

| What it looks like | Why it's wrong |
|--------------------|----------------|
| Using ML score as sole quality metric | Models trained on averages, miss artistic intent, polarizing works |
| **Instead**: Use ML as one input alongside theoretical analysis |

### "Color harmony means matching colors"

| What it looks like | Why it's wrong |
|--------------------|----------------|
| Recommending monochromatic or matchy palettes | Ignores Itten's contrasts, Albers' interaction effects |
| **Instead**: Evaluate harmony type AND contextual appropriateness |

### Ignoring genre context

| What it looks like | Why it's wrong |
|--------------------|----------------|
| Applying portrait criteria to documentary | Different genres have different quality signals |
| **Instead**: Assess against genre-appropriate standards |

## Reference Files

Load these for detailed implementations:

| File | Contents |
|------|----------|
| `references/composition-theory.md` | Arnheim visual weight, Gestalt, Dynamic Symmetry, Arabesque |
| `references/color-theory.md` | Albers interaction, Itten's 7 contrasts, harmony detection algo |
| `references/ml-models.md` | AVA dataset, NIMA, LAION-Aesthetics, VisualQuality-R1 |
| `references/analysis-scripts.md` | PhotoCritic class, MCP server implementation |

## Key Sources

**Theory**: Arnheim (1974), Hambidge (1926), Itten (1961), Albers (1963), Freeman (2007)

**Research**: AVA dataset (Murray 2012), NIMA (Talebi 2018), LAION-5B (Schuhmann 2022), Q-Instruct (Wu 2024)

Related Skills

photo-content-recognition-curation-expert

85
from curiositech/some_claude_skills

Expert in photo content recognition, intelligent curation, and quality filtering. Specializes in face/animal/place recognition, perceptual hashing for de-duplication, screenshot/meme detection, burst photo selection, and quick indexing strategies. Activate on 'face recognition', 'face clustering', 'perceptual hash', 'near-duplicate', 'burst photo', 'screenshot detection', 'photo curation', 'photo indexing', 'NSFW detection', 'pet recognition', 'DINOHash', 'HDBSCAN faces'. NOT for GPS-based location clustering (use event-detection-temporal-intelligence-expert), color palette extraction (use color-theory-palette-harmony-expert), semantic image-text matching (use clip-aware-embeddings), or video analysis/frame extraction.

design-critic

85
from curiositech/some_claude_skills

Aesthetic assessment and remix partner with trained visual taste. Provides structured design critiques using a 6-dimension scoring system inspired by VisualQuality-R1 chain-of-thought reasoning.

skill-coach

85
from curiositech/some_claude_skills

Guides creation of high-quality Agent Skills with domain expertise, anti-pattern detection, and progressive disclosure best practices. Use when creating skills, reviewing existing skills, or when users mention improving skill quality, encoding expertise, or avoiding common AI tooling mistakes. Activate on keywords: create skill, review skill, skill quality, skill best practices, skill anti-patterns. NOT for general coding advice or non-skill Claude Code features.

3d-cv-labeling-2026

85
from curiositech/some_claude_skills

Expert in 3D computer vision labeling tools, workflows, and AI-assisted annotation for LiDAR, point clouds, and sensor fusion. Covers SAM4D/Point-SAM, human-in-the-loop architectures, and vertical-specific training strategies. Activate on '3D labeling', 'point cloud annotation', 'LiDAR labeling', 'SAM 3D', 'SAM4D', 'sensor fusion annotation', '3D bounding box', 'semantic segmentation point cloud'. NOT for 2D image labeling (use clip-aware-embeddings), general ML training (use ml-engineer), video annotation without 3D (use computer-vision-pipeline), or VLM prompt engineering (use prompt-engineer).

wisdom-accountability-coach

85
from curiositech/some_claude_skills

Longitudinal memory tracking, philosophy teaching, and personal accountability with compassion. Expert in pattern recognition, Stoicism/Buddhism, and growth guidance. Activate on 'accountability', 'philosophy', 'Stoicism', 'Buddhism', 'personal growth', 'commitment tracking', 'wisdom teaching'. NOT for therapy or mental health treatment (refer to professionals), crisis intervention, or replacing professional coaching credentials.

windows-95-web-designer

85
from curiositech/some_claude_skills

Modern web applications with authentic Windows 95 aesthetic. Gradient title bars, Start menu paradigm, taskbar patterns, 3D beveled chrome. Extrapolates Win95 to AI chatbots, mobile UIs, responsive layouts. Activate on 'windows 95', 'win95', 'start menu', 'taskbar', 'retro desktop', '95 aesthetic', 'clippy'. NOT for Windows 3.1 (use windows-3-1-web-designer), vaporwave/synthwave, macOS, flat design.

windows-3-1-web-designer

85
from curiositech/some_claude_skills

Modern web applications with authentic Windows 3.1 aesthetic. Solid navy title bars, Program Manager navigation, beveled borders, single window controls. Extrapolates Win31 to AI chatbots (Cue Card paradigm), mobile UIs (pocket computing). Activate on 'windows 3.1', 'win31', 'program manager', 'retro desktop', '90s aesthetic', 'beveled'. NOT for Windows 95 (use windows-95-web-designer - has gradients, Start menu), vaporwave/synthwave, macOS, flat design.

win31-pixel-art-designer

85
from curiositech/some_claude_skills

Expert in Windows 3.1 era pixel art and graphics. Creates icons, banners, splash screens, and UI assets with authentic 16/256-color palettes, dithering patterns, and Program Manager styling. Activate on 'win31 icons', 'pixel art 90s', 'retro icons', '16-color', 'dithering', 'program manager icons', 'VGA palette'. NOT for modern flat icons, vaporwave art, or high-res illustrations.

win31-audio-design

85
from curiositech/some_claude_skills

Expert in Windows 3.1 era sound vocabulary for modern web/mobile apps. Creates satisfying retro UI sounds using CC-licensed 8-bit audio, Web Audio API, and haptic coordination. Activate on 'win31 sounds', 'retro audio', '90s sound effects', 'chimes', 'tada', 'ding', 'satisfying UI sounds'. NOT for modern flat UI sounds, voice synthesis, or music composition.

wedding-immortalist

85
from curiositech/some_claude_skills

Transform thousands of wedding photos and hours of footage into an immersive 3D Gaussian Splatting experience with theatre mode replay, face-clustered guest roster, and AI-curated best photos per person. Expert in 3DGS pipelines, face clustering, aesthetic scoring, and adaptive design matching the couple's wedding theme (disco, rustic, modern, LGBTQ+ celebrations). Activate on "wedding photos", "wedding video", "3D wedding", "Gaussian Splatting wedding", "wedding memory", "wedding immortalize", "face clustering wedding", "best wedding photos". NOT for general photo editing (use native-app-designer), non-wedding 3DGS (use drone-inspection-specialist), or event planning (not a wedding planner).

websocket-streaming

85
from curiositech/some_claude_skills

Implements real-time bidirectional communication between DAG execution engines and visualization dashboards via WebSocket. Covers connection management, typed event protocols, reconnection with backoff, and React hook integration. Activate on "WebSocket", "real-time updates", "live streaming", "execution events", "state streaming", "push notifications". NOT for HTTP REST APIs, server-sent events (SSE), or general networking.

webapp-testing

85
from curiositech/some_claude_skills

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs. Activate on: Playwright, webapp testing, browser automation, E2E testing, UI testing. NOT for API-only testing without browser, unit tests, or mobile app testing.