AI Agent Skill HUB

ClaudeDevOps & Infrastructure

error-debugging-error-trace

You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.

31,392 stars

Complexity: easy

View on GitHub Installation ↓

About this skill

This skill transforms the AI agent into a dedicated error tracking and observability expert. It empowers the agent to assist users in designing, implementing, and optimizing comprehensive error monitoring solutions. This includes guiding the setup of error tracking systems, configuring effective alert rules, implementing structured logging practices, and establishing efficient workflows to quickly identify, diagnose, and resolve production issues. The agent can provide expert advice, best practices, and step-by-step guidance to ensure robust system stability and rapid problem resolution.

Best use case

Guiding users through the process of setting up or enhancing error monitoring systems, configuring detailed alerts, implementing structured logging, and establishing efficient error triage workflows to quickly resolve production issues and improve system reliability.

You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.

Users will receive expert guidance and actionable recommendations for establishing or enhancing their error tracking and observability infrastructure. This leads to a clearer understanding of best practices, a structured approach to alert configuration and logging, and an improved ability to quickly identify, diagnose, and resolve production issues, ultimately resulting in more stable and reliable software systems.

Practical example

Example input

Help me design an error monitoring strategy for my new microservices application, focusing on critical alerts and efficient log management.

Example output

As an error tracking and observability expert, I recommend starting with a tiered alerting system. For critical microservices, implement 'P0' alerts for full outages or data corruption, ensuring immediate notification. For P1 issues like performance degradation, use clear thresholds. Ensure all logs are structured (JSON format is ideal) and include essential contextual data such as `trace_id`, `service_name`, `environment`, and `severity`. We should also discuss setting up a centralized logging solution like ELK or Datadog, and defining clear escalation paths for different alert severities to ensure swift and appropriate responses.

When to use this skill

When a development or operations team needs expert guidance on establishing or refining error tracking and observability practices. Use it to get advice on setting up new monitoring systems, optimizing existing alert configurations, implementing structured logging, or improving the overall incident response workflow to minimize downtime and quickly address production problems.

When not to use this skill

When you need the AI agent to directly execute code, debug a live system, or perform actual error tracking operations. This skill provides expert *advice* and *guidance* on setting up and optimizing observability, but it does not *act* as the monitoring system or perform the debugging itself.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/error-debugging-error-trace/SKILL.md --create-dirs "https://raw.githubusercontent.com/sickn33/antigravity-awesome-skills/main/plugins/antigravity-awesome-skills-claude/skills/error-debugging-error-trace/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/error-debugging-error-trace/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How error-debugging-error-trace Compares

Feature / Agent	error-debugging-error-trace	Standard Approach
Platform Support	Claude	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	easy	N/A

Frequently Asked Questions

What does this skill do?

You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.

Which AI agents support this skill?

This skill is designed for Claude.

How difficult is it to install?

The installation complexity is rated as easy. You can find the installation instructions above.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agents for Coding

Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.

Top AI Agents for Productivity

See the top AI agent skills for productivity, workflow automation, operational systems, documentation, and everyday task execution.

Cursor vs Codex for AI Workflows

Compare Cursor and Codex for AI coding workflows, repository assistance, debugging, refactoring, and reusable developer skills.

SKILL.md Source

# Error Tracking and Monitoring

You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.

## Use this skill when

- Implementing or improving error monitoring
- Configuring alerts, grouping, and triage workflows
- Setting up structured logging and tracing

## Do not use this skill when

- The system has no runtime or monitoring access
- The task is unrelated to observability or reliability
- You only need a one-off bug fix

## Context
The user needs to implement or improve error tracking and monitoring. Focus on real-time error detection, meaningful alerts, error grouping, performance monitoring, and integration with popular error tracking services.

## Requirements
$ARGUMENTS

## Instructions

- Assess current error capture, alerting, and grouping.
- Define severity levels and triage workflows.
- Configure logging, tracing, and alert routing.
- Validate signal quality with test errors.
- If detailed workflows are required, open `resources/implementation-playbook.md`.

## Safety

- Avoid logging secrets, tokens, or personal data.
- Use safe sampling to prevent overload in production.

## Resources

- `resources/implementation-playbook.md` for detailed monitoring patterns and examples.

Related Skills

error-diagnostics-error-trace

from sickn33/antigravity-awesome-skills

You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging,

DevOps & InfrastructureClaude

error-debugging-error-analysis

from sickn33/antigravity-awesome-skills

You are an expert error analysis specialist with deep expertise in debugging distributed systems, analyzing production incidents, and implementing comprehensive observability solutions.

DevOps & InfrastructureClaude

linux-shell-scripting

from sickn33/antigravity-awesome-skills

Provide production-ready shell script templates for common Linux system administration tasks including backups, monitoring, user management, log analysis, and automation. These scripts serve as building blocks for security operations and penetration testing environments.

DevOps & InfrastructureClaude

iterate-pr

from sickn33/antigravity-awesome-skills

Iterate on a PR until CI passes. Use when you need to fix CI failures, address review feedback, or continuously push fixes until all checks are green. Automates the feedback-fix-push-wait cycle.

DevOps & InfrastructureClaude

istio-traffic-management

from sickn33/antigravity-awesome-skills

Comprehensive guide to Istio traffic management for production service mesh deployments.

DevOps & InfrastructureClaude

incident-runbook-templates

from sickn33/antigravity-awesome-skills

Production-ready templates for incident response runbooks covering detection, triage, mitigation, resolution, and communication.

DevOps & InfrastructureClaude

incident-response-smart-fix

from sickn33/antigravity-awesome-skills

[Extended thinking: This workflow implements a sophisticated debugging and resolution pipeline that leverages AI-assisted debugging tools and observability platforms to systematically diagnose and res

DevOps & InfrastructureClaudeGitHub Copilot

incident-responder

from sickn33/antigravity-awesome-skills

Expert SRE incident responder specializing in rapid problem resolution, modern observability, and comprehensive incident management.

DevOps & InfrastructureClaude

expo-cicd-workflows

from sickn33/antigravity-awesome-skills

Helps understand and write EAS workflow YAML files for Expo projects. Use this skill when the user asks about CI/CD or workflows in an Expo or EAS context, mentions .eas/workflows/, or wants help with EAS build pipelines or deployment automation.

DevOps & InfrastructureClaude

docker-expert

from sickn33/antigravity-awesome-skills

You are an advanced Docker containerization expert with comprehensive, practical knowledge of container optimization, security hardening, multi-stage builds, orchestration patterns, and production deployment strategies based on current industry best practices.

DevOps & InfrastructureClaude

devops-troubleshooter

from sickn33/antigravity-awesome-skills

Expert DevOps troubleshooter specializing in rapid incident response, advanced debugging, and modern observability.

DevOps & InfrastructureClaude

devops-deploy

from sickn33/antigravity-awesome-skills

DevOps e deploy de aplicacoes — Docker, CI/CD com GitHub Actions, AWS Lambda, SAM, Terraform, infraestrutura como codigo e monitoramento.

DevOps & InfrastructureClaudeCursorGemini