error-debugging-error-trace

You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.

31,392 stars
Complexity: easy

About this skill

This skill transforms the AI agent into a dedicated error tracking and observability expert. It empowers the agent to assist users in designing, implementing, and optimizing comprehensive error monitoring solutions. This includes guiding the setup of error tracking systems, configuring effective alert rules, implementing structured logging practices, and establishing efficient workflows to quickly identify, diagnose, and resolve production issues. The agent can provide expert advice, best practices, and step-by-step guidance to ensure robust system stability and rapid problem resolution.

Best use case

Guiding users through the process of setting up or enhancing error monitoring systems, configuring detailed alerts, implementing structured logging, and establishing efficient error triage workflows to quickly resolve production issues and improve system reliability.

You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.

Users will receive expert guidance and actionable recommendations for establishing or enhancing their error tracking and observability infrastructure. This leads to a clearer understanding of best practices, a structured approach to alert configuration and logging, and an improved ability to quickly identify, diagnose, and resolve production issues, ultimately resulting in more stable and reliable software systems.

Practical example

Example input

Help me design an error monitoring strategy for my new microservices application, focusing on critical alerts and efficient log management.

Example output

As an error tracking and observability expert, I recommend starting with a tiered alerting system. For critical microservices, implement 'P0' alerts for full outages or data corruption, ensuring immediate notification. For P1 issues like performance degradation, use clear thresholds. Ensure all logs are structured (JSON format is ideal) and include essential contextual data such as `trace_id`, `service_name`, `environment`, and `severity`. We should also discuss setting up a centralized logging solution like ELK or Datadog, and defining clear escalation paths for different alert severities to ensure swift and appropriate responses.

When to use this skill

  • When a development or operations team needs expert guidance on establishing or refining error tracking and observability practices. Use it to get advice on setting up new monitoring systems, optimizing existing alert configurations, implementing structured logging, or improving the overall incident response workflow to minimize downtime and quickly address production problems.

When not to use this skill

  • When you need the AI agent to directly execute code, debug a live system, or perform actual error tracking operations. This skill provides expert *advice* and *guidance* on setting up and optimizing observability, but it does not *act* as the monitoring system or perform the debugging itself.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/error-debugging-error-trace/SKILL.md --create-dirs "https://raw.githubusercontent.com/sickn33/antigravity-awesome-skills/main/plugins/antigravity-awesome-skills-claude/skills/error-debugging-error-trace/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/error-debugging-error-trace/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How error-debugging-error-trace Compares

Feature / Agenterror-debugging-error-traceStandard Approach
Platform SupportClaudeLimited / Varies
Context Awareness High Baseline
Installation ComplexityeasyN/A

Frequently Asked Questions

What does this skill do?

You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.

Which AI agents support this skill?

This skill is designed for Claude.

How difficult is it to install?

The installation complexity is rated as easy. You can find the installation instructions above.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Error Tracking and Monitoring

You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.

## Use this skill when

- Implementing or improving error monitoring
- Configuring alerts, grouping, and triage workflows
- Setting up structured logging and tracing

## Do not use this skill when

- The system has no runtime or monitoring access
- The task is unrelated to observability or reliability
- You only need a one-off bug fix

## Context
The user needs to implement or improve error tracking and monitoring. Focus on real-time error detection, meaningful alerts, error grouping, performance monitoring, and integration with popular error tracking services.

## Requirements
$ARGUMENTS

## Instructions

- Assess current error capture, alerting, and grouping.
- Define severity levels and triage workflows.
- Configure logging, tracing, and alert routing.
- Validate signal quality with test errors.
- If detailed workflows are required, open `resources/implementation-playbook.md`.

## Safety

- Avoid logging secrets, tokens, or personal data.
- Use safe sampling to prevent overload in production.

## Resources

- `resources/implementation-playbook.md` for detailed monitoring patterns and examples.

Related Skills

error-diagnostics-error-trace

31392
from sickn33/antigravity-awesome-skills

You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging,

DevOps & InfrastructureClaude

error-debugging-error-analysis

31392
from sickn33/antigravity-awesome-skills

You are an expert error analysis specialist with deep expertise in debugging distributed systems, analyzing production incidents, and implementing comprehensive observability solutions.

DevOps & InfrastructureClaude

linux-shell-scripting

31392
from sickn33/antigravity-awesome-skills

Provide production-ready shell script templates for common Linux system administration tasks including backups, monitoring, user management, log analysis, and automation. These scripts serve as building blocks for security operations and penetration testing environments.

DevOps & InfrastructureClaude

iterate-pr

31392
from sickn33/antigravity-awesome-skills

Iterate on a PR until CI passes. Use when you need to fix CI failures, address review feedback, or continuously push fixes until all checks are green. Automates the feedback-fix-push-wait cycle.

DevOps & InfrastructureClaude

istio-traffic-management

31392
from sickn33/antigravity-awesome-skills

Comprehensive guide to Istio traffic management for production service mesh deployments.

DevOps & InfrastructureClaude

incident-runbook-templates

31392
from sickn33/antigravity-awesome-skills

Production-ready templates for incident response runbooks covering detection, triage, mitigation, resolution, and communication.

DevOps & InfrastructureClaude

incident-response-smart-fix

31392
from sickn33/antigravity-awesome-skills

[Extended thinking: This workflow implements a sophisticated debugging and resolution pipeline that leverages AI-assisted debugging tools and observability platforms to systematically diagnose and res

DevOps & InfrastructureClaudeGitHub Copilot

incident-responder

31392
from sickn33/antigravity-awesome-skills

Expert SRE incident responder specializing in rapid problem resolution, modern observability, and comprehensive incident management.

DevOps & InfrastructureClaude

expo-cicd-workflows

31392
from sickn33/antigravity-awesome-skills

Helps understand and write EAS workflow YAML files for Expo projects. Use this skill when the user asks about CI/CD or workflows in an Expo or EAS context, mentions .eas/workflows/, or wants help with EAS build pipelines or deployment automation.

DevOps & InfrastructureClaude

docker-expert

31392
from sickn33/antigravity-awesome-skills

You are an advanced Docker containerization expert with comprehensive, practical knowledge of container optimization, security hardening, multi-stage builds, orchestration patterns, and production deployment strategies based on current industry best practices.

DevOps & InfrastructureClaude

devops-troubleshooter

31392
from sickn33/antigravity-awesome-skills

Expert DevOps troubleshooter specializing in rapid incident response, advanced debugging, and modern observability.

DevOps & InfrastructureClaude

devops-deploy

31392
from sickn33/antigravity-awesome-skills

DevOps e deploy de aplicacoes — Docker, CI/CD com GitHub Actions, AWS Lambda, SAM, Terraform, infraestrutura como codigo e monitoramento.

DevOps & InfrastructureClaudeCursorGemini