error-debugging-error-trace
You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.
About this skill
This skill transforms the AI agent into a dedicated error tracking and observability expert. It empowers the agent to assist users in designing, implementing, and optimizing comprehensive error monitoring solutions. This includes guiding the setup of error tracking systems, configuring effective alert rules, implementing structured logging practices, and establishing efficient workflows to quickly identify, diagnose, and resolve production issues. The agent can provide expert advice, best practices, and step-by-step guidance to ensure robust system stability and rapid problem resolution.
Best use case
Guiding users through the process of setting up or enhancing error monitoring systems, configuring detailed alerts, implementing structured logging, and establishing efficient error triage workflows to quickly resolve production issues and improve system reliability.
You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.
Users will receive expert guidance and actionable recommendations for establishing or enhancing their error tracking and observability infrastructure. This leads to a clearer understanding of best practices, a structured approach to alert configuration and logging, and an improved ability to quickly identify, diagnose, and resolve production issues, ultimately resulting in more stable and reliable software systems.
Practical example
Example input
Help me design an error monitoring strategy for my new microservices application, focusing on critical alerts and efficient log management.
Example output
As an error tracking and observability expert, I recommend starting with a tiered alerting system. For critical microservices, implement 'P0' alerts for full outages or data corruption, ensuring immediate notification. For P1 issues like performance degradation, use clear thresholds. Ensure all logs are structured (JSON format is ideal) and include essential contextual data such as `trace_id`, `service_name`, `environment`, and `severity`. We should also discuss setting up a centralized logging solution like ELK or Datadog, and defining clear escalation paths for different alert severities to ensure swift and appropriate responses.
When to use this skill
- When a development or operations team needs expert guidance on establishing or refining error tracking and observability practices. Use it to get advice on setting up new monitoring systems, optimizing existing alert configurations, implementing structured logging, or improving the overall incident response workflow to minimize downtime and quickly address production problems.
When not to use this skill
- When you need the AI agent to directly execute code, debug a live system, or perform actual error tracking operations. This skill provides expert *advice* and *guidance* on setting up and optimizing observability, but it does not *act* as the monitoring system or perform the debugging itself.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/error-debugging-error-trace/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How error-debugging-error-trace Compares
| Feature / Agent | error-debugging-error-trace | Standard Approach |
|---|---|---|
| Platform Support | Claude | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | easy | N/A |
Frequently Asked Questions
What does this skill do?
You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues.
Which AI agents support this skill?
This skill is designed for Claude.
How difficult is it to install?
The installation complexity is rated as easy. You can find the installation instructions above.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
Top AI Agents for Productivity
See the top AI agent skills for productivity, workflow automation, operational systems, documentation, and everyday task execution.
Cursor vs Codex for AI Workflows
Compare Cursor and Codex for AI coding workflows, repository assistance, debugging, refactoring, and reusable developer skills.
SKILL.md Source
# Error Tracking and Monitoring You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging, and ensure teams can quickly identify and resolve production issues. ## Use this skill when - Implementing or improving error monitoring - Configuring alerts, grouping, and triage workflows - Setting up structured logging and tracing ## Do not use this skill when - The system has no runtime or monitoring access - The task is unrelated to observability or reliability - You only need a one-off bug fix ## Context The user needs to implement or improve error tracking and monitoring. Focus on real-time error detection, meaningful alerts, error grouping, performance monitoring, and integration with popular error tracking services. ## Requirements $ARGUMENTS ## Instructions - Assess current error capture, alerting, and grouping. - Define severity levels and triage workflows. - Configure logging, tracing, and alert routing. - Validate signal quality with test errors. - If detailed workflows are required, open `resources/implementation-playbook.md`. ## Safety - Avoid logging secrets, tokens, or personal data. - Use safe sampling to prevent overload in production. ## Resources - `resources/implementation-playbook.md` for detailed monitoring patterns and examples.
Related Skills
error-diagnostics-error-trace
You are an error tracking and observability expert specializing in implementing comprehensive error monitoring solutions. Set up error tracking systems, configure alerts, implement structured logging,
error-debugging-error-analysis
You are an expert error analysis specialist with deep expertise in debugging distributed systems, analyzing production incidents, and implementing comprehensive observability solutions.
linux-shell-scripting
Provide production-ready shell script templates for common Linux system administration tasks including backups, monitoring, user management, log analysis, and automation. These scripts serve as building blocks for security operations and penetration testing environments.
iterate-pr
Iterate on a PR until CI passes. Use when you need to fix CI failures, address review feedback, or continuously push fixes until all checks are green. Automates the feedback-fix-push-wait cycle.
istio-traffic-management
Comprehensive guide to Istio traffic management for production service mesh deployments.
incident-runbook-templates
Production-ready templates for incident response runbooks covering detection, triage, mitigation, resolution, and communication.
incident-response-smart-fix
[Extended thinking: This workflow implements a sophisticated debugging and resolution pipeline that leverages AI-assisted debugging tools and observability platforms to systematically diagnose and res
incident-responder
Expert SRE incident responder specializing in rapid problem resolution, modern observability, and comprehensive incident management.
expo-cicd-workflows
Helps understand and write EAS workflow YAML files for Expo projects. Use this skill when the user asks about CI/CD or workflows in an Expo or EAS context, mentions .eas/workflows/, or wants help with EAS build pipelines or deployment automation.
docker-expert
You are an advanced Docker containerization expert with comprehensive, practical knowledge of container optimization, security hardening, multi-stage builds, orchestration patterns, and production deployment strategies based on current industry best practices.
devops-troubleshooter
Expert DevOps troubleshooter specializing in rapid incident response, advanced debugging, and modern observability.
devops-deploy
DevOps e deploy de aplicacoes — Docker, CI/CD com GitHub Actions, AWS Lambda, SAM, Terraform, infraestrutura como codigo e monitoramento.