multi-cloud-architecture
Decision framework and patterns for architecting applications across AWS, Azure, and GCP.
About this skill
This skill equips an AI agent with a robust decision framework and established architectural patterns for building, migrating, and optimizing applications across leading cloud providers: AWS, Azure, and GCP. It guides the agent in advising on cloud-agnostic solutions, making informed service selections, and formulating actionable strategies for complex multi-cloud deployments. The skill ensures that the architectural recommendations are based on best practices for scalability, resilience, cost-effectiveness, and vendor neutrality, helping users design solutions that leverage the strengths of each cloud environment.
Best use case
Designing new applications to be deployed across multiple cloud providers; migrating existing applications to a multi-cloud environment; optimizing cloud costs and performance by strategically distributing workloads; ensuring disaster recovery and business continuity through cloud provider diversification; seeking vendor neutrality in cloud infrastructure; evaluating and selecting appropriate cloud services from AWS, Azure, and GCP for specific architectural components.
Decision framework and patterns for architecting applications across AWS, Azure, and GCP.
A well-structured architectural recommendation that details patterns, service choices, and strategic considerations for a multi-cloud application. The output will include actionable steps, best practices, and validation criteria for the proposed design, potentially referencing a detailed implementation playbook if requested.
Practical example
Example input
I need to design a highly available, event-driven microservices architecture using serverless functions, a globally distributed NoSQL database, and message queuing across AWS, Azure, and GCP. What are the key architectural patterns and service recommendations to ensure vendor neutrality and disaster recovery?
Example output
To achieve a highly available, event-driven microservices architecture with vendor neutrality and disaster recovery across AWS, Azure, and GCP, consider the following:
**Architectural Patterns:**
1. **Abstraction Layer:** Use frameworks or custom adapters to abstract cloud-specific services, minimizing vendor lock-in.
2. **Multi-Cloud by Design:** Architect for active-active or active-passive deployment across regions/clouds for high availability and disaster recovery.
3. **Data Replication/Synchronization:** Implement robust mechanisms for data consistency across distributed databases.
4. **Service Meshes/API Gateways:** Standardize ingress/egress and inter-service communication regardless of cloud provider.
**Service Recommendations:**
* **Serverless Functions:**
* AWS: Lambda
* Azure: Azure Functions
* GCP: Cloud Functions
* **Recommendation:** Abstract function invocation and event triggers, possibly using a unified deployment tool like Serverless Framework.
* **NoSQL Database (Globally Distributed):**
* AWS: DynamoDB Global Tables
* Azure: Azure Cosmos DB (with multi-region writes)
* GCP: Firestore (Datastore mode) or Cloud Spanner (if relational aspects are critical)
* **Recommendation:** Cosmos DB offers strong global distribution capabilities. Ensure data synchronization and conflict resolution strategies are well-defined.
* **Message Queuing:**
* AWS: SQS/SNS
* Azure: Azure Service Bus/Event Hubs
* GCP: Cloud Pub/Sub
* **Recommendation:** Cloud Pub/Sub offers strong cross-cloud potential with its global reach. Consider a common messaging interface for your microservices.
**Key Considerations:**
* **Networking:** Implement secure cross-cloud connectivity (e.g., VPNs, direct connect) and define DNS strategies.
* **Identity & Access Management:** Centralize IAM or synchronize identities across clouds.
* **Monitoring & Logging:** Adopt a unified monitoring solution (e.g., Prometheus, Grafana, Datadog) that aggregates logs and metrics from all clouds.
* **Deployment:** Utilize Infrastructure as Code (IaC) tools like Terraform to manage resources uniformly.
**Actionable Steps:**
1. Define a clear 'cloud-agnostic' definition for each architectural component.
2. Pilot core services (e.g., a simple serverless function with a database entry) in each cloud to validate performance and integration.
3. Develop a disaster recovery plan detailing failover and failback procedures between clouds.
4. If detailed implementation examples are needed, consult the `resources/implementation-playbook.md` for specific code patterns and configurations.When to use this skill
- When you need to design an application that will operate across two or more major cloud platforms (AWS, Azure, GCP).
- When seeking guidance on best practices for achieving cloud-agnostic solutions, ensuring portability and interoperability.
- When requiring recommendations for service selection (e.g., compute, database, storage, networking) tailored for a multi-cloud context.
- When the goal is to build resilient, scalable, and cost-effective architectures that leverage the unique strengths of different cloud providers.
When not to use this skill
- When the task is strictly unrelated to cloud architecture or design.
- When your application exclusively targets a single cloud provider (e.g., AWS-only, Azure-only, GCP-only).
- When the required domain or tool is outside the scope of multi-cloud architectural patterns and decision frameworks (e.g., specific code debugging, deep-dive into non-cloud specific algorithms).
- When you need direct interaction or execution of commands against cloud APIs; this skill provides guidance, not operational control.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/multi-cloud-architecture/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How multi-cloud-architecture Compares
| Feature / Agent | multi-cloud-architecture | Standard Approach |
|---|---|---|
| Platform Support | Claude | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | medium | N/A |
Frequently Asked Questions
What does this skill do?
Decision framework and patterns for architecting applications across AWS, Azure, and GCP.
Which AI agents support this skill?
This skill is designed for Claude.
How difficult is it to install?
The installation complexity is rated as medium. You can find the installation instructions above.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
Best AI Skills for Claude
Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
ChatGPT vs Claude for Agent Skills
Compare ChatGPT and Claude for AI agent skills across coding, writing, research, and reusable workflow execution.
SKILL.md Source
# Multi-Cloud Architecture
Decision framework and patterns for architecting applications across AWS, Azure, and GCP.
## Do not use this skill when
- The task is unrelated to multi-cloud architecture
- You need a different domain or tool outside this scope
## Instructions
- Clarify goals, constraints, and required inputs.
- Apply relevant best practices and validate outcomes.
- Provide actionable steps and verification.
- If detailed examples are required, open `resources/implementation-playbook.md`.
## Purpose
Design cloud-agnostic architectures and make informed decisions about service selection across cloud providers.
## Use this skill when
- Design multi-cloud strategies
- Migrate between cloud providers
- Select cloud services for specific workloads
- Implement cloud-agnostic architectures
- Optimize costs across providers
## Cloud Service Comparison
### Compute Services
| AWS | Azure | GCP | Use Case |
|-----|-------|-----|----------|
| EC2 | Virtual Machines | Compute Engine | IaaS VMs |
| ECS | Container Instances | Cloud Run | Containers |
| EKS | AKS | GKE | Kubernetes |
| Lambda | Functions | Cloud Functions | Serverless |
| Fargate | Container Apps | Cloud Run | Managed containers |
### Storage Services
| AWS | Azure | GCP | Use Case |
|-----|-------|-----|----------|
| S3 | Blob Storage | Cloud Storage | Object storage |
| EBS | Managed Disks | Persistent Disk | Block storage |
| EFS | Azure Files | Filestore | File storage |
| Glacier | Archive Storage | Archive Storage | Cold storage |
### Database Services
| AWS | Azure | GCP | Use Case |
|-----|-------|-----|----------|
| RDS | SQL Database | Cloud SQL | Managed SQL |
| DynamoDB | Cosmos DB | Firestore | NoSQL |
| Aurora | PostgreSQL/MySQL | Cloud Spanner | Distributed SQL |
| ElastiCache | Cache for Redis | Memorystore | Caching |
**Reference:** See `references/service-comparison.md` for complete comparison
## Multi-Cloud Patterns
### Pattern 1: Single Provider with DR
- Primary workload in one cloud
- Disaster recovery in another
- Database replication across clouds
- Automated failover
### Pattern 2: Best-of-Breed
- Use best service from each provider
- AI/ML on GCP
- Enterprise apps on Azure
- General compute on AWS
### Pattern 3: Geographic Distribution
- Serve users from nearest cloud region
- Data sovereignty compliance
- Global load balancing
- Regional failover
### Pattern 4: Cloud-Agnostic Abstraction
- Kubernetes for compute
- PostgreSQL for database
- S3-compatible storage (MinIO)
- Open source tools
## Cloud-Agnostic Architecture
### Use Cloud-Native Alternatives
- **Compute:** Kubernetes (EKS/AKS/GKE)
- **Database:** PostgreSQL/MySQL (RDS/SQL Database/Cloud SQL)
- **Message Queue:** Apache Kafka (MSK/Event Hubs/Confluent)
- **Cache:** Redis (ElastiCache/Azure Cache/Memorystore)
- **Object Storage:** S3-compatible API
- **Monitoring:** Prometheus/Grafana
- **Service Mesh:** Istio/Linkerd
### Abstraction Layers
```
Application Layer
↓
Infrastructure Abstraction (Terraform)
↓
Cloud Provider APIs
↓
AWS / Azure / GCP
```
## Cost Comparison
### Compute Pricing Factors
- **AWS:** On-demand, Reserved, Spot, Savings Plans
- **Azure:** Pay-as-you-go, Reserved, Spot
- **GCP:** On-demand, Committed use, Preemptible
### Cost Optimization Strategies
1. Use reserved/committed capacity (30-70% savings)
2. Leverage spot/preemptible instances
3. Right-size resources
4. Use serverless for variable workloads
5. Optimize data transfer costs
6. Implement lifecycle policies
7. Use cost allocation tags
8. Monitor with cloud cost tools
**Reference:** See `references/multi-cloud-patterns.md`
## Migration Strategy
### Phase 1: Assessment
- Inventory current infrastructure
- Identify dependencies
- Assess cloud compatibility
- Estimate costs
### Phase 2: Pilot
- Select pilot workload
- Implement in target cloud
- Test thoroughly
- Document learnings
### Phase 3: Migration
- Migrate workloads incrementally
- Maintain dual-run period
- Monitor performance
- Validate functionality
### Phase 4: Optimization
- Right-size resources
- Implement cloud-native services
- Optimize costs
- Enhance security
## Best Practices
1. **Use infrastructure as code** (Terraform/OpenTofu)
2. **Implement CI/CD pipelines** for deployments
3. **Design for failure** across clouds
4. **Use managed services** when possible
5. **Implement comprehensive monitoring**
6. **Automate cost optimization**
7. **Follow security best practices**
8. **Document cloud-specific configurations**
9. **Test disaster recovery** procedures
10. **Train teams** on multiple clouds
## Reference Files
- `references/service-comparison.md` - Complete service comparison
- `references/multi-cloud-patterns.md` - Architecture patterns
## Related Skills
- `terraform-module-library` - For IaC implementation
- `cost-optimization` - For cost management
- `hybrid-cloud-networking` - For connectivityRelated Skills
hybrid-cloud-architect
Expert hybrid cloud architect specializing in complex multi-cloud solutions across AWS/Azure/GCP and private clouds (OpenStack/VMware).
cloud-architect
Expert cloud architect specializing in AWS/Azure/GCP multi-cloud infrastructure design, advanced IaC (Terraform/OpenTofu/CDK), FinOps cost optimization, and modern architectural patterns.
multi-platform-apps-multi-platform
Build and deploy the same feature consistently across web, mobile, and desktop platforms using API-first architecture and parallel implementation strategies.
multi-agent-brainstorming
Simulate a structured peer-review process using multiple specialized agents to validate designs, surface hidden assumptions, and identify failure modes before implementation.
multi-advisor
Conselho de especialistas — consulta multiplos agentes do ecossistema em paralelo para analise multi-perspectiva de qualquer topico. Ativa personas, especialistas e agentes tecnicos simultaneamente, cada um pela sua otica unica, e consolida em sintese decisoria final.
lightning-architecture-review
Review Bitcoin Lightning Network protocol designs, compare channel factory approaches, and analyze Layer 2 scaling tradeoffs. Covers trust models, on-chain footprint, consensus requirements, HTLC/PTLC compatibility, liveness, and watchtower support.
hybrid-cloud-networking
Configure secure, high-performance connectivity between on-premises and cloud environments using VPN, Direct Connect, and ExpressRoute.
gcp-cloud-run
Specialized skill for building production-ready serverless applications on GCP. Covers Cloud Run services (containerized), Cloud Run Functions (event-driven), cold start optimization, and event-driven architecture with Pub/Sub.
error-debugging-multi-agent-review
Use when working with error debugging multi agent review
database-cloud-optimization-cost-optimize
You are a cloud cost optimization expert specializing in reducing infrastructure expenses while maintaining performance and reliability. Analyze cloud spending, identify savings opportunities, and implement cost-effective architectures across AWS, Azure, and GCP.
cloudformation-best-practices
CloudFormation template optimization, nested stacks, drift detection, and production-ready patterns. Use when writing or reviewing CF templates.
cloudflare-workers-expert
Expert in Cloudflare Workers and the Edge Computing ecosystem. Covers Wrangler, KV, D1, Durable Objects, and R2 storage.