Avanade is looking for a Cloud Resiliency Architect/Engineer to support customers through technical engagements focused on improving the reliability and recoverability of their cloud environments. You will assess cloud infrastructure and resiliency processes end-to-end, identifying risks and delivering actionable recommendations.
What You'll Do
- Assess the resiliency posture of customer environments from both technical and operational perspectives.
- Evaluate the effectiveness of disaster recovery (DR) plans, business continuity (BC) strategies, and Major Incident Response Plans (MIRPs).
- Conduct in-depth technical analysis of cloud infrastructure, application dependencies, observability configurations, and failover capabilities.
- Review and enhance MIRPs, including escalation protocols, incident communication plans, recovery sequencing, and impact containment strategies.
- Identify risks, vulnerabilities, and single points of failure across workloads and operational processes.
- Recommend improvements aligned with the Azure Well-Architected Framework, SRE principles, and ITIL practices.
- Engage customer teams to understand RTO/RPO targets, recovery workflows, and coordination models for major incidents.
- Deliver professional, customer-facing documentation summarizing technical findings and process maturity recommendations.
What We're Looking For
- Minimum 2+ years of hands-on experience.
- Deep experience with high availability (HA) and disaster recovery (DR) design in Microsoft Azure.
- Solid understanding of Azure architecture, including infrastructure, Availability Zones, backup/recovery, and monitoring services.
- Familiarity with cloud-native resiliency patterns and site reliability engineering (SRE) methods.
- Proven ability to assess and design effective Major Incident Response Plans (MIRPs) that align with operational SLAs and business risk tolerances.
- Experience in business continuity planning, incident response coordination, and process maturity assessments.
- Excellent communication and documentation skills for technical and executive stakeholders.
- Background in technical consulting or assessment-based delivery engagements.
- Microsoft Certified: Azure Solutions Architect Expert certification.
- Microsoft Certified: Azure Administrator Associate certification.
Nice to Have
- Experience contributing to or leading the development of enterprise-wide MIRP and DR testing programs.
- Familiarity with compliance frameworks such as ISO 22301, NIST SP 800-34, or SOC 2 Type II in the context of operational resilience.
- Prior experience supporting regulated industries (e.g., finance, healthcare, government) with stringent uptime, data protection, or continuity mandates.
- Hands-on experience with Azure BCDR tools such as Azure Site Recovery, Backup Vaults, Azure Automation, or Service Health Alerts.
- Working knowledge of multi-cloud or hybrid-cloud environments and related resiliency implications.
- Experience integrating DevOps pipelines with resiliency validation steps (e.g., backup validation, DR simulation, alerting thresholds).
- Exposure to incident simulation platforms, runbook automation, or response coordination tooling (e.g., Microsoft Sentinel, ServiceNow).
- Microsoft Certified: DevOps Engineer Expert certification.
- BC/DR certifications such as CBCP, MBCI, ISO 22301 Lead Implementer, or equivalent industry-recognized certifications.
- Microsoft Certified: Cybersecurity Architect Expert certification.
Technical Stack
- Microsoft Azure
- Azure Site Recovery
- Azure Backup Vaults
- Azure Automation
- Service Health Alerts
- Microsoft Sentinel
- ServiceNow
Work Mode
This is a local-country position open to candidates in British Columbia and Ontario.
Avanade is an equal opportunity employer.






