Canva is looking for an Engineering Manager (Infra) - AI Reliability to lead the team building the infrastructure that powers our in-house AI research.
What You'll Do
- Build AI infrastructure to support a 100+ person research team at the forefront of creative AI.
- Design and scale multi-cloud systems that support high-performance model training and inference.
- Partner across AWS, GCP, Cloudflare and GCore to optimise GPU compute environments.
- Enhance CI/CD pipelines and developer velocity within our AI platform teams.
- Improve monitoring, alerting and system observability for AI workloads.
- Drive alignment in DevOps best practices across the AI platform and CORE engineering teams.
- Lead a high-impact engineering team.
What We're Looking For
- Experience leading DevOps or infrastructure teams, ideally in AI or high-performance computing environments.
- Hands-on experience with AWS (ECS, EC2, S3, IAM) and multi-cloud environments like GCP, Cloudflare or GCore.
- Experience with Kubernetes, SLURM, or similar distributed training infrastructure.
- Proficiency in infrastructure as code tools like Terraform.
- Understanding of the lifecycle of AI models and how to support R&D at scale.
- A strong grasp of containerisation, Linux fundamentals, and cloud networking.
- A collaborative, curious, and passionate approach to enabling others.
Technical Stack
- AWS (ECS, EC2, S3, IAM)
- GCP, Cloudflare, GCore
- Kubernetes, SLURM
- Terraform
- Linux
Team & Environment
You will lead a team that is part of CORE (Canva Original Research & Exploration), our in-house AI research lab, which includes over 100 researchers.
Benefits & Compensation
- Equity packages
- Inclusive parental leave policy
- Annual Vibe & Thrive allowance for wellbeing, social connection, office setup & more
- Flexible leave options
Work Mode
This is a remote position open to candidates located in Australia and New Zealand.
Canva is an equal opportunity employer.

