Visa is hiring a Site Reliability Engineer as a senior individual contributor within the SRE Tribe. In this role, you will be responsible for owning and evolving the containerized platform that underpins critical workloads, serving as a technical reference for platform reliability, resilience, and automation.
What You'll Do
- Own the end-to-end lifecycle of core platform components, including cloud infrastructure primitives, Kubernetes clusters and services, networking, ingress, service discovery, and Service Mesh.
- Ensure platform components are resilient by design, applying SRE principles like fault isolation, capacity planning, and reduced operational toil.
- Continuously assess and mitigate reliability risks to proactively improve platform stability and operational readiness.
- Lead the design and implementation of infrastructure bootstrap orchestration, including automated cluster provisioning and dependency-aware orchestration.
- Drive a strong Infrastructure-as-Code and GitOps-first approach to ensure reproducible, auditable, and automated platform changes.
- Identify automation gaps and lead initiatives to significantly reduce human effort, onboarding time, and operational risk.
- Apply and promote SRE practices, including ownership of runbooks, participation in on-call rotation, and incident response and problem management.
- Improve platform operability by simplifying day-2 operations, standardizing upgrade strategies, and reducing Mean Time to Detect and Mean Time to Recover.
- Ensure platform operations align with security, compliance, and internal control requirements.
What We're Looking For
- Strong hands-on experience with Public Cloud platforms (AWS preferred, Azure).
- Strong hands-on experience administrating Kubernetes at scale in production environments.
- Strong hands-on experience with Service Mesh technologies (e.g., Istio preferred, App Mesh, Linkerd).
- Strong understanding of Observability tooling and Golden Signals concepts.
- Strong understanding of Incident management concepts and on-call operations.
- Strong understanding of Infrastructure as Code (e.g., Terraform).
- Strong understanding of Cloud-Native containerized micro-services architecture.
- Strong collaboration and communication skills.
Nice to Have
- AWS experience.
- Istio experience.
Technical Stack
- AWS, Azure
- Kubernetes
- Istio, App Mesh, Linkerd
- Terraform
Team & Environment
You will be a senior individual contributor within the SRE Tribe.
Work Mode
This is a remote position.
Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.




