West Monroe is looking for a Senior Site Reliability Engineer to join the Lamina product team. You will be responsible for designing, building, operating, and securing the platform that powers the product, owning its reliability, availability, and operational excellence in AWS.
What You'll Do
- Own the reliability, operability, and security of multiple platform services or a critical product domain in production across regions and environments.
- Design and implement reusable, production-ready platform architecture and automation, including Terraform modules, CI/CD, and container platforms.
- Design, build, and operate resilient, secure infrastructure and platform automation in Amazon Web Services (AWS).
- Implement and maintain Infrastructure as Code (IaC) using Terraform, including modules, state management, testing, and CI/CD.
- Instrument and improve observability across metrics, logging, tracing, dashboards, and alerting.
- Automate operational tasks and repetitive processes through scripting, tooling, and runbooks to reduce toil.
- Partner closely with Product Security to implement secure-by-design practices, vulnerability remediation, secrets management, and IAM best practices.
- Collaborate with engineering teams on reliability reviews, production readiness checks, and resilience testing.
- Drive tradeoffs between reliability, cost, performance, and security, and articulate their impact to stakeholders.
- Advocate for automation, standardization, and continuous improvement across engineering operations.
- Efficiently manage multiple priorities in a fast-paced, team-oriented Agile environment.
- Mentor junior team members and provide design and release reviews.
What We're Looking For
- 5+ years of experience in site reliability, systems engineering, platform engineering, or a related infrastructure role.
- Demonstrated experience designing, deploying, and operating production systems at scale in AWS.
- Deep hands-on experience with Terraform, including modules, remote state, and workspaces.
- Strong Linux system administration, networking fundamentals, and storage concepts.
- Experience with containerization and orchestration using Docker and building CI/CD pipelines.
- Strong knowledge of monitoring, observability, logging, distributed tracing, and alerting best practices.
- Experience owning incident response, postmortems, SLOs, and operational playbooks.
- Practical knowledge of collaborating on security—threat modeling, secure configuration, vulnerability remediation, and secrets management.
- Excellent collaboration, communication, and mentoring skills.
- Excellent communication ability with team members and clients in English.
Nice to Have
- Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
Technical Stack
- AWS: EC2, S3, RDS, VPC, IAM, Lambda, EKS
- Infrastructure: Terraform, Docker
- Observability: CloudWatch, Datadog, Prometheus, Grafana, OpenTelemetry, ELK, Splunk, Jaeger/Zipkin
- CI/CD & Tools: Jenkins, GitHub Actions, Azure DevOps, Terraform Cloud/Enterprise
- Languages & Scripting: Python, Go, Bash
- Security & Operations: AWS Secrets Manager, HashiCorp Vault, PagerDuty, ServiceNow/Jira, Runbook tooling
Team & Environment
You will partner closely with Product, Engineering, Product Security, and the Technical Architect on the Lamina product team.
West Monroe is an Equal Employment Opportunity Employer. We believe in treating each employee and applicant fairly and with dignity, basing decisions on merit, experience, and potential, without regard to race, color, national origin, sex, sexual orientation, gender identity, marital status, age, religion, disability, veteran status, or any other characteristic prohibited by law.



