JPMorgan Chase & Co. is looking for a Lead Site Reliability Engineer to assume a critical leadership role within our Enterprise Technology, Infrastructure Platforms team. You will define the future of our technology, champion SRE culture, and advise on technical and business issues across multiple domains.
What You'll Do
- Demonstrate and champion site reliability culture and practices, exerting technical influence throughout your team.
- Lead initiatives to improve application and platform reliability using data-driven analytics.
- Collaborate to define service level indicators and establish service level objectives and error budgets.
- Demonstrate high-level technical expertise and proactively solve technology bottlenecks.
- Act as the primary point of contact during major incidents, solving issues quickly to avoid financial loss.
- Document and share knowledge within the organization via forums and communities of practice.
- Lead and conduct resiliency design reviews.
- Break up complex problems into digestible work for other engineers.
- Act as the technical lead for medium to large-sized products.
- Provide advice and mentoring to other engineers.
What We're Looking For
- Formal training or certification in SRE concepts and 5+ years of applied experience.
- Deep proficiency in reliability, scalability, performance, security, architecture, and toil reduction.
- Fluency in at least one programming language such as Python, Java Spring Boot, or .Net.
- Deep knowledge of software applications and technical processes with emerging depth in specific disciplines.
- Proficiency in observability tools like Grafana, Dynatrace, Prometheus, Datadog, or Splunk.
- Proficiency in CI/CD tools such as Jenkins, GitLab, and Terraform.
- Experience with container and container orchestration technologies like ECS, Kubernetes, and Docker.
- Experience troubleshooting common networking technologies and issues.
- Ability to identify and solve problems related to complex data structures and algorithms.
Nice to Have
- Drive to self-educate and evaluate new technology.
- Ability to teach new programming languages to team members.
- Ability to expand and collaborate across different levels and stakeholder groups.
Technical Stack
- Languages: Python, Java Spring Boot, .Net
- Observability: Grafana, Dynatrace, Prometheus, Datadog, Splunk
- CI/CD & IaC: Jenkins, GitLab, Terraform
- Containers & Orchestration: ECS, Kubernetes, Docker
Benefits & Compensation
- Comprehensive health care coverage
- On-site health and wellness centers
- Retirement savings plan
- Backup childcare
- Tuition reimbursement
- Mental health support
- Financial coaching
- Base salary determined based on role, experience, skill set, and location
We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law.





