JPMorgan Chase & Co. is seeking a Lead Site Reliability Engineer to join our Consumer and Community Banking division. In this leadership role, you will be responsible for advising on technical and business issues, leading resiliency design reviews, and serving as a technical lead for medium to large-sized products.
What You'll Do
- Demonstrate and champion site reliability culture and practices, exerting technical influence across your team.
- Lead initiatives to improve application and platform reliability using data-driven analytics to enhance service levels.
- Collaborate with team members to identify service level indicators and establish objectives and error budgets with stakeholders.
- Demonstrate deep technical expertise in one or more domains and proactively solve technology-related bottlenecks.
- Act as the primary point of contact during major incidents for your application, identifying and resolving issues swiftly to mitigate financial impact.
- Document and share knowledge within the organization through internal forums and communities of practice.
What We're Looking For
- Formal training or certification in software engineering concepts and 5 years of applied experience.
- Deep proficiency in reliability, scalability, performance, security, enterprise architecture, toil reduction, and other SRE best practices.
- Fluency in at least one programming language such as Python, Java Spring Boot, or .Net.
- Deep knowledge of software applications and technical processes with emerging depth in one or more technical disciplines.
- Proficiency in observability, including monitoring, SLO alerting, and telemetry collection using tools like Grafana, Dynatrace, Prometheus, Datadog, or Splunk.
- Proficiency in continuous integration and continuous delivery tools such as Jenkins, GitLab, or Terraform.
- Experience with container and container orchestration technologies like ECS, Kubernetes, or Docker.
- Experience troubleshooting common networking technologies and issues.
Nice to Have
- Familiarity with observability tools like Prometheus, Grafana, or Open Telemetry.
- Familiarity with AWS cloud technologies, including deployment, management, and optimization.
- Ability to identify and solve problems related to complex data structures and algorithms.
- Ability to collaborate effectively across different levels and stakeholder groups.
- Proactive recognition of roadblocks and a demonstrated interest in learning technology that facilitates innovation.
- Ability to identify new technologies and relevant solutions to meet design constraints.
- Ability to initiate and implement ideas to solve business problems.
Technical Stack
- Languages: Python, Java Spring Boot, .Net
- Observability: Grafana, Dynatrace, Prometheus, Datadog, Splunk
- CI/CD & IaC: Jenkins, GitLab, Terraform
- Containers & Orchestration: ECS, Kubernetes, Docker
- Cloud: AWS
JPMorgan Chase & Co. is an equal opportunity employer.





