The Site Reliability Engineer drives improvements in system reliability and operational efficiency by reducing manual workloads and strengthening incident response. This role involves automation, observability, disaster recovery planning, and collaboration across teams. The engineer participates in on-call rotations and contributes to post-incident reviews and production readiness assessments.
Responsibilities
- Develops monitoring queries and defines service level objectives to track system performance
- Assists senior engineers during major system incidents
- Contributes to root cause analyses and post-mortem reports
- Takes part in disaster recovery exercises to validate system resilience
- Implements and runs automated code deployments in production environments
- Supports the creation of infrastructure diagrams and deployment workflows
- Tests system availability, reliability, and recovery capabilities in non-production settings
- Documents performance benchmarks to inform production readiness decisions
- Applies advanced DevOps practices across monitoring, networking, cloud storage, containerization, CI/CD, and security
- Participates in on-call rotations to support incident recovery for production systems
- Conducts failover testing across geographic regions
- Automates system recovery using Infrastructure-as-Code and configuration management tools
- Prepares and presents root cause analyses with executive summaries, timelines, impact assessments, and action plans
- Leads modeling exercises and designs workflows triggered by service level breaches
- Writes advanced automation scripts for incident response, including failover and rollback procedures
- Analyzes operational toil through ticket trends and recommends process improvements
- Creates reusable observability dashboards and configurations for team-wide use
- Influences the definition of service level objectives and error budgets
- Supports cross-functional teams in migrating applications to standardized platforms
- Provides guidance on implementing new platform features and standardized tooling
Requirements
- Proven hands-on experience in core SRE practices
- Understanding of distributed systems and their interdependencies
- Ability to automate recovery processes to maintain service levels
- Willingness to participate in on-call rotations and support incident response
- Experience driving process improvements
- Ability to provide informal guidance to less experienced team members
- Advanced proficiency with DevOps tools and methods, including monitoring, virtual networks, cloud storage, containers, orchestration, CI/CD, configuration management, and cloud security
- Experience working with Azure and AKS
- Proficiency in Terraform for infrastructure automation
- Familiarity with GitHub and CI/CD pipelines
- Java debugging capabilities
- Experience using Helm charts for deployment
- Working knowledge of JFrog for artifact management
- Experience with the ELK stack for log management
- Familiarity with Akeyless or Vault for secrets management
Tech Stack
Azure, AKS, Terraform, GitHub, CI/CD, Java, Helm, JFrog, ELK, Akeyless, Vault
Benefits
- Eligibility for an annual incentive bonus
- Location-specific benefits, including well-being and happiness programs
- Access to additional benefits through the RELX careers portal
Compensation
U.S. National Base Pay Range: $95,300 - $158,800. Geographic differentials may apply in some locations to better reflect local market rates. Annual incentive bonus
Team
Cross-functional team responsible for application migration to standard platforms and implementation of Paved Road features and services
- Commitment to employee well-being and happiness
- Focus on system reliability and reduction of manual operational work
- Collaborative environment that encourages knowledge sharing
- Support for professional growth and training
Additional Information
- This role includes on-call responsibilities
- Participation in disaster recovery testing is required
- Geographic pay adjustments may apply based on location
- The position is eligible for an annual incentive bonus
- Reasonable accommodations are available for applicants with disabilities
- No requests for money or banking information will be made during the hiring process — candidates should be alert to potential scams
- Candidate data is subject to a privacy policy


