The Site Reliability Engineer drives improvements in system reliability and operational efficiency by reducing manual workloads and strengthening incident response. This role involves automation, observability, disaster recovery planning, and collaboration across teams. The engineer participates in on-call rotations and contributes to post-incident reviews and production readiness assessments.

Responsibilities

Develops monitoring queries and defines service level objectives to track system performance
Assists senior engineers during major system incidents
Contributes to root cause analyses and post-mortem reports
Takes part in disaster recovery exercises to validate system resilience
Implements and runs automated code deployments in production environments
Supports the creation of infrastructure diagrams and deployment workflows
Tests system availability, reliability, and recovery capabilities in non-production settings
Documents performance benchmarks to inform production readiness decisions
Applies advanced DevOps practices across monitoring, networking, cloud storage, containerization, CI/CD, and security
Participates in on-call rotations to support incident recovery for production systems
Conducts failover testing across geographic regions
Automates system recovery using Infrastructure-as-Code and configuration management tools
Prepares and presents root cause analyses with executive summaries, timelines, impact assessments, and action plans
Leads modeling exercises and designs workflows triggered by service level breaches
Writes advanced automation scripts for incident response, including failover and rollback procedures
Analyzes operational toil through ticket trends and recommends process improvements
Creates reusable observability dashboards and configurations for team-wide use
Influences the definition of service level objectives and error budgets
Supports cross-functional teams in migrating applications to standardized platforms
Provides guidance on implementing new platform features and standardized tooling

Requirements

Proven hands-on experience in core SRE practices
Understanding of distributed systems and their interdependencies
Ability to automate recovery processes to maintain service levels
Willingness to participate in on-call rotations and support incident response
Experience driving process improvements
Ability to provide informal guidance to less experienced team members
Advanced proficiency with DevOps tools and methods, including monitoring, virtual networks, cloud storage, containers, orchestration, CI/CD, configuration management, and cloud security
Experience working with Azure and AKS
Proficiency in Terraform for infrastructure automation
Familiarity with GitHub and CI/CD pipelines
Java debugging capabilities
Experience using Helm charts for deployment
Working knowledge of JFrog for artifact management
Experience with the ELK stack for log management
Familiarity with Akeyless or Vault for secrets management

Tech Stack

Azure, AKS, Terraform, GitHub, CI/CD, Java, Helm, JFrog, ELK, Akeyless, Vault

Benefits

Eligibility for an annual incentive bonus
Location-specific benefits, including well-being and happiness programs
Access to additional benefits through the RELX careers portal

Compensation

U.S. National Base Pay Range: $95,300 - $158,800. Geographic differentials may apply in some locations to better reflect local market rates. Annual incentive bonus

Team

Cross-functional team responsible for application migration to standard platforms and implementation of Paved Road features and services

Commitment to employee well-being and happiness
Focus on system reliability and reduction of manual operational work
Collaborative environment that encourages knowledge sharing
Support for professional growth and training

Additional Information

This role includes on-call responsibilities
Participation in disaster recovery testing is required
Geographic pay adjustments may apply based on location
The position is eligible for an annual incentive bonus
Reasonable accommodations are available for applicants with disabilities
No requests for money or banking information will be made during the hiring process — candidates should be alert to potential scams
Candidate data is subject to a privacy policy

LexisNexis Risk Solutions is hiring a Site Reliability Engineer

Responsibilities

Requirements

Tech Stack

Benefits

Compensation

Team

Additional Information

Similar Jobs

Senior DevOps Engineer

Senior DevOps Engineer

Senior DevOps Engineer

Senior DevOps Engineer

Senior DevOps Engineer

Senior Infrastructure Engineer

Related Articles

Network Configuration as Code: CI/CD for Automation | NVIDIA

CI/CD Testing Tools: 23 Best Options for 2026

Remote SRE Jobs: Vanguard’s Cloud Transformation