California, United States USD 95,300 - 158,800 Yearly

LexisNexis Risk Solutions is hiring a Site Reliability Engineer

The Site Reliability Engineer drives improvements in system reliability and operational efficiency by reducing manual workloads and strengthening incident response. This role involves automation, observability, disaster recovery planning, and collaboration across teams. The engineer participates in on-call rotations and contributes to post-incident reviews and production readiness assessments.

Responsibilities

  • Develops monitoring queries and defines service level objectives to track system performance
  • Assists senior engineers during major system incidents
  • Contributes to root cause analyses and post-mortem reports
  • Takes part in disaster recovery exercises to validate system resilience
  • Implements and runs automated code deployments in production environments
  • Supports the creation of infrastructure diagrams and deployment workflows
  • Tests system availability, reliability, and recovery capabilities in non-production settings
  • Documents performance benchmarks to inform production readiness decisions
  • Applies advanced DevOps practices across monitoring, networking, cloud storage, containerization, CI/CD, and security
  • Participates in on-call rotations to support incident recovery for production systems
  • Conducts failover testing across geographic regions
  • Automates system recovery using Infrastructure-as-Code and configuration management tools
  • Prepares and presents root cause analyses with executive summaries, timelines, impact assessments, and action plans
  • Leads modeling exercises and designs workflows triggered by service level breaches
  • Writes advanced automation scripts for incident response, including failover and rollback procedures
  • Analyzes operational toil through ticket trends and recommends process improvements
  • Creates reusable observability dashboards and configurations for team-wide use
  • Influences the definition of service level objectives and error budgets
  • Supports cross-functional teams in migrating applications to standardized platforms
  • Provides guidance on implementing new platform features and standardized tooling

Requirements

  • Proven hands-on experience in core SRE practices
  • Understanding of distributed systems and their interdependencies
  • Ability to automate recovery processes to maintain service levels
  • Willingness to participate in on-call rotations and support incident response
  • Experience driving process improvements
  • Ability to provide informal guidance to less experienced team members
  • Advanced proficiency with DevOps tools and methods, including monitoring, virtual networks, cloud storage, containers, orchestration, CI/CD, configuration management, and cloud security
  • Experience working with Azure and AKS
  • Proficiency in Terraform for infrastructure automation
  • Familiarity with GitHub and CI/CD pipelines
  • Java debugging capabilities
  • Experience using Helm charts for deployment
  • Working knowledge of JFrog for artifact management
  • Experience with the ELK stack for log management
  • Familiarity with Akeyless or Vault for secrets management

Tech Stack

Azure, AKS, Terraform, GitHub, CI/CD, Java, Helm, JFrog, ELK, Akeyless, Vault

Benefits

  • Eligibility for an annual incentive bonus
  • Location-specific benefits, including well-being and happiness programs
  • Access to additional benefits through the RELX careers portal

Compensation

U.S. National Base Pay Range: $95,300 - $158,800. Geographic differentials may apply in some locations to better reflect local market rates. Annual incentive bonus

Team

Cross-functional team responsible for application migration to standard platforms and implementation of Paved Road features and services

  • Commitment to employee well-being and happiness
  • Focus on system reliability and reduction of manual operational work
  • Collaborative environment that encourages knowledge sharing
  • Support for professional growth and training

Additional Information

  • This role includes on-call responsibilities
  • Participation in disaster recovery testing is required
  • Geographic pay adjustments may apply based on location
  • The position is eligible for an annual incentive bonus
  • Reasonable accommodations are available for applicants with disabilities
  • No requests for money or banking information will be made during the hiring process — candidates should be alert to potential scams
  • Candidate data is subject to a privacy policy
Required Skills
AzureAKSTerraformGitHubCI/CDJavaHelmJFrogELKAkeylessVault AzureAKSTerraformGitHubCI/CDJavaHelmJFrogELKAkeylessVault
About company
LexisNexis Risk Solutions
LexisNexis Risk Solutions is the essential partner across Financial Crime Compliance, Fraud & Identity and Payments. Within their Business Services vertical, they offer solutions focused on helping businesses drive higher revenue growth, maximize operational efficiencies, and improve customer experience. Their solutions help customers solve problems in Anti-Money Laundering/Counter Terrorist Financing, Identity Authentication & Verification, Fraud and Credit Risk mitigation, and Customer Data Management.
All jobs at LexisNexis Risk Solutions Visit website
Job Details
Department Software Development
Category infrastructure
Posted 2 months ago