Delhi or Gurugram Hybrid Employment

SITA Switzerland Sarl is hiring a Site Reliability Engineer

About the Role

SITA Switzerland Sarl is looking for a Site Reliability Engineer to provide proactive support for our products, ensuring high performance and driving continuous improvement. You will focus on identifying incident root causes, implementing solutions to improve stability, and managing automation and integration to enhance efficiency between development and operations.

What You'll Do

  • Define, build, and maintain support systems to ensure high availability and performance.
  • Handle complex cases and perform incident response with root cause analysis for critical system failures.
  • Implement automation for system provisioning, self-healing, auto-recovery, deployment, and monitoring.
  • Monitor system performance and establish Service-Level Indicators and Service-Level Objectives.
  • Collaborate with Development and Operations to integrate reliability best practices, including zero-downtime architecture.
  • Proactively identify and remediate performance issues.
  • Work with Product teams for new product productization as a technical expert.
  • Coordinate with internal and external stakeholders to improve service performance and ensure high availability.
  • Conduct thorough problem investigations and root cause analyses to diagnose recurring incidents.
  • Define, build, and maintain an event catalog specifying active events, thresholds, and remediation actions.
  • Develop event response protocols and provide training for efficient incident handling.
  • Collaborate with Customer Success Managers to implement initiatives that enhance customer satisfaction.
  • Prepare reports, documentation, and communication materials covering customer metrics and product changes.
  • Identify and implement improvements in internal processes and workflows.
  • Contribute to knowledge management resources such as FAQs and training materials.
  • Implement data governance policies and monitor data quality, consistency, and compliance.

What We're Looking For

  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field.
  • 5+ years of experience in IT operations, service management, or infrastructure management, including roles such as Site Reliability Engineer, Problem Manager, or DevOps Manager.
  • Proven experience managing high-availability systems and ensuring operational reliability.
  • Extensive experience in root cause analysis, incident management, and developing permanent solutions for recurring service disruptions.
  • Hands-on experience with CI/CD pipelines, automation, system performance monitoring, and infrastructure as code.
  • Strong background in collaborating with cross-functional teams to improve operational processes and service delivery.
  • Experience managing deployments, conducting risk assessments, and optimizing event and problem management processes.
  • Familiarity with cloud technologies, containerization, scalable architectures, and zero-downtime deployment strategies.
  • Strong AKS & On prem K8s skills and experience.
  • Scripting skills in Ansible, Bash, or Python.
  • Experience with Terraform.
  • Azure or AWS experience.
  • Basic database skills.
  • Strong problem-solving skills and the ability to learn quickly.
  • A demonstrated SRE mindset.

Technical Stack

  • AKS
  • On prem Kubernetes
  • Ansible
  • Bash
  • Python
  • Terraform
  • Azure
  • AWS

Benefits & Compensation

  • Flex Week: Work from home up to 2 days/week (depending on your team's needs).
  • Flex Day: Make your workday suit your life and plans.
  • Flex-Location: Take up to 30 days a year to work from any location in the world.
  • Employee Wellbeing: Employee Assistance Program for you and your dependents 24/7, 365 days/year. Access to the Champion Health platform.
  • Professional Development: Access to LinkedIn Learning, Microsoft's Enterprise Skills Initiative, Airport Council International, Pluralsight, Harvard Business Publishing, Stanford, and other learning platforms.

Work Mode

This role operates on a hybrid work model.

SITA is an Equal Opportunity Employer. We value a diverse workforce. In support of our Employment Equity Program, we encourage women, aboriginal people, members of visible minorities, and/or persons with disabilities to apply and self-identify in the application process.

Required Skills
AKSKubernetesAnsibleBashPythonTerraformAzureAWSCI/CDInfrastructure as CodeIncident ManagementRoot Cause AnalysisSystem MonitoringAutomation
Looking for a remote dev community?

200+ professionals, 37 countries, one network

Working remotely doesn't mean working alone. Iglu connects you with developers, designers, and digital experts worldwide. Collaborate, learn, and grow together.

Global professional network
Knowledge sharing & collaboration
Regular community events
Cross-project opportunities
Join the community
37 countries represented
About company
SITA Switzerland Sarl

SITA provides technology and communication innovations that power the success of the global air travel industry. They are present in 95% of international airports, working closely with over 2,500 transportation and government clients.

Visit website
Job Details
Department Information Technology
Category infrastructure
Posted 14 days ago