Dubai, Dubai, United Arab Emirates Hybrid Employment

Sana Commerce is hiring a Team Lead Site Reliability Engineer

About the Role

Sana Commerce is looking for a Team Lead Site Reliability Engineer to build and manage our global SRE team. You will be responsible for managing and monitoring all installed systems, environments, and infrastructure, resolving issues, and ensuring high reliability for our platform.

What You'll Do

  • Lead the SRE team, setting objectives and guiding the team towards achieving high reliability while balancing cost and performance SLAs.
  • Collaborate with platform & product engineering teams to embed reliability and operational best practices into the software development lifecycle.
  • Develop and implement SRE policies and practices, including service level objectives (SLOs), service level indicators (SLIs), and error budgets.
  • Drive automation across operations to reduce toil, improve system performance, and ensure scalability.
  • Oversee incident management, post-mortem analyses, and root cause investigations to prevent future outages and enhance system reliability.
  • Facilitate capacity planning and scalability exercises to manage growth and ensure efficient resource use.
  • Facilitate disaster recovery plans & testing to ensure business continuity for our customers’ webstores.
  • Encourage a culture of continuous improvement by mentoring team members and fostering innovation.
  • Stay up to date with the latest trends and technologies in SRE and advocate for their adoption where appropriate.

What We're Looking For

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.
  • At least 5 years of experience in Site Reliability Engineering.
  • 2+ years in a leadership or management role.
  • Proven, hands-on expertise in Microsoft Azure, including designing, deploying, and managing cloud-native infrastructure.
  • Experience with container orchestration (e.g., Kubernetes).
  • A deep understanding of network protocols, load balancing, and high availability configurations.
  • Experience in applying software development solutions to SRE and familiarity with programming languages such as (preferably) PowerShell and C# or else Python, Go, Java etc.
  • Experience with automation tools, infrastructure as code (e.g., Terraform, Ansible).
  • Proficiency in monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) and in implementing comprehensive monitoring solutions.
  • Excellent problem-solving skills, with a proven ability to tackle complex issues under pressure.
  • Outstanding leadership qualities, with a track record of mentoring and developing high-performing teams.
  • Exceptional communication and collaboration skills, capable of working effectively with cross-functional teams.

Nice to Have

  • Dynatrace knowledge is a plus.

Technical Stack

  • Microsoft Azure, Kubernetes, PowerShell, C#, Python, Go, Java, Terraform, Ansible, Prometheus, Grafana, ELK Stack, Dynatrace

Team & Environment

You will build and manage our global SRE team.

Benefits & Compensation

  • Up to 5 weeks “work from anywhere” per year.
  • A global and customized onboarding program.
  • A hybrid working model – 3 days from the office, 2 days from home.
  • Weekly company lunch.

Work Mode

This role follows a hybrid work model.

Sana Commerce is an equal opportunity employer.

Required Skills
Microsoft AzureKubernetesPowerShellC#PythonGoJavaTerraformAnsiblePrometheusSite Reliability EngineeringCloud InfrastructureContainer Orchestration
Invoicing holding you back?

Focus on work, not paperwork

Stop worrying about invoicing, taxes, and compliance. Glopay handles the business setup, you handle the client work. Get paid faster and look professional.

Auto-generated compliant invoices
Built-in expense management
Income reports for tax season
95% of earnings stay with you
Try Glopay free
No credit card needed
About company
Sana Commerce

Sana Commerce is an e-commerce platform designed to help manufacturers, distributors and wholesalers succeed by fostering lasting relationships with customers. Founded in 2007, they are a fast-growing SaaS company that allows employees to take ownership of their careers.

Visit website
Job Details
Department Engineering
Category infrastructure
Posted 14 days ago