Portugal Remote (Global) Full-time

EX Squared is hiring a Site Reliability Engineer (Remote - EMEA)

About the Role

Jobgether is hiring a Site Reliability Engineer to play a key role in ensuring the stability, performance, and scalability of complex cloud systems that power high-traffic digital platforms. You will collaborate with cross-functional engineering teams to design and operate resilient infrastructure, enhance observability, and streamline automation.

What You'll Do

  • Design, operate, and optimize AWS-based infrastructure using Terraform, Helm, and Kubernetes to ensure scalability and high availability.
  • Strengthen observability through effective monitoring, logging, and alerting systems to improve incident detection and resolution times.
  • Automate key workflows and reduce manual tasks to enhance engineering productivity and operational consistency.
  • Partner with software and cloud engineering teams to improve the resilience and performance of services under heavy workloads.
  • Participate in building the next-generation architecture supporting regional expansion and data residency requirements.
  • Contribute to on-call rotations, manage incidents calmly, and document learnings to prevent recurrence.
  • Experiment with and adopt AI tools to streamline workflows and increase efficiency in reliability operations.

What We're Looking For

  • Minimum of 4 years of experience in cloud engineering, systems administration, or site reliability engineering.
  • Strong proficiency with AWS (or other major cloud platforms such as GCP or Azure) and infrastructure-as-code tools.
  • Hands-on experience with Kubernetes, serverless technologies, and automation frameworks.
  • Proficiency in a programming language such as Python, Go, or TypeScript.
  • Solid understanding of observability practices and tools like Grafana, Datadog, Prometheus, and Sentry.
  • Excellent analytical and problem-solving skills with a passion for improving performance and reliability.
  • Strong communication and documentation abilities to share knowledge across teams.
  • Ability to work independently in a remote-first environment and collaborate effectively across regions.

Nice to Have

  • Eagerness to explore and apply AI technologies to improve operational processes.

Technical Stack

  • AWS, Terraform, Helm, Kubernetes
  • Python, Go, TypeScript
  • Grafana, Datadog, Prometheus, Sentry

Team & Environment

You will collaborate with cross-functional engineering teams.

Benefits & Compensation

  • Competitive salary packages, with equity and performance-based bonuses.
  • Comprehensive healthcare and wellness benefits.
  • Fully remote work model offering flexibility and autonomy.
  • Professional development opportunities and access to leading-edge tools and technologies.

Work Mode

This is a fully remote position open to candidates within the EMEA region.

Jobgether values diversity and is committed to creating an inclusive, collaborative, and global work environment.

Required Skills
AWSTerraformKubernetesHelmPythonGoTypeScriptGrafanaDatadogPrometheusSite Reliability EngineeringInfrastructure as CodeMonitoringObservabilityCloud Architecture
Ready to relocate and code from paradise?

Thailand or Vietnam — your office, your rules

Iglu offers relocation to Bangkok, Chiang Mai, Ho Chi Minh City, or Hong Kong. Full employment, legal setup, and a community of 200+ digital professionals.

Relocation to 5 countries
Full legal work setup
Developer community access
Work-life balance culture
Explore locations
Relocation support included
About company
EX Squared

Technology company focused on IT and software solutions

Visit website
Job Details
Category infrastructure
Posted 5 months ago