Jobgether is hiring a Site Reliability Engineer to play a key role in ensuring the stability, performance, and scalability of complex cloud systems that power high-traffic digital platforms. You will collaborate with cross-functional engineering teams to design and operate resilient infrastructure, enhance observability, and streamline automation.
What You'll Do
- Design, operate, and optimize AWS-based infrastructure using Terraform, Helm, and Kubernetes to ensure scalability and high availability.
- Strengthen observability through effective monitoring, logging, and alerting systems to improve incident detection and resolution times.
- Automate key workflows and reduce manual tasks to enhance engineering productivity and operational consistency.
- Partner with software and cloud engineering teams to improve the resilience and performance of services under heavy workloads.
- Participate in building the next-generation architecture supporting regional expansion and data residency requirements.
- Contribute to on-call rotations, manage incidents calmly, and document learnings to prevent recurrence.
- Experiment with and adopt AI tools to streamline workflows and increase efficiency in reliability operations.
What We're Looking For
- Minimum of 4 years of experience in cloud engineering, systems administration, or site reliability engineering.
- Strong proficiency with AWS (or other major cloud platforms such as GCP or Azure) and infrastructure-as-code tools.
- Hands-on experience with Kubernetes, serverless technologies, and automation frameworks.
- Proficiency in a programming language such as Python, Go, or TypeScript.
- Solid understanding of observability practices and tools like Grafana, Datadog, Prometheus, and Sentry.
- Excellent analytical and problem-solving skills with a passion for improving performance and reliability.
- Strong communication and documentation abilities to share knowledge across teams.
- Ability to work independently in a remote-first environment and collaborate effectively across regions.
Nice to Have
- Eagerness to explore and apply AI technologies to improve operational processes.
Technical Stack
- AWS, Terraform, Helm, Kubernetes
- Python, Go, TypeScript
- Grafana, Datadog, Prometheus, Sentry
Team & Environment
You will collaborate with cross-functional engineering teams.
Benefits & Compensation
- Competitive salary packages, with equity and performance-based bonuses.
- Comprehensive healthcare and wellness benefits.
- Fully remote work model offering flexibility and autonomy.
- Professional development opportunities and access to leading-edge tools and technologies.
Work Mode
This is a fully remote position open to candidates within the EMEA region.
Jobgether values diversity and is committed to creating an inclusive, collaborative, and global work environment.



