What You'll Do
Own the end-to-end reliability, scalability, and performance of production systems that operate with low latency and high availability. You'll design and manage cloud infrastructure on AWS, focusing on resilience and operational excellence across all environments. Collaborate with development teams to refine deployment workflows and enhance system observability through monitoring and alerting solutions.
Build and maintain infrastructure as code using Terraform, ensuring consistent and repeatable deployments. Implement GitOps practices with ArgoCD to streamline Kubernetes operations and manage deployments across clusters. Improve CI/CD pipelines using GitHub Actions to support rapid, safe releases.
Configure and maintain monitoring stacks with Prometheus, Grafana, and NewRelic to detect and resolve issues proactively. Troubleshoot infrastructure bottlenecks and lead incident response efforts, including root cause analysis and documentation. Manage networking components such as firewalls, load balancers, and VPNs to ensure secure connectivity.
Requirements
- Demonstrated experience with AWS infrastructure and services
- Proficiency in Kubernetes, Docker, Helm, and EKS for container orchestration
- Solid understanding of Linux internals, networking, and routing protocols
- Hands-on experience building highly available systems using infrastructure as code, specifically Terraform
- Experience with GitOps workflows, particularly ArgoCD
- At least two years in DevOps roles supporting production systems
- Minimum one year using Terraform for infrastructure management
- Proven experience with monitoring tools including Prometheus, Grafana, or NewRelic
- Familiarity with Agile methodologies and development cycles
- Ability to secure production environments and implement disaster recovery strategies
- Skilled in maintaining and optimizing CI/CD pipelines
Preferred Qualifications
- Background in fast-growing startup environments
- AWS certifications are a plus
Benefits
This role is fully remote, open to candidates in Egypt, Uzbekistan, and Pakistan. Work hours are aligned with Saudi Arabian Time (9:00 AM – 6:00 PM, Sunday to Thursday), allowing for structured collaboration across distributed teams. You’ll have the opportunity to shape infrastructure strategy and contribute to a growing technical ecosystem without being tied to a physical office location.


