About the Role
The role involves building and managing resilient infrastructure for data-heavy applications operating at scale, ensuring performance, reliability, and security across cloud environments.
Responsibilities
- Design and maintain scalable cloud infrastructure for high-traffic systems
- Automate deployment, monitoring, and recovery processes
- Optimize system performance and resource utilization
- Ensure reliability and uptime of production environments
- Collaborate with development teams to streamline CI/CD pipelines
- Implement and manage containerized workloads using Kubernetes
- Support secure handling of sensitive data across distributed systems
- Troubleshoot complex infrastructure and network issues
- Manage configuration and version control for infrastructure code
- Integrate observability tools for logs, metrics, and tracing
- Enforce compliance with security and operational standards
- Respond to incidents and lead post-mortem analyses
- Scale infrastructure in response to growing data demands
- Evaluate and adopt new cloud technologies and services
- Document architecture decisions and operational procedures
- Participate in on-call rotation for critical systems
- Ensure disaster recovery and backup strategies are effective
- Work with global teams across asynchronous schedules
- Maintain cost-efficiency in cloud resource usage
- Support migration of legacy systems to modern platforms
Nice to Have
- Experience with real-time data processing systems
- Contributions to open-source infrastructure projects
- Certifications in cloud or DevOps domains
- Background in fintech or data-intensive industries
- Knowledge of gRPC and API gateway patterns
- Experience with zero-downtime deployment strategies
- Familiarity with chaos engineering practices
- Exposure to edge computing or CDN architectures
Compensation
Competitive salary, negotiable based on experience
Work Arrangement
Remote within EU, flexible hours with overlap for European time zones
Team
Cross-functional engineering team focused on scalable infrastructure and data systems
Why This Role Stands Out
- Work on systems processing terabytes of data daily with strict SLAs
- Opportunity to influence technical direction in a growing infrastructure team
- Fully remote setup with emphasis on work-life balance
Tech Stack Highlights
- Primary cloud: Google Cloud Platform
- Kubernetes clusters managed via GKE
- CI/CD powered by GitLab and ArgoCD
- Monitoring stack: Prometheus, Loki, Tempo, Grafana
- Infrastructure as code using Terraform and Helm
Not available