We're looking for a skilled engineer to maintain and improve two complex infrastructure environments: a custom multi-cloud PaaS and a large-scale AWS serverless platform. You'll lead incident response, drive infrastructure modernization, enhance tooling, and support internal and external teams in a remote-first, collaborative engineering culture.
Responsibilities
- Maintain system stability through proactive patching, performance tuning, incident response, and ongoing infrastructure health monitoring
- Lead long-term initiatives including infrastructure migrations, security enhancements, monitoring improvements, and development of internal tools
- Design and evolve infrastructure architecture to ensure it remains secure, modern, and easy for developers to use
- Provide technical guidance and support to internal teams when complex issues arise
- Collaborate with external development teams to resolve operational challenges and improve workflows
Requirements
- Fluent in written and spoken English for clear communication across teams
- Extensive experience with Docker, AWS, and building cloud-native systems
- Proficient in programming, particularly with Python or TypeScript, though experience with Go, Java, or similar languages is also valuable
- Skilled in Infrastructure as Code and configuration management, especially using Ansible and AWS CDK
- Strong understanding of core system concepts including Linux, networking, TCP/IP, and load balancing
- Proactive and dependable with the ability to work independently and take ownership of systems
- Comfortable supporting clients and handling support responsibilities with professionalism
Nice to Have
- Practical experience administering and tuning Linux systems
- Familiarity with Django or other Python-based web frameworks
- Hands-on work with AWS CDK using TypeScript
- Operational experience with PostgreSQL, Redis, RabbitMQ, or Elasticsearch
- Exposure to any of the technologies in our stack is a strong plus
Tech Stack
Docker, Django, AWS, EC2, S3, RDS, OpenSearch, Ansible, Python, Datadog, Redis, Elasticsearch, Nessus, Gatsby, Storyblok, Lambda, AWS CDK, TypeScript, GitHub Actions, Cloudflare, DynamoDB, API Gateway
Work Arrangement
Remote-first with team members across Europe
Team
Team of 18 engineers operating in a remote-first, flat structure with minimal hierarchy
- Curiosity
- Ownership
- Clarity
- Collaboration
- Engineer-led decision making
- Flexibility across tech stacks
- Balancing immediate fixes with long-term improvements
Additional Information
- Fully remote organization with no central office
- The Divio Cloud team works in 2-week sprints with quarterly OKRs
- The enterprise AWS project follows a 3-week Scrum cycle including planning and retrospectives
- On-call and support duties rotate weekly among team members
- Team members must be able to adapt quickly between different technical contexts and solve practical engineering problems
- The organization emphasizes values such as curiosity, ownership, and clarity
- Decisions are made collaboratively with input from all engineers, reflecting a flat hierarchy

