About the Role
The role involves designing, implementing, and maintaining robust CI/CD pipelines, managing cloud infrastructure, and ensuring system reliability and security for an AI-powered platform.
Responsibilities
- Design and manage cloud infrastructure using modern IaC practices
- Develop, optimize, and maintain automated CI/CD workflows
- Monitor system performance and implement proactive alerting
- Ensure high availability and fault tolerance across environments
- Collaborate with development teams to improve deployment reliability
- Implement and enforce infrastructure security standards
- Troubleshoot and resolve infrastructure and deployment issues
- Support containerization and orchestration using Kubernetes
- Manage configuration and version control for infrastructure
- Integrate observability tools for logs, metrics, and traces
- Participate in incident response and on-call rotations
- Drive improvements in system scalability and efficiency
- Maintain documentation for systems and processes
- Evaluate and adopt new DevOps tools and technologies
- Work cross-functionally to align infrastructure with product goals
Nice to Have
- Experience with large-scale distributed systems
- Background in AI or machine learning infrastructure
- Familiarity with service mesh technologies like Istio
- Knowledge of compliance and audit requirements for cloud systems
- Certifications in cloud or DevOps technologies
Compensation
Competitive salary based on experience
Work Arrangement
Full remote, work from Ukraine
Team
Collaborative engineering team focused on AI-driven solutions
Why This Role Matters
This position plays a critical role in ensuring the reliability, scalability, and security of core AI systems. Your work directly impacts product delivery speed and platform stability.
Tech Stack
AWS, Kubernetes, Terraform, GitLab CI, Docker, Prometheus, Grafana, Python, Helm
Not applicable


