About the Role
The ideal candidate will help maintain and scale cloud systems, improve CI/CD workflows, and ensure high availability of services through automation and monitoring.
Responsibilities
- Manage and optimize cloud infrastructure across multiple regions
- Design and maintain CI/CD pipelines for rapid and reliable deployments
- Implement infrastructure as code using modern configuration tools
- Monitor system performance and respond to incidents promptly
- Collaborate with development teams to streamline deployment processes
- Ensure system reliability, scalability, and security
- Troubleshoot and resolve infrastructure-related issues
- Automate routine operational tasks to improve efficiency
- Maintain documentation for systems and procedures
- Support compliance with security and operational standards
- Participate in on-call rotations for critical systems
- Evaluate and integrate new technologies to improve platform stability
- Work closely with engineers to refine deployment strategies
- Configure and manage containerized environments
- Optimize resource usage and reduce cloud spending
- Implement backup and disaster recovery protocols
- Enforce access controls and identity management policies
- Deploy and maintain monitoring and alerting systems
- Contribute to post-incident reviews and remediation plans
- Support integration of third-party services and APIs
- Promote best practices in configuration management
- Ensure environments are consistent across development, staging, and production
- Assist in capacity planning and system forecasting
- Maintain secure and auditable deployment workflows
- Drive improvements in system observability
Compensation
Competitive salary based on experience and location
Work Arrangement
Fully remote with international team members
Team
Collaborative engineering team focused on reliability and automation
Why This Role Matters
This position plays a critical role in maintaining the stability and efficiency of our global platform. Your work will directly impact system uptime, deployment speed, and developer productivity.
What We Value
We prioritize clear communication, proactive problem solving, and a commitment to operational excellence. Candidates who take initiative and focus on long-term system health will thrive.
Not available for this position