About the Role
The role involves owning and improving cloud infrastructure, building automation tools, and ensuring system reliability through observability and incident response.
Responsibilities
- Design and manage cloud-based infrastructure using modern DevOps practices
- Develop and maintain CI/CD pipelines for rapid and safe deployments
- Implement infrastructure as code using configuration management tools
- Ensure high availability, scalability, and performance of production systems
- Lead incident response and conduct root cause analysis for outages
- Collaborate with development teams to optimize application performance
- Enforce security best practices across infrastructure and deployment workflows
- Monitor system health using observability platforms and alerting systems
- Automate operational tasks to reduce manual intervention and errors
- Support compliance and audit requirements for infrastructure systems
- Evaluate and integrate new technologies to improve platform capabilities
- Document architecture decisions and operational procedures
- Participate in on-call rotations for critical system support
- Optimize cloud resource usage to control costs
- Work closely with product teams to align infrastructure with business goals
- Improve disaster recovery and business continuity planning
- Manage containerized environments including orchestration platforms
- Maintain version control and change management processes
- Drive improvements in system reliability and mean time to recovery
- Contribute to technical decision-making at the platform level
- Ensure consistent deployment environments across stages
- Support integration of third-party services and APIs
- Troubleshoot complex distributed systems issues
- Promote a culture of operational excellence
- Mentor junior engineers in DevOps best practices
Compensation
Competitive salary and equity package
Work Arrangement
Hybrid with flexible remote options
Team
Part of a growing engineering team within an early-stage technology company
Why This Role Matters
- This position plays a critical role in shaping the foundation of a growing technology platform.
- You will directly influence system stability, deployment speed, and engineering efficiency.
What We Value
- Ownership of systems from design to decommissioning
- Collaborative problem-solving with engineering teams
- Continuous learning and adoption of best practices
- Transparency in incident communication and resolution
Available for qualified candidates


