About the Role
This role involves managing cloud infrastructure, supporting scalable systems, and implementing operational best practices to ensure reliability and efficiency.
Responsibilities
- Monitor cloud infrastructure for performance and stability
- Respond to system alerts and resolve operational issues
- Implement automation to improve system reliability
- Collaborate with development teams on deployment strategies
- Maintain system documentation and runbooks
- Support incident response and root cause analysis
- Ensure compliance with security and operational standards
- Manage cloud resource provisioning and configuration
- Optimize system performance and resource utilization
- Participate in on-call rotation for critical systems
- Troubleshoot network and application-level problems
- Deploy and manage monitoring tools
- Support disaster recovery planning and testing
- Enforce change management processes
- Work with cross-functional teams to improve system design
- Maintain cloud security posture and access controls
- Assist in capacity planning and forecasting
- Implement backup and recovery procedures
- Support continuous integration and delivery pipelines
- Apply software patches and system updates
- Track and report on system uptime and reliability
- Evaluate new technologies for operational improvements
- Ensure adherence to service level agreements
- Contribute to post-incident reviews
- Promote a culture of operational excellence
Nice to Have
- Certification in cloud platforms such as AWS or GCP
- Experience with large-scale distributed systems
- Background in telecommunications or public safety systems
- Familiarity with regulatory compliance standards
- Advanced scripting or programming skills
- Experience with service mesh technologies
- Knowledge of CI/CD pipeline architecture
- Exposure to site reliability engineering practices
- Experience with multi-region cloud deployments
- Understanding of data privacy principles
Compensation
Competitive salary and benefits package
Work Arrangement
Remote position within the United States
Team
Part of a dynamic engineering team supporting cloud infrastructure
Why This Role Matters
- This position plays a key role in maintaining the reliability of critical communication systems used by public safety organizations.
- Engineers contribute to infrastructure that supports real-time operations during emergency responses.
What to Expect
- You will work in a fast-paced environment with a focus on uptime and system resilience.
- Collaboration with engineering teams is essential to deploy and maintain secure cloud services.
Not available for this position


