About the Role
This role involves ensuring the reliability, performance, and security of a critical transaction platform by building resilient systems, automating operations, and responding to incidents with a focus on minimizing downtime and risk.
Responsibilities
- Design and maintain scalable infrastructure for high availability
- Implement automated deployment and recovery processes
- Monitor system performance and respond to incidents
- Optimize cloud resource usage and costs
- Collaborate with development teams to improve service reliability
- Develop and enforce observability standards
- Troubleshoot complex production issues
- Maintain disaster recovery protocols
- Support secure handling of sensitive financial data
- Drive improvements in system uptime and response times
- Participate in on-call rotations
- Ensure compliance with security standards
- Manage configuration and change control processes
- Evaluate new tools and technologies for operational efficiency
- Document system architecture and operational procedures
- Lead post-incident reviews and implement corrective actions
- Integrate security practices into CI/CD pipelines
- Monitor and report on service level objectives
- Support capacity planning initiatives
- Improve system resilience under load
- Enforce infrastructure as code principles
- Collaborate on incident response planning
- Contribute to platform modernization efforts
- Ensure consistency across staging and production environments
- Promote best practices in monitoring and alerting
Nice to Have
- Master’s degree in a technical field
- Certifications in cloud or systems engineering
- Experience with Kubernetes in production
- Background in fintech or cybersecurity domains
- Contributions to open-source infrastructure projects
Compensation
Competitive salary and benefits package
Work Arrangement
Hybrid
Team
Collaborative engineering team focused on system stability and security
About Us
The company specializes in securing real estate transactions through identity verification and fraud prevention technology, serving title and real estate professionals nationwide.
Our Technology Stack
The platform leverages AWS, Docker, Kubernetes, Terraform, Prometheus, Grafana, Python, and PostgreSQL to deliver a secure, scalable service.
Available