About the Role
We are seeking an experienced engineer to lead improvements in database infrastructure stability and performance. The role involves deep technical ownership, mentoring others, and driving reliability initiatives across production environments.
Responsibilities
- Design and maintain highly available database systems
- Optimize database performance across production workloads
- Lead incident response and root cause analysis for critical outages
- Develop automation tools for provisioning and maintenance
- Collaborate with engineering teams on schema and query design
- Monitor system health and implement proactive alerting
- Improve backup and disaster recovery procedures
- Support migration of database platforms and versions
- Enforce security and compliance standards for data storage
- Troubleshoot replication and clustering issues
- Contribute to capacity planning and scaling strategies
- Document architecture and operational runbooks
- Mentor junior engineers on best practices
- Evaluate new database technologies and tools
- Participate in on-call rotations
- Drive post-mortem follow-up actions
- Integrate database systems with CI/CD pipelines
- Ensure observability across data tiers
- Reduce operational toil through automation
- Collaborate on incident prevention strategies
- Maintain uptime and service level objectives
- Support hybrid and cloud infrastructure environments
- Work across time zones with global peers
Nice to Have
- Master’s degree in a technical field
- Experience with time-series or analytical databases
- Contributions to open-source database tools
- Public speaking or conference presentations
- Leadership in large-scale incident resolution
- Direct experience with financial data systems
Compensation
Competitive salary and benefits package
Work Arrangement
Remote-friendly with global team presence
Team
Team of 230+ professionals distributed across multiple countries including the USA, Canada, Japan, Hungary, Nigeria, Brazil, and the UK
Why This Role Matters
Database reliability is critical to system uptime and user trust. This role directly impacts the resilience and scalability of core infrastructure serving global clients.
What You’ll Bring
A mindset focused on automation, measurement, and continuous improvement. You will challenge existing processes and implement robust, long-term solutions.
Available for qualified candidates