About the Role
The role involves maintaining and improving the reliability of critical database infrastructure by diagnosing complex issues, implementing automated solutions, and collaborating closely with engineering teams to optimize system performance and uptime.
Responsibilities
- Monitor database system health and respond to incidents promptly
- Diagnose performance bottlenecks and implement effective fixes
- Develop automation tools to streamline operational workflows
- Collaborate with developers to enhance database resilience
- Perform root cause analysis for production issues
- Design and maintain monitoring and alerting systems
- Support upgrades and migrations of database environments
- Ensure configurations follow best practices for reliability
- Conduct regular system reviews to prevent outages
- Optimize backup and recovery procedures
- Improve scalability of database infrastructure
- Document operational procedures and troubleshooting steps
- Participate in on-call rotations for incident response
- Evaluate new technologies for operational improvements
- Maintain security and access controls for database systems
- Work with distributed systems at large scale
- Troubleshoot replication and consistency issues
- Assist in capacity planning for future growth
- Implement disaster recovery strategies
- Contribute to post-mortem reviews after incidents
- Enforce reliability standards across environments
- Reduce technical debt in operational tooling
- Improve deployment reliability and rollback processes
- Support testing of high-availability configurations
- Analyze logs and metrics to detect anomalies
Compensation
Competitive salary with performance-based incentives
Work Arrangement
Hybrid work model with flexible scheduling options
Team
Collaborative engineering group focused on database infrastructure reliability
Why This Role Matters
- This position plays a central role in ensuring the database platform remains stable, fast, and resilient under heavy workloads.
- Your work directly impacts the trust users place in the system's consistency and availability.
Growth Opportunities
- Engineers in this role have pathways to advance into senior reliability, architecture, or leadership roles.
- Regular knowledge-sharing sessions and mentorship programs support professional development.
Available for qualified candidates requiring work authorization