What You'll Do
Design and maintain robust MySQL and MariaDB cluster environments in both on-premise and AWS cloud infrastructures. Ensure databases remain highly available, secure, and responsive in a 24/7 production setting. Monitor system performance and implement optimizations to meet evolving capacity demands.
Manage replication topologies to support fault tolerance and seamless failover. Develop and enforce backup and recovery protocols, regularly validating data integrity. Lead database upgrades, patching, and configuration tuning to maintain stability and security. Troubleshoot complex database issues, including deadlocks, replication lag, and performance bottlenecks, often in coordination with development teams.
Collaborate on query optimization efforts and support secure access controls, auditing, and user permission management. Contribute to disaster recovery planning and ensure compliance with internal and industry-specific data protection standards.
Requirements
- Minimum of three years as a production Database Administrator with hands-on experience in Galera or Percona Cluster environments
- Strong grasp of MySQL architecture, replication, and performance tuning methodologies
- Proven track record in managing backup, recovery, and data migration processes
- Experience configuring high-availability solutions such as clustering, log shipping, and database mirroring
- Proficiency in Bash, Python, or Perl for automating operational tasks
- Deep familiarity with UNIX-like server operating systems
- Knowledge of database security practices, access controls, and auditing
- Experience using Git or similar version control for managing scripts and configurations
Preferred Qualifications
- Background in FinTech or financial services
- Exposure to PCI DSS or PCI 3DS compliance frameworks
- Familiarity with AWS database services like Aurora DB and Global Databases
- Experience with FreeBSD and ZFS file system management
Work Mode
This role supports a globally distributed team. Success requires strong self-direction, consistent communication, and the ability to remain productive in a remote-first environment.