Alkami is hiring a Senior Site Reliability Engineer specializing in container orchestration. In this role, you will ensure the reliability, scalability, and performance of our infrastructure. You'll be part of a team dedicated to building robust platform solutions in a remote-first environment.
What You'll Do
- Participate in the architecture and implementation of scalable platform solutions that present long-term solutions meeting business requirements.
- Collaborate with software engineers, DevOps, Information Security, and other teams to integrate applications into the platform.
- Create and share guidance for other teams on the proper implementation of infrastructure including technical guides with best practices.
- Monitor system health, performance, and reliability, and implement proactive measures to prevent downtime.
- Implement processes to ensure security vulnerabilities are remediated within SLA.
- Investigate and troubleshoot complex system and performance issues, providing root cause analysis and solutions.
- Identify opportunities for process and system improvements and contribute to ongoing performance and cost optimization efforts.
- Stay informed about industry trends and best practices to continually improve the platform.
- Participate in an on-call schedule.
- Create system infrastructure and processes documentation.
What We're Looking For
- 4+ years experience in a DevOps, SRE, or Platform Engineering role.
- Direct experience with cloud platforms (e.g., AWS, Azure, GCP).
- Proficiency in scripting and automation using tools such as Python, Bash, Powershell, or Java/.NET development.
- Familiarity with creating/modifying infrastructure-as-code.
- Strong experience with modern CI/CD tooling focused around rapid container deployment.
- Strong understanding of networking, load balancing, and security principles.
- Experience using automation tools for build, provision, deploy, test, and monitor workflows.
- Familiarity with creating physical and logical infrastructure diagrams.
- Ability to communicate effectively both verbally and in written form, adapting style to different audiences.
- Effective presentation skills.
- Ability to work cross functionally.
- Provide mentorship to team members.
- Work is done independently and reviewed at critical points.
- Key stakeholder in projects of diverse scope from design to completion.
- Enhances relationships with internal/external partners.
- Ability to participate in an on-call rotation as assigned.
Nice to Have
- Master’s degree in computer science or a related field.
- Experience with containerization and orchestration technologies, such as Docker and Kubernetes.
- Understanding of regulatory standards or experience working in a PCI environment.
- Previous experience with Git or a similar source code management system.
- Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK Stack).
Technical Stack
- Cloud: AWS, Azure, GCP
- Languages/Scripting: Python, Bash, Powershell, Java/.NET
- Methodologies: Infrastructure-as-Code, CI/CD
- Platform: Docker, Kubernetes
- Tools: Git, Prometheus, Grafana, ELK Stack
Team & Environment
You will collaborate closely with software engineers, DevOps, Information Security, and other teams.
Benefits & Compensation
- Salary: $115,000 - $130,500
- Remote-first environment
- Unlimited paid time off
- 401(k) with employer match
Work Mode
This is a remote-first position open to candidates based in the US.
Alkami Technology is an Equal Opportunity Employer and Prohibits Discrimination and Harassment of Any Kind.


