Luma Financial Technologies is hiring a Site Reliability Engineer to join the team responsible for keeping our platform reliable, secure, and fast. You will own everything from AWS infrastructure and Kubernetes clusters to CI/CD pipelines, monitoring, and alerting, collaborating closely with product engineering teams.
What You'll Do
- Collaborate with product engineering teams to design and build the infrastructure their services run on.
- Keep Kubernetes clusters on AWS EKS running smoothly, secure, and ready to scale.
- Design and deliver resilience strategies covering multi-region architecture, backups, disaster recovery, and failover.
- Automate infrastructure with Terraform and Infrastructure-as-Code.
- Improve CI/CD pipelines and deployment practices to help teams ship faster.
- Monitor performance and reliability using modern observability tools.
- Support on-call rotations and lead incident response with a focus on long-term fixes.
What We're Looking For
- 5+ years of applicable experience in Site Reliability or Software Development Engineering.
- You code to solve problems and are comfortable in Java, Python, Bash, and Go.
- Strong experience with AWS (RDS, CloudFront, IAM, VPCs), Terraform, and Kubernetes.
- Resilience focused, with experience designing and running dependable systems during failures.
- Hands-on experience improving and operating CI/CD pipelines (e.g., CircleCI, GitHub Actions).
- Stay calm under pressure, bringing incident response expertise and strong root-cause analysis skills.
- A team player who brings clear communication, strong collaboration, and a mindset of continuous improvement.
Nice to Have
- Bachelor’s degree in Computer Science, Software Engineering or related concentration.
Technical Stack
- AWS, Kubernetes, Terraform, Java, Python, Bash, Go, CircleCI, GitHub Actions
Luma Financial Technologies is an equal opportunity employer.





