Kohl's is hiring a Senior DevOps / Site Reliability Engineer to lead the design, automation, and operation of cloud infrastructure for our fast-growing, tech-driven platform. You will join a brilliant team of engineers, ex-bankers, and startup superstars in a fast-moving environment.
What You'll Do
- Design, implement, and manage scalable infrastructure on AWS (ECS, RDS, S3, VPC, CloudWatch).
- Develop and maintain CI/CD pipelines to enable secure, reliable, and automated software delivery.
- Define and automate infrastructure as code with Terraform (or CloudFormation) to ensure consistency and repeatability.
- Monitor and optimize system health, performance, and reliability; proactively address incidents and capacity challenges.
- Enhance observability through monitoring, logging, alerting, and dashboards.
- Strengthen security, compliance, and disaster recovery practices across infrastructure.
- Collaborate with engineering and QA teams to improve deployments, system reliability, and performance.
- Mentor and guide junior DevOps/SRE engineers, promoting standards for automation, scalability, and reliability.
What We're Looking For
- 5+ years of professional experience in DevOps, SRE, or Cloud Engineering roles.
- Strong expertise in AWS services and infrastructure as code (Terraform required).
- Proficiency in containerization and orchestration (Docker, Kubernetes, ECS).
- Solid background with CI/CD tools (e.g., GitHub Actions, Jenkins, GitLab CI).
- Experience with observability stacks (Prometheus, Grafana, ELK, CloudWatch).
- Strong scripting/programming skills (Python, Bash, Go).
- Knowledge of networking, security best practices, and high-availability architectures.
- Excellent communication skills in English, with the ability to collaborate across teams.
- Experience using AI-assisted development/ops tools (e.g., Cursor AI) to increase efficiency and reliability.
- Thrives in a fast-moving startup environment—flexible, proactive, and self-directed.
Nice to Have
- CloudFormation knowledge is a plus.
Technical Stack
- AWS: ECS, RDS, S3, VPC, CloudWatch
- Infrastructure as Code: Terraform, CloudFormation
- Containers: Docker, Kubernetes, ECS
- CI/CD: GitHub Actions, Jenkins, GitLab CI
- Observability: Prometheus, Grafana, ELK, CloudWatch
- Languages/Scripting: Python, Bash, Go
Work Mode
This role operates in local-country work mode in Albania.

