KOMOJU is looking for a Site Reliability Engineer to work at the intersection of software engineering and infrastructure operations. This role is ideal for engineers passionate about automation, systems design, and building scalable, reliable platforms.
What You'll Do
- Actively participate in improving and maintaining our AWS infrastructure
- Continuously improve system performance, reliability, and security
- Design, implement, and maintain our observability stack (metrics, logging, tracing, dashboards)
- Correspond with engineering teams to instrument applications for better observability
- Improve developer productivity with tooling
- Secure the system and adhere to compliance
- Be part of the team's on-call rotation
What We're Looking For
- 2+ years in SRE roles working with the AWS platform
- 2+ years experience in a software development role
- Hands-on experience with observability tools, preferably Datadog
- Proficiency in Terraform
- Proficiency in at least one scripting or programming language (Ruby/Rails, Python, Go, Shell Script, etc.)
- Experience working with CI/CD tools such as GitHub Actions, Jenkins, Circle CI, etc.
Nice to Have
- Strong communication skills to work closely with outside companies and various internal departments
- Knowledge of TCP/IP and other networking protocols
- Experience with AWS Direct Connect
Technical Stack
- AWS, Terraform, Datadog
- Ruby/Rails, Python, Go, Shell Script
- GitHub Actions, Jenkins, Circle CI
Team & Environment
You will collaborate closely with developers, security engineers, and product stakeholders.
Benefits & Compensation
- Embrace remote work while also offering office space for those who prefer in-person collaboration
- 10 days regular vacation, additional 5 days summer, and 5 days winter vacation
- Paid birthday holiday
- Budget for self-learning allowance
- Access to the O’Reilly Learning Platform
- Language training for Japanese/English
- Twice a week office lunch
Work Mode
This role follows a hybrid work model.



