Join Guidewire as a Site Reliability Engineer and become part of a passionate team dedicated to automating every process to ensure our systems run efficiently. You will play a key role in ensuring the stability of our flagship cloud platform products while building the tooling necessary for efficient operations and optimal availability of our SaaS multi-tenant, customer-focused systems.
What You'll Do
- Take a dedicated SRE approach to managing shared multi-tenant infrastructure for resilient SaaS microservice-based systems and customer-centric applications.
- Oversee and continuously enhance our team’s presence in AWS by automating deployment and operational tasks.
- Contribute to the development of our core infrastructure systems—adding features, fixing bugs, and implementing reliability enhancements.
- Engineer and maintain a complex single sign-on authentication platform based on SAML/OAuth to ensure secure, seamless access for our users.
- Build and maintain comprehensive observability tooling, metrics, and dashboards to support our global platform infrastructure.
- Improve our incident management lifecycle by identifying, mitigating, and learning from reliability risks, while helping to create a self-healing environment.
- Develop system documentation and training materials to educate and empower your teammates.
- Collaborate with various engineering teams, providing valuable feedback and contributing code when needed to enhance our products.
What We're Looking For
- Bachelor’s Degree in Computer Science or a related field.
- Proven software engineering and automation skills using Bash, Python, and/or Go.
- Well-versed in agile development methodologies.
- Deep background in Linux systems.
- Significant experience in automating and managing systems on Amazon Web Services.
- Experience supporting live production environments with Java/Apache/Tomcat.
- Proficient with Infrastructure as Code tools such as Terraform, Terragrunt, or Terraspace.
- Experience with devops/gitops tools like Git, Bitbucket, Flux CD, and TeamCity for smooth code promotions.
- Hands-on experience in containerization with Docker, Helm, Kubernetes/EKS, CNI, and Ingress networking.
- Strong understanding of Single-Sign On, SAML, and OAuth.
- Experienced with observability tools such as Datadog, CloudWatch, and PagerDuty.
- Familiar with event store/stream-processing technologies like Kafka or AWS SQS.
- Worked with relational databases such as Aurora Postgres or Oracle RDS.
- Possess advanced exposure to application development, web UI design, JSON, and overall application architecture.
- Outstanding troubleshooting skills, analytical mindset, and process-driven approach.
- Proactive team player with excellent communication skills, capable of explaining complex technical concepts to a varied audience.
Nice to Have
- Experience in production support for a SaaS platform.
- Comfortable working with cutting-edge, highly containerized, cloud-native environments in AWS.
- Experience with Okta.
- Exposure to Open Application Model systems like KubeVela or Crossplane.
Technical Stack
- Cloud & Infrastructure: AWS, Kubernetes/EKS, Terraform/Terragrunt/Terraspace
- Containers: Docker, Helm
- Databases: Aurora Postgres, Oracle RDS
- CI/CD & Version Control: Git, Bitbucket, Flux CD, TeamCity
- Languages & Runtime: Bash, Python, Go, Java, Apache, Tomcat
- Observability: Datadog, CloudWatch, PagerDuty
- Messaging & Auth: Kafka, AWS SQS, SAML, OAuth
- Operating System: Linux
Team & Environment
You will join a platform team that collaborates closely with core product developers.
We operate by our core values of integrity, rationality, and collegiality in a fun work environment. We are passionate about working together to deliver quality products and support, and champion a culture of reliability by promoting practices such as blameless postmortems, SLO tracking, and continuous learning from incidents.
Guidewire Software, Inc. is proud to be an equal opportunity and affirmative action employer. We are committed to an inclusive workplace, and believe that a diversity of perspectives, abilities, and cultures is a key to our success.




