Full-time

Guidewire is hiring a Site Reliability Engineer

About the Role

Join Guidewire as a Site Reliability Engineer and become part of a passionate team dedicated to automating every process to ensure our systems run efficiently. You will play a key role in ensuring the stability of our flagship cloud platform products while building the tooling necessary for efficient operations and optimal availability of our SaaS multi-tenant, customer-focused systems.

What You'll Do

  • Take a dedicated SRE approach to managing shared multi-tenant infrastructure for resilient SaaS microservice-based systems and customer-centric applications.
  • Oversee and continuously enhance our team’s presence in AWS by automating deployment and operational tasks.
  • Contribute to the development of our core infrastructure systems—adding features, fixing bugs, and implementing reliability enhancements.
  • Engineer and maintain a complex single sign-on authentication platform based on SAML/OAuth to ensure secure, seamless access for our users.
  • Build and maintain comprehensive observability tooling, metrics, and dashboards to support our global platform infrastructure.
  • Improve our incident management lifecycle by identifying, mitigating, and learning from reliability risks, while helping to create a self-healing environment.
  • Develop system documentation and training materials to educate and empower your teammates.
  • Collaborate with various engineering teams, providing valuable feedback and contributing code when needed to enhance our products.

What We're Looking For

  • Bachelor’s Degree in Computer Science or a related field.
  • Proven software engineering and automation skills using Bash, Python, and/or Go.
  • Well-versed in agile development methodologies.
  • Deep background in Linux systems.
  • Significant experience in automating and managing systems on Amazon Web Services.
  • Experience supporting live production environments with Java/Apache/Tomcat.
  • Proficient with Infrastructure as Code tools such as Terraform, Terragrunt, or Terraspace.
  • Experience with devops/gitops tools like Git, Bitbucket, Flux CD, and TeamCity for smooth code promotions.
  • Hands-on experience in containerization with Docker, Helm, Kubernetes/EKS, CNI, and Ingress networking.
  • Strong understanding of Single-Sign On, SAML, and OAuth.
  • Experienced with observability tools such as Datadog, CloudWatch, and PagerDuty.
  • Familiar with event store/stream-processing technologies like Kafka or AWS SQS.
  • Worked with relational databases such as Aurora Postgres or Oracle RDS.
  • Possess advanced exposure to application development, web UI design, JSON, and overall application architecture.
  • Outstanding troubleshooting skills, analytical mindset, and process-driven approach.
  • Proactive team player with excellent communication skills, capable of explaining complex technical concepts to a varied audience.

Nice to Have

  • Experience in production support for a SaaS platform.
  • Comfortable working with cutting-edge, highly containerized, cloud-native environments in AWS.
  • Experience with Okta.
  • Exposure to Open Application Model systems like KubeVela or Crossplane.

Technical Stack

  • Cloud & Infrastructure: AWS, Kubernetes/EKS, Terraform/Terragrunt/Terraspace
  • Containers: Docker, Helm
  • Databases: Aurora Postgres, Oracle RDS
  • CI/CD & Version Control: Git, Bitbucket, Flux CD, TeamCity
  • Languages & Runtime: Bash, Python, Go, Java, Apache, Tomcat
  • Observability: Datadog, CloudWatch, PagerDuty
  • Messaging & Auth: Kafka, AWS SQS, SAML, OAuth
  • Operating System: Linux

Team & Environment

You will join a platform team that collaborates closely with core product developers.

We operate by our core values of integrity, rationality, and collegiality in a fun work environment. We are passionate about working together to deliver quality products and support, and champion a culture of reliability by promoting practices such as blameless postmortems, SLO tracking, and continuous learning from incidents.

Guidewire Software, Inc. is proud to be an equal opportunity and affirmative action employer. We are committed to an inclusive workplace, and believe that a diversity of perspectives, abilities, and cultures is a key to our success.

Required Skills
AWSKubernetesEKSTerraformHelmDockerPostgreSQLOracle RDSGitBitbucketFlux CDTerragruntTerraspaceSite Reliability EngineeringInfrastructure as Code
Scaling your freelance income?

Invoice multiple clients effortlessly

Managing 3+ international clients? Glopay streamlines everything. One EU company, unlimited invoices, automatic compliance. You just send and get paid.

Unlimited clients & invoices
Multi-currency support
Automated tax compliance
Client portal for easy payments
Scale with Glopay
Trusted by 10,000+ freelancers
About company
Guidewire

Guidewire is the platform P&C insurers trust to engage, innovate, and grow efficiently. They combine digital, core, analytics, and AI to deliver their platform as a cloud service. More than 540+ insurers in 40 countries run on Guidewire.

Visit website
Job Details
Category infrastructure
Posted 7 months ago