Responsibilities
- Work directly with developers to debug infrastructure and deployment issues in Temporal environments (Kubernetes, Docker, cloud services).
- Investigate and resolve performance bottlenecks, networking problems, and availability incidents.
- Provide clear, actionable guidance to help developers run Temporal reliably at scale.
- Once ramped up, independently drive technical solutions, whether debugging complex production issues or proactively improving systems.
- Contribute to observability and monitoring solutions using Prometheus, Grafana, and related tools.
- Help improve reliability, load balancing, and ingress/egress networking for Temporal deployments.
- Partner closely with engineering, product, and developer advocacy teams to relay field issues and feedback.
- Document best practices, troubleshooting playbooks, and infrastructure guides for the developer community.
- Participate in on-call rotations to support production deployments and provide timely responses to critical issues.
Requirements
- 5+ years of experience in cloud-based environments as an infrastructure engineer
- Proficiency with AWS, GCP, or Azure cloud platforms
- Hands-on experience deploying and managing containerized services (Kubernetes, Docker, EKS, GKE)
- Familiarity with cloud infrastructure and networking (load balancing, DNS, TLS, ingress/egress)
- Demonstrated ability to manage services across on-premises and cloud infrastructures
- Experience with monitoring/observability tools such as Prometheus, Grafana, or OpenTelemetry
- Prior experience troubleshooting production performance and availability issues
- Strong written and verbal communication skills; ability to explain complex technical concepts clearly
- Comfort working on a remote team and collaborating across time zones
Nice to Have
- Experience with infrastructure-as-code (Terraform, Ansible, or AWS CDK)
- Knowledge of security certificate management and implementation
- Background in customer-facing roles such as professional services, solutions architecture, or developer advocacy
- Fluency in one or more of: Python, Go, Java, or TypeScript
- Familiarity with distributed systems concepts and Temporal workflows
Work Arrangement
Remote (Worldwide)
Additional Information
- Occasional travel may be required for company events, team offsites, and other meaningful moments that bring the team together.
- Employee benefits and perks apply to full-time employees; part-time or temporary positions are excluded.
- Temporal is committed to providing access, equal opportunity, and reasonable accommodation for individuals with disabilities in employment.
- Temporal is not working with external recruitment agencies.
- Company-issued hardware includes laptop, monitor, keyboard, mouse, trackpad, and extension power cable at no cost to the employee.
- Work-from-home meals, internet stipend, in-home office setup, lifestyle spending, professional memberships, and learning & development are provided as perks.


