Work with a team of six engineers to build and maintain cloud-native systems, ensuring high reliability, scalability, and security through close collaboration with security and engineering teams.
Responsibilities
- Design and manage cloud infrastructure using Infrastructure as Code methodologies to ensure consistency and scalability.
- Develop and enhance CI/CD pipelines to support automated software delivery for engineering teams.
- Troubleshoot, optimize, and maintain Kubernetes-based workloads while adopting platform best practices.
- Improve system observability by building dashboards, setting up alerts, and making production environments easier to monitor and understand.
- Collaborate on GitOps workflows and contribute to platform-wide improvements.
- Expand and refine a self-service engineering platform to increase team efficiency and autonomy.
Requirements
- 2 to 5 years of experience in Site Reliability Engineering, DevOps, or a related infrastructure role.
- Hands-on experience with public cloud platforms such as AWS or GCP.
- Working knowledge of containerization technologies and Kubernetes, with production experience preferred.
- Proficiency in writing automation scripts using languages like Python or Go.
- Solid understanding of Linux operating system fundamentals.
- Ability to communicate technical concepts clearly to both technical and non-technical stakeholders.
- Demonstrated curiosity and a strong desire to continuously learn and improve.
Nice to Have
- Experience with Infrastructure as Code tools such as Terraform or Ansible.
- Familiarity with the Cloud Native Computing Foundation (CNCF) ecosystem and cloud-native technologies.
- Background in CI/CD systems and GitOps practices.
- Interest in security principles and experience building secure, resilient systems.
- Hands-on experience with observability tools for logging, metrics, and alerting.
Tech Stack
AWS, GCP, Kubernetes, Terraform, Ansible, CI/CD, GitOps, Python, Go, Linux, CNCF, Logging tools, Metrics tools, Alerting tools
Benefits
- Flexible paid time off policy including 14 paid holidays annually.
- Annual $1,500 Learning & Development stipend for professional growth.
- Regular company-sponsored team events and social gatherings.
- Access to an Employee Assistance Program for personal support.
- Subscription to Headspace, a personalized mental wellness app.
- Flat 3% contribution to retirement accounts regardless of employee contribution.
- High flexibility to accommodate personal appointments and emergencies.
- Competitive base salary and total compensation package.
- Generous parental leave, medical, and bereavement policies.
- 401K contributions and stock options as part of total rewards.
- Comprehensive medical, dental, and vision insurance coverage.
- Welcome swag and IT equipment provided for new hires.
- Structured semi-annual 360-degree performance reviews for career development.
Compensation
$125,000 - $155,000 base salary. Includes stock options, 401K contribution, and a flat 3% retirement contribution.
Work Arrangement
Not specified
Team
Team of 6 engineers. The SRE team works closely with the Security team and multiple engineering teams to ensure system reliability and security.
- Prioritizes wellbeing across occupational, mental, social, physical, and environmental dimensions.
- Committed to diversity, equity, inclusion, and fostering a sense of belonging.
- Emphasizes psychological safety in all interactions and decision-making.
- Supports continuous learning and professional development.
- Encourages social connection through regular team celebrations and events.
Additional Information
- Commitment to creating an inclusive workplace for individuals of all backgrounds.
- Equal Opportunity Employer: does not discriminate based on race, color, gender, sexual orientation, gender identity, religion, disability, national origin, protected veteran status, age, or other legally protected characteristics.
- Structured onboarding with mentorship and ramp-up time provided for new team members.
Not specified


