United States of America Remote (Global) CAD 200,000 - 300,000 Yearly

Hopper is hiring a Site Reliability Engineer

The position involves improving cloud infrastructure at scale, with a focus on cost efficiency, automation, reliability, and security. The engineer will work within the Cloud FinOps team to optimize systems used by hundreds of engineers and millions of users worldwide.

Responsibilities

  • Lead initiatives to improve cost efficiency, such as minimizing network egress expenses by eliminating redundant data headers
  • Evaluate data storage usage and assign appropriate storage classes, including cold storage for infrequently accessed data
  • Optimize autoscaling configurations for databases and compute resources to balance performance and cost
  • Enhance cost attribution systems to provide teams with transparent and accurate cost visibility
  • Respond to platform incidents as part of an on-call rotation and provide technical support during outages
  • Assist engineering teams in resolving infrastructure-related challenges and interpreting platform policies
  • Review and approve pull requests requiring platform-level oversight
  • Collaborate within a compact, high-impact team of SREs focused on cloud efficiency and reliability

Requirements

  • Proven experience in site reliability engineering, DevOps, software engineering, or systems engineering
  • Strong troubleshooting abilities in complex distributed systems
  • Experience in system design with solid analytical reasoning
  • Effective communication skills for cross-team collaboration
  • Familiarity with major cloud platforms, with a preference for Google Cloud
  • Proficiency in SQL for data querying and analysis
  • Hands-on experience with containerization, Kubernetes, and configuration tools like Helm and Kustomize
  • Knowledge of service mesh technologies, particularly Istio
  • Understanding of networking concepts including DNS, TLS, certificates, and ingress routing
  • Experience with observability tools for logs, metrics, and APM, especially Datadog
  • Working knowledge of security practices such as IAM, RBAC, and network security
  • Familiarity with authentication and authorization systems
  • Experience with CI/CD pipelines and automation
  • Understanding of database technologies and their operational characteristics
  • Proficiency in scripting with Bash, Python, or equivalent languages

Tech Stack

Google Cloud, Kubernetes, Helm, Kustomize, Istio, Datadog, SQL, Bash, Python, CI/CD, IAM, RBAC, TLS, DNS, Ingress, APM, Log collection, Metrics, Network security, Authentication and authorization technologies, Database technologies

Benefits

  • Backed by a well-funded startup with ambitious growth goals
  • Offers competitive salary packages
  • Provides pre-IPO equity with significant upside potential
  • Covers 100% of group insurance plan premiums
  • Includes life insurance and short- and long-term disability coverage
  • Provides access to an HSA for qualified medical and dental expenses
  • Grants employees and dependents 24/7 access to telemedicine via Dialogue
  • Offers an RRSP plan with automatic pre-tax contributions
  • Features generous parental leave exceeding industry norms
  • Provides unlimited paid time off
  • Includes a travel stipend through Carrot Cash
  • Grants access to on-demand co-working spaces via FlexDesk
  • Offers a work-from-home stipend
  • Promotes an entrepreneurial culture that encourages risk-taking and innovation
  • Encourages open dialogue with leadership and management
  • Operates with small, agile teams to maximize individual impact

Compensation

competitive salary. Equity: pre-IPO equity packages. Carrot Cash travel stipend, work-from-home stipend

Work Arrangement

global — America, Europe — Team is scattered across America and Europe, so you can sleep at night!

Team

small and highly efficient team of SREs within the Cloud FinOps team

  • Entrepreneurial culture where pushing limits and taking risks is everyday business
  • Open communication with management and company leadership
  • Small, dynamic teams = massive impact

Additional Information

  • On-call rotation for platform incidents is part of the role
  • Team spans America and Europe, allowing for reasonable working hours and better work-life balance
  • The role is tagged as #LI-REMOTE, indicating eligibility for remote work
Required Skills
Google CloudKubernetesKustomizeHelmIstioDatadogSQLBashPythonSREDevOpsSoftware EngineeringSystems engineeringTroubleshootingSystem design Google CloudKubernetesHelmKustomizeIstioDatadogSQLBashPythonCI/CDIAMRBACTLSDNSIngress
About company
Hopper
Hopper is a leading travel platform that powers its mobile app, website, and B2B business (HTS) using data and machine learning. It offers travel agency services and proprietary fintech products like Cancel for Any Reason and Flight Disruption Assistance. Hopper serves hundreds of millions of travelers globally and partners with major brands like Capital One, Air Canada, and Uber through its HTS division to integrate fintech and travel inventory into their direct channels.
All jobs at Hopper Visit website
Job Details
Department Engineering
Category infrastructure
Posted 3 months ago