Responsibilities
- Define and guide the overall platform strategy, including architectural direction and technology choices, to support petabyte-scale data and billions of daily events.
- Own and manage the platform engineering roadmap, focusing on reliability, performance, security, and developer productivity.
- Lead Site Reliability Engineering efforts by establishing and enforcing Service Level Objectives and Indicators, managing on-call processes, and reducing manual work through automation.
- Implement operational best practices using comprehensive monitoring, observability tools, automated infrastructure provisioning, and resilient disaster recovery plans.
- Manage and mentor a multidisciplinary team of platform, data, and infrastructure engineers, promoting accountability, technical excellence, and continuous learning.
- Partner with product and security operations teams to convert business and operational requirements into scalable, efficient platform solutions.
- Champion DevSecOps principles by standardizing secure CI/CD pipelines, infrastructure-as-code practices, and automated security testing.
- Drive innovation by evaluating and integrating emerging technologies, cloud-native architectures, and advanced distributed systems to maintain technical leadership.
Compensation
Competitive salary and equity package
Work Arrangement
Flexible, hybrid or remote options available
Team
Cross-functional engineering organization focused on security and data-intensive systems
Responsibilities
- Define Platform Strategy & Architecture: Overview the entire service platform, overseeing architectural design, technology selection, and strategic planning to ensure the platform can scale to petabytes of security data and billions of daily events.
- Own the Platform Engineering Roadmap: Prioritize initiatives that maximize reliability, performance, security, and developer efficiency across our core systems.
- Lead Site Reliability Engineering (SRE): Define and driving adherence to critical Service Level Objectives (SLOs) and Service Level Indicators (SLIs), managing on-call rotations, and minimizing toil through automation.
- Drive Operational Excellence: Drive the platform, implementing advanced monitoring, observability (logs, metrics, tracing), automated provisioning, and disaster recovery strategies.
- Lead & Mentor Platform Engineering Team: Engineering managers and a diverse team of platform, data, and infrastructure engineers. Drive a culture of engineering rigor, operational ownership, and continuous improvement.
- Collaborate with Product Management & Security Operations: Translate new product requirements and operational needs into robust, scalable, and cost-effective platform solutions.
- Establish and Enforce DevSecOps Practices: Standarize CI/CD pipelines, infrastructure-as-code (IaC), security testing, and deployment mechanisms to ensure rapid, secure, and reliable software delivery.
- Foster Innovation: Identify and driving the adoption of new engineering methodologies and infrastructure technologies. Push the frontier—leverage cloud-native patterns, advanced data stores, and distributed computing frameworks to maintain a competitive advantage.
Available for qualified candidates