Responsibilities
- Define the long-term technical direction and system architecture for a platform handling petabytes of security data and billions of daily events.
- Ensure system designs support scalability, reliability, and security for autonomous detection and response workflows.
- Design and implement the AI infrastructure layer enabling automated threat detection, retrieval-augmented investigations, and self-driven remediation.
- Build and deploy large language models, graph-based reasoning systems, and real-time feature pipelines processing massive volumes of security events.
- Lead evaluation frameworks and ensure system reliability through prompt management, fine-tuning, red-team exercises, latency controls, and fallback mechanisms.
- Guarantee high operational standards including observability, performance, security, and resilience in production environments processing billions of events daily.
- Work closely with Product, Detection Engineering, and Customer Success teams to convert real-world attack patterns into effective detection logic and technical requirements.
- Drive innovation by testing emerging AI methods such as retrieval-augmented generation, agent tool use, and multimodal modeling across text, logs, and graphs.
- Provide technical leadership and mentorship to engineering teams in cloud architecture, secure development, and system reliability.
Work Arrangement
Hybrid
Other
This position requires on-site presence Monday through Thursday at one of the company's offices in San Jose, CA, Sarasota, FL, or Kansas City, MO, with remote work permitted on Fridays.