Thales is looking for a Site Reliability Engineering Manager to lead the SRE function within the Cybersecurity and Digital Identity (CDI) Cloud Center of Excellence (CCoE). In this role, you will manage SREs responsible for the reliability, observability, and incident excellence of five core products—Synapse, Jarvis, Oxygen, Photon, and Foundations—using Datadog as the strategic observability platform.
What You'll Do
- Manage career development, performance reviews, and mentorship for SRE engineers, building growth plans with strong Datadog capability development.
- Foster a culture of blameless post-mortems, psychological safety, and sustainable on-call practices.
- Partner with product teams to define and govern SLOs, error budgets, and reliability targets.
- Improve incident response and MTTR through automated safeguards and runbook maturity.
- Define reliability standards and operational excellence processes.
- Drive Datadog adoption and maturity across the organization.
- Guide reliability architecture for the core product suite.
- Ensure high-quality observability coverage and reliable incident intelligence.
- Support adoption of advanced Datadog practices like SLO/error budgets, forecasting, and anomaly detection.
- Co-own reliability commitments with Product Owners and Engineering Managers.
- Act as an executive point of contact during incidents and reliability reviews.
- Escalate risks early and negotiate innovation versus stability trade-offs.
What We're Looking For
- Proven experience managing SRE or platform engineering teams, including direct people management.
- Strong background in site reliability engineering, observability, and distributed systems.
- Hands-on experience with Datadog or an equivalent large-scale observability platform.
- Demonstrated experience owning SLO frameworks, error budgets, and production incident management.
- Fluent in English.
Nice to Have
- Master's degree in engineering or graduation from an engineering school.
- Advanced expertise in the Datadog ecosystem (enterprise governance, reusable templates, workflows).
- Experience supporting AI-centric platforms and managing third-party dependency risks.
- Familiarity with FinOps-informed reliability planning and cloud cost-performance trade-offs.
- French language skills.
Technical Stack
- Datadog
- AWS, Azure, GCP
- Kubernetes, EKS, AKS
Team & Environment
You will lead SRE engineers within the Cloud Center of Excellence (CCoE) department.
Benefits & Compensation
- Attractive compensation package
- Continuous skills development via training paths, academies, and internal communities
- Inclusive, caring environment that respects work-life balance
- Recognized social and environmental commitment
Work Mode
This is an onsite position located in La Ciotat, France.
Thales, entreprise Handi-Engagée, reconnait tous les talents. La diversité est notre meilleur atout.





