Site Reliability Engineer (SRE) - Compute at Scaleway (Expired)

Join a team dedicated to building and maintaining resilient, high-performance infrastructure for a sovereign cloud platform. As a Site Reliability Engineer, you'll play a key role in shaping the foundation that powers a wide range of cloud and AI services. Your work will directly influence system reliability, security, and scalability across multiple regions.

What You’ll Do

Design, standardize, and maintain core infrastructure supporting a diverse product portfolio.
Automate deployment and configuration using GitOps and Infrastructure as Code tools like ArgoCD, FluxCD, and Ansible.
Enhance observability by managing monitoring, alerting, and metrics pipelines with Prometheus, Thanos, and Grafana.
Support product teams during onboarding, ensuring smooth integration with shared platforms.
Deploy and maintain systems across geographic regions, ensuring consistency and compliance.
Participate in a weekly on-call rotation, contributing to incident response and post-mortem analysis.
Strengthen security practices, including secret management with Vault and adherence to compliance standards.
Refine CI/CD pipelines and improve technical documentation to support sustainable operations.
Collaborate across disciplines to bridge infrastructure and development, fostering shared ownership and reliability.
Lead initiatives for continuous improvement, focusing on efficiency, resilience, and operational excellence.

What We’re Looking For

Proven background in systems administration, with at least 7 years of hands-on experience.
Deep familiarity with Kubernetes and modern GitOps workflows.
Strong scripting and automation skills using tools such as Ansible, Salt, or GitLab CI.
Experience with observability stacks and monitoring best practices.
Solid understanding of networking, security principles, and Infrastructure as Code methodologies.
Comfort with on-call responsibilities and a clear grasp of SLAs, SLOs, and error budgeting.
Active listener who values collaboration and clear communication across teams.
Practical problem-solver who balances innovation with operational stability.
Detail-oriented mindset with a commitment to precision in configuration and deployment.
Open to feedback and new ideas, with a drive to continuously learn and improve systems.

Work Environment

This role follows a hybrid model, allowing up to three days of remote work per week. Offices are located in Paris, Lille, Toulouse, Rennes, Rouen, Bordeaux, and Lyon—each designed with modern workspaces, outdoor areas, and easy access to public transit. A diverse, international team fosters inclusive collaboration, with English as the primary working language.

Our Commitment

Support for work-life balance, including flexible schedules and wellness benefits.
Access to fitness facilities, childcare support, and discounted personal services.
Internal mobility across a growing tech ecosystem.
Commitment to sustainable computing—our data centers run entirely on renewable energy.
Industry-recognized certifications in ecological responsibility.
A platform powered by modern bare metal and cloud-native technologies, serving over 100 public cloud products.

Scaleway was looking for a Site Reliability Engineer (SRE) - Compute

What You’ll Do

What We’re Looking For

Work Environment

Our Commitment

Similar Jobs

Cloud Engineer - Platform Engineering

Senior Site Reliability Engineer

DevOps Engineer (Mid level)

Senior Site Reliability Engineer

KTO - Platform Engineer - SRE - Lever

Senior DevOps Engineer (m/w/d)

Related Articles

Platform Engineering: Kubernetes for All

Network Configuration as Code: CI/CD for Automation | NVIDIA

Developer Experience Platform: Lessons from Europe