Berlin, Germany Hybrid Employment

Upvest is hiring a SRE Lead (f/m/d)

About the Role

As the SRE Lead, you'll establish and lead a dedicated reliability engineering practice from the ground up. Your focus will be on building systems that are inherently stable, scalable, and fault-tolerant, ensuring seamless performance as the platform grows. You'll set the technical and cultural foundation for reliability across engineering teams, driving practices that balance innovation with operational excellence.

Key Responsibilities

Define and implement service-level objectives, indicators, and error budgets to align engineering speed with system stability
Design and run chaos engineering exercises to proactively uncover weaknesses before they impact users
Develop comprehensive runbooks and foster a blameless post-incident learning culture
Lead performance testing initiatives, including load simulations and architectural assessments for 10x scale
Automate repetitive operational tasks to reduce toil and free engineers for higher-value work
Establish defensive design patterns such as circuit breakers, rate limiting, and graceful degradation
Validate system behavior under stress using synthetic and adversarial workloads
Integrate observability and resilience practices early in the development lifecycle
Create frameworks that enable rapid iteration without sacrificing system integrity
Recruit, develop, and lead a small team of SREs who amplify reliability practices across engineering
Collaborate with engineering leaders and product teams to elevate how reliability is prioritized and measured

What You Bring

Proven experience in high-pressure domains like FinTech, payments, or critical SaaS platforms where uptime is essential
Deep technical fluency in SLO design, observability tooling, automation, and chaos engineering
Strong grasp of resilient architecture and systems thinking, with a focus on graceful failure modes
Ability to influence stakeholders and align teams without direct authority
Leadership experience in hiring, coaching engineers, and setting technical direction

Nice to Have

Familiarity with investment technology concepts to better align reliability with business needs
Hands-on experience with Golang, Kubernetes, Google Cloud Platform, Postgres, Kafka, or Datadog

Environment & Culture

The organization values transparency, empowerment, and collective progress. You'll work in a flexible hybrid model with team hubs in Berlin, Tallinn, and London, involving periodic in-person collaboration in Berlin. The culture emphasizes curiosity, inclusivity, and shared success—where simplifying complexity and owning outcomes are core principles. Continuous learning is supported through development budgets and access to coaching. Diversity and inclusion are actively upheld across all levels of the company.

Required Skills

GolangKubernetesGCPPostgresKafkaDatadogSLOsChaos EngineeringObservabilityAutomationResilience ArchitectureSystems ThinkingStakeholder ManagementTechnical LeadershipHiring and Mentoring GolangKubernetesGCPPostgresKafkaDatadogSLOsChaos EngineeringObservabilityAutomationResilience ArchitectureSystems ThinkingStakeholder ManagementTechnical LeadershipHiring and Mentoring

Invoicing holding you back?

Focus on work, not paperwork

Stop worrying about invoicing, taxes, and compliance. Glopay handles the business setup, you handle the client work. Get paid faster and look professional.

Auto-generated compliant invoices

Built-in expense management

Income reports for tax season

95% of earnings stay with you

Try Glopay free

No credit card needed

About company

At Upvest, we are on a mission to make investing as easy as spending money. Upvest empowers businesses to offer a wide range of investment products and the best experience in the field of capital market investment and retirement planning. Upvest’s Investment API is easy to integrate so that fintechs and financial institutions can save resources and fully focus on their core business. We are proud to partner with Europe’s leading Fintechs and financial institutions such as DKB, Revolut, N26 and Raisin. Founded in 2017 by Martin Kassing, Upvest now brings together over 270 talented professionals from more than 70 nationalities. Upvest is backed by €280M in total funding from world-class investors, including BlackRock, Tencent, Sapphire Ventures, and Bessemer Venture Partners, Earlybird, Notion Capital, and Motive. Our latest €105M funding round in March 2026 - led by Sapphire and Tencent - serves as a massive catalyst for our growth, allowing us to offer premier investment experience.

All jobs at Upvest Visit website

Job Details

Department Engineering

Category infrastructure

Posted 4 months ago

Similar Jobs

Other opportunities you might be interested in

Senior Site Reliability Engineer (Remote or NYC - Hybrid)

Perchwell

New York Hybrid

Senior Site Reliability Engineer - Ireland

Arista Networks

Dublin Remote (Country)

IT Software Engineer

Monks

Bogotá Remote (Global)

Staff Software Engineer - Compute Infrastructure

Relativity

Kraków Remote (Global)

Senior Site Reliability Engineer

NinjaOne

United States of America Hybrid

Staff Platform Engineer

Joko

Paris Remote (Global)

Insights related to this role

Home office setup with dual monitors showing Kubernetes dashboards, representing the rise of Kubernetes remote jobs in AI and cloud-native careers 2026.

Job Search

Kubernetes Remote Jobs: AI & Cloud-Native Careers in 2026

As AI reshapes infrastructure, Kubernetes remote jobs are surging in demand. With 66% of generative AI inference running on Kubernetes, cloud-native careers are shifting toward platform engineering, observability, and remote-first roles across Europe and globally.

5 min a month ago

A remote developer working in a well-lit, modern workspace, illustrating a productive environment enabled by a developer experience platform.

Remote Work

Developer Experience Platform: Lessons from Europe

A platform engineering team in Europe shares practical lessons on building a developer experience platform that empowers developers, improves productivity, and bridges silos. Learn what worked, what didn’t, and how they measured success.

5 min 13 days ago

Outdoor café in San Francisco with patrons dining, illustrating leisure and hospitality job growth amid AI-driven tech sector changes.

Industry Trends

AI Boom Job Impact: Tech Decline vs. Service Growth in SF

While AI reshapes the tech sector, San Francisco’s job market is seeing a surprising reversal. Tech lost 4,500 jobs in 2025, but leisure and hospitality added the same number—driven by in-person services.

3 min 8 days ago