Prague, Prague, Czech Republic Remote (City)

Barclays is hiring a Site Reliability Engineer

Responsibilities

Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning.
Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring.
Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience.
Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning.
Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure smooth and efficient operations.
Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities to foster a culture of technical excellence and growth.

Requirements

Hands-on experience with Elastic Stack (Elasticsearch, Kibana, Logstash/Beats).
Strong understanding of observability & monitoring (metrics, logs, traces, APM).
Experience with defining and configuring dashboards, alerts, and SLI/SLOs.
Basic infrastructure-management exposure (capacity planning, performance insights, scaling, monitoring).

Nice to Have

Experience with DevOps tools: GitLab, TeamCity, CI/CD pipelines.
Scripting/programming in Python, Java or C#.
Basic Linux experience.
Exposure to additional monitoring tools (Grafana, Prometheus, Splunk, etc.).

About company

All jobs at Barclays Visit website

Job Details

Department Information Technology

Category infrastructure

Posted 4 months ago

Similar Jobs

Other opportunities you might be interested in

Database Platform Engineer

Proton

Contact Center Production Control Engineer (Amazon Connect preferable)

Miratech

Gurgaon Remote (Global)

DevOps & Site Reliability Engineer

Oowlish Technology

Brasília Remote (Global)

Software Engineer - Observability

Scaleway

KTO - Platform Engineer - SRE - Lever

KTO

Porto Alegre Remote (Country)

Senior Engineer - Site Reliability Engineering

Relativity

Kraków Remote (Global)

Related Articles

Insights related to this role

Data center rack with network switches and fiber connections, illustrating automated network deployment using CI/CD and network configuration as code.

Network Configuration as Code: CI/CD for Automation | NVIDIA

4 min 3 months ago

Workspace setup for an AI developer, showing dual monitors with code and neural networks, symbolizing the AI developer career path.

Become an AI Developer: Your Career Guide

5 min 2 months ago