Remote (Global) Full-time

Red Hat is hiring a Senior Site Reliability Engineer

About the Role

Red Hat is hiring a Senior Site Reliability Engineer to develop, scale, and operate Azure Red Hat OpenShift managed cloud services. You will contribute to running OpenShift at scale by enabling customer self-service, improving monitoring, and automating tasks.

What You'll Do

  • Contribute code to increase the scalability and reliability of the service
  • Contribute software tests and participate in peer review to increase the quality of our codebase
  • Help and develop peers’ capabilities through knowledge sharing, mentoring, and collaboration
  • Participate in a regular on-call schedule, including occasional paid weekends and holidays
  • Practice sustainable incident response and blameless postmortems
  • Resolve customer issues escalated from the Red Hat Global Support team
  • Work within a small agile team to develop and improve SRE software, support your peers, plan and self-improve
  • Explore and experiment with emerging AI technologies relevant to software development, proactively identifying opportunities to incorporate new AI capabilities into existing workflows and tooling

What We're Looking For

  • Bachelor’s degree in Computer Science, Engineering, or related field; equivalent practical experience will also be considered
  • Strong experience (5+ years) in at least one programming language (Golang, C, C++, Python, Java) and software life cycles
  • Hands-on experience with public cloud platforms (AWS, GCP, Azure). Preferably Azure
  • Experience with Docker based containers
  • Strong collaboration and problem-solving skills in distributed, team-based environments
  • Experience troubleshooting as-a-service offerings (SaaS/PaaS) and working with complex distributed systems
  • Working knowledge of Linux/Unix operating systems
  • Proven ability to automate repetitive tasks and debug performance issues
  • Ability to collaboratively troubleshoot and solve problems in a remote and distributed team setting

Nice to Have

  • Direct experience with Kubernetes or OpenShift is a major plus
  • Demonstrated ability to debug, optimize code and automate routine tasks, 4+ years is desired
  • Strong experience managing Linux servers running Red Hat Enterprise Linux (RHEL), CentOS, or Fedora hosted at a cloud provider such as Microsoft Azure, Amazon Web Services (AWS) or Google Compute Engine (GCE)
  • Strong experience with enterprise systems monitoring; knowledge of Prometheus is a plus
  • Experience with enterprise configuration management software like Ansible by Red Hat, Puppet, or Chef
  • Strong experience delivering a hosted service
  • Demonstrated ability to quickly and accurately troubleshoot system issues
  • Solid understanding of standard TCP/IP networking and common protocols like DNS and HTTP
  • Solid communications skills and experience working directly with and presenting to customers

Technical Stack

  • Languages: Golang, C, C++, Python, Java
  • Cloud: AWS, GCP, Azure
  • Platforms: Kubernetes, OpenShift, Docker
  • Operating Systems: Linux/Unix
  • Monitoring & Automation: Prometheus, Ansible, Puppet, Chef

Team & Environment

You will join a small agile team, an SRE team within a global team. Our culture relies on teamwork and openness for its success and we strive to cultivate a transparent environment that makes room for different voices. We learn from our failures in a blameless environment to support the continuous improvement of the team. Individual contributions have more visibility than most large companies, and you are encouraged to bring your best ideas, no matter your title or tenure. This open and inclusive environment is built on the open source principles of transparency, collaboration, and inclusion.

Work Mode

This is a global role. The team is spread across 40+ countries.

Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.

Required Skills
GolangCC++PythonJavaAWSGCPAzureKubernetesOpenShiftSite Reliability EngineeringDistributed SystemsObservabilityAutomationLinux
Your first international client?

Don't lose them over invoicing

Clients ghost freelancers with unprofessional invoicing. Glopay gives you a real EU company partnership so they take you seriously from invoice #1.

Instant EU company partnership
Invoice builder with your branding
Automated payment reminders
Real-time payment tracking
Get EU company now
Ready in 24 hours
About company
Red Hat

Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies.

Visit website
Job Details
Category infrastructure
Posted 4 months ago