Dearborn, Michigan, United States On-site Employment

Ford is hiring a SRE Engineer

About the Role

Ford Motor Company is seeking a Site Reliability Engineer to join our team. In this role, you will be instrumental in elevating the performance and dependability of our Marketing and Sales Tech platform and applications. Your work will directly impact the smooth operation and evolutionary growth of our technology landscape.

What You'll Do

  • Participate in a 24/7 on-call rotation, providing rapid response to critical incidents.
  • Diagnose, troubleshoot, and resolve complex production issues to reduce Mean Time to Recovery.
  • Execute and contribute to the continuous improvement of operational runbooks and Standard Operating Procedures.
  • Lead and participate in blameless post-mortems and Root Cause Analysis sessions.
  • Partner with cross-functional teams to architect long-term reliability solutions.
  • Define and track Service Level Indicators and Objectives to measure service health.
  • Collaborate with Product Owners to establish service levels and manage Error Budgets.
  • Provide critical analysis during monthly release reviews on service health impact.
  • Leverage and optimize Ford’s observability suite to monitor system health and proactively identify anomalies.
  • Identify observability blind spots and implement solutions for comprehensive system visibility.
  • Manage metric collection, dashboard creation, and alert definitions using Terraform.
  • Design robust notification strategies and thresholds for KPI/SLO violations.
  • Champion automation by developing scripts, tools, and streamlined workflows to eliminate manual tasks.
  • Design and implement self-healing mechanisms to automatically remediate common failures.
  • Implement and manage AI-driven observability solutions for proactive monitoring.
  • Coordinate with platform and engineering teams to resolve production bottlenecks.
  • Deliver clear, data-driven status reports on system health and SRE initiatives to leadership.

What We're Looking For

  • Bachelor’s degree in computer science or a related field.
  • Minimum of 5+ years of professional experience in Site Reliability Engineering or DevOps.
  • Deep hands-on experience with Google Cloud Platform, specifically Cloud Run, GKE, and OpenShift.
  • Advanced proficiency in Terraform, including writing reusable modules and automating infrastructure.
  • Experience in comprehensive system observability using primary telemetry – Metrics, Events, Logs and Traces.
  • Hands-on experience with Dynatrace or similar APM tools for distributed tracing and profiling.
  • Proficiency in at least one high-level programming language (Java, Node.js, Python, or Go).
  • Proven experience managing high-severity incidents and the full incident lifecycle.

Technical Stack

  • Google Cloud Platform (GCP), Cloud Run, GKE, OpenShift
  • Terraform
  • Dynatrace
  • Java, Node.js, Python, Go

Team & Environment

You will collaborate with diverse teams across the organization, including cross-functional development and platform teams.

Work Mode

This position is onsite.

Ford is an equal opportunity employer.

Required Skills
Google Cloud PlatformCloud RunGKEOpenShiftTerraformDynatraceJavaNode.jsPythonGoSite Reliability EngineeringDevOpsInfrastructure as CodeSystem Observability
Freelancing without stability?

Get steady projects, keep your freedom

Iglu connects you with international clients and handles contracts, payments, and admin. You get consistent work and flexibility — no more chasing invoices or worrying about gaps.

Consistent client projects
Contract & payment management
Flexible work schedule
Revenue-sharing compensation
See open positions
Work from anywhere
About company
Ford

Ford Motor Company is an established global automotive manufacturer building a better world through innovative, exciting, and sustainable products and services. The company advances technologies in autonomy, electrification, and smart mobility.

Visit website
Job Details
Department Engineering
Category infrastructure
Posted 14 days ago