New York City Hybrid Full-time USD 170,000 – 213,000 / year

FanDuel is hiring a Staff Observability Engineer

Responsibilities

  • Help shape and guide the organization's observability strategy and long-term roadmap in alignment with business objectives and technical priorities.
  • Design and enhance scalable observability solutions that deliver meaningful insights into system performance, health, and user interactions.
  • Define and promote standardized practices for monitoring, alerting, incident response, and post-incident reviews across teams.
  • Advance operational excellence by refining incident management processes, on-call procedures, and postmortem follow-ups to drive systemic improvements.
  • Lead collaborative efforts across teams to strengthen end-to-end system reliability by identifying and resolving systemic risks.
  • Use automation and AI-powered tools to speed up root cause analysis and reduce repetitive operational tasks at scale.
  • Collaborate with engineering and product leaders to turn observability data into actionable inputs for strategic planning.
  • Analyze patterns in system and user behavior to anticipate, prevent, and minimize widespread outages or issues.
  • Improve observability platforms for efficiency, cost-effectiveness, and sustainable growth.
  • Coach and guide engineers to elevate the organization’s overall reliability and observability capabilities.
  • Perform additional duties as assigned to support operational adaptability and changing business demands.

Work Arrangement

Hybrid

Responsibilities

  • Help shape and guide the organization's observability strategy and long-term roadmap in alignment with business objectives and technical priorities.
  • Design and enhance scalable observability solutions that deliver meaningful insights into system performance, health, and user interactions.
  • Define and promote standardized practices for monitoring, alerting, incident response, and post-incident reviews across teams.
  • Advance operational excellence by refining incident management processes, on-call procedures, and postmortem follow-ups to drive systemic improvements.
  • Lead collaborative efforts across teams to strengthen end-to-end system reliability by identifying and resolving systemic risks.
  • Use automation and AI-powered tools to speed up root cause analysis and reduce repetitive operational tasks at scale.
  • Collaborate with engineering and product leaders to turn observability data into actionable inputs for strategic planning.
  • Analyze patterns in system and user behavior to anticipate, prevent, and minimize widespread outages or issues.
  • Improve observability platforms for efficiency, cost-effectiveness, and sustainable growth.
  • Coach and guide engineers to elevate the organization’s overall reliability and observability capabilities.
  • Perform additional duties as assigned to support operational adaptability and changing business demands.
About company
FanDuel

FanDuel is redefining the way the world engages with digital, tech-driven, gaming and entertainment, by delivering memorable experiences bringing fans even closer to the games they love.

We’re America’s #1 Sportsbook and the premier mobile gaming company in North America, consisting of a portfolio of leading brands across mobile wagering, including FanDuel Sportsbook, FanDuel Casino, and FanDuel Racing, the industry’s unquestioned leader in horse racing. FanDuel also operates FanDuel TV, its broadly distributed television network with hit shows “Up & Adams” and “Run It Back.”

FanDuel Group is a subsidiary of Flutter Entertainment, the world’s largest sports betting and gaming operator with a portfolio of globally recognized brands and traded on the New York Stock Exchange (NYSE: FLUT).

All jobs at FanDuel Visit website
Job Details
Department Platform Engineering
Category infrastructure
Posted 2 hours ago