About the Role
This role is responsible for managing complex technical programs related to observability, including architecture design, migration planning, and execution across engineering teams. The individual will coordinate cross-functional efforts, track progress, and ensure alignment with long-term platform goals.
Responsibilities
- Lead end-to-end planning and execution of observability migration projects
- Collaborate with engineering teams to define scalable architecture solutions
- Develop detailed project roadmaps and timelines
- Identify risks and implement mitigation strategies
- Facilitate communication between technical and non-technical stakeholders
- Monitor program KPIs and report on progress
- Ensure compliance with security and operational standards
- Coordinate integration of monitoring tools across platforms
- Manage dependencies across multiple teams and initiatives
- Drive alignment on technical decisions with senior leadership
- Support incident response readiness during migration phases
- Document system architecture changes and decisions
- Optimize workflows for observability data pipelines
- Lead post-implementation reviews and lessons learned
- Work closely with SRE and platform teams on scalability needs
- Advocate for best practices in telemetry and monitoring
- Track budget and resource allocation for programs
- Manage external vendor coordination when applicable
- Ensure documentation is maintained and accessible
- Promote consistency in tooling and observability frameworks
- Drive adoption of new observability standards
- Support change management processes
- Evaluate new technologies for integration potential
- Maintain alignment with cloud infrastructure strategies
- Foster collaboration across geographically distributed teams
Compensation
Competitive market rate
Work Arrangement
Hybrid
Team
Cross-functional engineering and product organization
What We Value
- Collaborative problem solving
- Transparency in communication
- Ownership of project outcomes
- Continuous learning and improvement
- Inclusive team culture
Why This Role Matters
- Critical to scaling observability infrastructure
- Enables faster incident detection and resolution
- Supports reliability and performance goals
- Drives consistency across engineering teams
- Positions the platform for future growth
Available for qualified candidates