Responsibilities
- Define and document the target observability architecture for a greenfield environment, including configuration templates, dashboard layouts, and telemetry polling settings for the chosen monitoring platform.
- Develop intelligent alerting frameworks with threshold tuning, suppression policies, and dependency mapping to reduce false alarms and maintain NOC efficiency.
- Create standardized provisioning templates and API-driven integrations to enable automatic onboarding of sites into the monitoring system post-migration.
- Work with ITSM/ITIL design leads to automate alert escalation to incident tickets and coordinate with service delivery teams to align runbooks with new telemetry models.
- Support architecture governance by participating in design reviews, approving technical blueprints, and shaping enterprise-wide architecture standards and reference models.
- Engage with security, application, and platform architects to ensure cohesive, cross-functional design decisions across technology domains.