Responsibilities
- Establish and document the target-state observability framework, including configuration blueprints, dashboard layouts, and telemetry collection mechanisms for the client's chosen monitoring platform.
- Develop intelligent alerting systems with calibrated thresholds, suppression policies, and dependency relationships to reduce false alarms and maintain operational focus in the NOC.
- Create standardized provisioning blueprints and API-based integrations to enable seamless, hands-off onboarding of sites into the monitoring platform post-migration.
- Work with ITSM/ITIL design leads to define automated workflows from alert to incident ticket, and coordinate with service delivery leadership to align NOC runbooks with new monitoring logic.
- Support architecture governance activities such as design evaluations, approval processes, and the formulation of enterprise-wide architectural standards and reference models.
- Engage with architects across security, applications, and platforms to ensure cohesive, cross-functional system designs.