Responsibilities
- Establish and document the future-state observability framework for a new environment, including technical setup templates, dashboard layouts, and data collection methods for the chosen monitoring system
- Develop intelligent alerting mechanisms, including threshold settings, alert suppression policies, and dependency relationships to reduce false alarms and operator overload in the network operations center
- Create standardized provisioning blueprints and API-based connections to enable automatic enrollment of migrated sites into the monitoring platform without requiring manual input
- Work with IT service management design leads to define rules for routing alerts to incident tickets, and coordinate with service delivery leadership during deployment to align operational procedures with updated monitoring data
- Support architecture governance activities such as design evaluations, approval processes, and the creation of enterprise-wide architectural guidelines, standards, and reference frameworks
- Partner with architects across security, application, and platform domains to ensure cohesive and interoperable system designs