Oversee the ongoing reliability and performance of hybrid server platforms, ensuring seamless integration between on-premises systems and cloud-hosted services. This role centers on maintaining secure, scalable infrastructure through proactive administration, incident leadership, and continuous optimization of both Windows and Linux environments.
What You'll Do
- Manage and troubleshoot server infrastructure across Windows and Linux, covering deployment, configuration, patching, and system upgrades.
- Provision and monitor Azure resources including virtual machines, virtual networks, storage, backup, and monitoring services to support workload availability and performance.
- Respond to critical infrastructure incidents, lead root-cause investigations, and implement long-term fixes to improve system resilience.
- Enforce security standards by identifying vulnerabilities in server and network components and coordinating timely remediation with security teams.
- Automate repetitive operational tasks using PowerShell and command-line tools to increase efficiency and reduce manual intervention.
- Support hybrid connectivity through identity synchronization, DNS, and network integration between cloud and on-prem environments.
- Collaborate with network, security, and application teams to deliver infrastructure changes, deployments, and lifecycle upgrades.
- Maintain and enhance monitoring, backup validation, and disaster recovery procedures to ensure system recoverability.
- Follow structured IT service processes for incident, change, and problem management, keeping documentation and diagrams up to date.
- Lead or support infrastructure projects such as cloud migrations, standardization initiatives, and platform refreshes.
Requirements
- Proven experience administering Windows Server and Linux systems, including configuration, patching, and troubleshooting.
- Strong hands-on background with Azure infrastructure services such as VMs, VNets, NSGs, load balancers, Backup, and Monitor.
- Experience diagnosing and resolving complex infrastructure incidents and leading post-mortem analyses.
- Working knowledge of hybrid environments, including identity, networking, and on-prem virtualization (VMware or Hyper-V).
- Proficiency in PowerShell and command-line automation for operational workflows.
- Familiarity with ITIL practices for incident, change, and problem management.
- Ability to coordinate with internal teams and vendors to deliver infrastructure solutions.
- Experience securing server environments and remediating vulnerabilities in collaboration with security stakeholders.
- Strong documentation skills, including runbooks, risk tracking, and technical diagrams.
Preferred Qualifications
- Understanding of on-prem virtualization platforms such as VMware or Hyper-V.
- Exposure to Azure PaaS services that support server-based applications, including managed databases.
- Contributions to operational documentation and cross-team remediation planning.