Responsibilities
- Support and operate physical datacenter infrastructure, including servers, storage, networking, and rack-and-stack activities.
- Manage and maintain VMware vSphere / ESXi environments, including host builds, upgrades, patching, and troubleshooting.
- Administer enterprise storage platforms such as Pure Storage and/or NetApp, including provisioning, snapshots, replication, and performance monitoring.
- Perform hardware lifecycle activities: installs, replacements, firmware upgrades, and decommissioning.
- Participate in planned maintenance, change control, and incident response for datacenter systems.
- Build infrastructure-as-code software and other tools providing a foundation for software teams to rapidly build large scale cloud systems.
- Establish a golden AMI pipeline and maintain a continuous patching schedule for all cloud resources.
- Operationalize the resiliency and disaster recovery processes for IaaS landscape.
- Automation of key operating and security processes in IaaS workloads.
- Infrastructure management via Configuration Management platform to ensure security and full stack automation for applications, drive the use of the platform and standards for all applications.
- Drive investigation and resolution of security incidents impacting cloud infrastructure.
- Work closely with security team to address vulnerabilities, evidence gathering and securing our infrastructure.
- Handle the security hardening efforts for major OS distributions to ensure overall compliance and a streamlined service for internal clients.
- Participate in the rotating on-call schedule.
- Ensure that user emergencies, platform alerts, and support requests are addressed.
- Identify and solve problems in cloud and hybrid environments and the ability to create and implement a new solution from scratch.
- Mentor and develop less experienced engineers.
Requirements
- 3+ years’ hands-on experience working in a physical datacenter environment.
- Minimum of 2 days on-site in our data center, additional may be required based on business needs.
- Based in the Raleigh-Durham (RDU) area, North Carolina.
- Strong experience with VMware (vSphere / ESXi) in a production setting.
- Experience administering enterprise storage platforms, preferably Pure Storage and/or NetApp.
- Familiarity with server hardware (Dell, HPE, or similar), RAID, BIOS/firmware management.
- Experience following change management and operational best practices.
- Knowledge and experience of high availability and scalability.
- Experience with AWS foundations, including computer, networking, storage, and security.
- Experience architecting containerization solutions in cloud environments like ECS or EKS.
- A strong background in systems engineering, especially with tools like Docker and Kubernetes in Linux and containerized environments
- Expertise in implementing content delivery solutions at the edge.
- A thorough understanding of IAM roles, access management, DNS, load balancing, routing, firewalls, and monitoring tools for cloud and hybrid environments
- Experience deploying and supporting AWS network services, including VPC, Subnets, Route Tables, NACLs, Security Groups, TGW, GWLB, VPC Endpoints, Route53.
- Knowledge of AWS compute, data sources, security technologies, services including EC2, S3, IAM, ECS, EKS, Load Balancers, SCP, RAM, CloudWatch, CloudTrail, WAF.
- Experience architecting and securing regulated enterprise-class cloud services with compliance frameworks like SOC2, NIST, and ISO.
- Experience with logging and monitoring systems like Datadog and Cloudwatch.
- Familiarity and ability to diagnose large systems - how they work and can be operated on a large scale, edge cases, failure modes, behaviors.
- Proficient in writing code/scripting in languages like.
- Automation of infrastructure with CDK, Terraform, and Ansible, as well as containerization using EKS or ECS.
- Ability to diagnose complex distributed systems problems whether it be system, network or code.
Nice to Have
- You have a bachelor's degree, or higher, in Computer Science or related field.
- You are capable of operating in a highly collaborative/Agile environment with other team members as well as all software engineering.
- You have the ability to operate independently and self-prioritize work.
- You are a team player, and an exceptional communicator.
- You enjoy learning new technologies and help foster a collegial environment of continuous improvement and innovation.
Work Arrangement
Hybrid
Additional Information
- Travel Required Up to 10%
- Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice.