About the Role
Role details below.
Responsibilities
- Using automation and Infrastructure as a Code (IaC) to continuously improve the reliability, scalability, and performance of services deployed on AWS.
- Performance tuning and configuration of both Linux system and application parameters supporting highly concurrent web stacks.
- Manage infrastructure through code using configuration management and IaC templating software such as Terraform and Puppet.
- Documenting procedures and knowledge base articles throughout problem resolution and architecture development processes.
- Monitoring the availability, performance and health of production systems in support of meeting service level objectives using monitoring systems such as Icinga, Prometheus, Grafana, CloudWatch, and Loki.
- Participating in emergency incident response on-call rosters.
- Practicing blameless postmortems that lead to improvements in resiliency and reductions in alert fatigue.
Requirements
- In depth experience of AWS services (RDS (Relational Database Service), EC2, Autoscaling groups, S3, Deploying Lambda, Aurora PostgreSQL, WAF, NAT GW, ALB, etc.)
- Analytical problem-solving methodology with an outstanding ability to communicate and document effectively
- Minimum of three years in Linux system administration with experience automating system processes with a variety of scripting languages or equivalent skills
- Practical experience analyzing and troubleshooting large-scale, multi-region deployments in a public cloud (i.e. AWS)
- Practical experience with IaC, CI/CD, structured configuration such as JSON or YAML, and version control solutions (git)
- Experience in cloud deployment and management tools (e.g. Terraform, Puppet, Chef, Ansible)
- Familiarity with one or more programming or scripting languages (Python/ PHP)
- Experience with LAMP stack: Linux, Apache, MariaDB/PostgreSQL/Aurora MySQL
- Experience in database administration along with a fundamental understanding of structured query language
- Knowledge of standard network/application protocols like HTTPS, SMTP, DNS, VPN
- A BS in Computer Science or a related field such as engineering or mathematics and 3+ years of work experience in Information Technology, or 5+ years of work experience in Information Technology overall
- Fluency in written and spoken English
Nice to Have
- Proven experience optimizing cloud infrastructure and reducing cloud hosting costs, ideally in AWS
- Experience using tools such as CloudHealth or AWS Cost Explorer.
- A BA in Computer Science or related field, or equivalent work experience
- Demonstrable scripting experience, preferably in PHP
- Experience with network and/or Infrastructure security
- Prior experience within the education industry and/or with e-learning technologies
- Experience managing large scale backend M