Responsibilities
- Shape and implement the organization's DevOps strategy in alignment with business and technical objectives.
- Supervise and guide a team of DevOps engineers, site reliability engineers, and automation experts.
- Foster a culture centered on automation, ongoing enhancement, and collaboration between development and operations.
- Architect, deploy, and maintain cloud environments that are scalable, secure, and cost-efficient across AWS, GCP, or Azure.
- Manage containerization and orchestration systems using Docker, Kubernetes, and Helm.
- Establish and enforce infrastructure-as-code standards using tools such as Terraform, CloudFormation, or Pulumi.
- Build and sustain CI/CD workflows to enable fast and dependable software delivery via platforms like GitHub Actions, Jenkins, GitLab CI, or ArgoCD.
- Advance automation across deployment processes, monitoring, and self-correcting system behaviors.
- Implement comprehensive observability, monitoring, and log management using ELK, Prometheus, Grafana, or Datadog.
- Integrate security practices into the development lifecycle through DevSecOps principles.
- Enforce security standards including identity and access management, vulnerability detection, and secure handling of secrets via Vault or AWS Secrets Manager.
- Ensure adherence to regulatory and compliance frameworks such as SOC 2, ISO 27001, and GDPR.
- Define and track service-level agreements, objectives, and indicators for system performance and availability.
- Lead incident response, disaster recovery planning, and continuity of operations.
- Conduct performance optimization and capacity forecasting for cloud-based systems.
Other
- All employees must adhere to established information security policies and procedures.
- Security must remain a primary consideration in every aspect of work.