Responsibilities
- Architect new and existing systems to enhance performance, reliability, and scalability
- Build, implement, iterate over CI/CD pipelines
- Assist with the Management, Development, Design, and Deployment of microservice and containerized applications
- Implement strong security controls in distributed systems/agents
- Coordinate with engineers and developers to automate deployments and configurations across various platforms
- Abstract the complexity of Observability implementation by writing scalable automation
- Identify opportunities for improvement around observability and process
- Standardization and development of alerts/notifications and response to monitoring tools
- Work alongside application teams to implement Observability in day-to-day operations
- Contribute to post-mortems and provide root cause analysis and implementation of resulting action items
- Promote DevOps best-practices within the team
- Participate and promote Agile/Scrum
- Contribute to hybrid cloud production containerization service offering
- Design and implement standards, policies, and procedures for automation and integrations
- Working alongside application subject matter experts, learn our toolsets and suggest/implement new features to streamline operations
Requirements
- Bachelor’s Degree with 7 years’ experience; Master’s Degree with 6 years’ experience’ PhD with 2 years’ experience.
- Treat best practices for security as a requirement, not an afterthought
- Knowledge of Cloud Platform administration (AWS, GCP, Azure)
- Familiarity with Observability pillars
- Experience in working in high-scale environments and understanding of distributed architectures
- Knowledge of Agile / DevOps methodologies
- Experience with CI/CD tools (Github Actions, Bamboo, Jenkins, Azure DevOps)
- Familiarity with running docker workloads using orchestration tools (Kubernetes / Amazon ECS)
- Ability to work both independently without direction and within a group for day-to-day activities
- Passion for learning new concepts and processes quickly, and adapting to a changing environment
- Comfortable working in and administering Linux and Windows environments
Nice to Have
- Exposure and implementation of SPIRE/SPIFFE
- Direct experience with Terraform/Crossplane
- Proficiency working with development tools and scripting languages (git / mercurial / subversion; Python / Elixir / Go)
- Integrating MCP Servers with authorization controls
- Knowledge of database management systems (NoSQL, Relational Databases, and associated query languages)
- AWS Cloud Practitioner / Azure AZ-900 Certification
- Deep experience in implementation and design of serverless architecture solutions
- Demonstrated experience in deployment of containerized applications (Kubernetes, etc)
- Experience with data management and pipeline technologies (Apache Storm, Kafka, Flink, Spark, Hadoop, etc)
- Prior experience working in an Agile team
- Solid understanding of observability solutions using OpenTelemetry, Prometheus/Grafana or similar application
- Excellent understanding of distributed system architectures and telemetry
- Excellent Experience in Deploying and managing large Kubernetes Distributed Platforms
- Proficiency in GitOps practices and Infrastructure as Code systems (such as Terraform, ArgoCD, Helm)
Team
Structure: The CSE Team, working within the Cyber Security Operations (CSO) function, is responsible for designing, deploying, maintaining, and optimizing the tool sets in use by the Information Security teams.
Additional Information
- This job is eligible to participate in our short-term incentive programs.
- We offer a comprehensive package of benefits including paid time off (vacation, holidays, sick), medical/dental/vision insurance and 401(k) to eligible employees.


