Extreme Networks is looking for a Principal Cloud Operations Engineer to provide technical leadership in cloud architecture, operational excellence, and cost optimization for our large-scale production environments. You will stay current with industry trends and leverage AI technologies and cloud platforms to improve efficiency, reliability, security, and scalability.
What You'll Do
- Provide technical leadership in cloud architecture, operational excellence, reliability, and cost optimization across large-scale production environments.
- Stay current with industry trends and best practices, and leverage AI technologies and cloud service provider platforms (AWS, Google Cloud, and Azure) to improve operational efficiency, scalability, security, and resiliency.
- Design and ensure secure, reliable, and high-performance communication across multiple regions and cloud service providers.
- Configure, tune, and operate middleware services, including SQL and NoSQL databases, messaging and streaming platforms, and related infrastructure components.
- Evaluate, recommend, and lead the adoption of CloudOps and DevOps tools, platforms, and automation solutions.
- Troubleshoot complex production infrastructure and application issues, providing deep technical expertise and hands-on support when required.
- Drive root cause analysis (RCA), implement corrective actions, and establish preventive measures to avoid recurrence.
- Collaborate closely with engineering cloud architects in system design discussions, architecture reviews, and whiteboard sessions.
- Partner with Development, QA, SRE, and external service providers or carriers to resolve issues and improve system reliability.
- Design, implement, and evolve deployment automation platforms for Kubernetes-based microservices.
- Improve service availability, performance, and scalability through automation, tooling, capacity planning, and process improvements.
- Analyze system and service performance, identify bottlenecks, and deliver actionable recommendations to improve efficiency and resilience.
What We're Looking For
- BS level technical degree required; Computer Science or Engineering background preferred.
- 8+ years of experience in a CloudOps / DevOps role.
- Hands-on experience with AWS or any public cloud (Azure, GCP etc.).
- Knowledge of Linux, security and networking fundamentals.
- Working knowledge of container-based architecture and deployment (Docker, Kubernetes).
- Working knowledge of deployment automation development (Terraform, Helm, ArgoCD).
- Experience in diagnosing and resolving complex application problems.
- Working knowledge of Elasticsearch, PostgreSQL, Redis, Ignite, Flink, Kafka, and RabbitMQ.
- Experience with monitoring tools (Nagios, Grafana, Prometheus).
- Strong follow-through and initiative to stay with issues until they are resolved.
- Comfortable working within a distributed team located in multiple time zones.
Nice to Have
- Experience with cloud security and compliance implementation.
Technical Stack
- Cloud: AWS, Google Cloud, Azure
- OS & Infrastructure: Linux, Docker, Kubernetes
- Automation & IaC: Terraform, Helm, ArgoCD
- Data & Messaging: Elasticsearch, PostgreSQL, Redis, Ignite, Flink, Kafka, RabbitMQ
- Monitoring: Nagios, Grafana, Prometheus
Team & Environment
This role is part of the Cloud Operations team, collaborating with Development, QA, SRE, and external service providers. It operates within a globally distributed team across multiple time zones.
Work Mode
This is a global position.
Extreme Networks is committed to fostering an inclusive workplace that embraces our differences. We encourage people from underrepresented groups to apply. In keeping with our values, no employee or applicant will face discrimination/harassment based on: race, color, ancestry, national origin, religion, age, gender, marital domestic partner status, sexual orientation, gender identity, disability status, or veteran status.





