Wipro is looking for a Site Reliability Engineer to design, analyze, develop, and troubleshoot highly-distributed, large-scale production systems and cloud-based services. In this role, you will take ownership of system reliability, security, cost, and performance, and be a key advocate for Infrastructure as Code practices. We’re building a modern Wipro as an end-to-end digital transformation partner and need people inspired by reinvention—of themselves, their careers, and their skills.
What You'll Do
- Perform hands-on design, analysis, development, and troubleshooting of highly-distributed, large-scale production systems and event-driven, cloud-based services.
- Manage a fleet of Linux and Windows VMs as part of application solutions, with a primary focus on Linux Administration.
- Participate in Pull Requests to advance site reliability goals.
- Advocate for and implement IaC (Infrastructure as Code) and CaC (Configuration as Code) practices.
- Own system reliability, uptime, security, cost, operations, capacity, and performance analysis.
- Monitor and report on service level objectives, working with business, technology teams, and product owners to establish key service level indicators.
- Ensure the repeatability, traceability, and transparency of infrastructure automation.
- Support on-call rotations for operational duties not yet addressed by automation.
- Support healthy software development practices, including Agile methodologies, code review standards, and work packaging.
- Create and maintain monitoring technologies and processes to improve application performance visibility and manage operational workload.
- Partner with security engineers to develop plans and automation for responding to new risks and vulnerabilities.
- Develop, communicate, and monitor standard processes for the long-term health of operational development tasks.
What We're Looking For
- 5+ years of experience in system administration, application development, infrastructure development, or related areas.
- 5+ years of experience programming in languages like Javascript, Python, PHP, Go, Java, or Ruby.
- 3+ years reading, understanding, and writing code in the same.
- 3+ years mastery of infrastructure automation technologies like Terraform, Code Deploy, Puppet, Ansible, or Chef.
- 3+ years expertise in container and container-fleet-orchestration technologies like Kubernetes, Openshift, AKS, EKS, Docker, Vagrant, etcd, or zookeeper.
- 5+ years of cloud and container-native Linux administration, build, and management skills.
Technical Stack
- Languages: Javascript, Python, PHP, Go, Java, Ruby
- Infrastructure Automation: Terraform, Code Deploy, Puppet, Ansible, Chef
- Container & Orchestration: Kubernetes, Openshift, AKS, EKS, Docker, Vagrant, etcd, zookeeper
- Platform: Linux





