Cognizant is looking for a Site Reliability Engineer (Infra Dev Specialist) to optimize and maintain our infrastructure systems. You will ensure seamless operations, drive innovation and efficiency, and enhance media domain capabilities within our hybrid work environment.
What You'll Do
- Lead the administration and optimization of Splunk environments to ensure high availability and performance.
- Oversee the implementation of Site Reliability Engineering practices to enhance system reliability and scalability.
- Provide expert guidance in configuring and maintaining Grafana dashboards for real-time monitoring and visualization.
- Collaborate with cross-functional teams to integrate ELK stack solutions for effective log management and analysis.
- Utilize Dynatrace AppMon to monitor application performance and proactively address potential issues.
- Develop and implement strategies to improve infrastructure efficiency and reduce downtime.
- Conduct regular assessments of system health and performance, identifying areas for improvement.
- Drive innovation by researching and implementing new technologies and methodologies.
- Ensure compliance with industry standards and best practices in infrastructure management.
- Facilitate knowledge sharing and training sessions to enhance team capabilities and understanding.
- Support media domain projects by leveraging technical expertise to optimize infrastructure solutions.
- Maintain documentation of infrastructure processes and configurations for future reference.
What We're Looking For
- Extensive experience in Splunk Admin, demonstrating ability to manage complex environments.
- Strong background in Site Reliability Engineering, showcasing skills in system reliability enhancement.
- Proficiency in Grafana with a proven track record of creating effective monitoring solutions.
- Expertise in the ELK stack, highlighting ability to manage and analyze large volumes of data.
- Knowledge in Dynatrace AppMon with experience in application performance monitoring.
- AWS core services expertise (EC2, S3, IAM, VPC, CloudWatch, Lambda).
- Snowflake operations tuning, cost governance, and incident response.
- dbt operations & model dependency debugging with Grafana and Splunk.
Nice to Have
- Experience in the media domain, providing insights into industry-specific infrastructure needs.
- Adaptability to a hybrid work model, ensuring effective collaboration and productivity in diverse environments.
Technical Stack
- Monitoring & Observability: Splunk, Grafana, ELK stack, Dynatrace AppMon
- Cloud: AWS EC2, AWS S3, AWS IAM, AWS VPC, AWS CloudWatch, AWS Lambda
- Data: Snowflake, dbt
Work Mode
This role follows a hybrid work model.
Cognizant is an equal opportunity employer.


