Responsibilities
- Lead the management and tuning of Splunk systems to maintain consistent performance and uptime.
- Guide the adoption of Site Reliability Engineering principles to improve system stability and growth capacity.
- Offer technical leadership in setting up and managing Grafana dashboards for live system monitoring and data display.
- Work with diverse teams to deploy ELK stack tools for efficient log handling and insight generation.
- Use Dynatrace AppMon to track application health and resolve performance concerns before escalation.
- Create and execute plans to boost infrastructure effectiveness and minimize service interruptions.
- Perform ongoing evaluations of system performance to detect opportunities for enhancement.
- Promote technical advancement by exploring and adopting emerging tools and methods.
- Ensure infrastructure operations follow recognized industry standards and recommended procedures.
- Organize training and knowledge transfer activities to strengthen team expertise.
- Support initiatives in the media sector by applying technical skills to refine system architectures.
- Advance organizational goals by improving system dependability, leading to better media service delivery.
- Keep updated records of system setups and operational workflows for continuity and reference.
Work Arrangement
Hybrid