Responsibilities
- Builds and improves performance testing systems to generate useful metrics and reports that detect system constraints and problems through proven observability practices.
- Analyzes system architecture, deployment environment layout, and Kubernetes platform data using observability tools to verify sufficient capacity and system uptime.
- Implements Agile methodologies to continuously refine workflows, boost efficiency, and maintain high-quality outputs.
- Collaborates across organizational levels to support enterprise-wide site reliability engineering efforts and communicate progress to stakeholders.
- Manages task prioritization in rapidly changing environments while consistently meeting project deadlines.
- Promotes teamwork and drives the integration of new technologies and procedures within the engineering group.
- Conducts independent, in-depth technical evaluations across several projects that support business-wide objectives.
- Adds to the team's shared repository of knowledge and technical know-how in the SRE domain.
- Provides guidance and mentorship to less experienced team members.


