Fidelity is hiring a Principal Site Reliability Engineer to design and maintain performance test frameworks, implement monitoring solutions, and ensure the scalability of our industry-leading financial platforms. You will solve complex performance and stability challenges to keep our systems high-performing.
What You'll Do
- Create and maintain performance test cases, suites, and frameworks using tools like LoadRunner, CloudTest, JMeter, Locust, and K6.
- Implement performance tools via CI/CD pipelines using Jenkins and Github.
- Support the migration and tuning of Cloud-based applications in AWS, ECS, and EKS.
- Build monitoring frameworks and dashboards to improve visibility using Splunk and Datadog.
- Apply a thorough understanding of microservices architecture on Docker and Kubernetes to provide tuning recommendations and triage incidents.
- Build and improve standard methodologies for performance, load, stress, and chaos testing, along with analytics and reporting.
- Establish and enhance application performance testing frameworks to provide actionable metrics.
- Review environment topology, software architecture, and Kubernetes platform metrics to ensure adequate capacity headroom and availability.
- Apply Agile principles for continuous improvement on processes, efficiency, and quality.
- Coordinate cross-business SRE initiatives to track and report status.
- Prioritize workloads in fast-paced dynamic environments and meet deadlines.
- Champion collaboration and adoption of new tools and processes within the team.
- Perform independent and complex technical analysis for multiple projects across business units.
- Contribute to the team’s documented knowledge base and expertise.
- Coach junior members of the team.
What We're Looking For
- A Bachelor’s degree in Computer Science, Engineering, Information Technology, or a closely related field (or foreign education equivalent) and five (5) years of experience as a Principal Site Reliability Engineer executing software performance engineering of online financial systems in a DevOps environment.
- Or, a Master’s degree in a related field and three (3) years of the same experience.
- Demonstrated expertise orchestrating software performance benchmarking on online financial Web and mobile applications across scrum teams, deployed on AWS and Azure, using CloudTest, JMeter, Locust, K6 with Jenkins, and Github in a Cloud Kubernetes environment.
- Demonstrated expertise creating and monitoring dashboards using Splunk and Datadog for performance benchmarking; analyzing hardware system performance metrics using Apache, Java, Angular, and Node.js platforms deployed on Unix and Windows.
- Demonstrated expertise identifying performance gaps and triaging incidents in capacity and infrastructure configurations, providing corrective recommendations using observability and performance tools.
- Demonstrated expertise conducting resiliency, chaos, and failure testing on financial software applications using Chaos Mesh, Gremlin, AWS Fault Injection Service, Splunk, and Datadog; and virtualizing backends using Wiremock.
Technical Stack
- Performance Tools: LoadRunner, CloudTest, JMeter, Locust, K6
- CI/CD & Source Control: Jenkins, Github
- Cloud & Infrastructure: Amazon Web Services (AWS), Elastic Container Service (ECS), Elastic Kubernetes Service (EKS), Docker, Kubernetes
- Monitoring & Observability: Splunk, Datadog
- Software Platforms: Apache, Java, Angular, Node.js
- Operating Systems: Unix, Windows
- Chaos & Testing Tools: Chaos Mesh, Gremlin, AWS Fault Injection Service, Wiremock
Work Mode
This role follows a hybrid work model.
Fidelity is an equal opportunity employer.



