This role is central to ensuring the robustness, scalability, and operational excellence of our global payments network. As a Lead Network Engineer, you will drive initiatives that enhance network reliability, automate responses to failures, and improve system observability across hybrid and cloud environments. You will lead incident response efforts, mentor engineering teams, and influence long-term architectural decisions to support rapid business growth. Your work will directly impact system uptime, customer experience, and the resilience of mission-critical financial transactions.
Responsibilities
- Lead ongoing evaluations of network infrastructure to assess performance, health, and capacity for critical applications.
- Work with development teams to anticipate scaling needs and align infrastructure with future growth.
- Conduct incident post-mortems with support teams to identify root causes and prevent recurrence.
- Design and execute strategies to address risks from software-infrastructure incompatibilities or recurring failures.
- Promote observability by identifying gaps in monitoring and alerting across environments and technologies.
- Implement solutions to integrate infrastructure telemetry into a unified monitoring dashboard.
- Use automation and AI-driven tools to improve early detection and enable self-healing of network issues.
- Reduce time to detect and mitigate incidents across the network ecosystem.
- Create testing plans for new environments, disaster recovery, and maintenance to validate readiness before production cutover.
- Foster continuous learning and knowledge exchange across networking and infrastructure teams.
- Lead training sessions for engineers and developers on networking components of the platform.
- Assess vendor roadmaps for hardware, firmware, and software upgrades.
- Run proof-of-concept trials to evaluate risks and improvements in upcoming technology releases.
Requirements
- 5 to 10 years of hands-on experience in network engineering, covering operations or design.
- Minimum of 3 years supporting e-commerce, financial services, or large-scale SaaS environments.
- Expert knowledge of TCP/IP, including LAN and WAN architectures.
- Strong understanding of HTTP, DNS, and TLS protocols with troubleshooting capabilities at transport and application layers.
- Solid Linux skills, including system configuration, networking, and diagnostics.
- In-depth knowledge of load balancing and traffic management techniques.
- Proven experience with automation and infrastructure-as-code tools like Ansible and Terraform, and structured data formats such as JSON and YAML.
- Experience using APM tools like Dynatrace or Datadog, and familiarity with OpenTelemetry.
- Proficient in packet-level analysis using tcpdump and Wireshark.
- Strong communication skills with demonstrated ability to lead root cause analysis and coordinate complex troubleshooting.
Nice to Have
- Experience working with private MPLS networks and SD-WAN technologies.
- Advanced skills or certifications in Cisco, Arista, or Check Point platforms.
- Knowledge of Content Delivery Networks and DDoS protection services.
- Familiarity with network telemetry solutions such as SolarWinds and Netscout.
Tech Stack
TCP/IP, LAN, WAN, MPLS, SD-WAN, HTTP, DNS, TLS, CDN, DDoS protection, Linux, Load balancing, Ansible, Terraform, JSON, YAML, Dynatrace, Datadog, OpenTelemetry, tcpdump, Wireshark, SolarWinds, Netscout, Cisco, Arista
Compensation
Competitive salary and benefits package commensurate with experience
Work Arrangement
Hybrid work model with periodic on-call duties due to 24/7 operations
Team
Part of the Payments Network SRE team within Site Reliability Engineering
- Committed to creating an inclusive digital economy that serves people worldwide
- Focused on delivering secure
Additional Information
- We offer flexible work hours and support professional development through training and certification programs.
- Our engineering teams follow Agile methodologies with a strong emphasis on collaboration and innovation.
- We prioritize mental health and wellness with access to counseling services and wellness stipends.
- The role may involve occasional travel for team summits or on-site infrastructure reviews.
- We are committed to environmental sustainability and carbon-neutral operations.
