Remote (Country) USD 140,000 – 180,000 / year

OnePay is hiring a Software Engineer- SRE

About the Role

The Software Engineer in Site Reliability will design, implement, and maintain systems that support scalable and resilient services. This includes automating operations, managing incidents, and improving system performance.

Responsibilities

  • Design and maintain infrastructure for high availability and scalability
  • Implement automated deployment and rollback procedures
  • Monitor system performance and respond to alerts promptly
  • Diagnose and resolve production issues across distributed systems
  • Collaborate with development teams to improve code deployability
  • Optimize system reliability and reduce incident frequency
  • Develop scripts and tools to automate operational tasks
  • Manage configuration and provisioning of cloud resources
  • Participate in on-call incident response rotations
  • Ensure compliance with security and operational standards
  • Track and analyze system metrics to guide improvements
  • Support disaster recovery planning and testing
  • Contribute to post-incident reviews and action follow-ups
  • Maintain documentation for systems and procedures
  • Evaluate new technologies for operational efficiency
  • Work closely with product teams to align infrastructure with business goals
  • Implement and manage CI/CD pipelines
  • Troubleshoot network and service connectivity issues
  • Enforce observability standards across services
  • Drive adoption of best practices in system design
  • Assist in capacity planning and resource forecasting
  • Improve monitoring coverage and alerting precision
  • Reduce technical debt in operational systems
  • Promote a culture of reliability and accountability
  • Stay current with industry trends in cloud and systems engineering

Nice to Have

  • Master’s degree in Computer Science or related field
  • Certifications in cloud platforms or DevOps practices
  • Experience with large-scale production systems
  • Background in security engineering or compliance
  • Contributions to open-source projects
  • Public speaking or technical writing experience
  • Leadership in incident response scenarios
  • Experience with service mesh technologies
  • Knowledge of regulatory standards such as SOC2 or PCI-DSS
  • Familiarity with machine learning operations

Compensation

Competitive salary and benefits package

Work Arrangement

Hybrid work model with flexible scheduling

Team

Collaborative engineering team focused on system reliability and performance

About the Team

The team operates at the intersection of development and operations, ensuring systems are robust, observable, and efficient. Members work closely with engineers across the organization to build reliable infrastructure and promote best practices in deployment and monitoring.

What We Value

We prioritize transparency, continuous learning, and proactive problem-solving. Candidates should demonstrate accountability, adaptability, and a commitment to maintaining high system standards.

Available for qualified candidates

Required Skills
KubernetesAWSTerraformPythonIncident ManagementCloud ArchitectureDistributed Systems
About company
OnePay
OnePay is an all-in-one financial services platform that brings together banking, high-yield savings, credit cards, point-of-sale lending, investing, and crypto in one place. It also partners with employers, HCM providers, gig platforms, and others to deliver embedded financial services to millions of employees and frontline workers.
All jobs at OnePay Visit website
Job Details
Category infrastructure
Posted 7 months ago