Remote (Global)

P2P.org is hiring a Staff SRE - Solana

About the Role

The role involves owning and improving the reliability of critical systems within a decentralized network environment, ensuring high availability and rapid incident response.

Responsibilities

  • Monitor and maintain the health of Solana-based blockchain infrastructure
  • Design and implement scalable automation for system operations
  • Lead incident response and conduct post-mortem analyses
  • Optimize system performance and reduce latency across distributed nodes
  • Collaborate with engineering teams to enhance system resilience
  • Develop and maintain comprehensive monitoring and alerting systems
  • Ensure infrastructure meets security and compliance standards
  • Drive improvements in deployment reliability and rollback procedures
  • Support on-call operations with minimal service disruption
  • Document system architecture and operational runbooks
  • Troubleshoot complex production issues across multiple environments
  • Evaluate and integrate new tools for observability and diagnostics
  • Improve CI/CD pipelines for reliability and speed
  • Contribute to capacity planning and resource forecasting
  • Enforce best practices in configuration management
  • Participate in system design reviews for new features
  • Maintain uptime and service level objectives
  • Reduce mean time to detection and resolution
  • Promote a culture of operational discipline
  • Mentor engineers in reliability practices
  • Work closely with developers to refine service ownership
  • Ensure systems are resilient under high-load scenarios
  • Implement proactive failure testing and chaos engineering
  • Support audit processes and system validations
  • Stay current with blockchain and infrastructure trends

Compensation

Competitive salary and equity package commensurate with experience

Work Arrangement

Remote-first with flexible scheduling; some global coordination required

Team

Part of a distributed engineering team focused on blockchain infrastructure reliability

Why Solana?

Solana offers a high-performance blockchain platform enabling fast, secure, and scalable decentralized applications, making it a key focus for infrastructure development.

Our Engineering Culture

We emphasize ownership, transparency, and continuous learning, with a strong focus on production excellence and collaborative problem-solving.

Not available

Required Skills
LinuxAnsibleDockerTerraformGCPAWSGoRustPythonSite Reliability EngineeringBlockchainSolanaMonitoringCI/CDDistributed Systems
About company
P2P.org
P2P.org is the largest institutional staking provider with a TVL of over $10B and a market share exceeding 20% in restaking. The company focuses on researching and improving infrastructure to maximize APR and security across networks like ETH, SOL, DOT, and new launches including TON, Avail, Monad, Babylon, Story, Berachain. It offers products such as unified API, widgets, custom dApps, yield aggregators, and is expanding into RWA, data, yield, and service products for banks, exchanges, custodians, and wallets.
All jobs at P2P.org Visit website
Job Details
Category infrastructure
Posted 6 months ago