About the Role
The role involves owning and improving the reliability of critical systems within a decentralized network environment, ensuring high availability and rapid incident response.
Responsibilities
- Monitor and maintain the health of Solana-based blockchain infrastructure
- Design and implement scalable automation for system operations
- Lead incident response and conduct post-mortem analyses
- Optimize system performance and reduce latency across distributed nodes
- Collaborate with engineering teams to enhance system resilience
- Develop and maintain comprehensive monitoring and alerting systems
- Ensure infrastructure meets security and compliance standards
- Drive improvements in deployment reliability and rollback procedures
- Support on-call operations with minimal service disruption
- Document system architecture and operational runbooks
- Troubleshoot complex production issues across multiple environments
- Evaluate and integrate new tools for observability and diagnostics
- Improve CI/CD pipelines for reliability and speed
- Contribute to capacity planning and resource forecasting
- Enforce best practices in configuration management
- Participate in system design reviews for new features
- Maintain uptime and service level objectives
- Reduce mean time to detection and resolution
- Promote a culture of operational discipline
- Mentor engineers in reliability practices
- Work closely with developers to refine service ownership
- Ensure systems are resilient under high-load scenarios
- Implement proactive failure testing and chaos engineering
- Support audit processes and system validations
- Stay current with blockchain and infrastructure trends
Compensation
Competitive salary and equity package commensurate with experience
Work Arrangement
Remote-first with flexible scheduling; some global coordination required
Team
Part of a distributed engineering team focused on blockchain infrastructure reliability
Why Solana?
Solana offers a high-performance blockchain platform enabling fast, secure, and scalable decentralized applications, making it a key focus for infrastructure development.
Our Engineering Culture
We emphasize ownership, transparency, and continuous learning, with a strong focus on production excellence and collaborative problem-solving.
Not available


