About the Role
The role involves designing, maintaining, and optimizing streaming infrastructure to ensure high availability, scalability, and fault tolerance. The engineer will collaborate with cross-functional teams to improve system observability, incident response, and automation practices.
Responsibilities
- Design and implement reliable streaming data pipelines
- Monitor system performance and respond to incidents
- Improve fault detection and resolution workflows
- Collaborate with engineering teams to enhance system resilience
- Develop automation tools for operational tasks
- Troubleshoot complex production issues
- Maintain documentation for infrastructure and processes
- Optimize resource utilization and cost efficiency
- Support disaster recovery planning and execution
- Ensure compliance with security and operational standards
- Lead post-incident reviews and implement preventive measures
- Drive improvements in observability and alerting systems
- Evaluate and integrate new technologies into the streaming stack
- Participate in on-call rotations
- Mentor junior engineers and share technical expertise
- Contribute to capacity planning and scalability strategies
- Work closely with product teams to understand data requirements
- Implement and maintain CI/CD pipelines for streaming services
- Enforce best practices in configuration management
- Analyze system metrics to identify performance bottlenecks
- Support deployment of high-throughput data systems
- Ensure data consistency and delivery guarantees
- Collaborate on architectural design decisions
- Promote reliability as a shared team responsibility
- Respond to escalations during critical outages
Compensation
Competitive salary and benefits package
Work Arrangement
Hybrid or remote options available
Team
Part of the infrastructure and reliability team focused on streaming systems
Why This Role Matters
Streaming infrastructure is critical to real-time data delivery across the organization. This role ensures systems remain stable, scalable, and efficient as data volume grows.
What You’ll Bring
Deep technical expertise in streaming platforms, operational discipline, and a proactive mindset to anticipate and resolve system challenges before they impact service.
Available for qualified candidates