About the Role
This position involves maintaining and improving software systems in production environments. The engineer will diagnose and resolve live issues, implement fixes, and work closely with development and operations teams to ensure high availability and performance.
Responsibilities
- Monitor and respond to system alerts and incidents in real time
- Diagnose and troubleshoot production software issues
- Deploy and roll back code changes as needed
- Collaborate with engineering teams to identify root causes
- Implement hotfixes and patches for critical issues
- Maintain system uptime and service reliability
- Document incidents and resolution steps
- Participate in on-call rotations
- Support CI/CD pipelines and deployment processes
- Analyze logs and metrics to detect anomalies
- Work with QA to validate fixes in staging environments
- Ensure compliance with security and operational policies
- Optimize system performance and scalability
- Assist in automating operational workflows
- Coordinate with DevOps for infrastructure changes
- Track and report on incident resolution times
- Contribute to post-mortem reviews after outages
- Improve monitoring and alerting systems
- Test disaster recovery procedures
- Support third-party integrations in production
- Update runbooks and operational documentation
- Escalate complex issues to senior engineers
- Verify data consistency across services
- Maintain version control for production configurations
- Assist in onboarding new team members
Nice to Have
- Master’s degree in Computer Science or related field
- Experience in e-commerce or subscription-based platforms
- Background in site reliability engineering
- Knowledge of Terraform or infrastructure as code
- Familiarity with service mesh technologies
- Experience with large-scale data pipelines
- Certifications in cloud platforms
- Prior work in agile development environments
- Exposure to machine learning systems in production
- Contributions to open-source projects
Compensation
Competitive salary and benefits package
Work Arrangement
Hybrid work model
Team
Engineering team focused on product delivery and system reliability
Why This Role Matters
The deployed engineer plays a critical role in maintaining system integrity and user trust. By ensuring rapid resolution of live issues, this position directly impacts customer experience and platform reliability.
What to Expect
You will work in a fast-paced environment where quick decision-making and technical precision are essential. Expect regular interaction with production systems and collaboration across engineering disciplines to prevent and resolve issues.
Available for qualified candidates