Booking Holdings is hiring a Site Reliability Engineer II to take end-to-end ownership of services and build software applications, systems, and tools. This role is central to improving system reliability, automating operations, and applying critical thinking for process improvement across our technology landscape.
What You'll Do
- Build software applications using relevant development languages, systems, services, and tools.
- Refactor and simplify code by introducing design patterns when necessary.
- Ensure application quality by following standard testing techniques and methods.
- Write readable and reusable code by applying standard patterns and using standard libraries.
- Maintain data security, integrity, and quality by following company standards and best practices.
- Evaluate possible architecture solutions considering cost, business requirements, technology requirements, and emerging technologies.
- Describe the implications of changing or adding systems by understanding the infrastructure and architecture.
- Help grow the business and accelerate software development by applying engineering techniques.
- Design solutions that meet current requirements and are adaptable for future enhancements.
- Own a service end-to-end by monitoring application health and performance, setting metrics, and acting when violated.
- Reduce business continuity risks by applying practices and tools, and writing documentation like runbooks and OpDocs.
- Reduce risk and obtain customer feedback by using continuous delivery and experimentation frameworks.
- Independently manage an application or service through deployment and operations in production.
- Address and resolve live production issues by mitigating customer impact within SLA.
- Improve overall system reliability by producing long-term solutions through root cause analysis.
- Keep track of incidents by contributing to postmortem processes and logging live issues.
- Ensure infrastructure stays current by reducing technical debt, searching for bottlenecks, and preparing for scaling.
- Reduce cost of operations and maintenance by leveraging new technologies, automation, and partnering with vendors.
- Reduce human labor by writing software features that address availability, scalability, latency, and efficiency.
- Review and verify performance of production systems and network infrastructure by monitoring observability metrics, business KPIs, and capacity planning.
- Improve application reliability by advising development teams on setting appropriate observability metrics.
- Systematically identify patterns and underlying issues in complex situations and find solutions by applying logical and analytical thinking.
- Constructively evaluate and develop ideas, plans, and solutions by reviewing them, taking into account external knowledge, and articulating rationale.
- Identify opportunities for process, system, and structural improvements by examining current process flows, methods, and standards.
- Design and implement relevant improvements by defining adapted or new process flows, standards, and practices.
- Deliver clear, well-structured, and meaningful information to a target audience using suitable communication mediums.
- Achieve mutually agreeable solutions by staying adaptable, communicating ideas clearly, and practicing active listening.
- Ask relevant follow-up questions to properly engage with the speaker and understand them.
- Advise product teams towards a technical solution that meets functional, nonfunctional, and architectural requirements.
- Set a clear direction for a technical capability by evaluating and aligning target architecture improvements for varied stakeholders.
What We're Looking For
- A Bachelor's degree.
- Broad job knowledge with 3-5 years of relevant experience.
- Demonstrated experience in building software applications.
- Proficiency in software system design.
- Experience with end-to-end system ownership.
- Skills in technical incident management.
- Background in operations, including automation and reducing toil.
- Expertise in observability, monitoring, and alerting.
- Strong critical thinking and continuous quality & process improvement abilities.
- Effective communication skills.
- Ability to provide architectural guidance.

