Booking Holdings is hiring a Site Reliability Engineer I in Bangalore, India. In this role, you will operate as a specialist in treating operations as a software problem, focusing on the reliability of systems and services—addressing availability, performance, scalability, latency, observability, and efficiency. You will be responsible for producing quality technical solutions for problems defined by the Engineering Manager and Product Owner within a global e-commerce company where data drives decisions and technology is at the core.
What You'll Do
- Build software applications using relevant development languages and knowledge of systems, services, and tools.
- Refactor and simplify code by introducing design patterns when necessary.
- Ensure application quality by following standard testing techniques and adhering to the test strategy.
- Write readable and reusable code by applying standard patterns and libraries.
- Maintain data security, integrity, and quality by following company standards and best practices.
- Evaluate possible architecture solutions considering cost, business requirements, technology needs, and emerging tech.
- Describe the implications of changing existing systems or adding new ones.
- Help grow the business and accelerate development by applying engineering techniques like prototyping and vendor evaluation.
- Design solutions meeting current needs and adaptable for future enhancements.
- Own a service end-to-end by monitoring health and performance, setting metrics, and acting when violated.
- Reduce business continuity risks by applying best practices and writing documentation like runbooks and OpDocs.
- Reduce risk and gather customer feedback using continuous delivery and experimentation frameworks.
- Independently manage an application or service through deployment and operations in production.
- Address and resolve live production issues, mitigating customer impact within SLA.
- Improve overall system reliability by producing long-term solutions via root cause analysis.
- Keep track of incidents by contributing to postmortem processes and logging live issues.
- Ensure infrastructure stays current by reducing technical debt, searching for bottlenecks, and preparing for scaling.
- Reduce operational and maintenance costs by leveraging new technologies, automation, and vendor partnerships.
- Reduce human labor by writing software features that address availability, scalability, latency, and efficiency.
- Review performance of production systems and network infrastructure by monitoring observability metrics, KPIs, and capacity.
- Improve application reliability by partnering with development teams to advise on setting observability metrics.
- Systematically identify patterns and underlying issues in complex situations to find logical solutions.
- Constructively evaluate and develop ideas, plans, and solutions by reviewing them and initiating SMART improvements.
- Identify opportunities for process, system, and structural improvements by examining current workflows and standards.
- Design and implement relevant improvements by defining new or adapted process flows, standards, and practices.
- Deliver clear, well-structured information to a target audience using suitable communication mediums and language.
- Achieve mutually agreeable solutions by staying adaptable, communicating clearly, and practicing active listening.
- Ask relevant follow-up questions to properly engage and understand speakers using listening and reflection techniques.
- Advise product teams on technical solutions that meet functional, nonfunctional, and architectural requirements.
What We're Looking For
- Sufficient knowledge to build software applications using relevant languages, systems, services, and tools.
- Sufficient knowledge to refactor and simplify code using design patterns.
- Sufficient knowledge to ensure application quality with standard testing techniques.
- Basic knowledge to write readable, reusable code using standard patterns and libraries.
- Basic knowledge to maintain data security, integrity, and quality following standards and best practices.
- Basic knowledge to evaluate architecture solutions considering cost, requirements, and emerging technologies.
- Basic knowledge to describe the implications of system changes.
- Basic knowledge to apply engineering techniques like prototyping and vendor evaluation to aid business growth.
- Basic knowledge to design solutions meeting current needs and adaptable for the future.
- Basic knowledge to own a service end-to-end by monitoring metrics and health.
- Basic knowledge to reduce business continuity risks with best practices and documentation.
- Basic knowledge to use continuous delivery and experimentation frameworks to reduce risk and get feedback.
- Sufficient knowledge to independently manage an application or service through deployment and production operations.
- Basic knowledge to address and resolve live production issues within SLA.
- Basic knowledge to improve system reliability through root cause analysis and long-term solutions.
- Basic knowledge to keep track of incidents via postmortems and logging.
- Basic knowledge to ensure infrastructure currency by reducing technical debt and preparing for scale.
- Basic knowledge to reduce operational costs via new technologies, automation, and vendor partnerships.
- Basic knowledge to reduce human labor by writing software for availability, scalability, latency, and efficiency.
- Sufficient knowledge to review performance by monitoring observability metrics, KPIs, and capacity planning.
- Basic knowledge to improve application reliability by advising development teams on observability metrics.
- Sufficient knowledge to systematically identify patterns and issues in complex situations and find logical solutions.
- Sufficient knowledge to constructively evaluate ideas and develop SMART improvements.
- Basic knowledge to identify improvement opportunities by examining current processes and standards.
- Basic knowledge to design and implement improvements by defining new process flows and practices.
- Sufficient knowledge to deliver clear, structured information tailored to the audience.
- Sufficient knowledge to achieve agreeable solutions through adaptability, clear communication, and active listening.
- Sufficient knowledge to ask relevant follow-up questions using listening and reflection techniques.
- Basic knowledge to advise product teams on technical solutions that meet requirements by challenging designs and providing architectural context.
Team & Environment
You will be part of a business service owner team, own a piece of infrastructure, and/or consult with product development teams on Site Reliability Engineering scope. You will report to an Engineering Manager and Product Owner.
Booking Holdings is an equal opportunity employer in accordance with all applicable federal, state and local laws. We ensure equal employment opportunity to all employees and applicants without discrimination or harassment based on race, religious creed, color, age, sex, sexual orientation, and other protected characteristics.

