Google seeks a Systems Engineer to join its Site Reliability Engineering (SRE) organization. SRE combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems, ensuring our services are reliable and rapidly improving. You'll manage the complex challenges of scale unique to Google, applying your expertise in coding, algorithms, and large-scale system design.
What You'll Do
- Design, build, and maintain high-reliability, scalable, and secure on-premise and hybrid cloud infrastructures.
- Oversee critical manufacturing floor systems, including servers, storage, networking, and industrial control systems.
- Develop automation tools and scripts (using Python, Go) to manage repetitive tasks, system maintenance, and data management processes.
- Define and implement comprehensive monitoring and alerting for all critical manufacturing systems, including both IT and OT components.
- Manage Identity and Access Management (IAM) for factory floor systems, on-prem infrastructure, and cloud resources, enforcing the principle of least privilege.
- Manage and integrate on-premise infrastructure (VMware clusters, physical servers, storage arrays, network hardware) with Google Cloud Platform (GCP) services.
What We're Looking For
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- 2 years of experience with scripting in one or more languages (e.g., Python, JavaScript, PowerShell, Bash/Shell, Perl).
- 2 years of experience working with system administration (e.g., filesystems, inodes, system calls) or networking (e.g., TCP/IP, routing, network topologies and hardware, SDN).
- Experience developing automation in cloud environments.
- Experience writing scripts or playbooks in one or more technologies (e.g., Terraform, Ansible, Shell scripts).
Nice to Have
- Master's degree in Computer Science or Engineering.
- 2 years of experience designing, analyzing, and troubleshooting large-scale distributed systems.
- 2 years of experience with data structures and algorithms.
- Experience with Google Cloud Platform or other similar cloud technologies.
- Experience implementing and supporting internet-based applications, web/mail servers, and operating systems, with a proven track record in managing Identity Management or Directory Services (LDAP, AD).
- Experience as an administrator and familiarity with the Linux environment.
Technical Stack
- Languages: Python, Go, JavaScript, PowerShell, Bash/Shell, Perl
- Infrastructure as Code: Terraform, Ansible
- Cloud & Platforms: Google Cloud Platform, VMware
- Identity & OS: LDAP, Active Directory, Linux
Team & Environment
You will be part of Google's Site Reliability Engineering (SRE) team, working in a culture of intellectual curiosity, problem solving, and openness. We encourage collaboration, thinking big, and taking risks in a blame-free environment, while providing the support and mentorship needed to learn and grow.
Benefits & Compensation
- Compensation range: $147,000-$211,000
- Equity grants
- Performance bonus
- Comprehensive health benefits
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status.




