Responsibilities
- Build, lead, and mentor multiple DevOps and Site Reliability Engineering team members, including management, focused on operational excellence, reliability, and automation.
- Define and monitor KPIs and SLIs/SLOs to measure reliability, system performance, and operational maturity.
- Implement robust observability frameworks encompassing logging, metrics, and tracing to ensure proactive monitoring and incident response.
- Foster a culture of continuous improvement, driving efficiency, reducing technical debt, and simplifying complex systems, engaging in the day-to-day team efforts and escalations.
- Architect, implement, and optimize scalable and secure cloud environments (AWS, GCP, Azure) to support enterprise-grade applications and platforms.
- Define and execute the enterprise DevOps strategy to align with broader engineering and business objectives.
- Drive the design and deployment of CI/CD pipelines and automated release processes to enhance development velocity and reliability.
- Lead large-scale modernization initiatives, including cloud migrations, containerization (Kubernetes), and platform consolidation.
- Leverage data and analytics to measure and improve system health, deployment efficiency, and incident response performance. Implement robust observability frameworks encompassing logging, metrics, and tracing to ensure proactive monitoring and incident response.
- Integrate security best practices throughout the SDLC, aligning with standards such as SOC2, FedRAMP, and GDPR.
- Ensure the creation and maintenance of technical documentation, workflows, and knowledge-based articles.
- Instill quality-of-work standards and clear expectations for projects and tasks.
- Proactively identify issues and enact solutions which have a significant and quantifiable impact on OKR’s and business objectives.
- Evaluate and adopt emerging technologies, frameworks, and methodologies that enhance performance, scalability, and developer productivity.
- Partner closely with engineering, security, and product leadership to ensure cohesive execution and alignment with organizational goals.
- Stay up to date with the latest Azure, regulatory changes and industry trends, advising teams on potential impacts and necessary adjustments.
- Financial budgeting, business case analysis, and annual planning for the department.
- Troubleshoot outages and manage On-Call coverage.
- Lead team members through technical hurdles and challenges.
Requirements
- BS/BA in Computer Science, Engineering, related field, or equivalent work experience.
- 10+ years of software related experience required (Site Reliability, DevOps, Release Engineering).
- 4+ years of building and managing high-performing engineering teams or similar roles across development or operational teams.
- Experience working for a cloud service provider (CSP), managed service provider (MSP), or enterprise SaaS company.
- Advanced knowledge of SaaS application architecture and design.
- Experience running and monitoring large scale distributed systems.
- Deep understanding of cloud native concepts including elasticity, interconnectivity, security, and identity management.
- Experience leading teams with the following technologies, tools, and concepts:
- Deploying and managing solutions hosted on major public cloud providers (Azure, AWS, GCP).
- Hybrid Microsoft and Linux-based technology stack, including Windows Server, .NET/C#, IIS, SQL Server, alongside Linux (Alpine, Ubuntu), and container orchestration via Kubernetes.
- Automating processes using PowerShell, CLI, Bash, Python, or other scripting languages.
- Strong understanding of Azure Kubernetes Services (AKS) with container-based deployment skills or other platforms such as OpenShift, GKS, EKS.
- Identity management using Azure AD, Okta, OpenID Connect (OIDC), SAML.
- Working knowledge of various cryptographic algorithms and protocols (TLS, mTLS, SSH, AES).
- Hand-on experience with orchestration, configuration management, and CI/CD tools (e.g., Terraform, ArgoCD, Azure DevOps Pipelines, git etc.).
Nice to Have
- Experience working for a cloud service provider (CSP), managed service provider (MSP), or enterprise SaaS company.
- Comfortable leading complex projects and seeing them through to delivery.
- Experience working with geographically distributed teams.
- Public cloud certifications or training (Azure, AWS, or GCP).
- Any certifications such as Information Technology Infrastructure Library (ITIL) Foundation, Microsoft Certified Professional (MCP), CompTIA Security+, Cloud Platform: Microsoft Certified Azure Administrator.
Benefits
- Vision
- Medical
- Life
- Dental
- 401K
Who We Are
OneStream is how today’s Finance teams can go beyond just reporting on the past and Take Finance Further™ by steering the business to the future. It’s the only enterprise finance platform that unifies financial and operational data, embeds AI for better decisions and productivity, and empowers the CFO to become a critical driver of business strategy and execution. Our vision is to be the operating system for modern finance, digitizing core financial functions and empowering the CFO to become a critical driver of business strategy. To learn more visit www.onestream.com.
Why Join The OneStream Team
- Transparency around corporate structure, salary, and benefits
- Core value of customer success
- Variety of project work (not industry-specific)
- Strong culture and camaraderie
- Multiple training opportunities
Additional Information
- Travel Requirement: Travel is not expected to exceed 5%.
- All candidates must be legally authorized to work for any company in the country where this position is located without sponsorship.
- OneStream is an Equal Opportunity Employer.


