Loadsmart is looking for a Senior Site Reliability Engineer to design our infrastructure, networking, and software platform architecture. You will be responsible for building and maintaining core systems, internal tooling, and our observability platform.
What You'll Do
- Design infrastructure, networking, and software platform architecture.
- Define platform guidelines, requirements, and processes considering DevOps methodology.
- Build and maintain infrastructure automation using Infrastructure as Code tools.
- Ensure auditable delivery of infrastructure definition and changes.
- Automate Continuous Integration and Continuous Deployment pipelines.
- Develop and maintain Developer Experience and Productivity initiatives, service catalogs, and service maturity.
- Build and maintain the application platform used by all engineering teams.
- Manage multiple Kubernetes clusters.
- Design, develop, and maintain core systems using common programming languages.
- Build and maintain internal tooling used by all engineering teams.
- Troubleshoot infrastructure, internal applications, networking, and security issues.
- Build and maintain an observability platform, guidelines, and standards.
- Define the internal platform SLI/SLO/SLAs.
- Manage backup policies and operation.
- Maintain the fleet of databases, including upgrades, security patches, performance analysis, optimizations, and troubleshooting.
- Conduct security risk assessments, vulnerability scans, VPNs, and tests.
What We're Looking For
- Bachelor’s or foreign equivalent in Computer Science, Computer Engineering, or Information Technology.
- 2 years experience as a Site Reliability Engineer, Reliability Engineer, Cloud Engineer, Software Engineer, or related occupation.
Technical Stack
- Linux
- Python
- Go
- JavaScript
- Shell script
Work Mode
This position is based locally in Chicago, IL.
Loadsmart is an equal opportunity employer.


