Phiture is hiring a Senior Software Engineer to build and operate Upbound Spaces, the multi-control plane management software at the heart of the Upbound Platform. In this role, you will help scale Upbound to support thousands of control planes and extend enterprise control plane management across cloud and on-premises environments.
What You'll Do
- Actively build and operate Upbound Spaces in production, troubleshooting and resolving issues across multi-tenant SaaS environments.
- Contribute to Upbound's open-source projects, including Crossplane.
- Take ownership of building features in high demand by Upbound's customers.
- Investigate and debug complex issues in customer environments, including multi-control plane scenarios, resource reconciliation problems, and performance bottlenecks.
- Communicate through thoughtful design documents for new initiatives and detailed post-incident reviews.
- Support the full project lifecycle for highly scalable and reliable services running in a cloud environment.
- Write and maintain Go code that interfaces with the Kubernetes API, with a focus on observability, debuggability, and operational excellence.
- Deploy, manage, and troubleshoot Kubernetes services in production, using metrics, logs, and traces.
- Build and maintain operational tooling for debugging customer environments, analyzing control plane health, and automating incident response.
- Author documentation, user guides, runbooks, and blog posts to support and promote new features.
- Support the software release cycle for Spaces self-hosted distributions.
- Participate in an on-call rotation to support Upbound Cloud, responding to incidents and driving them to resolution.
What We're Looking For
- Experience operating production cloud services at scale: monitoring, alerting, incident response, post-mortems, and continuous improvement of service reliability.
- Strong debugging skills across distributed systems, including experience with observability tools like Prometheus, Grafana, OpenTelemetry, and distributed tracing.
- Experience building and operating controllers that interact with the Kubernetes API server, including troubleshooting reconciliation loops, managing API rate limits, and optimizing controller performance.
- Comfortable working directly with customers to understand, reproduce, and resolve complex technical issues in their environments.
- Take responsibility and ownership for solving problems even if they are outside your lane, especially during incidents affecting customer workloads.
- Demonstrate excellence in your work, constantly trying to improve your skills and the operational posture of the systems you build.
- Have empathy for customers and keep them in mind as you build solutions, understanding that reliability and debuggability are features.
- Realize the importance of clear communication and effective collaboration to work as a team, deliver great results, and support customers through technical challenges.
- Help create a safe environment where everyone can contribute, learn from failures, share on-call knowledge, and help each other grow as operators and engineers.
Technical Stack
- Go
- Kubernetes API
- Prometheus
- Grafana
- OpenTelemetry
Team & Environment
You will be part of the Spaces team.
Work Mode
This is a remote position with a global scope. A listed location is Atlanta, Georgia, United States.



