Responsibilities
- Ensure high availability and low latency across Google Cloud Platform systems, including Kubernetes, managed services, and data workflows
- Manage end-to-end incident response with on-call duties, documented procedures, post-incident reviews, and preventive follow-up actions
- Design and implement scalable database solutions using sharding, replication, query optimization, and capacity forecasting for MySQL and Postgres
- Collaborate with product teams to convert feature needs into resilient, scalable infrastructure architectures
- Maintain and enhance infrastructure-as-code configurations using Terraform for GCP and Kubernetes environments
- Oversee Elasticsearch clusters in production, including performance tuning, index lifecycle, sharding, upgrades, and capacity planning
- Develop comprehensive observability solutions with metrics, dashboards, alerts, and distributed tracing
- Enhance the stability and speed of CI/CD pipelines managed through GitHub Actions
- Strengthen security practices by improving identity access management, secret storage, and network isolation as part of routine infrastructure maintenance
Benefits
- Competitive pay and significant equity compensation
- Full health coverage including medical, dental, and vision plans
- 401(k) retirement savings plan
- Wellness offerings such as Wellhub membership and mental wellness support
- Paid leave for new parents and benefits for fertility and maternal health
- Generous paid time off policy
- Daily meals and commuter allowances available at the New York City office
- Financial support for learning and professional growth
Work Arrangement
On-site — NYC
Other
Daily meals and commuter benefits at our NYC HQ in Flatiron