Responsibilities
- Build and operate large-scale, high-throughput data pipelines powering AI-ready web search
- Design and maintain distributed ETL / ELT workflows for web indexing, embeddings, and analytics
- Develop and optimize data ingestion and transformation systems across structured and unstructured sources
- Manage and scale data storage layers (SQL, NoSQL, and object stores) for performance and reliability
- Participate in architecture discussions and contribute to system design for data infrastructure
- Collaborate with ML and backend teams to deliver high-quality, real-time data for ranking and retrieval
- Write clean, maintainable code and participate in code reviews
Tech Stack
Node.js, NestJS, Python, Java, PostgreSQL, OpenSearch, Vespa, Apache Storm, Kubernetes, GitHub Actions, Terraform
Benefits
- Flexible remote-work policy
- Opportunity to play a central role in engineering team and partner with founders
- Spearhead groundbreaking products shaping the future of AI-powered applications
Work Arrangement
Remote (Worldwide)
Team
Team size: small. Structure: technical team working shoulder to shoulder