About the Role
The role involves building and improving core database functionality, collaborating with the open-source community, and enhancing system reliability and developer experience.
Responsibilities
- Design and implement core database features for vector storage and retrieval
- Optimize query performance and indexing mechanisms
- Write clean, maintainable, and well-tested code
- Respond to community pull requests and issue reports
- Collaborate with contributors across time zones
- Maintain documentation for developers and users
- Participate in architectural discussions and technical planning
- Improve tooling for local development and testing
- Support integration with popular machine learning frameworks
- Monitor and address security vulnerabilities
- Contribute to release planning and version management
- Engage in code reviews with team members
- Troubleshoot complex system behavior in production environments
- Enhance observability and logging across components
- Work on compatibility with cloud storage backends
- Develop benchmarks to measure system performance
- Assist in debugging user-reported issues
- Help shape project priorities based on community feedback
- Ensure backward compatibility during major updates
- Support the growth of the contributor community
Nice to Have
- Contributions to database or storage-related open-source projects
- Experience with Apache Arrow or related ecosystem tools
- Background in distributed systems design
- Knowledge of machine learning operations workflows
- Prior work with vector indexing algorithms like HNSW or IVF
- Experience mentoring other developers in open-source settings
- Public speaking or conference presentation experience
- Involvement in developer advocacy or community outreach
Compensation
Competitive salary with equity and benefits
Work Arrangement
Remote-first with flexible hours
Team
Small, distributed team focused on developer tools and open-source innovation
Open Source Philosophy
- We believe in transparent development and community-driven progress
- All major changes are discussed publicly before implementation
- Contributors are treated as core team members regardless of affiliation
Tech Stack
- Primary language: Rust
- Frontend components in Python and JavaScript
- Built on Apache Arrow for in-memory data representation
- Uses Parquet for persistent storage
- Deployed on AWS with Terraform and GitHub Actions
Available for qualified candidates