About the Role

The role involves building and improving core database functionality, collaborating with the open-source community, and enhancing system reliability and developer experience.

Responsibilities

Design and implement core database features for vector storage and retrieval
Optimize query performance and indexing mechanisms
Write clean, maintainable, and well-tested code
Respond to community pull requests and issue reports
Collaborate with contributors across time zones
Maintain documentation for developers and users
Participate in architectural discussions and technical planning
Improve tooling for local development and testing
Support integration with popular machine learning frameworks
Monitor and address security vulnerabilities
Contribute to release planning and version management
Engage in code reviews with team members
Troubleshoot complex system behavior in production environments
Enhance observability and logging across components
Work on compatibility with cloud storage backends
Develop benchmarks to measure system performance
Assist in debugging user-reported issues
Help shape project priorities based on community feedback
Ensure backward compatibility during major updates
Support the growth of the contributor community

Nice to Have

Contributions to database or storage-related open-source projects
Experience with Apache Arrow or related ecosystem tools
Background in distributed systems design
Knowledge of machine learning operations workflows
Prior work with vector indexing algorithms like HNSW or IVF
Experience mentoring other developers in open-source settings
Public speaking or conference presentation experience
Involvement in developer advocacy or community outreach

Compensation

Competitive salary with equity and benefits

Work Arrangement

Remote-first with flexible hours

Team

Small, distributed team focused on developer tools and open-source innovation

Open Source Philosophy

We believe in transparent development and community-driven progress
All major changes are discussed publicly before implementation
Contributors are treated as core team members regardless of affiliation

Tech Stack

Primary language: Rust
Frontend components in Python and JavaScript
Built on Apache Arrow for in-memory data representation
Uses Parquet for persistent storage
Deployed on AWS with Terraform and GitHub Actions

Available for qualified candidates

LanceDB is hiring an Open Source Engineer

About the Role

Responsibilities

Nice to Have

Compensation

Work Arrangement

Team

Open Source Philosophy

Tech Stack

Similar Jobs

Staff Machine Learning Engineer

Data Engineering Manager

Senior Software Engineer, Data Platform

Business Analyst | Foco em Dados - Remote - English Advanced

Machine Learning Engineer

Machine Learning Engineer Intern 2026/27