About the Role
The engineer will develop robust backend systems that generate and manage fingerprints for large-scale data collections, ensuring accuracy, performance, and integration across platforms.
Responsibilities
- Design and build backend services for data fingerprinting
- Implement algorithms that generate unique identifiers for data sets
- Ensure system scalability under high data throughput
- Optimize performance of fingerprint computation and storage
- Collaborate with research teams on novel identification methods
- Maintain high standards for code quality and testing
- Integrate fingerprinting systems with existing data pipelines
- Troubleshoot and resolve production issues promptly
- Document system architecture and technical decisions
- Support deployment and monitoring in cloud environments
- Work closely with frontend teams for seamless API delivery
- Evaluate new technologies for potential adoption
- Contribute to security practices for data integrity
- Participate in code reviews and technical planning
- Ensure compliance with data handling standards
- Improve system reliability and fault tolerance
- Monitor system performance and usage patterns
- Refactor legacy components for better maintainability
- Assist in defining product requirements from technical insights
- Develop internal tools for operational efficiency
- Support reproducibility of data processing workflows
- Maintain backward compatibility during system upgrades
- Engage in cross-team knowledge sharing
- Respond to feedback from internal and external users
- Contribute to long-term technical roadmap
Nice to Have
- Experience with data provenance systems
- Background in digital fingerprinting or hashing techniques
- Work with metadata-rich data environments
- Prior role in research or academic computing
- Familiarity with formal verification methods
- Knowledge of information retrieval systems
- Experience in high-assurance software domains
- Involvement in open-source projects
- Understanding of reproducibility challenges in data science
- Prior work with immutable data structures
Compensation
Competitive salary with equity and benefits
Work Arrangement
Hybrid or remote options available
Team
Small, interdisciplinary team focused on research and development
About the Project
This role supports a research initiative focused on creating reliable methods to identify and track data collections across systems. The work enables verification of data origin and integrity in dynamic environments.
Technology Environment
The team uses modern backend frameworks, cloud-native architectures, and automated tooling. Development emphasizes correctness, auditability, and long-term maintainability over rapid feature delivery.
Available for qualified candidates