About the Role
Develop robust, scalable software solutions for acquiring and processing diverse data sources in distributed environments.
Responsibilities
- Design and implement data ingestion pipelines for structured and unstructured content
- Optimize data flow and storage across distributed systems
- Collaborate with cross-functional teams to integrate data sources into core platforms
- Ensure data quality, consistency, and timely delivery
- Troubleshoot and resolve issues in production data pipelines
- Write clean, maintainable, and well-tested code
- Support scalability and reliability of data acquisition infrastructure
- Work with large datasets from various formats and protocols
- Monitor system performance and implement improvements
- Contribute to architectural decisions for data processing systems
- Maintain documentation for data workflows and system designs
- Follow engineering best practices for version control and code reviews
- Participate in planning and prioritization within the engineering team
- Respond to operational alerts and support on-call rotations
- Evaluate new tools and technologies for data handling
- Ensure compliance with security and privacy standards
- Support integration of real-time and batch data sources
- Improve error handling and recovery mechanisms in data pipelines
- Collaborate on schema design and data modeling
- Assist in onboarding new data sources and formats
Nice to Have
- Master’s degree in computer science or related field
- Experience with search engines or information retrieval systems
- Contributions to open-source data processing projects
- Knowledge of natural language processing concepts
- Experience with large-scale data indexing systems
- Familiarity with data privacy regulations
- Prior work with enterprise software platforms
- Exposure to machine learning pipelines
- Background in building high-throughput data systems
- Understanding of network protocols and data transfer optimization
Compensation
Competitive salary and benefits package
Work Arrangement
Hybrid work model with flexibility for remote and office-based work
Team
Part of the engineering team focused on data systems and scalable infrastructure
Why This Role Matters
This position plays a key role in enabling accurate and efficient data access across the platform, directly impacting the quality of search and analytics capabilities for end users.
What We Offer
Opportunities for professional growth, a collaborative environment, access to cutting-edge technologies, and a focus on innovation in data systems.
Available for qualified candidates