About the Role
The role involves building and optimizing data infrastructure to support analytics and machine learning initiatives across the organization.
Responsibilities
- Design and implement data pipelines for large-scale datasets
- Ensure data accuracy, reliability, and accessibility across systems
- Collaborate with data scientists and analysts to understand requirements
- Develop and maintain ETL processes and data transformation workflows
- Optimize data storage and query performance
- Monitor data pipeline health and troubleshoot issues
- Support data governance and compliance standards
- Work with cloud-based data platforms and services
- Integrate data from multiple sources into centralized repositories
- Document data models, schemas, and system architecture
- Improve data security and access controls
- Evaluate and adopt new data technologies and tools
- Participate in code reviews and technical design discussions
- Contribute to data warehouse design and maintenance
- Automate routine data operations and monitoring tasks
- Ensure scalability and reliability of data infrastructure
- Assist in defining best practices for data engineering
- Support business intelligence reporting needs
- Collaborate on data quality assurance initiatives
- Help onboard teams to new data platforms
Nice to Have
- Master’s degree in a technical field
- Experience with real-time data streaming platforms
- Background in retail or e-commerce data environments
- Contributions to open-source data projects
- Certifications in cloud data services
Compensation
Competitive salary and benefits package
Work Arrangement
Hybrid work model with flexibility for remote work
Team
Collaborative environment within a data-driven organization
Our Culture
We value transparency, innovation, and continuous learning. Team members are encouraged to experiment, share ideas, and drive improvements across the data ecosystem.
Growth Opportunities
Engineers are supported in pursuing professional development, attending conferences, and exploring emerging technologies relevant to data infrastructure.