About the Role
The role involves leading data engineering initiatives, mentoring team members, designing robust data architectures, and ensuring high data quality across systems.
Responsibilities
- Lead the design and implementation of scalable data pipelines
- Collaborate with cross-functional teams to define data requirements
- Mentor engineers and promote best practices in data development
- Oversee data modeling and database design for analytics use cases
- Ensure data accuracy, consistency, and timely delivery
- Drive improvements in data observability and monitoring
- Evaluate and integrate new data tools and technologies
- Support production data systems with minimal downtime
- Work closely with analytics and product teams to deliver insights
- Define and enforce data governance standards
- Optimize query performance and storage efficiency
- Contribute to architectural decisions for data platforms
- Troubleshoot complex data issues across environments
- Promote automation in data workflows and testing
- Lead code reviews and ensure code quality standards
- Document technical designs and system changes
- Align data strategy with business objectives
- Participate in incident response for data-related outages
- Manage technical debt in data systems
- Support compliance with data privacy regulations
Nice to Have
- Master’s degree in computer science or related field
- Experience with real-time data processing systems
- Knowledge of machine learning pipelines
- Background in financial or investment technology
- Contributions to open-source data projects
- Experience with Terraform or infrastructure as code
- Exposure to data mesh or domain-driven data architectures
Compensation
Competitive salary and benefits package
Work Arrangement
Remote position based in India
Team
Part of the engineering team focused on data systems and infrastructure
About the Team
This role is embedded within a distributed engineering organization that values clarity, ownership, and technical excellence. The data team enables analytics, reporting, and product features by building and maintaining reliable data infrastructure.
Technology Stack
Primary tools include Python, SQL, BigQuery, Airflow, and GitHub. Infrastructure is cloud-native, leveraging Google Cloud Platform with IaC practices. The team uses modern observability and monitoring solutions to maintain data health.
Not available for this role