About the Role
This role involves building and optimizing data infrastructure to support real-time and batch processing. The engineer will work closely with data scientists and analysts to ensure data accuracy, availability, and performance across platforms.
Responsibilities
- Design and implement scalable data pipelines for ingestion and transformation
- Monitor and troubleshoot data workflows to ensure reliability and uptime
- Optimize data storage and query performance across distributed systems
- Collaborate with data teams to understand requirements and deliver solutions
- Maintain documentation for data architecture and operational procedures
- Automate deployment and configuration of data infrastructure
- Ensure data quality through validation and monitoring systems
- Support compliance with data governance and security standards
- Integrate new data sources into existing pipelines
- Improve system observability with logging and alerting frameworks
- Work with cloud-based data platforms and services
- Implement version control for data pipeline code
- Contribute to disaster recovery and backup strategies
- Evaluate and adopt new data technologies and tools
- Participate in code reviews and system design discussions
- Assist in capacity planning for data systems
- Troubleshoot cross-system data inconsistencies
- Ensure efficient data lineage tracking
- Support data pipeline scalability during peak loads
- Coordinate with DevOps for CI/CD integration
- Maintain uptime and SLA adherence for data services
- Refactor legacy data workflows for improved performance
- Implement access controls for sensitive data
- Conduct root cause analysis for data incidents
- Promote best practices in data engineering across teams
Nice to Have
- Master’s degree in a technical field
- Experience with real-time data streaming technologies
- Knowledge of machine learning data lifecycle
- Contributions to open-source data projects
- Certifications in cloud data services
- Experience in high-growth startup environments
- Familiarity with data observability platforms
- Background in mobile or advertising data domains
Compensation
Competitive salary and benefits package commensurate with experience
Work Arrangement
Hybrid work model with flexible remote options
Team
Collaborative data engineering team focused on scalable data infrastructure
Tech Stack
Airflow, Kafka, Snowflake, AWS, Docker, Kubernetes, Terraform, Prometheus, Grafana, Git, Python, SQL
Impact
Your work will directly influence data reliability and speed, enabling faster insights and better decision-making across the organization.
Available for qualified candidates


