Responsibilities
- Develop and manage both batch and streaming data pipelines for ingestion and transformation across various business units.
- Partner with data engineering, analytics, and data science teams to deliver trustworthy, high-quality, and business-ready data assets.
- Utilize cloud and data platforms such as Google Cloud Platform, Confluent Cloud with Kafka, Dataform, Airflow, Terraform, and GitLab to build robust production data solutions.
- Enable seamless integration of diverse source systems using Change Data Capture tools like Debezium and real-time streaming technologies.
- Operate in an agile team environment emphasizing automation, iterative improvement, and consistent delivery.
- Create, deploy, and oversee scalable data workflows supporting both real-time and batch data processing.
- Design, implement, and manage data models and warehouse schemas within BigQuery.
- Construct and maintain integrations for data exchange between internal systems and external brands, both incoming and outgoing.
- Engage in code reviews, testing practices, and knowledge sharing to uphold strong engineering standards.
- Configure and maintain monitoring and alerting systems to track data pipeline performance and data integrity.
- Work closely with analytics, data science, and finance departments to support data-driven decision-making.
- Continuously enhance the efficiency, scalability, and reliability of data infrastructure and services.
- Maintain clear documentation of data pipelines, schemas, and architectural designs.
Work Arrangement
Remote (Country) — CEE