Responsibilities
- Design and implement reusable data pipelines driven by metadata, emphasizing efficient database architecture and advanced SQL optimization
- Work deeply with datasets and data systems hosted on Amazon Web Services
- Support end-to-end development and tuning of ETL workflows
- Automate and enhance processes within the data platform for greater efficiency
- Develop reliable connections between data sources and downstream applications
- Identify and resolve data quality and system performance issues proactively
- Use DBT for modeling data transformations and Jenkins or GitHub Actions to manage CI/CD pipelines
- Assist in maintaining platform documentation and operational runbooks
- Suggest and execute architectural improvements for the data platform