Responsibilities
- Manage full lifecycle data workflows, from initial requirements and architectural design to deployment and ongoing operational support, covering data ingestion, validation standards, and integration with data science systems.
- Develop and maintain high-performance batch and streaming data pipelines, along with lakehouse architectures, optimized for massive data volumes.
- Investigate and implement emerging technologies to expand the data platform’s capacity in response to accelerating data growth.
- Work closely with data science teams to operationalize machine learning and artificial intelligence models within production environments.
- Coordinate with cloud infrastructure, DevOps, software development, and client-facing teams to build reliable, secure, and scalable data solutions that address key business needs.
- Assess and integrate new tools, frameworks, and architectural patterns to advance the data ecosystem as demands for scale and complexity increase.