Nashville, Tennessee, United States USD 46,000 - 111,000 Yearly

Capgemini is hiring an Associate Data Engineer

Responsibilities

  • Design and manage ETL and ELT workflows using Databricks with PySpark and Delta Lake.
  • Enhance data pipeline efficiency, cost-effectiveness, and scalability on Google Cloud Platform.
  • Develop both batch and real-time data processing systems using Spark Streaming and related tools.
  • Build data solutions leveraging BigQuery, Cloud Storage, Dataflow, Cloud Composer, and Vertex AI.
  • Follow cloud security standards, including IAM policies, monitoring setups, and cost controls.
  • Create and maintain data models such as dimensional schemas and data vault architectures.
  • Establish data quality processes, validation checks, and automated testing frameworks.
  • Handle data versioning, governance, and lineage tracking using Unity Catalog or GCP Data Catalog.
  • Work with diverse teams to convert business needs into technical data designs.
  • Offer technical direction and promote engineering best practices across projects.
  • Support the creation of documentation, system diagrams, and internal knowledge resources.
Required Skills
DatabricksPySparkSparkDelta LakeGCPBigQueryCloud StorageDataflowDataprocCloud ComposerPythonSQLData WarehousingData Architecture
About company
Capgemini
Our Client is one of the United States’ largest insurers, providing a wide range of insurance and financial services products with gross written premiums well over US$25 Billion (P&C). They proudly serve more than 10 million U.S. households with more than 19 million individual policies across all 50 states through the efforts of over 48,000 exclusive and independent agents and nearly 18,500 employees. Finally, our Client is part of one the largest Insurance Groups in the world.
All jobs at Capgemini Visit website
Job Details
Department Data and Analytics
Category data
Posted 2 months ago