Poland or New York

Capstone Integrated Solutions is hiring a Data Engineer

Responsibilities

  • Design and manage ETL pipelines using AWS Glue, Python, and Apache Spark.
  • Construct and refine data lakes on AWS with S3, Lake Formation, and Glue Data Catalog.
  • Apply efficient data partitioning, schema management, and performance optimization in distributed systems.
  • Work closely with data scientists, analysts, and business teams to provide accurate and timely data.
  • Establish and uphold metadata frameworks, data lineage tracking, and governance policies in AWS environments.
  • Oversee, debug, and enhance ETL workflows for improved scalability, dependability, and cost efficiency.
  • Consolidate structured and unstructured data from diverse sources into unified storage platforms.
  • Maintain adherence to data security protocols, privacy standards, and regulatory mandates.
  • Support the design and strategic direction of enterprise-wide data infrastructure and analytics systems.

Benefits

  • Remote work

Work Arrangement

Remote

Other

No Agencies Please!

Required Skills
PythonApache SparkpySparkScalaETLData Engineering
About company
Capstone Integrated Solutions
Capstone Integrated Solutions is a comprehensive services provider. Our team consists of outstanding professionals, highly experienced in designing, building, and supporting retail software. We see ourselves as a build-as-a-service provider who follows a repeatable business pattern that can be applied to a variety of platforms and verticals. Having a culture built on outcomes and delivery at the core of the business, Capstone is providing its customers with a complete suite of services for software development, system analysis, integration, implementation, and support, as well as the option to engage a single team to perform all the services they require.
All jobs at Capstone Integrated Solutions Visit website
Job Details
Department Information Technology
Category data
Posted 3 months ago