remote Remote (Global)

People Data Labs is hiring a Data Engineer

About the Role

The role involves developing robust data workflows, ensuring data integrity, and enabling efficient data access across systems. The engineer will work closely with data scientists and analysts to deliver reliable datasets for business insights and product development.

Responsibilities

  • Design and implement data pipelines for large-scale datasets
  • Ensure data accuracy and consistency across sources
  • Optimize data storage and retrieval performance
  • Collaborate with cross-functional teams to define data needs
  • Maintain data infrastructure reliability and scalability
  • Monitor data workflows for errors and inefficiencies
  • Support data governance and compliance standards
  • Troubleshoot and resolve data-related issues
  • Improve data processing efficiency and automation
  • Document data models, schemas, and pipeline logic
  • Integrate third-party data sources into internal systems
  • Develop tools for data quality validation
  • Work with streaming and batch data processing
  • Use version control for data pipeline code
  • Participate in code reviews and system design discussions
  • Contribute to data architecture strategy
  • Ensure secure handling of sensitive data
  • Evaluate new data technologies and tools
  • Assist in onboarding and training team members
  • Support data warehouse operations and updates

Nice to Have

  • Experience with real-time data streaming platforms
  • Familiarity with machine learning pipelines
  • Knowledge of data observability tools
  • Experience in a fast-paced startup environment
  • Contributions to open-source data projects
  • Advanced degree in a technical field
  • Experience with data lineage tracking
  • Background in handling large public datasets

Compensation

Competitive salary and benefits package

Work Arrangement

Remote-friendly with flexible scheduling

Team

Collaborative team focused on data infrastructure and quality

Our Data Philosophy

We believe high-quality data is foundational to ethical AI and meaningful insights. Our systems prioritize accuracy, transparency, and responsible use.

Engineering Culture

We value clean code, thoughtful design, and continuous improvement. Engineers are encouraged to propose solutions and lead technical initiatives.

Available for qualified candidates

Required Skills
SparkSQLAWSDatabricksPythonApache SparkAirflowdbtdagsterDelta LakeData EngineeringData InfrastructureETLData ModelingScalable Systems SparkSQLAWSDatabricksPythonApache SparkAirflowdbtdagsterDelta LakeData EngineeringData InfrastructureETLData ModelingScalable Systems
About company
People Data Labs
People Data Labs (PDL) is the provider of people and company data. We do the heavy lifting of data collection and standardization so our customers can focus on building and scaling innovative, compliant data solutions. Our sole focus is on building the best data available by integrating thousands of compliantly sourced datasets into a single, developer-friendly source of truth.
All jobs at People Data Labs Visit website
Job Details
Category data
Posted 6 months ago