Boston, Massachusetts Hybrid Full-time

Absentia Labs is hiring a Senior Data Engineer

About the Role

The Senior Data Engineer will architect and lead the design of end-to-end data systems for large-scale biomedical datasets at Absentia Labs. This role is central to shaping the data infrastructure of an AI-driven biomedical platform, making long-term architectural decisions, and ensuring data is modeled, validated, versioned, and served reliably across scientific and machine learning workflows.

What You'll Do

Architect and lead the design of end-to-end data systems for large-scale biomedical datasets (chemical, biological, toxicology, omics, assay, clinical, and experimental data).
Define and evolve schema-driven data models that reconcile noisy, semi-structured, and heterogeneous sources into coherent, interoperable representations.
Establish best practices for data quality, validation, provenance, lineage, and versioning suitable for scientific and ML workflows.
Build and maintain cloud-native data infrastructure (data lakes, warehouses, object storage, streaming systems) with an emphasis on scalability and reliability.
Design pipelines that support both batch and streaming access for ML training, evaluation, and inference.
Partner closely with ML engineers, scientists, and product leads to translate research needs into durable data abstractions.
Make principled trade-offs around performance, cost, flexibility, and correctness in production systems.
Provide technical leadership through design reviews, architectural guidance, and mentorship of other engineers.
Identify and proactively address systemic risks in data integrity, scalability, and operational complexity.

What We're Looking For

5+ years of experience in data engineering, platform engineering, or ML infrastructure roles, with clear ownership of production systems.
Proven experience designing and operating large-scale, production-grade data pipelines.
Strong proficiency in Python and data-centric software engineering practices.
Deep experience with cloud platforms (AWS, GCP, or Azure), including storage, compute, and security primitives.
Familiarity with distributed data processing and orchestration systems (e.g., Spark, Beam, Ray, Airflow, Dagster).
Experience supporting ML/AI workloads, including dataset generation, feature pipelines, and reproducible training workflows.
Strong architectural judgment and the ability to communicate technical decisions clearly across disciplines.

Nice to Have

Prior work with biomedical or life-science data (e.g., omics, assays, molecular representations, clinical or toxicology data).
Experience with streaming platforms (Kafka, Pub/Sub, Kinesis).
Exposure to ontology-aware data modeling or schema evolution in scientific domains.
Infrastructure-as-code and systems experience (Terraform, Docker, Kubernetes).
Experience in early-stage startups or research-heavy environments.
Open-source contributions or technical publications.

Technical Stack

Python, AWS, GCP, Azure, Spark, Beam, Ray, Airflow, Dagster, Kafka, Pub/Sub, Kinesis, Terraform, Docker, Kubernetes

Benefits & Compensation

A chance to architect the data backbone of an AI-driven biomedical platform.
Direct impact on how scientific data is translated into machine intelligence.
High autonomy, high trust, and ownership over critical systems.
Flexible remote or hybrid work arrangements.
A deeply technical, low-ego culture focused on learning and rigor.
Competitive compensation
meaningful equity participation

Work Mode

Flexible remote or hybrid work arrangements

Absentia Labs is an equal opportunity employer. We value diversity and are committed to creating an inclusive environment for all employees.

Required Skills

PythonAWSGCPAzureSparkBeamRayAirflowDagsterKafkadata pipelinescloud platformsdistributed processingdata engineeringML infrastructure PythonAWSGCPAzureSparkBeamRayAirflowDagsterKafkadata pipelinescloud platformsdistributed processingdata engineeringML infrastructure

Need to work legally in Thailand?

Work permits without the paperwork nightmare

Thai immigration rules are strict and easy to get wrong. SVBL handles the bureaucracy — correct visa type, proper documentation, timely submissions. You focus on your work.

Right visa type for your situation

Document preparation & submission

Deadline tracking & renewals

Direct liaison with immigration

Talk to an expert

10+ years experience

About company

Absentia Labs is building intelligent systems that sit at the intersection of AI, biology, chemistry, and large-scale engineering. Their goal is to translate complex scientific data into machine intelligence capable of reasoning, generalizing, and driving discovery.

All jobs at Absentia Labs Visit website

Job Details

Category data

Posted 13 days ago

Similar Jobs

Other opportunities you might be interested in

Senior Data Engineer (Founding Data Engineer)

The IMA Group

Remote (Global)

Senior Data Engineer

Counsel Health

Remote (Global)

Senior Data Engineer

MLabs

New York, New York, United States Remote (Global)

Senior Data Engineer

phData

India Remote (Global)

Senior Software Engineer (Data)

Beamery

Senior Data Engineer

Customer.io

remote

Insights related to this role

remote-work

Platform Engineering: Kubernetes for All

With cloud-native adoption nearing 20 million developers, platform engineering is breaking down infrastructure barriers. Now, data scientists and frontend engineers can leverage Kubernetes without deep backend knowledge—thanks to abstraction and unified platforms.

3 min 5 days ago

Senior tech professional working remotely at home, demonstrating resilience and focus in remote engineering leadership jobs after a career transition.

career-growth

Remote Engineering Leadership Jobs: A Layoff's Silver Lining

Laid off from Amazon in 2025 after 11.5 years, Hemant Virmani is rebuilding his career with AI upskilling and a focus on remote engineering leadership roles. His journey highlights resilience, health, and strategic reinvention in a competitive tech job market.

4 min

Developer working remotely at night, illustrating the evolution of AI-resilient engineering careers in a changing tech landscape.

career-growth

AI-Resilient Engineering Careers: Skills for the Shift

As Atlassian cuts 1,600 jobs amid AI restructuring, developers must rethink career strategies. Learn how to future-proof your skills in an AI-augmented enterprise landscape.

4 min 6 days ago

Absentia Labs is hiring a Senior Data Engineer

What You'll Do

What We're Looking For

Nice to Have

Technical Stack

Benefits & Compensation

Work Mode

Work permits without the paperwork nightmare

Similar Jobs

Senior Data Engineer (Founding Data Engineer)

Senior Data Engineer

Senior Data Engineer

Senior Data Engineer

Senior Software Engineer (Data)

Senior Data Engineer

Related Articles

Platform Engineering: Kubernetes for All

Remote Engineering Leadership Jobs: A Layoff's Silver Lining

AI-Resilient Engineering Careers: Skills for the Shift