As a Senior Data Engineer at Clario, you will play a critical role in designing and building the modern data infrastructure that powers advanced analytics, machine learning, and AI-driven innovation across our clinical technology platform. You will architect cloud-native, scalable, and secure data systems that support regulated clinical environments, ensuring data flows are reliable, compliant, and optimized for next-generation clinical insights.
What You'll Do
- Design, build, and maintain scalable ETL/ELT pipelines for structured and unstructured clinical data
- Develop and optimize data models supporting analytics, reporting, and machine learning workflows
- Build and maintain cloud-native data architectures within AWS environments
- Develop pipelines that support AI and machine learning model development and deployment
- Operationalize and productionize machine learning models developed by Data Science teams
- Ensure data quality, integrity, governance, and regulatory compliance
- Improve performance, reliability, and scalability of large-scale data platforms
- Collaborate closely with data scientists, AI engineers, software engineers, and product teams
- Translate clinical and business requirements into scalable data engineering solutions
- Implement monitoring, observability, and automated validation across data pipelines
- Contribute to data engineering standards, architecture design, and platform evolution
What We're Looking For
- Bachelor’s degree in Computer Science, Engineering, Mathematics, or a related quantitative field
- 5+ years of experience in data engineering or data platform development
- Strong proficiency in Python and SQL
- Experience designing and maintaining scalable data pipelines in cloud environments
- Hands-on experience with AWS services such as S3, Redshift, Glue, Lambda, EMR, or similar
- Strong understanding of data modeling, schema design, and performance optimization
- Experience supporting machine learning or AI workflows in production environments
- Experience working with distributed or large-scale data architectures
- Strong analytical, problem-solving, and communication skills
Nice to Have
- Experience in regulated industries such as healthcare, life sciences, or clinical research
- Experience with AI/ML data pipelines or generative AI workflows
- Experience handling large-scale or high-volume datasets
- Experience working with medical imaging data or complex healthcare data structures
Technical Stack
- Python, SQL
- AWS, S3, Redshift, Glue, Lambda, EMR
Clario is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.




