Data Science/Machine Learning Engineer – Remote (Continental U.S.)
Role Overview
This position focuses on applying advanced machine learning and natural language processing techniques to deliver impactful solutions for federal research and strategic initiatives. You'll play a central role in building, refining, and maintaining AI systems that turn data into actionable insights.
Key Responsibilities
- Partner with stakeholders to identify opportunities where data science can advance mission-driven objectives
- Extract, clean, and analyze high-volume datasets using statistical modeling and machine learning methods
- Evaluate and integrate new data sources, ensuring quality and relevance for modeling tasks
- Apply state-of-the-art large language models (LLMs) to text processing, information extraction, and document analysis workflows
- Develop and fine-tune LLMs using frameworks like Hugging Face Transformers, LangChain, and Llama Index
- Design evaluation strategies to measure model accuracy, reliability, and performance over time
- Create clear demonstrations and presentations that communicate technical findings to diverse audiences
- Iterate on models to adapt to changing requirements and improve real-world outcomes
- Implement monitoring systems to track data integrity and model behavior in production
Required Qualifications
- Minimum of 5 years in data science, analytics, or a related technical domain
- 2–3 years of direct experience with Large Language Models, including prompt engineering, fine-tuning, and instruction tuning
- Degree in Computer Science, Statistics, Mathematics, Data Science, or a quantitative field
- Strong programming skills in Python, with experience in R or SQL
- Hands-on experience with NLP techniques such as named entity recognition, text classification, and document parsing
- Familiarity with AWS tools including S3, Athena, SageMaker, Glue, and Bedrock
- Working knowledge of deep learning, regression, and time series analysis
- Ability to obtain Public Trust Clearance
Preferred Skills
- Experience with MLOps pipelines and large-scale data processing tools like Spark or Hadoop
- Background in document processing systems and structured data extraction
- Exposure to cloud-based AI development environments beyond AWS
Work Environment
This is a fully remote role open to candidates anywhere in the continental United States. While the team is distributed, collaboration is central to our approach. Candidates located in the DMV area may have more opportunities for in-person coordination.
Benefits
- Employer-paid medical insurance (three plan options)
- Dental and vision coverage
- Health and flexible spending accounts
- Life and disability insurance
- 401(k) plan with company match
- Paid vacation, sick leave, and holidays
- Support for continuing education and professional growth
- Remote work flexibility
Company Culture
We emphasize results, collaboration, and sustainable work practices. Our team values technical rigor and clear communication, with a shared commitment to using data science for public benefit. Work-life balance is built into our operating model, and we foster an inclusive, supportive environment for all team members.
Equal Opportunity Employer
We are committed to fair and inclusive hiring practices. All qualified applicants receive consideration regardless of race, color, religion, sex, gender identity, sexual orientation, national origin, veteran status, disability, age, or other protected characteristics under applicable law. This policy covers every aspect of employment, from recruitment to advancement.

